/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.292 by wakaba, Sat Sep 13 04:19:55 2008 UTC revision 1.330 by wakaba, Sun Sep 21 05:08:15 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
4            unsupported charset="" parameter value (HTML5 revision 2131).
5    
6    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: Reminding places where U+000B is allowed as a space
9            character is fixed (cf. HTML5 revision 1738).
10    
11            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
12            space characters (HTML5 revision 1738).
13    
14    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: The "anything else" case for the "after after body"
17            insertion mode was not updated to swtich to the "in body"
18            insertion mode.  U+000B is no longer a space character for the
19            purpose of tree construction phase (HTML5 revision 1738).
20    
21    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: U+000B is no longer a space character (HTML5
24            revision 1738).
25    
26    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
27    
28            * ContentType.pm: 0x0B is no longer a space character (HTML5
29            revision 1738).
30    
31            * HTML.pm.src: U+000B is no longer a space character for the
32            algorithm for extracting an encoding from a Content-Type (HTML5
33            revision 1738).
34    
35    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
36    
37            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
38    
39    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
40    
41            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
42    
43            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
44            matching.
45    
46    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
47    
48            * LangTag.pm: Warn for private use language subtags.  Error level
49            typos fixed.  Support for Suppress-Script field.
50    
51            * mklangreg.pl: Support for dumping of nested structure.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
56            recommended case as per RFC 4646.
57    
58    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
59    
60            * LangTag.pm (check_rfc4646_langtag): New method.
61    
62    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
63    
64            * mklangreg.pl: New script.
65    
66            * Makefile: Updated for creation of the module for language subtag
67            registry.
68            
69    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
70    
71            * Makefile: WebIDL.html added.
72    
73            * WebIDL.pod: New documentation.
74    
75    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
76    
77            * WebIDL.pm: Checker's error types are redefined.
78    
79    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
80    
81            * WebIDL.pm: Parser's error types are redefined.  Some forward
82            compatible parsing bugs are fixed.  Some unreachable codes are
83            commented out.
84    
85    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
86    
87            * WebIDL.pm: Support for the reminding extended attributes are
88            added.  It does not satisfy the definition that a forward
89            interface declaration has an extended attribute.  It seems that
90            unless explicitly allowed multiple extended attributes with the
91            same name is not allowed, though it is not explicitly mentioned in
92            the spec.
93    
94    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
95    
96            * WebIDL.pm: Unescapes extended attribute names and extended
97            attribute identifiers.  Preserve whether an extended attribute has
98            an argument list of not.  Support for extended attributes:
99            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
100            NameGetter, NameSetter, and Null.
101            (has_argument_list): New attribute.
102            (idl_text): Stringifies argument lists, if any, even if it is
103            empty.
104    
105    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
108            |{s_kwd}| in DATA_STATE as default.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
113            by |{s_kwd}| in DATA_STATE.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: Shorten keys.
118    
119    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: Remove checking for control character, surrogate
122            pair, or noncharacter code points and non-Unicode code
123            points (they should be handled by Whatpm::Charset::UnicodeChecker).
124            (parse_char_stream): Support for the |$get_wrapper| argument and
125            character stream error handlers.
126    
127    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
128    
129            * ContentChecker.pm: Don't call |loda_ns_module|
130            for null-namespace elements/attributes.
131    
132            * HTML.pm.src: Fact out $disallowed_control_chars
133            as a hash.
134    
135    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
138            and |{next_char}| initializations are moved to initialization
139            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
140            with |parse_char_stream|.
141    
142    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src (parse_char_stream): Make |set_next_char|
145            invoke |manakai_read_until|, not only |read|, where
146            possible, to decrease the number of |read| method calls.
147    
148            * mkhtmlparser.pl: Related changes to the aforementioned
149            modification.
150    
151    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
154            would report character error from now.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src: White-space-leaded non-white-space character
159            tokens in "before head insertion mode" was not
160            correctly handled.
161            (set_inner_html): Reimplemented using CharString decodehandle
162            class.  Support for $get_wrapper argument.  Support
163            for |{read_until}| feature.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: Make a "bare ero" error for unknown
168            entities point the "&" character.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: It turns out that U+FFFD don't have to
173            be added to the list of excluded characters.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src ($char_onerror): Have character decoder's |line|
178            and |column| a higher priority than the one set by the
179            tokenizer's input handler.
180            ($self->{read_until}): Exclude U+FFFD (but this might
181            not be necessary, since now we do line/column fixup in
182            the character decode handle).
183    
184    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: Use |{read_until}| where possible.
187    
188    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
191            and |manakai_getc_until| to |manakai_read_until| to
192            reduce the number of string copies.
193    
194    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src (parse_char_string): Use newly created
197            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
198            standard feature to |open| a string as a filehandle,
199            since Perl's string filehandle seems not supporting |ungetc|
200            method correctly.
201            (parse_char_stream): Define |{getc_until}| method.
202            (DATA_STATE): Experimental support for |getc_until| feature.
203    
204    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: Check points added to newly added branches.
207    
208    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: Remove |{char}|, which is no longer used.
211            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
212            and replaced by |{prev_state}|.
213    
214            * mkhtmlparser.pl: Remove |{char}| feature.
215            Remove |!!!back-next-input-character;| macro.
216    
217    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
218    
219            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
220            entity related tokenizer states in favor of new states
221            implementing the consume character reference algorithm.
222    
223    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: "Consume a character reference" algorithm is
226            now implemented as a tokenizer's state, rather than
227            a method, with minimum changes (more changes will
228            be made, in due course).  "Bogus comment state"'s inner
229            loop gets removed.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
234            into their own tokenizer states.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
239            is split into three states.
240    
241    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
244            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
245            no longer does the tokenizer have to push back next input
246            characters in those states.
247    
248  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
249    
250          * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken          * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken

Legend:
Removed from v.1.292  
changed lines
  Added in v.1.330

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24