/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.291 by wakaba, Thu Sep 11 09:12:27 2008 UTC revision 1.349 by wakaba, Mon Oct 13 06:18:30 2008 UTC
# Line 1  Line 1 
1    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
4            construction stage synced with the spec (HTML5 revisions 2139 and
5            2302).
6    
7    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
8    
9            * ContentChecker.pm: New error level "html5_fact" added, which
10            should be tentatively used until all of requirements are properly
11            specced as RFC 2119 "MUST" in HTML5.
12    
13    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
14    
15            * ContentChecker.pod: Note on internal flags is added.
16    
17    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src: An AAA bug fixed.
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: If another node is inserted by the parser, don't
24            reuse existing Text node to append a character (HTML5 revision
25            2124).
26    
27    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
30            revisions 1731 and 2128).
31    
32    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
35            Support for end tags of camelCase SVG elements were broken.  A
36            wrong error type text fixed.
37    
38    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Support for new definition of |param| and |source|
45            start tag parsing (HTML5 revision 1731).
46    
47    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
48    
49            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
54            revisions 1731 and 1831).
55    
56    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
59            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
60            revision 1731).
61    
62    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
65            revision 1731).  End tags of |option| and |optgroup| elements are
66            now optional (HTML5 revision 1731).
67    
68    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: New "special" elements added to the list (HTML5
71            revision 1778).  "strile" -> "strike".
72    
73    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
74    
75            * ContentType.pm (get_sniffed_type): Support for the "better"
76            content sniffing (HTML5 revision 1927).  In a case the official
77            type was not returned when the method is invoked in the list
78            context.
79    
80    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: Character references for non-space C0 characters,
83            including U+000B VT, DEL character, noncharacter code points, are
84            now converted to the U+FFFD character (cf. HTML5 revision 2138).
85    
86    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
87    
88            * ContentChecker.pm: |form=""| check support added.
89    
90    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pm: |contextmenu| validness is now checked using
93            |id| and |id_type| properties, and |menu| property is removed.
94    
95    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
96    
97            * ContentChecker.pm: Prepare for |form| |name| attribute's
98            duplication checking.
99    
100    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
103            unsupported charset="" parameter value (HTML5 revision 2131).
104    
105    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Reminding places where U+000B is allowed as a space
108            character is fixed (cf. HTML5 revision 1738).
109    
110            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
111            space characters (HTML5 revision 1738).
112    
113    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: The "anything else" case for the "after after body"
116            insertion mode was not updated to swtich to the "in body"
117            insertion mode.  U+000B is no longer a space character for the
118            purpose of tree construction phase (HTML5 revision 1738).
119    
120    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: U+000B is no longer a space character (HTML5
123            revision 1738).
124    
125    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
126    
127            * ContentType.pm: 0x0B is no longer a space character (HTML5
128            revision 1738).
129    
130            * HTML.pm.src: U+000B is no longer a space character for the
131            algorithm for extracting an encoding from a Content-Type (HTML5
132            revision 1738).
133    
134    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
135    
136            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
137    
138    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
139    
140            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
141    
142            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
143            matching.
144    
145    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
146    
147            * LangTag.pm: Warn for private use language subtags.  Error level
148            typos fixed.  Support for Suppress-Script field.
149    
150            * mklangreg.pl: Support for dumping of nested structure.
151    
152    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
153    
154            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
155            recommended case as per RFC 4646.
156    
157    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
158    
159            * LangTag.pm (check_rfc4646_langtag): New method.
160    
161    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
162    
163            * mklangreg.pl: New script.
164    
165            * Makefile: Updated for creation of the module for language subtag
166            registry.
167            
168    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
169    
170            * Makefile: WebIDL.html added.
171    
172            * WebIDL.pod: New documentation.
173    
174    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
175    
176            * WebIDL.pm: Checker's error types are redefined.
177    
178    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
179    
180            * WebIDL.pm: Parser's error types are redefined.  Some forward
181            compatible parsing bugs are fixed.  Some unreachable codes are
182            commented out.
183    
184    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
185    
186            * WebIDL.pm: Support for the reminding extended attributes are
187            added.  It does not satisfy the definition that a forward
188            interface declaration has an extended attribute.  It seems that
189            unless explicitly allowed multiple extended attributes with the
190            same name is not allowed, though it is not explicitly mentioned in
191            the spec.
192    
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * WebIDL.pm: Unescapes extended attribute names and extended
196            attribute identifiers.  Preserve whether an extended attribute has
197            an argument list of not.  Support for extended attributes:
198            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
199            NameGetter, NameSetter, and Null.
200            (has_argument_list): New attribute.
201            (idl_text): Stringifies argument lists, if any, even if it is
202            empty.
203    
204    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
207            |{s_kwd}| in DATA_STATE as default.
208    
209    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
212            by |{s_kwd}| in DATA_STATE.
213    
214    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Shorten keys.
217    
218    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Remove checking for control character, surrogate
221            pair, or noncharacter code points and non-Unicode code
222            points (they should be handled by Whatpm::Charset::UnicodeChecker).
223            (parse_char_stream): Support for the |$get_wrapper| argument and
224            character stream error handlers.
225    
226    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentChecker.pm: Don't call |loda_ns_module|
229            for null-namespace elements/attributes.
230    
231            * HTML.pm.src: Fact out $disallowed_control_chars
232            as a hash.
233    
234    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
237            and |{next_char}| initializations are moved to initialization
238            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
239            with |parse_char_stream|.
240    
241    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src (parse_char_stream): Make |set_next_char|
244            invoke |manakai_read_until|, not only |read|, where
245            possible, to decrease the number of |read| method calls.
246    
247            * mkhtmlparser.pl: Related changes to the aforementioned
248            modification.
249    
250    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
253            would report character error from now.
254    
255    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: White-space-leaded non-white-space character
258            tokens in "before head insertion mode" was not
259            correctly handled.
260            (set_inner_html): Reimplemented using CharString decodehandle
261            class.  Support for $get_wrapper argument.  Support
262            for |{read_until}| feature.
263    
264    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Make a "bare ero" error for unknown
267            entities point the "&" character.
268    
269    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: It turns out that U+FFFD don't have to
272            be added to the list of excluded characters.
273    
274    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src ($char_onerror): Have character decoder's |line|
277            and |column| a higher priority than the one set by the
278            tokenizer's input handler.
279            ($self->{read_until}): Exclude U+FFFD (but this might
280            not be necessary, since now we do line/column fixup in
281            the character decode handle).
282    
283    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Use |{read_until}| where possible.
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
290            and |manakai_getc_until| to |manakai_read_until| to
291            reduce the number of string copies.
292    
293    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src (parse_char_string): Use newly created
296            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
297            standard feature to |open| a string as a filehandle,
298            since Perl's string filehandle seems not supporting |ungetc|
299            method correctly.
300            (parse_char_stream): Define |{getc_until}| method.
301            (DATA_STATE): Experimental support for |getc_until| feature.
302    
303    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
304    
305            * HTML.pm.src: Check points added to newly added branches.
306    
307    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src: Remove |{char}|, which is no longer used.
310            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
311            and replaced by |{prev_state}|.
312    
313            * mkhtmlparser.pl: Remove |{char}| feature.
314            Remove |!!!back-next-input-character;| macro.
315    
316    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
319            entity related tokenizer states in favor of new states
320            implementing the consume character reference algorithm.
321    
322    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: "Consume a character reference" algorithm is
325            now implemented as a tokenizer's state, rather than
326            a method, with minimum changes (more changes will
327            be made, in due course).  "Bogus comment state"'s inner
328            loop gets removed.
329    
330    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
331    
332            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
333            into their own tokenizer states.
334    
335    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
338            is split into three states.
339    
340    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
343            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
344            no longer does the tokenizer have to push back next input
345            characters in those states.
346    
347    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
350            into four states so that no longer does the tokenizer have to push
351            back next input characters in that state.
352    
353  2008-09-11  Wakaba  <wakaba@suika.fam.cx>  2008-09-11  Wakaba  <wakaba@suika.fam.cx>
354    
355          * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,          * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,

Legend:
Removed from v.1.291  
changed lines
  Added in v.1.349

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24