/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.276 by wakaba, Sun Aug 17 05:09:12 2008 UTC revision 1.352 by wakaba, Tue Oct 14 02:27:54 2008 UTC
# Line 1  Line 1 
1    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * Makefile: New rule to make HTML/Tokenizer.pm is added.
4    
5            * HTML.pm.src: Tokenizer part moved to another file.
6    
7    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
10    
11    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: Element category constants redefined.
14            
15    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
18            construction stage synced with the spec (HTML5 revisions 2139 and
19            2302).
20    
21    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
22    
23            * ContentChecker.pm: New error level "html5_fact" added, which
24            should be tentatively used until all of requirements are properly
25            specced as RFC 2119 "MUST" in HTML5.
26    
27    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
28    
29            * ContentChecker.pod: Note on internal flags is added.
30    
31    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: An AAA bug fixed.
34    
35    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
36    
37            * HTML.pm.src: If another node is inserted by the parser, don't
38            reuse existing Text node to append a character (HTML5 revision
39            2124).
40    
41    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
44            revisions 1731 and 2128).
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
49            Support for end tags of camelCase SVG elements were broken.  A
50            wrong error type text fixed.
51    
52    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
55    
56    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src: Support for new definition of |param| and |source|
59            start tag parsing (HTML5 revision 1731).
60    
61    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
62    
63            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
64    
65    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
68            revisions 1731 and 1831).
69    
70    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
73            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
74            revision 1731).
75    
76    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
79            revision 1731).  End tags of |option| and |optgroup| elements are
80            now optional (HTML5 revision 1731).
81    
82    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: New "special" elements added to the list (HTML5
85            revision 1778).  "strile" -> "strike".
86    
87    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
88    
89            * ContentType.pm (get_sniffed_type): Support for the "better"
90            content sniffing (HTML5 revision 1927).  In a case the official
91            type was not returned when the method is invoked in the list
92            context.
93    
94    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: Character references for non-space C0 characters,
97            including U+000B VT, DEL character, noncharacter code points, are
98            now converted to the U+FFFD character (cf. HTML5 revision 2138).
99    
100    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
101    
102            * ContentChecker.pm: |form=""| check support added.
103    
104    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
105    
106            * ContentChecker.pm: |contextmenu| validness is now checked using
107            |id| and |id_type| properties, and |menu| property is removed.
108    
109    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
110    
111            * ContentChecker.pm: Prepare for |form| |name| attribute's
112            duplication checking.
113    
114    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
117            unsupported charset="" parameter value (HTML5 revision 2131).
118    
119    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: Reminding places where U+000B is allowed as a space
122            character is fixed (cf. HTML5 revision 1738).
123    
124            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
125            space characters (HTML5 revision 1738).
126    
127    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src: The "anything else" case for the "after after body"
130            insertion mode was not updated to swtich to the "in body"
131            insertion mode.  U+000B is no longer a space character for the
132            purpose of tree construction phase (HTML5 revision 1738).
133    
134    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
135    
136            * HTML.pm.src: U+000B is no longer a space character (HTML5
137            revision 1738).
138    
139    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
140    
141            * ContentType.pm: 0x0B is no longer a space character (HTML5
142            revision 1738).
143    
144            * HTML.pm.src: U+000B is no longer a space character for the
145            algorithm for extracting an encoding from a Content-Type (HTML5
146            revision 1738).
147    
148    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
149    
150            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
151    
152    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
153    
154            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
155    
156            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
157            matching.
158    
159    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
160    
161            * LangTag.pm: Warn for private use language subtags.  Error level
162            typos fixed.  Support for Suppress-Script field.
163    
164            * mklangreg.pl: Support for dumping of nested structure.
165    
166    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
167    
168            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
169            recommended case as per RFC 4646.
170    
171    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
172    
173            * LangTag.pm (check_rfc4646_langtag): New method.
174    
175    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
176    
177            * mklangreg.pl: New script.
178    
179            * Makefile: Updated for creation of the module for language subtag
180            registry.
181            
182    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
183    
184            * Makefile: WebIDL.html added.
185    
186            * WebIDL.pod: New documentation.
187    
188    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
189    
190            * WebIDL.pm: Checker's error types are redefined.
191    
192    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
193    
194            * WebIDL.pm: Parser's error types are redefined.  Some forward
195            compatible parsing bugs are fixed.  Some unreachable codes are
196            commented out.
197    
198    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
199    
200            * WebIDL.pm: Support for the reminding extended attributes are
201            added.  It does not satisfy the definition that a forward
202            interface declaration has an extended attribute.  It seems that
203            unless explicitly allowed multiple extended attributes with the
204            same name is not allowed, though it is not explicitly mentioned in
205            the spec.
206    
207    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
208    
209            * WebIDL.pm: Unescapes extended attribute names and extended
210            attribute identifiers.  Preserve whether an extended attribute has
211            an argument list of not.  Support for extended attributes:
212            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
213            NameGetter, NameSetter, and Null.
214            (has_argument_list): New attribute.
215            (idl_text): Stringifies argument lists, if any, even if it is
216            empty.
217    
218    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
221            |{s_kwd}| in DATA_STATE as default.
222    
223    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
226            by |{s_kwd}| in DATA_STATE.
227    
228    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: Shorten keys.
231    
232    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
233    
234            * HTML.pm.src: Remove checking for control character, surrogate
235            pair, or noncharacter code points and non-Unicode code
236            points (they should be handled by Whatpm::Charset::UnicodeChecker).
237            (parse_char_stream): Support for the |$get_wrapper| argument and
238            character stream error handlers.
239    
240    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
241    
242            * ContentChecker.pm: Don't call |loda_ns_module|
243            for null-namespace elements/attributes.
244    
245            * HTML.pm.src: Fact out $disallowed_control_chars
246            as a hash.
247    
248    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
251            and |{next_char}| initializations are moved to initialization
252            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
253            with |parse_char_stream|.
254    
255    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src (parse_char_stream): Make |set_next_char|
258            invoke |manakai_read_until|, not only |read|, where
259            possible, to decrease the number of |read| method calls.
260    
261            * mkhtmlparser.pl: Related changes to the aforementioned
262            modification.
263    
264    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
267            would report character error from now.
268    
269    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: White-space-leaded non-white-space character
272            tokens in "before head insertion mode" was not
273            correctly handled.
274            (set_inner_html): Reimplemented using CharString decodehandle
275            class.  Support for $get_wrapper argument.  Support
276            for |{read_until}| feature.
277    
278    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
279    
280            * HTML.pm.src: Make a "bare ero" error for unknown
281            entities point the "&" character.
282    
283    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: It turns out that U+FFFD don't have to
286            be added to the list of excluded characters.
287    
288    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
289    
290            * HTML.pm.src ($char_onerror): Have character decoder's |line|
291            and |column| a higher priority than the one set by the
292            tokenizer's input handler.
293            ($self->{read_until}): Exclude U+FFFD (but this might
294            not be necessary, since now we do line/column fixup in
295            the character decode handle).
296    
297    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
298    
299            * HTML.pm.src: Use |{read_until}| where possible.
300    
301    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
304            and |manakai_getc_until| to |manakai_read_until| to
305            reduce the number of string copies.
306    
307    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
308    
309            * HTML.pm.src (parse_char_string): Use newly created
310            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
311            standard feature to |open| a string as a filehandle,
312            since Perl's string filehandle seems not supporting |ungetc|
313            method correctly.
314            (parse_char_stream): Define |{getc_until}| method.
315            (DATA_STATE): Experimental support for |getc_until| feature.
316    
317    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
318    
319            * HTML.pm.src: Check points added to newly added branches.
320    
321    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
322    
323            * HTML.pm.src: Remove |{char}|, which is no longer used.
324            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
325            and replaced by |{prev_state}|.
326    
327            * mkhtmlparser.pl: Remove |{char}| feature.
328            Remove |!!!back-next-input-character;| macro.
329    
330    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
331    
332            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
333            entity related tokenizer states in favor of new states
334            implementing the consume character reference algorithm.
335    
336    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: "Consume a character reference" algorithm is
339            now implemented as a tokenizer's state, rather than
340            a method, with minimum changes (more changes will
341            be made, in due course).  "Bogus comment state"'s inner
342            loop gets removed.
343    
344    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
345    
346            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
347            into their own tokenizer states.
348    
349    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
352            is split into three states.
353    
354    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
357            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
358            no longer does the tokenizer have to push back next input
359            characters in those states.
360    
361    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
362    
363            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
364            into four states so that no longer does the tokenizer have to push
365            back next input characters in that state.
366    
367    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
368    
369            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
370            which can be used to insert some wrapper between the character
371            stream handle and the tokenizer.  (It is currently not supported
372            for |set_inner_html| for |Element|s).
373    
374    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTML.pm.src: Ignore punctuations in charset names.
377    
378    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
379    
380            * ContentChecker.pm: Support for charset-layer error levels.
381    
382            * HTML.pm.src: Don't specify |text| argument for the
383            |chardecode:fallback| error, since it is not the encoding
384            being used alternatively.
385    
386    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
387    
388            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
389    
390    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
391    
392            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
393    
394    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTML.pm.src: Bug fix and sync with the spec with regard
397            to after after frameset insertion mode processing (HTML5
398            revision 1909).  Note that the implementation was wrong
399            per the old spec before the r1909 changes.
400    
401    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
402    
403            * HTMLTable.pm: scope=auto algorithm fix synced with the
404            spec (HTML5 revision 2093).
405            ($process_row): Algorithm step numbers synced with the
406            spec (HTML5 revision 2092).
407    
408    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
409    
410            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
411            revision 2094).
412    
413    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
414    
415            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
416    
417    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: '"' and "'" at the end of attribute
420            name (after another attribute) now raise parse error (HTML5
421            revision 2123).  Empty unquoted attribute values are no
422            longer allowed (HTML5 revision 2122).
423    
424    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
425    
426            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
427            revision 2130).
428    
429    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
430    
431            * ContentChecker.pm: |xml:lang| attribute value must be same
432            as |lang| attribute value for HTML elements (HTML5 revision 2062
433            and so on).
434    
435    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
436    
437            * ContentChecker.pm: Error level definition for |xml_id_error|
438            was missing.
439    
440            * URIChecker.pm: The end of the URL should be marked as the
441            error location for an empty path error.  The position
442            between the userinfo and the port components should be
443            marked as the error location for an empty host error.
444    
445    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
446    
447            * URIChecker.pm: Set parameters representing where in the
448            value the error occurs for errors.  Report unknown
449            address format error in warning level, since address
450            formats are rarely added.  Path segments starting with "/.."
451            were misinterpreted as a dot-segment.
452    
453    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
454    
455            * URIChecker.pm (check_iri_reference): Requires
456            |Message::DOM::DOMImplementation|.
457    
458    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
459    
460            * IMTChecker.pm: Updated for the new error reporting architecture.
461    
462            * ContentChecker.pm: Error levels for IMTs are added.
463    
464  2008-08-17  Wakaba  <wakaba@suika.fam.cx>  2008-08-17  Wakaba  <wakaba@suika.fam.cx>
465    
466          * H2H.pm (_shift_token): Support for unquoted HTML attribute          * H2H.pm (_shift_token): Support for unquoted HTML attribute

Legend:
Removed from v.1.276  
changed lines
  Added in v.1.352

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24