/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.57 by wakaba, Sun Jul 1 04:46:48 2007 UTC revision 1.158 by wakaba, Tue Mar 4 00:03:12 2008 UTC
# Line 1  Line 1 
1    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (_tree_construction_initial): Some limited quirks
4            doctypes were not uppercased for comparison.
5    
6    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
9            are added.
10    
11    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
14            are set.  Cases that are unlikely reached are noted as so.
15    
16    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Checkpoints for debugging are added.
19    
20            * mkhtmlparser.pl: Support for |!!!cp| syntax.
21    
22    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
25            for simplicity.
26    
27    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
30            revision 1307).
31    
32    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
35            is set are changed (HTML5 revision 1305).
36    
37    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
40            1292).  Entities are not parsed in comment-like part in RCDATA
41            elements (HTML5 revision 1294).  Allow bare & at the end
42            of attribute value literals (HTML5 revision 1296).  More
43            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
44            between attributes and ban attribute names or unquoted
45            attribute values containing single or double quotes (HTML5
46            revision 1303).
47    
48    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
49    
50            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
51            and related errors unless it is an HTML document (though the spec
52            is unclear on whether it is applied to XHTML document).
53    
54            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
55            are added.
56    
57    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: s/local_name/manakai_local_name/g.
60    
61    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
62    
63            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
64    
65            * HTML.pm.src: |charset| in |content| attribute is
66            case-insensitive (HTML5 revision 1270).
67    
68    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
69    
70            * ContentChecker.pm: New status constants are added.
71            ($ElementDefault): |status| added.
72            (check_element): Err for non-standard or deprecated elements.
73            (_attr_status_info): For non-standard or deprecated attributes.
74    
75    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
76    
77            * ContentChecker.pm (_attr_status_info): New internal method.
78    
79    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
80    
81            * ContentChecker.pm (check_element): Element standardized
82            status information is now dispatched.
83    
84    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentChecker.pm (check_element): Fix |del|-and-significant
87            problem by adding some more arguments.
88    
89    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
90    
91            * ContentChecker.pm (check_element): Use context of
92            container-for-the-purpose-of-content-model element (not transparent
93            element) for |check_child_element| calling and significant
94            text flag marking.  This reintroduces |<del>|-and-significant
95            problem again.
96    
97    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
98    
99            * ContentChecker.pm (check_element): Make semi-transparent
100            elements ignored for the purpose of phase changes in
101            content model checking.
102    
103    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
104    
105            * ContentChecker.pm (check_element): In-element state
106            was not properly managed for transparent cases.
107    
108    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
109    
110            * ContentChecker.pm (check_element): Support for |video|
111            and |audio| as semi-transparent elements.
112    
113    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
114    
115            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
116            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
117            Support for |html:object| as a semi-transparent element.
118    
119    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
120    
121            * ContentChecker.pm (check_element): The way to traverse
122            the tree is entirely revised to make it easier to track
123            the state of ancestors/descendants.  As a result of this
124            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
125            support for content model checking for HTML elements |figure|,
126            |object|, |video|, and |audio| and checking for XML elements (and
127            some XMLNS checkings) are dropped for now.  They will be
128            reimplemented in due cource.
129    
130    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
133            token (HTML5 revision 1225).
134    
135    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
136    
137            * ContentChecker.pm ({unsupported_level}): New value.
138    
139            * HTML.pm.src: Save whether |meta| |content| attribute
140            contains character references or not.
141    
142    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
143    
144            * ContentChecker.pm (_get_children): (Incompleted) attempt to
145            imlement significant content checking for contents
146            with |del| elements.
147    
148    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
149    
150            * ContenteChecker.pm ($HTMLTransparentElements): More
151            elements are added.
152            (_get_children): HTML |object| elements are now semi-transparent.
153    
154            * NanoDOM.pm (manakai_html, manakai_head): New methods.
155    
156    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
157    
158            * CacheManifest.pm: HTML5 revision 1211 implemented.
159    
160            * CacheManifest.pod: Updated.
161    
162    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
163    
164            * ContentChecker.pm (check_document, check_element): Support
165            for second argument ($onsubdoc).
166            (_get_css_parser): Removed (now it is part of WDCC).
167    
168    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
169    
170            * ContentChecker.pm (_get_css_parser): New.
171    
172    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
173    
174            * ContentChecker.pm ($AnyChecker): Old way to add child elements
175            for checking had been used.
176    
177    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
178    
179            * ContentChecker.pm (check_element): New todo item type |descendant|.
180    
181    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
182    
183            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
184            specced in RFC 2046 and RFC 4288.
185            (application/atom+xml): Definition added.
186    
187    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
188    
189            * URIChecker.pm: Make RFC 3986 should-level errors
190            warnings (rather than SHOULD-level errors).
191    
192    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
193    
194            * NanoDOM.pm (get_user_data, set_user_data): New methods.
195    
196            * HTML.pm.src: A flag for character references in attribute
197            values are added.  Set |manakai_has_reference| user data
198            to |charset| attribute.
199    
200    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
201    
202            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
203            attributes.
204    
205            * ContentChecker.pm (check_document): Warn if charset requirements
206            cannot be tested.
207    
208    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src (parse_byte_string): Detect charset
211            by universalchardet if charset parameter is not specified.
212    
213            * Makefile (Charset-all, Charset-clean): New rules.
214    
215    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
216    
217            * ContentChecker.pm (check_document): Check the existence
218            of character encoding declaration and selection of encoding
219            for HTML document.
220    
221    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
222    
223            * ContentType.pm (get_sniffed_type): Return also the official
224            type in list context.
225    
226    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentType.pm: Sniffing with leading white space
229            ignoring (HTML5 revisions 1013 and 1016).
230    
231    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
232    
233            * ContentType.pm: HTML5 revision 1013 changes, except for leading
234            white spaces, are implemented.
235    
236    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src (parse_char_string): Set |inner_encoding|
239            attribute if possible.
240    
241    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src (parse_byte_string): New method.
244            (parse_char_string): New alias for |parse_string|.
245            (main phase): Invoking "change the encoding" algorithm if desired.
246    
247            * HTML.pod: Updated.
248    
249    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pod (get_inner_html): Removed.
252    
253            * Makefile (HTML-all, HTML-clean): New.
254    
255    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
258    
259    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
260    
261            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
262    
263    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
264    
265            * Makefile: |CacheManifest.html| is added.
266    
267            * CacheManifest.pod: New file.
268    
269    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
270    
271            * CacheManifest.pm: New module.
272    
273    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src: Support for application cache selection algorithm
276            callback.
277    
278    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
279    
280            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
281    
282    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
283    
284            * Makefile (clean): New rule.
285    
286            * NanoDOM.pm (public_id, system_id): New attributes.
287    
288    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
289    
290            * Makefile (CSS-all, CSS-clean, clean): New rules.
291    
292    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
293    
294            * ContentChecker.pm (check_document): Support for
295            new |is_xml_root| flag.
296            (check_element): Support for new |pluses| state.
297            (_add_pluses): New method.
298            (_remove_minuses): Support for new |minus| item.
299    
300    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
301    
302            * ContentChecker.pm: Raise specific error for invalid
303            root element.
304    
305    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
306    
307            * ContentChecker.pm: Set level values for later uses.
308    
309    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
310    
311            * ContentChecker.pm: Support for language tag validation.
312    
313    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
314    
315            * LangTag.pm (check_rfc3066_language_tag): New method.
316    
317    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
318    
319            * LangTag.pm: New module.
320    
321    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
322    
323            * HTML.pm.src: Some error types were wrong.
324    
325    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
326    
327            * CSS/: New directory.
328    
329    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
330    
331            * ContentChecker.pm (_check_get_children): Support
332            for |noscript| in |head|.
333    
334    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
335    
336            * URI/: New directory.
337    
338    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
339    
340            * HTML.pm.src: Tokenizer's states are now represented in
341            number.
342    
343    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
344    
345            * HTML.pm.src: |or|s for insertion modes are replaced
346            by |&|s.
347    
348    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
349    
350            * HTML.pm.src: Token types are now represented in number.
351    
352    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
353    
354            * ContentType.pm (SEE ALSO): Updated.
355    
356            * HTML.pm.src: Insertion modes are now represented in number.
357    
358    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
359    
360            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
361            999) is implemented.
362    
363    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
364    
365            * ContentType.pm: Sniffing for |<script| (HTML5 revision
366            983) is implemented.
367    
368    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
369    
370            * ContentChecker.pod: New documentation.
371    
372            * Makefile: A rule for |ContentChecker.html| is added.
373    
374            * ContentChecker.pm: A pod "LICENSE" section is added.
375    
376            * NanoDOM.pm ($VERSION): New variable.
377    
378    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
379    
380            * H2H.pm: |b|, |i|, and |sub| are added to the
381            list of allowed HTML elements.
382    
383    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
384    
385            * H2H.pm: |samp| is added to the list of allowed
386            HTML elements.
387    
388            * URIChecker.pm (check_iri): New.
389            (check_iri_reference): Error type for IRI reference
390            syntax error is changed.
391    
392    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
393    
394            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
395            is added.
396            (check_document): Load appropriate module before validation.
397    
398    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
399    
400            * ContentChecker/: New directory.
401    
402    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
403    
404            * ContentChecker.pm: HTML |time| element is implemented.
405    
406            * HTMLTable.pm: Comments are updated as HTML5 is revised.
407    
408    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
409    
410            * ContentChecker.pm (check_document): Return value
411            even if no document element is found.
412    
413    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
414    
415            * HTML.pm.src: |$in_body| is no longer a function.
416    
417    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: The |$in_body| code has been moved down.
420    
421    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
422    
423            * HTML.pm.src: The "trailing end" insertion mode
424            is split into "after html body" and "after html frameset"
425            insertion modes.  Their codes are merged with "after body"
426            and "after frameset" codes.  |$previous_insertion_mode|
427            has been removed.  "after frameset" code is
428            merged with "in frameset" code.
429    
430    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src: The "before head" insertion mode is
433            merged with the "in head" insertion mode.
434    
435    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
436    
437            * HTML.pm.src: Cases in "in head" insertion mode are
438            reorganized.
439    
440    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
441    
442            * HTML.pm.src: Some cases in "in table" insertion mode
443            are merged.
444    
445    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src: The "in row" insertion mode is merged
448            with "in table" insertion mode.
449    
450    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
451    
452            * HTML.pm.src: The "in table" and "in table body" insertion
453            modes are merged.
454    
455    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
456    
457            * HTML.pm.src: There is no "in table head" or "in table foot"
458            insertion mode!
459    
460    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
461    
462            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
463            now directly invoke the handler.
464    
465    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
466    
467            * HTML.pm.src: Codes for "in cell" insertion mode
468            is merged to the "in body" insertion mode code.
469    
470    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src: Codes for "in body" and "in caption"
473            insertion modes are merged.
474    
475    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
476    
477            * HTML.pm.src: Two |!!!next-token|s were missing.
478    
479    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTML.pm.src: Use numeric constant for |{content_mode}|
482            instead of string constant for |{content_model_flag}|.
483    
484    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
485    
486            * HTML.pm.src: Add the name of the attribute
487            to the "duplicate attribute" error.
488    
489    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
490    
491            * ContentChecker.pm: Return the |class| node list.
492    
493    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
494    
495            * ContentChecker.pm: Return the |id| node list.
496    
497            * HTML.pm.src: A typo is fixed.
498    
499    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
500    
501            * ContentChecker.pm: Drop wrong |level => 'error'| specification
502            from "in HTML:xml:lang" error.  Character position
503            is now the last part of the error type in the URI error
504            description.  Report "unsupported" status for language
505            tags, media queries, script codes, and style sheets.
506    
507    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
508    
509            * ContentChecker.pm: Report error if |xml:lang|
510            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
511            in XML.
512    
513            * NanoDOM.pm (Attr.owner_document): New attribute.
514    
515    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
516    
517            * HTML.pm.src: The character immediately following
518            a bare |hcro| was discarded.  Fix handling of
519            entity references in attribute values.
520    
521    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
522    
523            * HTML.pm.src (main and trailin end phases): Token
524            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
525            factored out.  Error types |in html:#DOCTYPE|
526            and |after html:#DOCTYPE| are merged
527            into |DOCTYPE in the middle|.  |</frameset>|
528            in fragment parsing mode changed the
529            insertion mode.
530    
531    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
532    
533            * HTML.pm.src: |$phase| has been removed; The |trailing end|
534            phase is now an insertion mode.  Treatments for white
535            space character tokens were incorrect for some
536            insertion modes.  An old |meta| case was not removed.
537    
538    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
539    
540            * HTML.pm.src: |meta| charset declaration extraction
541            implemented (but changing the encoding is not yet:-).
542    
543    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
544    
545            * Charset/: New directory.
546    
547    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
548    
549            * H2H.pm: New Perl module (created from
550            manakai's H2H.dis).
551    
552    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
553    
554            * XMLSerializer.pm: New Perl module (created from
555            manakai's SimpleLS.dis).
556    
557    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
558    
559            * HTML.pm.src: |<!---x-->| was not processed correctly.
560    
561    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTML.pm.src: Report correct error message
564            for |<body></div></body>|.
565    
566  2007-07-01  Wakaba  <wakaba@suika.fam.cx>  2007-07-01  Wakaba  <wakaba@suika.fam.cx>
567    
568          * HTMLTable.pm: An error description was incorrect.          * HTMLTable.pm: An error description was incorrect.

Legend:
Removed from v.1.57  
changed lines
  Added in v.1.158

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24