/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.54 by wakaba, Mon Jun 25 12:39:11 2007 UTC revision 1.157 by wakaba, Mon Mar 3 13:15:54 2008 UTC
# Line 1  Line 1 
1    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
4            are added.
5    
6    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
9            are set.  Cases that are unlikely reached are noted as so.
10    
11    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: Checkpoints for debugging are added.
14    
15            * mkhtmlparser.pl: Support for |!!!cp| syntax.
16    
17    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
20            for simplicity.
21    
22    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
25            revision 1307).
26    
27    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
30            is set are changed (HTML5 revision 1305).
31    
32    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
35            1292).  Entities are not parsed in comment-like part in RCDATA
36            elements (HTML5 revision 1294).  Allow bare & at the end
37            of attribute value literals (HTML5 revision 1296).  More
38            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
39            between attributes and ban attribute names or unquoted
40            attribute values containing single or double quotes (HTML5
41            revision 1303).
42    
43    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
44    
45            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
46            and related errors unless it is an HTML document (though the spec
47            is unclear on whether it is applied to XHTML document).
48    
49            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
50            are added.
51    
52    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: s/local_name/manakai_local_name/g.
55    
56    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
57    
58            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
59    
60            * HTML.pm.src: |charset| in |content| attribute is
61            case-insensitive (HTML5 revision 1270).
62    
63    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
64    
65            * ContentChecker.pm: New status constants are added.
66            ($ElementDefault): |status| added.
67            (check_element): Err for non-standard or deprecated elements.
68            (_attr_status_info): For non-standard or deprecated attributes.
69    
70    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
71    
72            * ContentChecker.pm (_attr_status_info): New internal method.
73    
74    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
75    
76            * ContentChecker.pm (check_element): Element standardized
77            status information is now dispatched.
78    
79    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
80    
81            * ContentChecker.pm (check_element): Fix |del|-and-significant
82            problem by adding some more arguments.
83    
84    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentChecker.pm (check_element): Use context of
87            container-for-the-purpose-of-content-model element (not transparent
88            element) for |check_child_element| calling and significant
89            text flag marking.  This reintroduces |<del>|-and-significant
90            problem again.
91    
92    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
93    
94            * ContentChecker.pm (check_element): Make semi-transparent
95            elements ignored for the purpose of phase changes in
96            content model checking.
97    
98    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
99    
100            * ContentChecker.pm (check_element): In-element state
101            was not properly managed for transparent cases.
102    
103    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
104    
105            * ContentChecker.pm (check_element): Support for |video|
106            and |audio| as semi-transparent elements.
107    
108    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
109    
110            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
111            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
112            Support for |html:object| as a semi-transparent element.
113    
114    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
115    
116            * ContentChecker.pm (check_element): The way to traverse
117            the tree is entirely revised to make it easier to track
118            the state of ancestors/descendants.  As a result of this
119            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
120            support for content model checking for HTML elements |figure|,
121            |object|, |video|, and |audio| and checking for XML elements (and
122            some XMLNS checkings) are dropped for now.  They will be
123            reimplemented in due cource.
124    
125    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
128            token (HTML5 revision 1225).
129    
130    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
131    
132            * ContentChecker.pm ({unsupported_level}): New value.
133    
134            * HTML.pm.src: Save whether |meta| |content| attribute
135            contains character references or not.
136    
137    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
138    
139            * ContentChecker.pm (_get_children): (Incompleted) attempt to
140            imlement significant content checking for contents
141            with |del| elements.
142    
143    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
144    
145            * ContenteChecker.pm ($HTMLTransparentElements): More
146            elements are added.
147            (_get_children): HTML |object| elements are now semi-transparent.
148    
149            * NanoDOM.pm (manakai_html, manakai_head): New methods.
150    
151    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
152    
153            * CacheManifest.pm: HTML5 revision 1211 implemented.
154    
155            * CacheManifest.pod: Updated.
156    
157    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
158    
159            * ContentChecker.pm (check_document, check_element): Support
160            for second argument ($onsubdoc).
161            (_get_css_parser): Removed (now it is part of WDCC).
162    
163    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
164    
165            * ContentChecker.pm (_get_css_parser): New.
166    
167    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm ($AnyChecker): Old way to add child elements
170            for checking had been used.
171    
172    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
173    
174            * ContentChecker.pm (check_element): New todo item type |descendant|.
175    
176    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
177    
178            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
179            specced in RFC 2046 and RFC 4288.
180            (application/atom+xml): Definition added.
181    
182    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
183    
184            * URIChecker.pm: Make RFC 3986 should-level errors
185            warnings (rather than SHOULD-level errors).
186    
187    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
188    
189            * NanoDOM.pm (get_user_data, set_user_data): New methods.
190    
191            * HTML.pm.src: A flag for character references in attribute
192            values are added.  Set |manakai_has_reference| user data
193            to |charset| attribute.
194    
195    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
196    
197            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
198            attributes.
199    
200            * ContentChecker.pm (check_document): Warn if charset requirements
201            cannot be tested.
202    
203    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src (parse_byte_string): Detect charset
206            by universalchardet if charset parameter is not specified.
207    
208            * Makefile (Charset-all, Charset-clean): New rules.
209    
210    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
211    
212            * ContentChecker.pm (check_document): Check the existence
213            of character encoding declaration and selection of encoding
214            for HTML document.
215    
216    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
217    
218            * ContentType.pm (get_sniffed_type): Return also the official
219            type in list context.
220    
221    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
222    
223            * ContentType.pm: Sniffing with leading white space
224            ignoring (HTML5 revisions 1013 and 1016).
225    
226    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentType.pm: HTML5 revision 1013 changes, except for leading
229            white spaces, are implemented.
230    
231    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src (parse_char_string): Set |inner_encoding|
234            attribute if possible.
235    
236    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src (parse_byte_string): New method.
239            (parse_char_string): New alias for |parse_string|.
240            (main phase): Invoking "change the encoding" algorithm if desired.
241    
242            * HTML.pod: Updated.
243    
244    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
245    
246            * HTML.pod (get_inner_html): Removed.
247    
248            * Makefile (HTML-all, HTML-clean): New.
249    
250    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
253    
254    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
255    
256            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
257    
258    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
259    
260            * Makefile: |CacheManifest.html| is added.
261    
262            * CacheManifest.pod: New file.
263    
264    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
265    
266            * CacheManifest.pm: New module.
267    
268    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Support for application cache selection algorithm
271            callback.
272    
273    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
274    
275            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
276    
277    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
278    
279            * Makefile (clean): New rule.
280    
281            * NanoDOM.pm (public_id, system_id): New attributes.
282    
283    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
284    
285            * Makefile (CSS-all, CSS-clean, clean): New rules.
286    
287    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * ContentChecker.pm (check_document): Support for
290            new |is_xml_root| flag.
291            (check_element): Support for new |pluses| state.
292            (_add_pluses): New method.
293            (_remove_minuses): Support for new |minus| item.
294    
295    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
296    
297            * ContentChecker.pm: Raise specific error for invalid
298            root element.
299    
300    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
301    
302            * ContentChecker.pm: Set level values for later uses.
303    
304    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
305    
306            * ContentChecker.pm: Support for language tag validation.
307    
308    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
309    
310            * LangTag.pm (check_rfc3066_language_tag): New method.
311    
312    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
313    
314            * LangTag.pm: New module.
315    
316    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
317    
318            * HTML.pm.src: Some error types were wrong.
319    
320    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
321    
322            * CSS/: New directory.
323    
324    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
325    
326            * ContentChecker.pm (_check_get_children): Support
327            for |noscript| in |head|.
328    
329    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
330    
331            * URI/: New directory.
332    
333    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTML.pm.src: Tokenizer's states are now represented in
336            number.
337    
338    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
339    
340            * HTML.pm.src: |or|s for insertion modes are replaced
341            by |&|s.
342    
343    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
344    
345            * HTML.pm.src: Token types are now represented in number.
346    
347    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
348    
349            * ContentType.pm (SEE ALSO): Updated.
350    
351            * HTML.pm.src: Insertion modes are now represented in number.
352    
353    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
354    
355            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
356            999) is implemented.
357    
358    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
359    
360            * ContentType.pm: Sniffing for |<script| (HTML5 revision
361            983) is implemented.
362    
363    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
364    
365            * ContentChecker.pod: New documentation.
366    
367            * Makefile: A rule for |ContentChecker.html| is added.
368    
369            * ContentChecker.pm: A pod "LICENSE" section is added.
370    
371            * NanoDOM.pm ($VERSION): New variable.
372    
373    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
374    
375            * H2H.pm: |b|, |i|, and |sub| are added to the
376            list of allowed HTML elements.
377    
378    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
379    
380            * H2H.pm: |samp| is added to the list of allowed
381            HTML elements.
382    
383            * URIChecker.pm (check_iri): New.
384            (check_iri_reference): Error type for IRI reference
385            syntax error is changed.
386    
387    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
388    
389            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
390            is added.
391            (check_document): Load appropriate module before validation.
392    
393    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
394    
395            * ContentChecker/: New directory.
396    
397    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
398    
399            * ContentChecker.pm: HTML |time| element is implemented.
400    
401            * HTMLTable.pm: Comments are updated as HTML5 is revised.
402    
403    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
404    
405            * ContentChecker.pm (check_document): Return value
406            even if no document element is found.
407    
408    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
409    
410            * HTML.pm.src: |$in_body| is no longer a function.
411    
412    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTML.pm.src: The |$in_body| code has been moved down.
415    
416    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
417    
418            * HTML.pm.src: The "trailing end" insertion mode
419            is split into "after html body" and "after html frameset"
420            insertion modes.  Their codes are merged with "after body"
421            and "after frameset" codes.  |$previous_insertion_mode|
422            has been removed.  "after frameset" code is
423            merged with "in frameset" code.
424    
425    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
426    
427            * HTML.pm.src: The "before head" insertion mode is
428            merged with the "in head" insertion mode.
429    
430    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src: Cases in "in head" insertion mode are
433            reorganized.
434    
435    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
436    
437            * HTML.pm.src: Some cases in "in table" insertion mode
438            are merged.
439    
440    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
441    
442            * HTML.pm.src: The "in row" insertion mode is merged
443            with "in table" insertion mode.
444    
445    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src: The "in table" and "in table body" insertion
448            modes are merged.
449    
450    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
451    
452            * HTML.pm.src: There is no "in table head" or "in table foot"
453            insertion mode!
454    
455    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
456    
457            * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
458            now directly invoke the handler.
459    
460    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
461    
462            * HTML.pm.src: Codes for "in cell" insertion mode
463            is merged to the "in body" insertion mode code.
464    
465    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
466    
467            * HTML.pm.src: Codes for "in body" and "in caption"
468            insertion modes are merged.
469    
470    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
471    
472            * HTML.pm.src: Two |!!!next-token|s were missing.
473    
474    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
475    
476            * HTML.pm.src: Use numeric constant for |{content_mode}|
477            instead of string constant for |{content_model_flag}|.
478    
479    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTML.pm.src: Add the name of the attribute
482            to the "duplicate attribute" error.
483    
484    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
485    
486            * ContentChecker.pm: Return the |class| node list.
487    
488    2007-07-17  Wakaba  <wakaba@suika.fam.cx>
489    
490            * ContentChecker.pm: Return the |id| node list.
491    
492            * HTML.pm.src: A typo is fixed.
493    
494    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
495    
496            * ContentChecker.pm: Drop wrong |level => 'error'| specification
497            from "in HTML:xml:lang" error.  Character position
498            is now the last part of the error type in the URI error
499            description.  Report "unsupported" status for language
500            tags, media queries, script codes, and style sheets.
501    
502    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
503    
504            * ContentChecker.pm: Report error if |xml:lang|
505            in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
506            in XML.
507    
508            * NanoDOM.pm (Attr.owner_document): New attribute.
509    
510    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
511    
512            * HTML.pm.src: The character immediately following
513            a bare |hcro| was discarded.  Fix handling of
514            entity references in attribute values.
515    
516    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
517    
518            * HTML.pm.src (main and trailin end phases): Token
519            types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
520            factored out.  Error types |in html:#DOCTYPE|
521            and |after html:#DOCTYPE| are merged
522            into |DOCTYPE in the middle|.  |</frameset>|
523            in fragment parsing mode changed the
524            insertion mode.
525    
526    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
527    
528            * HTML.pm.src: |$phase| has been removed; The |trailing end|
529            phase is now an insertion mode.  Treatments for white
530            space character tokens were incorrect for some
531            insertion modes.  An old |meta| case was not removed.
532    
533    2007-07-16  Wakaba  <wakaba@suika.fam.cx>
534    
535            * HTML.pm.src: |meta| charset declaration extraction
536            implemented (but changing the encoding is not yet:-).
537    
538    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
539    
540            * Charset/: New directory.
541    
542    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
543    
544            * H2H.pm: New Perl module (created from
545            manakai's H2H.dis).
546    
547    2007-07-15  Wakaba  <wakaba@suika.fam.cx>
548    
549            * XMLSerializer.pm: New Perl module (created from
550            manakai's SimpleLS.dis).
551    
552    2007-07-07  Wakaba  <wakaba@suika.fam.cx>
553    
554            * HTML.pm.src: |<!---x-->| was not processed correctly.
555    
556    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
557    
558            * HTML.pm.src: Report correct error message
559            for |<body></div></body>|.
560    
561    2007-07-01  Wakaba  <wakaba@suika.fam.cx>
562    
563            * HTMLTable.pm: An error description was incorrect.
564    
565    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
566    
567            * ContentChecker.pm: Return |{term}| list.
568    
569    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
570    
571            * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
572            nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
573            and <title> outside of head).
574    
575    2007-06-30  Wakaba  <wakaba@suika.fam.cx>
576    
577            * IMTChecker.pm: Report warning for unregistered
578            and private types/subtypes.
579    
580            * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
581            URIChecker.pm, HTMLTable.pm: Error messages are now
582            consistent; they are all listed in
583            <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
584    
585  2007-06-25  Wakaba  <wakaba@suika.fam.cx>  2007-06-25  Wakaba  <wakaba@suika.fam.cx>
586    
587          * ContentChecker.pm: |<img ismap>| not in |<a></a>|          * ContentChecker.pm: |<img ismap>| not in |<a></a>|

Legend:
Removed from v.1.54  
changed lines
  Added in v.1.157

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24