/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.84 by wakaba, Sat Jul 21 12:27:22 2007 UTC revision 1.190 by wakaba, Sun Mar 16 23:53:48 2008 UTC
# Line 1  Line 1 
1    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (_get_next_token): The first "<" character
4            in "<?", "<>", or "</>" should be the error point.
5    
6    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: Some more fixes on error position reporting.
9    
10    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Token-level precious error reporting.
13    
14    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: Preparation for more precious error point
17            reporting.
18    
19    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: Error type revised.
22    
23    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
24    
25            * HTML.pm.src: Similar codes are merged together, again.
26    
27    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: Similar codes are merged together.
30    
31    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
32    
33            * mkhtmlparser.pl: Set "level" parameter to parse errors.
34    
35            * HTML.pm.src: Code refined.
36    
37    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: |</body>| treatement has been changed (HTML5
40            revision 1348).  Note that I really don't know this makes
41            any difference in the black-box behavior of the parser.
42    
43    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: New end-of-file token implementation (HTML5
46            revision 1348).
47    
48    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: |applet| support (HTML5 revision 1347).
51    
52    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
55    
56    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
57    
58            * HTML.pm.src: Support for |<input>| in the "in select" insertion
59            mode and support for the "in select in table" insertion mode (HTML5
60            revision 1342).
61    
62    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: No foster parenting for <script> and <script>
65            in non-tainted <table>s (HTML5 revision 1336).
66    
67    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Ignore white space characters between <html>
70            and <head> (HTML5 revision 1332).
71    
72    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: Treat <input type=hidden> as if it were a
75            white space (HTML5 revision 1331).
76    
77    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
80            element (HTML5 revision 1330).
81    
82    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: <title> is always appended to the current
85            element (HTML5 revision 1328).
86    
87    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: White space in tainted tables are moved
90            into foster parents (HTML5 revision 1326).
91    
92    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
93    
94            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
95            revision 1321).
96    
97    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: |</h/n/>| case code rearranged to align with
100            the spec (HTML5 revision 1320).  Note that we finally complete
101            all of HTML5 revision 1320 changes.
102    
103    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
104    
105            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
106            tags (HTML5 revision 1320).
107    
108    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
109    
110            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
111    
112    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
113    
114            * HTML.pm.src: A "generate implied end tags" code (t409.1)
115            could not be reached so that it is now removed (HTML5 revision
116            1320).
117    
118    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: Code for the case of |</div>| and so on are revised
121            to align with new spec text (HTML5 revision 1320).
122    
123    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src: Remove strange |if| condition; however, it should
126            have had no harm in theory.
127    
128    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src (_tree_construction_main): '</p>' in body
131            case is split from other end tags for the preparation
132            of implementing HTML revision 1320.
133    
134    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
135    
136            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
137            revision 1320).
138    
139    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
142            and "div" (HTML5 revision 1318).  Add comments to where
143            |form| pointer association codes should be inserted (HTML5 revision
144            1319).
145    
146    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
149    
150    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src: Since the case t268 should never be reached (no
153            other token type, there are), it is replaced by a |die| statement.
154    
155    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: Typo fixed.
158    
159    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
160    
161            * HTML.pm.src (_tree_construction_initial): Some limited quirks
162            doctypes were not uppercased for comparison.
163    
164    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
165    
166            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
167            are added.
168    
169    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
170    
171            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
172            are set.  Cases that are unlikely reached are noted as so.
173    
174    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src: Checkpoints for debugging are added.
177    
178            * mkhtmlparser.pl: Support for |!!!cp| syntax.
179    
180    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
183            for simplicity.
184    
185    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
188            revision 1307).
189    
190    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
193            is set are changed (HTML5 revision 1305).
194    
195    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
196    
197            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
198            1292).  Entities are not parsed in comment-like part in RCDATA
199            elements (HTML5 revision 1294).  Allow bare & at the end
200            of attribute value literals (HTML5 revision 1296).  More
201            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
202            between attributes and ban attribute names or unquoted
203            attribute values containing single or double quotes (HTML5
204            revision 1303).
205    
206    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
207    
208            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
209            and related errors unless it is an HTML document (though the spec
210            is unclear on whether it is applied to XHTML document).
211    
212            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
213            are added.
214    
215    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
216    
217            * HTML.pm.src: s/local_name/manakai_local_name/g.
218    
219    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
220    
221            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
222    
223            * HTML.pm.src: |charset| in |content| attribute is
224            case-insensitive (HTML5 revision 1270).
225    
226    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentChecker.pm: New status constants are added.
229            ($ElementDefault): |status| added.
230            (check_element): Err for non-standard or deprecated elements.
231            (_attr_status_info): For non-standard or deprecated attributes.
232    
233    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
234    
235            * ContentChecker.pm (_attr_status_info): New internal method.
236    
237    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
238    
239            * ContentChecker.pm (check_element): Element standardized
240            status information is now dispatched.
241    
242    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
243    
244            * ContentChecker.pm (check_element): Fix |del|-and-significant
245            problem by adding some more arguments.
246    
247    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
248    
249            * ContentChecker.pm (check_element): Use context of
250            container-for-the-purpose-of-content-model element (not transparent
251            element) for |check_child_element| calling and significant
252            text flag marking.  This reintroduces |<del>|-and-significant
253            problem again.
254    
255    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
256    
257            * ContentChecker.pm (check_element): Make semi-transparent
258            elements ignored for the purpose of phase changes in
259            content model checking.
260    
261    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
262    
263            * ContentChecker.pm (check_element): In-element state
264            was not properly managed for transparent cases.
265    
266    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
267    
268            * ContentChecker.pm (check_element): Support for |video|
269            and |audio| as semi-transparent elements.
270    
271    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
272    
273            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
274            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
275            Support for |html:object| as a semi-transparent element.
276    
277    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
278    
279            * ContentChecker.pm (check_element): The way to traverse
280            the tree is entirely revised to make it easier to track
281            the state of ancestors/descendants.  As a result of this
282            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
283            support for content model checking for HTML elements |figure|,
284            |object|, |video|, and |audio| and checking for XML elements (and
285            some XMLNS checkings) are dropped for now.  They will be
286            reimplemented in due cource.
287    
288    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
289    
290            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
291            token (HTML5 revision 1225).
292    
293    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
294    
295            * ContentChecker.pm ({unsupported_level}): New value.
296    
297            * HTML.pm.src: Save whether |meta| |content| attribute
298            contains character references or not.
299    
300    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
301    
302            * ContentChecker.pm (_get_children): (Incompleted) attempt to
303            imlement significant content checking for contents
304            with |del| elements.
305    
306    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
307    
308            * ContenteChecker.pm ($HTMLTransparentElements): More
309            elements are added.
310            (_get_children): HTML |object| elements are now semi-transparent.
311    
312            * NanoDOM.pm (manakai_html, manakai_head): New methods.
313    
314    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
315    
316            * CacheManifest.pm: HTML5 revision 1211 implemented.
317    
318            * CacheManifest.pod: Updated.
319    
320    2008-02-10  Wakaba  <wakaba@suika.fam.cx>
321    
322            * ContentChecker.pm (check_document, check_element): Support
323            for second argument ($onsubdoc).
324            (_get_css_parser): Removed (now it is part of WDCC).
325    
326    2008-02-09  Wakaba  <wakaba@suika.fam.cx>
327    
328            * ContentChecker.pm (_get_css_parser): New.
329    
330    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
331    
332            * ContentChecker.pm ($AnyChecker): Old way to add child elements
333            for checking had been used.
334    
335    2007-11-25  Wakaba  <wakaba@suika.fam.cx>
336    
337            * ContentChecker.pm (check_element): New todo item type |descendant|.
338    
339    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
340    
341            * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
342            specced in RFC 2046 and RFC 4288.
343            (application/atom+xml): Definition added.
344    
345    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
346    
347            * URIChecker.pm: Make RFC 3986 should-level errors
348            warnings (rather than SHOULD-level errors).
349    
350    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
351    
352            * NanoDOM.pm (get_user_data, set_user_data): New methods.
353    
354            * HTML.pm.src: A flag for character references in attribute
355            values are added.  Set |manakai_has_reference| user data
356            to |charset| attribute.
357    
358    2007-11-23  Wakaba  <wakaba@suika.fam.cx>
359    
360            * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
361            attributes.
362    
363            * ContentChecker.pm (check_document): Warn if charset requirements
364            cannot be tested.
365    
366    2007-11-19  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src (parse_byte_string): Detect charset
369            by universalchardet if charset parameter is not specified.
370    
371            * Makefile (Charset-all, Charset-clean): New rules.
372    
373    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
374    
375            * ContentChecker.pm (check_document): Check the existence
376            of character encoding declaration and selection of encoding
377            for HTML document.
378    
379    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
380    
381            * ContentType.pm (get_sniffed_type): Return also the official
382            type in list context.
383    
384    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
385    
386            * ContentType.pm: Sniffing with leading white space
387            ignoring (HTML5 revisions 1013 and 1016).
388    
389    2007-11-18  Wakaba  <wakaba@suika.fam.cx>
390    
391            * ContentType.pm: HTML5 revision 1013 changes, except for leading
392            white spaces, are implemented.
393    
394    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTML.pm.src (parse_char_string): Set |inner_encoding|
397            attribute if possible.
398    
399    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
400    
401            * HTML.pm.src (parse_byte_string): New method.
402            (parse_char_string): New alias for |parse_string|.
403            (main phase): Invoking "change the encoding" algorithm if desired.
404    
405            * HTML.pod: Updated.
406    
407    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pod (get_inner_html): Removed.
410    
411            * Makefile (HTML-all, HTML-clean): New.
412    
413    2007-11-11  Wakaba  <wakaba@suika.fam.cx>
414    
415            * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
416    
417    2007-11-08  Wakaba  <wakaba@suika.fam.cx>
418    
419            * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
420    
421    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
422    
423            * Makefile: |CacheManifest.html| is added.
424    
425            * CacheManifest.pod: New file.
426    
427    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
428    
429            * CacheManifest.pm: New module.
430    
431    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
432    
433            * HTML.pm.src: Support for application cache selection algorithm
434            callback.
435    
436    2007-11-04  Wakaba  <wakaba@suika.fam.cx>
437    
438            * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
439    
440    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
441    
442            * Makefile (clean): New rule.
443    
444            * NanoDOM.pm (public_id, system_id): New attributes.
445    
446    2007-10-17  Wakaba  <wakaba@suika.fam.cx>
447    
448            * Makefile (CSS-all, CSS-clean, clean): New rules.
449    
450    2007-10-14  Wakaba  <wakaba@suika.fam.cx>
451    
452            * ContentChecker.pm (check_document): Support for
453            new |is_xml_root| flag.
454            (check_element): Support for new |pluses| state.
455            (_add_pluses): New method.
456            (_remove_minuses): Support for new |minus| item.
457    
458    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
459    
460            * ContentChecker.pm: Raise specific error for invalid
461            root element.
462    
463    2007-09-24  Wakaba  <wakaba@suika.fam.cx>
464    
465            * ContentChecker.pm: Set level values for later uses.
466    
467    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
468    
469            * ContentChecker.pm: Support for language tag validation.
470    
471    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
472    
473            * LangTag.pm (check_rfc3066_language_tag): New method.
474    
475    2007-09-09  Wakaba  <wakaba@suika.fam.cx>
476    
477            * LangTag.pm: New module.
478    
479    2007-09-04  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTML.pm.src: Some error types were wrong.
482    
483    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
484    
485            * CSS/: New directory.
486    
487    2007-08-17  Wakaba  <wakaba@suika.fam.cx>
488    
489            * ContentChecker.pm (_check_get_children): Support
490            for |noscript| in |head|.
491    
492    2007-08-12  Wakaba  <wakaba@suika.fam.cx>
493    
494            * URI/: New directory.
495    
496    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
497    
498            * HTML.pm.src: Tokenizer's states are now represented in
499            number.
500    
501    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
502    
503            * HTML.pm.src: |or|s for insertion modes are replaced
504            by |&|s.
505    
506    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
507    
508            * HTML.pm.src: Token types are now represented in number.
509    
510    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
511    
512            * ContentType.pm (SEE ALSO): Updated.
513    
514            * HTML.pm.src: Insertion modes are now represented in number.
515    
516    2007-08-11  Wakaba  <wakaba@suika.fam.cx>
517    
518            * ContentType.pm: Sniffing for bitmap images (HTML5 revision
519            999) is implemented.
520    
521    2007-08-08  Wakaba  <wakaba@suika.fam.cx>
522    
523            * ContentType.pm: Sniffing for |<script| (HTML5 revision
524            983) is implemented.
525    
526    2007-08-06  Wakaba  <wakaba@suika.fam.cx>
527    
528            * ContentChecker.pod: New documentation.
529    
530            * Makefile: A rule for |ContentChecker.html| is added.
531    
532            * ContentChecker.pm: A pod "LICENSE" section is added.
533    
534            * NanoDOM.pm ($VERSION): New variable.
535    
536    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
537    
538            * H2H.pm: |b|, |i|, and |sub| are added to the
539            list of allowed HTML elements.
540    
541    2007-08-05  Wakaba  <wakaba@suika.fam.cx>
542    
543            * H2H.pm: |samp| is added to the list of allowed
544            HTML elements.
545    
546            * URIChecker.pm (check_iri): New.
547            (check_iri_reference): Error type for IRI reference
548            syntax error is changed.
549    
550    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
551    
552            * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
553            is added.
554            (check_document): Load appropriate module before validation.
555    
556    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
557    
558            * ContentChecker/: New directory.
559    
560    2007-08-04  Wakaba  <wakaba@suika.fam.cx>
561    
562            * ContentChecker.pm: HTML |time| element is implemented.
563    
564            * HTMLTable.pm: Comments are updated as HTML5 is revised.
565    
566    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
567    
568            * ContentChecker.pm (check_document): Return value
569            even if no document element is found.
570    
571    2007-07-21  Wakaba  <wakaba@suika.fam.cx>
572    
573            * HTML.pm.src: |$in_body| is no longer a function.
574    
575  2007-07-21  Wakaba  <wakaba@suika.fam.cx>  2007-07-21  Wakaba  <wakaba@suika.fam.cx>
576    
577          * HTML.pm.src: The |$in_body| code has been moved down.          * HTML.pm.src: The |$in_body| code has been moved down.

Legend:
Removed from v.1.84  
changed lines
  Added in v.1.190

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24