/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.132 by wakaba, Sun Feb 10 04:09:57 2008 UTC revision 1.233 by wakaba, Sat May 17 04:54:10 2008 UTC
# Line 1  Line 1 
1    2008-05-17  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
4            algorithm, except for the actual sniffing, is implemented
5            with new framework with Message::Charset::Info.
6    
7    2008-05-16  Wakaba  <wakaba@suika.fam.cx>
8    
9            * CacheManifest.pm (_parse): Drop fragment identifiers from
10            URIs in fallback section (HTML5 revision 1596).
11    
12    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
13    
14            * Makefile (Entities.html): URI has changed.
15    
16    2008-05-10  Wakaba  <wakaba@suika.fam.cx>
17    
18            * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
19            1553).
20    
21    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
22    
23            * ContentChecker.pm: Noted that those returned in |table| are
24            no longer table elements, but table objects returned
25            by Whatpm::HTMLTable.
26    
27            * HTMLTable.pm (form_table): Return table element node
28            as |$table->{element}|.
29            (assign_header): Support for the |headers=""| attribute.
30    
31    2008-05-06  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTMLTable.pm (assign_header): New function; first version
34            with no support for headers="".
35            (form_table): Include table width and height to the returned
36            table object for covenience.  Indexing in column assignement was wrong.
37            Set whether a data cell is empty or not for convenience.
38    
39    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
42    
43    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTMLTable.pm: How table model errors are detected is
46            changed (HTML5 revision 1387).
47    
48    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTMLTable.pm: The algorithm now moves |tfoot| elements
51            to the end of the table (HTML5 revision 1380).
52    
53    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTMLTable.pm: The algorithm is now 0-based indexing, instead
56            of 1-based (HTML5 revision 1376).
57    
58    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
59    
60            * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
61            and "Content-Encoding" no longer prevent sniffing (HTML5
62            revision 1288).
63    
64    2008-05-05  Wakaba  <wakaba@suika.fam.cx>
65    
66            * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
67            1282).
68    
69    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
70    
71            * ContentChecker.pm: Support for global attributes.
72            Status of XML specs are added.
73    
74    2008-05-03  Wakaba  <wakaba@suika.fam.cx>
75    
76            * ContentChecker.pm (check_element): Support for |template=""|
77            and |ref=""| attribute (referent element type checking).
78    
79    2008-04-29  Wakaba  <wakaba@suika.fam.cx>
80    
81            * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
82            1500) is implemented (except for IDNA part and URI-scheme-specific
83            knowledge).  Line number counting was wrong for LF-only
84            documents.
85    
86    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: Raise an parse error for any disallowed
89            character (HTML5 revision 1263).
90    
91    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
92    
93            * mkentitylist.pl: Support for new HTML5 entity table format (the
94            definition for |AElig;| was missing).
95    
96    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
97    
98            * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
99            name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
100    
101    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src: List of element names that close foreign content
104            insertion mode is added (HTML5 revisions 1412 and 1418).
105    
106    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Support for |mglyph| and |malignmark|
109            elements (HTML5 revision 1410).
110    
111    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
112    
113            * HTML.pm.src: Support for new long MathML entities (HTML5
114            revision 1406).
115    
116    2008-04-13  Wakaba  <wakaba@suika.fam.cx>
117    
118            * HTML.pm.src: CDATA section support for MathML and SVG
119            elements (HTML5 revision 1404 and 1420).
120    
121    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
122    
123            * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
124            elements (HTML5 revision 1404).  Unused !!!macro definitions
125            are removed.
126    
127    2008-04-12  Wakaba  <wakaba@suika.fam.cx>
128    
129            * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
130            are raised is changed (HTML5 revision 1404).
131    
132    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: Category sets that are no longer used are removed.
135    
136    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: The ->[1] property of stack entries are now
139            replaced by constants representing element category.
140    
141    2008-04-06  Wakaba  <wakaba@suika.fam.cx>
142    
143            * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
144            for error reporting.  (This is a preparation for using constant
145            value for ->[1].)
146    
147    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
148    
149            * RDFXML.pm: Typo fixed.
150    
151    2008-03-22  Wakaba  <wakaba@suika.fam.cx>
152    
153            * ContentChecker.pm: |fact_level| is now treated
154            as same as |must_level|, i.e. level = |m|.
155            (check_element): Make list of URIs in the DOM.
156    
157    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
158    
159            * RDFXML.pm: Language accessor implemented.
160            Local (null-namespace) attribute support.
161    
162    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
163    
164            * RDFXML.pm: Factored out ID checking code.
165    
166    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
167    
168            * RDFXML.pm: TODO items noted.  Validation of ID
169            and URI attributes is implemented.  Warn if unknown
170            value is used in rdf:parseType="" attribute.
171    
172            * URIChecker.pm (check_rdf_uri_reference): New function.
173    
174    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
175    
176            * RDFXML.pm: bnodeid implemented.  Relative references
177            are now resolved.
178    
179    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
180    
181            * ContentChecker.pm: RDF reification implemented.
182    
183            * RDFXML.pm: undef vs false bug fixed.
184            Reification implemented.
185    
186    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
187    
188            * RDFXML.pm: s/id/ID/ for attribute name.
189            The |node| arguments are added for |ontriple| calls.
190            Too many "attribute not allowed" errors were raised.
191    
192            * ContentChecker.pm: Initial experimental support for rdf:RDF
193            element.
194    
195    2008-03-21  Wakaba  <wakaba@suika.fam.cx>
196    
197            * RDFXML.pm: New module.
198    
199    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src (set_inner_html): Line/column number
202            code was old one yet.
203    
204    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: Better line/column reporting for "duplicate attribute"
207            errors.  Line/column markings for DOCTYPE, comment, and
208            character tokens are reintroduced; otherwise,
209            error location for "not HTML5" error and errors
210            for implied elements are not attached.
211    
212    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: Set line/column numbers to attributes.
215    
216            * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
217            (value): Setter implemented.
218    
219            * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
220    
221    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src: Unused line/column markings are removed.
224    
225    2008-03-20  Wakaba  <wakaba@suika.fam.cx>
226    
227            * HTML.pm.src (_get_next_token): Remove |first_start_tag|
228            flag, which is no longer used.
229    
230    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
231    
232            * HTML.pm.src: Set line/column information to element nodes.
233    
234            * mkhtmlparser.pl (!!!create-element, !!!insert-element,
235            and !!!insert-element-t): Set line/column information to
236            element nodes.
237    
238    2008-03-17  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src (_get_next_token): The first "<" character
241            in "<?", "<>", or "</>" should be the error point.
242    
243    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: Some more fixes on error position reporting.
246    
247    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
248    
249            * HTML.pm.src: Token-level precious error reporting.
250    
251    2008-03-16  Wakaba  <wakaba@suika.fam.cx>
252    
253            * HTML.pm.src: Preparation for more precious error point
254            reporting.
255    
256    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src: Error type revised.
259    
260    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
261    
262            * HTML.pm.src: Similar codes are merged together, again.
263    
264    2008-03-11  Wakaba  <wakaba@suika.fam.cx>
265    
266            * HTML.pm.src: Similar codes are merged together.
267    
268    2008-03-10  Wakaba  <wakaba@suika.fam.cx>
269    
270            * mkhtmlparser.pl: Set "level" parameter to parse errors.
271    
272            * HTML.pm.src: Code refined.
273    
274    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: |</body>| treatement has been changed (HTML5
277            revision 1348).  Note that I really don't know this makes
278            any difference in the black-box behavior of the parser.
279    
280    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
281    
282            * HTML.pm.src: New end-of-file token implementation (HTML5
283            revision 1348).
284    
285    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
286    
287            * HTML.pm.src: |applet| support (HTML5 revision 1347).
288    
289    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
292    
293    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src: Support for |<input>| in the "in select" insertion
296            mode and support for the "in select in table" insertion mode (HTML5
297            revision 1342).
298    
299    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: No foster parenting for <script> and <script>
302            in non-tainted <table>s (HTML5 revision 1336).
303    
304    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: Ignore white space characters between <html>
307            and <head> (HTML5 revision 1332).
308    
309    2008-03-09  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: Treat <input type=hidden> as if it were a
312            white space (HTML5 revision 1331).
313    
314    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
317            element (HTML5 revision 1330).
318    
319    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: <title> is always appended to the current
322            element (HTML5 revision 1328).
323    
324    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src: White space in tainted tables are moved
327            into foster parents (HTML5 revision 1326).
328    
329    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
330    
331            * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
332            revision 1321).
333    
334    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: |</h/n/>| case code rearranged to align with
337            the spec (HTML5 revision 1320).  Note that we finally complete
338            all of HTML5 revision 1320 changes.
339    
340    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
343            tags (HTML5 revision 1320).
344    
345    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
346    
347            * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
348    
349    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: A "generate implied end tags" code (t409.1)
352            could not be reached so that it is now removed (HTML5 revision
353            1320).
354    
355    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
356    
357            * HTML.pm.src: Code for the case of |</div>| and so on are revised
358            to align with new spec text (HTML5 revision 1320).
359    
360    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: Remove strange |if| condition; however, it should
363            have had no harm in theory.
364    
365    2008-03-08  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src (_tree_construction_main): '</p>' in body
368            case is split from other end tags for the preparation
369            of implementing HTML revision 1320.
370    
371    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
372    
373            * HTML.pm.src: Simplified "generate implied end tag" (HTML5
374            revision 1320).
375    
376    2008-03-07  Wakaba  <wakaba@suika.fam.cx>
377    
378            * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
379            and "div" (HTML5 revision 1318).  Add comments to where
380            |form| pointer association codes should be inserted (HTML5 revision
381            1319).
382    
383    2008-03-06  Wakaba  <wakaba@suika.fam.cx>
384    
385            * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
386    
387    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
388    
389            * HTML.pm.src: Since the case t268 should never be reached (no
390            other token type, there are), it is replaced by a |die| statement.
391    
392    2008-03-05  Wakaba  <wakaba@suika.fam.cx>
393    
394            * HTML.pm.src: Typo fixed.
395    
396    2008-03-04  Wakaba  <wakaba@suika.fam.cx>
397    
398            * HTML.pm.src (_tree_construction_initial): Some limited quirks
399            doctypes were not uppercased for comparison.
400    
401    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
402    
403            * HTML.pm.src (tree construction and set_inner_html): Checkpoints
404            are added.
405    
406    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
409            are set.  Cases that are unlikely reached are noted as so.
410    
411    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
412    
413            * HTML.pm.src: Checkpoints for debugging are added.
414    
415            * mkhtmlparser.pl: Support for |!!!cp| syntax.
416    
417    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
420            for simplicity.
421    
422    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
425            revision 1307).
426    
427    2008-03-03  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
430            is set are changed (HTML5 revision 1305).
431    
432    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
435            1292).  Entities are not parsed in comment-like part in RCDATA
436            elements (HTML5 revision 1294).  Allow bare & at the end
437            of attribute value literals (HTML5 revision 1296).  More
438            quirks mode doctypes (HTML5 revision 1302).  Requires spaces
439            between attributes and ban attribute names or unquoted
440            attribute values containing single or double quotes (HTML5
441            revision 1303).
442    
443    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
444    
445            * ContentChecker.pm: Typo fixed.  Don't raise "character encoding"
446            and related errors unless it is an HTML document (though the spec
447            is unclear on whether it is applied to XHTML document).
448    
449            * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
450            are added.
451    
452    2008-03-02  Wakaba  <wakaba@suika.fam.cx>
453    
454            * HTML.pm.src: s/local_name/manakai_local_name/g.
455    
456    2008-03-01  Wakaba  <wakaba@suika.fam.cx>
457    
458            * _NamedEntityList.pm: Updated (HTML5 revision 1286).
459    
460            * HTML.pm.src: |charset| in |content| attribute is
461            case-insensitive (HTML5 revision 1270).
462    
463    2008-02-26  Wakaba  <wakaba@suika.fam.cx>
464    
465            * ContentChecker.pm: New status constants are added.
466            ($ElementDefault): |status| added.
467            (check_element): Err for non-standard or deprecated elements.
468            (_attr_status_info): For non-standard or deprecated attributes.
469    
470    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
471    
472            * ContentChecker.pm (_attr_status_info): New internal method.
473    
474    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
475    
476            * ContentChecker.pm (check_element): Element standardized
477            status information is now dispatched.
478    
479    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
480    
481            * ContentChecker.pm (check_element): Fix |del|-and-significant
482            problem by adding some more arguments.
483    
484    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
485    
486            * ContentChecker.pm (check_element): Use context of
487            container-for-the-purpose-of-content-model element (not transparent
488            element) for |check_child_element| calling and significant
489            text flag marking.  This reintroduces |<del>|-and-significant
490            problem again.
491    
492    2008-02-24  Wakaba  <wakaba@suika.fam.cx>
493    
494            * ContentChecker.pm (check_element): Make semi-transparent
495            elements ignored for the purpose of phase changes in
496            content model checking.
497    
498    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
499    
500            * ContentChecker.pm (check_element): In-element state
501            was not properly managed for transparent cases.
502    
503    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
504    
505            * ContentChecker.pm (check_element): Support for |video|
506            and |audio| as semi-transparent elements.
507    
508    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
509    
510            * ContentChecker.pm ($HTMLSemiTransparentElements): New.
511            (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
512            Support for |html:object| as a semi-transparent element.
513    
514    2008-02-23  Wakaba  <wakaba@suika.fam.cx>
515    
516            * ContentChecker.pm (check_element): The way to traverse
517            the tree is entirely revised to make it easier to track
518            the state of ancestors/descendants.  As a result of this
519            revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
520            support for content model checking for HTML elements |figure|,
521            |object|, |video|, and |audio| and checking for XML elements (and
522            some XMLNS checkings) are dropped for now.  They will be
523            reimplemented in due cource.
524    
525    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
526    
527            * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
528            token (HTML5 revision 1225).
529    
530    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
531    
532            * ContentChecker.pm ({unsupported_level}): New value.
533    
534            * HTML.pm.src: Save whether |meta| |content| attribute
535            contains character references or not.
536    
537    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
538    
539            * ContentChecker.pm (_get_children): (Incompleted) attempt to
540            imlement significant content checking for contents
541            with |del| elements.
542    
543    2008-02-17  Wakaba  <wakaba@suika.fam.cx>
544    
545            * ContenteChecker.pm ($HTMLTransparentElements): More
546            elements are added.
547            (_get_children): HTML |object| elements are now semi-transparent.
548    
549            * NanoDOM.pm (manakai_html, manakai_head): New methods.
550    
551    2008-02-16  Wakaba  <wakaba@suika.fam.cx>
552    
553            * CacheManifest.pm: HTML5 revision 1211 implemented.
554    
555            * CacheManifest.pod: Updated.
556    
557  2008-02-10  Wakaba  <wakaba@suika.fam.cx>  2008-02-10  Wakaba  <wakaba@suika.fam.cx>
558    
559          * ContentChecker.pm (check_document, check_element): Support          * ContentChecker.pm (check_document, check_element): Support

Legend:
Removed from v.1.132  
changed lines
  Added in v.1.233

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24