/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.236 - (hide annotations) (download)
Sat May 17 12:29:23 2008 UTC (16 years, 11 months ago) by wakaba
Branch: MAIN
Changes since 1.235: +7 -0 lines
++ whatpm/Whatpm/ChangeLog	17 May 2008 12:28:47 -0000
	* HTML.pm.src (parse_byte_string): Use streaming decoder
	rather than converting the whole byte string and then parsing.
	Propagate errors in character encoding layer.
	(get_next_token): Precise error reporting for |bare stago| error.

2008-05-17  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	17 May 2008 12:29:09 -0000
2008-05-17  Wakaba  <wakaba@suika.fam.cx>

	* DecodeHandle.pm (ByteBuffer): New class.

1 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
4     rather than converting the whole byte string and then parsing.
5     Propagate errors in character encoding layer.
6     (get_next_token): Precise error reporting for |bare stago| error.
7    
8     2008-05-17 Wakaba <wakaba@suika.fam.cx>
9    
10 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
11     (parse_char_string): This method is now defined as an invocation
12     of the |parse_char_stream| method.
13    
14     2008-05-17 Wakaba <wakaba@suika.fam.cx>
15    
16 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
17     of the sniffing as info-level errors. Support for new
18     decoding framework in parser resestting.
19     (new): Various default error levels were not set.
20    
21     2008-05-17 Wakaba <wakaba@suika.fam.cx>
22    
23 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
24     algorithm, except for the actual sniffing, is implemented
25     with new framework with Message::Charset::Info.
26    
27 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
28    
29     * CacheManifest.pm (_parse): Drop fragment identifiers from
30     URIs in fallback section (HTML5 revision 1596).
31    
32 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
33    
34 wakaba 1.231 * Makefile (Entities.html): URI has changed.
35    
36     2008-05-10 Wakaba <wakaba@suika.fam.cx>
37    
38 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
39     1553).
40    
41 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
42    
43 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
44     no longer table elements, but table objects returned
45     by Whatpm::HTMLTable.
46    
47     * HTMLTable.pm (form_table): Return table element node
48     as |$table->{element}|.
49     (assign_header): Support for the |headers=""| attribute.
50    
51     2008-05-06 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
54     with no support for headers="".
55     (form_table): Include table width and height to the returned
56     table object for covenience. Indexing in column assignement was wrong.
57     Set whether a data cell is empty or not for convenience.
58    
59 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
60    
61 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
62    
63     2008-05-05 Wakaba <wakaba@suika.fam.cx>
64    
65 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
66     changed (HTML5 revision 1387).
67    
68     2008-05-05 Wakaba <wakaba@suika.fam.cx>
69    
70 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
71     to the end of the table (HTML5 revision 1380).
72    
73     2008-05-05 Wakaba <wakaba@suika.fam.cx>
74    
75 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
76     of 1-based (HTML5 revision 1376).
77    
78     2008-05-05 Wakaba <wakaba@suika.fam.cx>
79    
80 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
81     and "Content-Encoding" no longer prevent sniffing (HTML5
82     revision 1288).
83    
84     2008-05-05 Wakaba <wakaba@suika.fam.cx>
85    
86 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
87     1282).
88    
89 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
90    
91 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
92     Status of XML specs are added.
93    
94     2008-05-03 Wakaba <wakaba@suika.fam.cx>
95    
96 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
97     and |ref=""| attribute (referent element type checking).
98    
99 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
100    
101 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
102     1500) is implemented (except for IDNA part and URI-scheme-specific
103 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
104     documents.
105    
106 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
107    
108 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
109     character (HTML5 revision 1263).
110    
111     2008-04-13 Wakaba <wakaba@suika.fam.cx>
112    
113 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
114     definition for |AElig;| was missing).
115    
116     2008-04-13 Wakaba <wakaba@suika.fam.cx>
117    
118 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
119     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
120    
121     2008-04-13 Wakaba <wakaba@suika.fam.cx>
122    
123 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
124     insertion mode is added (HTML5 revisions 1412 and 1418).
125    
126     2008-04-13 Wakaba <wakaba@suika.fam.cx>
127    
128 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
129     elements (HTML5 revision 1410).
130    
131     2008-04-13 Wakaba <wakaba@suika.fam.cx>
132    
133 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
134     revision 1406).
135    
136     2008-04-13 Wakaba <wakaba@suika.fam.cx>
137    
138 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
139     elements (HTML5 revision 1404 and 1420).
140    
141 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
142    
143 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
144     elements (HTML5 revision 1404). Unused !!!macro definitions
145     are removed.
146    
147     2008-04-12 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
150     are raised is changed (HTML5 revision 1404).
151    
152 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
153    
154 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
155    
156     2008-04-06 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
159     replaced by constants representing element category.
160    
161     2008-04-06 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
164     for error reporting. (This is a preparation for using constant
165     value for ->[1].)
166    
167 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
168    
169 wakaba 1.205 * RDFXML.pm: Typo fixed.
170    
171     2008-03-22 Wakaba <wakaba@suika.fam.cx>
172    
173 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
174     as same as |must_level|, i.e. level = |m|.
175     (check_element): Make list of URIs in the DOM.
176    
177 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
180     Local (null-namespace) attribute support.
181    
182     2008-03-21 Wakaba <wakaba@suika.fam.cx>
183    
184 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
185    
186     2008-03-21 Wakaba <wakaba@suika.fam.cx>
187    
188 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
189     and URI attributes is implemented. Warn if unknown
190     value is used in rdf:parseType="" attribute.
191    
192     * URIChecker.pm (check_rdf_uri_reference): New function.
193    
194     2008-03-21 Wakaba <wakaba@suika.fam.cx>
195    
196 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
197     are now resolved.
198    
199     2008-03-21 Wakaba <wakaba@suika.fam.cx>
200    
201 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
202    
203     * RDFXML.pm: undef vs false bug fixed.
204     Reification implemented.
205    
206     2008-03-21 Wakaba <wakaba@suika.fam.cx>
207    
208 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
209     The |node| arguments are added for |ontriple| calls.
210     Too many "attribute not allowed" errors were raised.
211    
212     * ContentChecker.pm: Initial experimental support for rdf:RDF
213     element.
214    
215     2008-03-21 Wakaba <wakaba@suika.fam.cx>
216    
217 wakaba 1.197 * RDFXML.pm: New module.
218    
219 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
220    
221 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
222     code was old one yet.
223    
224     2008-03-20 Wakaba <wakaba@suika.fam.cx>
225    
226 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
227     errors. Line/column markings for DOCTYPE, comment, and
228     character tokens are reintroduced; otherwise,
229     error location for "not HTML5" error and errors
230     for implied elements are not attached.
231    
232     2008-03-20 Wakaba <wakaba@suika.fam.cx>
233    
234 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
235    
236     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
237     (value): Setter implemented.
238    
239     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
240    
241     2008-03-20 Wakaba <wakaba@suika.fam.cx>
242    
243 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
244    
245     2008-03-20 Wakaba <wakaba@suika.fam.cx>
246    
247 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
248     flag, which is no longer used.
249    
250 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
251    
252 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
253    
254     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
255     and !!!insert-element-t): Set line/column information to
256     element nodes.
257    
258     2008-03-17 Wakaba <wakaba@suika.fam.cx>
259    
260 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
261     in "<?", "<>", or "</>" should be the error point.
262    
263 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
264    
265 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
266    
267     2008-03-16 Wakaba <wakaba@suika.fam.cx>
268    
269 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
270    
271     2008-03-16 Wakaba <wakaba@suika.fam.cx>
272    
273 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
274     reporting.
275    
276 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
277    
278 wakaba 1.186 * HTML.pm.src: Error type revised.
279    
280     2008-03-11 Wakaba <wakaba@suika.fam.cx>
281    
282 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
283    
284     2008-03-11 Wakaba <wakaba@suika.fam.cx>
285    
286 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
287    
288 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
289    
290     * mkhtmlparser.pl: Set "level" parameter to parse errors.
291    
292     * HTML.pm.src: Code refined.
293    
294 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
295    
296 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
297     revision 1348). Note that I really don't know this makes
298     any difference in the black-box behavior of the parser.
299    
300     2008-03-09 Wakaba <wakaba@suika.fam.cx>
301    
302 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
303     revision 1348).
304    
305     2008-03-09 Wakaba <wakaba@suika.fam.cx>
306    
307 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
308    
309     2008-03-09 Wakaba <wakaba@suika.fam.cx>
310    
311 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
312    
313     2008-03-09 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
316     mode and support for the "in select in table" insertion mode (HTML5
317     revision 1342).
318    
319     2008-03-09 Wakaba <wakaba@suika.fam.cx>
320    
321 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
322     in non-tainted <table>s (HTML5 revision 1336).
323    
324     2008-03-09 Wakaba <wakaba@suika.fam.cx>
325    
326 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
327     and <head> (HTML5 revision 1332).
328    
329     2008-03-09 Wakaba <wakaba@suika.fam.cx>
330    
331 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
332     white space (HTML5 revision 1331).
333    
334 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
335    
336 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
337     element (HTML5 revision 1330).
338    
339     2008-03-08 Wakaba <wakaba@suika.fam.cx>
340    
341 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
342     element (HTML5 revision 1328).
343    
344     2008-03-08 Wakaba <wakaba@suika.fam.cx>
345    
346 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
347     into foster parents (HTML5 revision 1326).
348    
349     2008-03-08 Wakaba <wakaba@suika.fam.cx>
350    
351 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
352     revision 1321).
353    
354     2008-03-08 Wakaba <wakaba@suika.fam.cx>
355    
356 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
357     the spec (HTML5 revision 1320). Note that we finally complete
358     all of HTML5 revision 1320 changes.
359    
360     2008-03-08 Wakaba <wakaba@suika.fam.cx>
361    
362 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
363     tags (HTML5 revision 1320).
364    
365     2008-03-08 Wakaba <wakaba@suika.fam.cx>
366    
367 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
368    
369     2008-03-08 Wakaba <wakaba@suika.fam.cx>
370    
371 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
372     could not be reached so that it is now removed (HTML5 revision
373     1320).
374    
375     2008-03-08 Wakaba <wakaba@suika.fam.cx>
376    
377 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
378     to align with new spec text (HTML5 revision 1320).
379    
380     2008-03-08 Wakaba <wakaba@suika.fam.cx>
381    
382 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
383     have had no harm in theory.
384    
385     2008-03-08 Wakaba <wakaba@suika.fam.cx>
386    
387 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
388     case is split from other end tags for the preparation
389     of implementing HTML revision 1320.
390    
391 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
392    
393 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
394     revision 1320).
395    
396     2008-03-07 Wakaba <wakaba@suika.fam.cx>
397    
398 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
399     and "div" (HTML5 revision 1318). Add comments to where
400     |form| pointer association codes should be inserted (HTML5 revision
401     1319).
402    
403 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
404    
405     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
406    
407 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
408    
409 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
410     other token type, there are), it is replaced by a |die| statement.
411    
412     2008-03-05 Wakaba <wakaba@suika.fam.cx>
413    
414 wakaba 1.159 * HTML.pm.src: Typo fixed.
415    
416 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
417    
418     * HTML.pm.src (_tree_construction_initial): Some limited quirks
419     doctypes were not uppercased for comparison.
420    
421 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
422    
423 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
424     are added.
425    
426     2008-03-03 Wakaba <wakaba@suika.fam.cx>
427    
428 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
429     are set. Cases that are unlikely reached are noted as so.
430    
431     2008-03-03 Wakaba <wakaba@suika.fam.cx>
432    
433 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
434    
435     * mkhtmlparser.pl: Support for |!!!cp| syntax.
436    
437     2008-03-03 Wakaba <wakaba@suika.fam.cx>
438    
439 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
440     for simplicity.
441    
442     2008-03-03 Wakaba <wakaba@suika.fam.cx>
443    
444 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
445     revision 1307).
446    
447     2008-03-03 Wakaba <wakaba@suika.fam.cx>
448    
449 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
450     is set are changed (HTML5 revision 1305).
451    
452 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
453    
454 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
455     1292). Entities are not parsed in comment-like part in RCDATA
456     elements (HTML5 revision 1294). Allow bare & at the end
457     of attribute value literals (HTML5 revision 1296). More
458     quirks mode doctypes (HTML5 revision 1302). Requires spaces
459     between attributes and ban attribute names or unquoted
460     attribute values containing single or double quotes (HTML5
461     revision 1303).
462    
463     2008-03-02 Wakaba <wakaba@suika.fam.cx>
464    
465 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
466     and related errors unless it is an HTML document (though the spec
467     is unclear on whether it is applied to XHTML document).
468    
469     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
470     are added.
471    
472     2008-03-02 Wakaba <wakaba@suika.fam.cx>
473    
474 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
475    
476 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
477    
478     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
479    
480     * HTML.pm.src: |charset| in |content| attribute is
481     case-insensitive (HTML5 revision 1270).
482    
483 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
484    
485     * ContentChecker.pm: New status constants are added.
486     ($ElementDefault): |status| added.
487     (check_element): Err for non-standard or deprecated elements.
488     (_attr_status_info): For non-standard or deprecated attributes.
489    
490 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
491    
492 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
493    
494     2008-02-24 Wakaba <wakaba@suika.fam.cx>
495    
496 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
497     status information is now dispatched.
498    
499     2008-02-24 Wakaba <wakaba@suika.fam.cx>
500    
501 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
502     problem by adding some more arguments.
503    
504     2008-02-24 Wakaba <wakaba@suika.fam.cx>
505    
506 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
507     container-for-the-purpose-of-content-model element (not transparent
508     element) for |check_child_element| calling and significant
509     text flag marking. This reintroduces |<del>|-and-significant
510     problem again.
511    
512     2008-02-24 Wakaba <wakaba@suika.fam.cx>
513    
514 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
515     elements ignored for the purpose of phase changes in
516     content model checking.
517    
518 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
519    
520 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
521     was not properly managed for transparent cases.
522    
523     2008-02-23 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
526     and |audio| as semi-transparent elements.
527    
528     2008-02-23 Wakaba <wakaba@suika.fam.cx>
529    
530 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
531     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
532     Support for |html:object| as a semi-transparent element.
533    
534     2008-02-23 Wakaba <wakaba@suika.fam.cx>
535    
536 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
537     the tree is entirely revised to make it easier to track
538     the state of ancestors/descendants. As a result of this
539     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
540     support for content model checking for HTML elements |figure|,
541     |object|, |video|, and |audio| and checking for XML elements (and
542     some XMLNS checkings) are dropped for now. They will be
543     reimplemented in due cource.
544    
545 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
546    
547 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
548     token (HTML5 revision 1225).
549    
550     2008-02-17 Wakaba <wakaba@suika.fam.cx>
551    
552 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
553    
554     * HTML.pm.src: Save whether |meta| |content| attribute
555     contains character references or not.
556    
557     2008-02-17 Wakaba <wakaba@suika.fam.cx>
558    
559 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
560     imlement significant content checking for contents
561     with |del| elements.
562    
563     2008-02-17 Wakaba <wakaba@suika.fam.cx>
564    
565 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
566     elements are added.
567     (_get_children): HTML |object| elements are now semi-transparent.
568    
569     * NanoDOM.pm (manakai_html, manakai_head): New methods.
570    
571 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
572    
573     * CacheManifest.pm: HTML5 revision 1211 implemented.
574    
575     * CacheManifest.pod: Updated.
576    
577 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
578    
579     * ContentChecker.pm (check_document, check_element): Support
580     for second argument ($onsubdoc).
581     (_get_css_parser): Removed (now it is part of WDCC).
582    
583 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
584    
585     * ContentChecker.pm (_get_css_parser): New.
586    
587 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
588    
589 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
590     for checking had been used.
591    
592     2007-11-25 Wakaba <wakaba@suika.fam.cx>
593    
594 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
595    
596 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
597    
598 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
599     specced in RFC 2046 and RFC 4288.
600     (application/atom+xml): Definition added.
601    
602     2007-11-23 Wakaba <wakaba@suika.fam.cx>
603    
604 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
605     warnings (rather than SHOULD-level errors).
606    
607     2007-11-23 Wakaba <wakaba@suika.fam.cx>
608    
609 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
610    
611     * HTML.pm.src: A flag for character references in attribute
612     values are added. Set |manakai_has_reference| user data
613     to |charset| attribute.
614    
615     2007-11-23 Wakaba <wakaba@suika.fam.cx>
616    
617 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
618     attributes.
619    
620     * ContentChecker.pm (check_document): Warn if charset requirements
621     cannot be tested.
622    
623 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
624    
625     * HTML.pm.src (parse_byte_string): Detect charset
626     by universalchardet if charset parameter is not specified.
627    
628     * Makefile (Charset-all, Charset-clean): New rules.
629    
630 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
631    
632 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
633     of character encoding declaration and selection of encoding
634     for HTML document.
635    
636     2007-11-18 Wakaba <wakaba@suika.fam.cx>
637    
638 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
639     type in list context.
640    
641     2007-11-18 Wakaba <wakaba@suika.fam.cx>
642    
643 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
644     ignoring (HTML5 revisions 1013 and 1016).
645    
646     2007-11-18 Wakaba <wakaba@suika.fam.cx>
647    
648 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
649     white spaces, are implemented.
650    
651 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
652    
653 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
654     attribute if possible.
655    
656     2007-11-11 Wakaba <wakaba@suika.fam.cx>
657    
658 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
659     (parse_char_string): New alias for |parse_string|.
660     (main phase): Invoking "change the encoding" algorithm if desired.
661    
662     * HTML.pod: Updated.
663    
664     2007-11-11 Wakaba <wakaba@suika.fam.cx>
665    
666 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
667    
668     * Makefile (HTML-all, HTML-clean): New.
669    
670     2007-11-11 Wakaba <wakaba@suika.fam.cx>
671    
672     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
673    
674 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
675    
676     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
677    
678 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
681    
682     * CacheManifest.pod: New file.
683    
684     2007-11-04 Wakaba <wakaba@suika.fam.cx>
685    
686 wakaba 1.114 * CacheManifest.pm: New module.
687    
688     2007-11-04 Wakaba <wakaba@suika.fam.cx>
689    
690 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
691     callback.
692    
693     2007-11-04 Wakaba <wakaba@suika.fam.cx>
694    
695 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
696    
697 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
698    
699 wakaba 1.111 * Makefile (clean): New rule.
700    
701     * NanoDOM.pm (public_id, system_id): New attributes.
702    
703     2007-10-17 Wakaba <wakaba@suika.fam.cx>
704    
705 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
706    
707 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
708    
709     * ContentChecker.pm (check_document): Support for
710     new |is_xml_root| flag.
711     (check_element): Support for new |pluses| state.
712     (_add_pluses): New method.
713     (_remove_minuses): Support for new |minus| item.
714    
715 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
716    
717 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
718     root element.
719    
720     2007-09-24 Wakaba <wakaba@suika.fam.cx>
721    
722 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
723    
724 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
725    
726 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
727    
728     2007-09-09 Wakaba <wakaba@suika.fam.cx>
729    
730 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
731    
732     2007-09-09 Wakaba <wakaba@suika.fam.cx>
733    
734 wakaba 1.103 * LangTag.pm: New module.
735    
736 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
737    
738     * HTML.pm.src: Some error types were wrong.
739    
740 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
741    
742 wakaba 1.101 * CSS/: New directory.
743    
744     2007-08-17 Wakaba <wakaba@suika.fam.cx>
745    
746 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
747     for |noscript| in |head|.
748    
749 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
750    
751     * URI/: New directory.
752    
753 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
754    
755 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
756     number.
757    
758     2007-08-11 Wakaba <wakaba@suika.fam.cx>
759    
760 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
761     by |&|s.
762    
763     2007-08-11 Wakaba <wakaba@suika.fam.cx>
764    
765 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
766    
767     2007-08-11 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
770    
771     * HTML.pm.src: Insertion modes are now represented in number.
772    
773     2007-08-11 Wakaba <wakaba@suika.fam.cx>
774    
775 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
776     999) is implemented.
777    
778 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
779    
780     * ContentType.pm: Sniffing for |<script| (HTML5 revision
781     983) is implemented.
782    
783 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
784    
785     * ContentChecker.pod: New documentation.
786    
787     * Makefile: A rule for |ContentChecker.html| is added.
788    
789     * ContentChecker.pm: A pod "LICENSE" section is added.
790    
791     * NanoDOM.pm ($VERSION): New variable.
792    
793 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
794    
795 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
796     list of allowed HTML elements.
797    
798     2007-08-05 Wakaba <wakaba@suika.fam.cx>
799    
800 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
801     HTML elements.
802    
803     * URIChecker.pm (check_iri): New.
804     (check_iri_reference): Error type for IRI reference
805     syntax error is changed.
806    
807 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
808    
809 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
810     is added.
811     (check_document): Load appropriate module before validation.
812    
813     2007-08-04 Wakaba <wakaba@suika.fam.cx>
814    
815 wakaba 1.88 * ContentChecker/: New directory.
816    
817     2007-08-04 Wakaba <wakaba@suika.fam.cx>
818    
819 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
820    
821     * HTMLTable.pm: Comments are updated as HTML5 is revised.
822    
823 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
824    
825 wakaba 1.86 * ContentChecker.pm (check_document): Return value
826     even if no document element is found.
827    
828     2007-07-21 Wakaba <wakaba@suika.fam.cx>
829    
830 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
831    
832     2007-07-21 Wakaba <wakaba@suika.fam.cx>
833    
834 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
835    
836     2007-07-21 Wakaba <wakaba@suika.fam.cx>
837    
838 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
839     is split into "after html body" and "after html frameset"
840     insertion modes. Their codes are merged with "after body"
841     and "after frameset" codes. |$previous_insertion_mode|
842     has been removed. "after frameset" code is
843     merged with "in frameset" code.
844    
845     2007-07-21 Wakaba <wakaba@suika.fam.cx>
846    
847 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
848     merged with the "in head" insertion mode.
849    
850     2007-07-21 Wakaba <wakaba@suika.fam.cx>
851    
852 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
853     reorganized.
854    
855     2007-07-21 Wakaba <wakaba@suika.fam.cx>
856    
857 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
858     are merged.
859    
860     2007-07-21 Wakaba <wakaba@suika.fam.cx>
861    
862 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
863     with "in table" insertion mode.
864    
865     2007-07-21 Wakaba <wakaba@suika.fam.cx>
866    
867 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
868     modes are merged.
869    
870     2007-07-21 Wakaba <wakaba@suika.fam.cx>
871    
872 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
873     insertion mode!
874    
875     2007-07-21 Wakaba <wakaba@suika.fam.cx>
876    
877 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
878     now directly invoke the handler.
879    
880     2007-07-21 Wakaba <wakaba@suika.fam.cx>
881    
882 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
883     is merged to the "in body" insertion mode code.
884    
885     2007-07-21 Wakaba <wakaba@suika.fam.cx>
886    
887 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
888     insertion modes are merged.
889    
890     2007-07-21 Wakaba <wakaba@suika.fam.cx>
891    
892 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
893    
894     2007-07-21 Wakaba <wakaba@suika.fam.cx>
895    
896 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
897     instead of string constant for |{content_model_flag}|.
898    
899     2007-07-21 Wakaba <wakaba@suika.fam.cx>
900    
901 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
902     to the "duplicate attribute" error.
903    
904 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
905    
906 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
907    
908     2007-07-17 Wakaba <wakaba@suika.fam.cx>
909    
910 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
911    
912     * HTML.pm.src: A typo is fixed.
913    
914 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
917     from "in HTML:xml:lang" error. Character position
918     is now the last part of the error type in the URI error
919     description. Report "unsupported" status for language
920     tags, media queries, script codes, and style sheets.
921    
922     2007-07-16 Wakaba <wakaba@suika.fam.cx>
923    
924 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
925     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
926     in XML.
927    
928     * NanoDOM.pm (Attr.owner_document): New attribute.
929    
930     2007-07-16 Wakaba <wakaba@suika.fam.cx>
931    
932 wakaba 1.66 * HTML.pm.src: The character immediately following
933     a bare |hcro| was discarded. Fix handling of
934     entity references in attribute values.
935    
936     2007-07-16 Wakaba <wakaba@suika.fam.cx>
937    
938 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
939     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
940     factored out. Error types |in html:#DOCTYPE|
941     and |after html:#DOCTYPE| are merged
942     into |DOCTYPE in the middle|. |</frameset>|
943     in fragment parsing mode changed the
944     insertion mode.
945    
946     2007-07-16 Wakaba <wakaba@suika.fam.cx>
947    
948 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
949     phase is now an insertion mode. Treatments for white
950     space character tokens were incorrect for some
951     insertion modes. An old |meta| case was not removed.
952    
953     2007-07-16 Wakaba <wakaba@suika.fam.cx>
954    
955 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
956     implemented (but changing the encoding is not yet:-).
957    
958 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
959    
960 wakaba 1.62 * Charset/: New directory.
961    
962     2007-07-15 Wakaba <wakaba@suika.fam.cx>
963    
964 wakaba 1.61 * H2H.pm: New Perl module (created from
965     manakai's H2H.dis).
966    
967     2007-07-15 Wakaba <wakaba@suika.fam.cx>
968    
969 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
970     manakai's SimpleLS.dis).
971    
972 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
973    
974     * HTML.pm.src: |<!---x-->| was not processed correctly.
975    
976 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
977    
978 wakaba 1.58 * HTML.pm.src: Report correct error message
979     for |<body></div></body>|.
980    
981     2007-07-01 Wakaba <wakaba@suika.fam.cx>
982    
983 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
984    
985     2007-06-30 Wakaba <wakaba@suika.fam.cx>
986    
987     * ContentChecker.pm: Return |{term}| list.
988    
989 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
990    
991 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
992     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
993     and <title> outside of head).
994    
995     2007-06-30 Wakaba <wakaba@suika.fam.cx>
996    
997 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
998     and private types/subtypes.
999    
1000     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1001     URIChecker.pm, HTMLTable.pm: Error messages are now
1002     consistent; they are all listed in
1003     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1004    
1005 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1006    
1007 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1008     is now erred. |<datalist>| is implemented.
1009     Attribute checker for |<command>| and |<menu>| are
1010     added. Support for |contextmenu| global attribute
1011     is added.
1012    
1013     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1014    
1015 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1016     of Step 3 has been changed.
1017    
1018     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1019    
1020 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1021    
1022 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1023    
1024 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1025     incorrect.
1026    
1027     * ContentChecker.pm: Support for the |footer| element.
1028     Check URI syntax for space-separated URI attributes.
1029     Support for the |tabindex| attribute. Support
1030     for |datetime| attribute.
1031    
1032     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1033    
1034 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1035     character references). HTML5 revision 1.146 (white space
1036     characters before root start tag). HTML5 revision
1037     1.148 (named character references in attribute values).
1038     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1039    
1040     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1041    
1042     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1043 wakaba 1.49 in <head>).
1044    
1045     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1046    
1047 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1048     in body).
1049    
1050     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1051    
1052     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1053    
1054 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1055     or RCDATA in fragment parsing mode). HTML5 revision
1056     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1057    
1058     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1059    
1060     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1061     for <pre> and <textarea>).
1062    
1063 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1064    
1065 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1066    
1067     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1068    
1069 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1070    
1071     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1072    
1073 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1074     HTML5 revision 917 (conforming bare &).
1075    
1076     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1077    
1078 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1079    
1080     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1081     HTML5 revision 915 (<nobr>).
1082    
1083     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1084    
1085 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1086    
1087     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1088     New attributes.
1089    
1090     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1091    
1092 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1093     in tags).
1094    
1095     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1096    
1097 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1098    
1099     * HTML.pm.src: |$entity_char| is removed and
1100     requires |Whatpm::_NamedEntityList| instead.
1101     HTML5 revision 898 (refc), except that lack of refc
1102     is parse error.
1103    
1104     * mkentitylist.pl: New script.
1105    
1106     * Makefile (all): |_NamedEntityList.pm| is added.
1107     (_NamedEntityList.pm, Entities.html): New rules.
1108    
1109     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1110    
1111 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1112     were ignored and U+000D immediately following another
1113     U+000D was not converted to U+000A.
1114    
1115     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1116    
1117 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1118     nodes before appended). Parser was not ready for NULL
1119     parse error and escape flag.
1120    
1121     * NanoDOM.pm (adopt_node): New.
1122    
1123     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1124    
1125 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1126     CDATA and RCDATA). Note that current implementation
1127     is simply repeating what the spec says and it is maybe not
1128     a best way to do it.
1129    
1130     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1131    
1132 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1133     the form element if a descendant element without implied end tag has
1134     still been open).
1135    
1136     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1137    
1138 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1139     space characters non-conforming).
1140    
1141     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1142    
1143 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1144     HTML5 revision 869 (C1 character references).
1145    
1146     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1147    
1148 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1149     a |textarea| is removed).
1150    
1151 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1152    
1153     * NanoDOM.pm (get_attribute_node_ns): New method.
1154    
1155     * ContentChecker.pm: |script| |async| and |defer|
1156     no longer require |src|. |async| MUST NOT be
1157     specified if |defer|. (HTML5 revision 858).
1158    
1159 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1160    
1161     * HTML.pm.src: |<form><form>| went to inifinite loop.
1162    
1163 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1164    
1165 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1166     as a document element) flag on.
1167     (new): Removed.
1168     (check_document): New method.
1169    
1170     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1171    
1172 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1173     were incorrect.
1174    
1175     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1176    
1177 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1178     error position in the |type| option of the error.
1179    
1180     * HTMLTable.pm (form_table): The |$onerror| parameter
1181     is now optional. Some bugs are fixed.
1182    
1183     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1184    
1185 wakaba 1.27 * HTMLTable.pm: New module.
1186    
1187     * ContentChecker.pm (table): Invoke table model error checker.
1188    
1189     * NanoDOM.pm (first_child, get_attribute_ns): New.
1190    
1191 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1194     (link/@rel, a/@rel, area/@rel): Use new checker.
1195    
1196     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1197    
1198     * _LinkTypeList.pm: New file.
1199    
1200     * mklinktypelist.pl: New file.
1201    
1202     * .cvsignore: |RelExtensions.html| added.
1203    
1204     * NanoDOM.pm (child_nodes): Returns an empty array
1205     for non-child-containing node types.
1206     (text_content): New attribute.
1207    
1208     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1209    
1210 wakaba 1.25 * IMTChecker.pm: New module.
1211    
1212     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1213     to test parameter value validity.
1214    
1215     * HTML.pm.src ($style_start_tag): Attributes were
1216     discarded.
1217    
1218 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1219    
1220 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1221    
1222     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1223    
1224 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1225     and RFC 3987 are listed (not all testable items are checked yet).
1226    
1227     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1228    
1229 wakaba 1.22 * URIChecker.pm: New module.
1230    
1231 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1232    
1233 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1234     or associated to some placeholder.
1235     ($ElementDefault): Warn unknown attributes
1236     for unknown elements as "attribute not supported".
1237     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1238     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1239     (|a| attribute checker): Reimplemented.
1240    
1241     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1242    
1243 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1244     ($HTMLAttrChecker): Event handler content attributes
1245     are added.
1246     (link, embed): Required attribute is now checked.
1247     (embed): Unknown local attributes are no longer warned.
1248    
1249     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1250    
1251 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1252     ($HTMLIMTAttrChecker): New checker.
1253     (link@rel, link@href, link@type, style@type,
1254     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1255     object@data, object@type, source@src, source@type, area@alt,
1256     area@shape, area@coords, area@href,
1257     area@ping, area@rel, area@type, script@src,
1258     script@defer, script@async, script@type): Checkers added.
1259    
1260     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1261    
1262 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1263    
1264 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1265    
1266 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1267     attributes. Report an error if |Element.prefix|
1268     is |xmlns|.
1269    
1270     * NanoDOM.pm (prefix): New attribute.
1271    
1272     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1273    
1274 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1275     mode, action for |<iframe>| was missing.
1276    
1277     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1278    
1279 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1280     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1281    
1282     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1283    
1284 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1285     ($HTMLURIAttrChecker): Placeholder.
1286     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1287     New.
1288    
1289     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1290    
1291 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1292     for global attributes, |html|, |base|, |style|, and |meta|.
1293    
1294     * NanoDOM.pm (insert_before): Weaken reference
1295     to the parent node.
1296     (Attr::new): Set |owner_element| attribute.
1297     (namespace_uri, manakai_local_name): New attribute implementations.
1298     (owner_element): New attribute.
1299    
1300     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1301    
1302 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1303     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1304     $Element->{$HTML_NS}->{''}): New.
1305     (check_element): Invoke attrs_checker for each element.
1306    
1307 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1308    
1309 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1310    
1311     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1312    
1313 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1314     element type testings.
1315    
1316     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1317    
1318 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1319     error when an element type is put in the "minus" list
1320     and the element type is not allowed explicitly in the particular
1321     element content model.
1322     (html:a checker): New checker.
1323     (html:details, html:datagrid): New checkers.
1324     (html:legend): New checker.
1325    
1326     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1327    
1328 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1329    
1330     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1331    
1332 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1333     checker.
1334     (html:dd checker): New checker.
1335     (html:q, html:em, html:strong, html:small,
1336     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1337    
1338     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1339    
1340 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1341     from |$ElementDefault->{checker}|.
1342     ($ElementDefault->{checker}): Throw an error that
1343     the element type is not supported by the checker.
1344     ($HTMLMetadataElement): |html:base| was missing.
1345     ($HTMLEmptyChecker): Don't throw an error
1346     for inter-element whitespace nodes.
1347     (html:html checker): Errors were not
1348     thrown even if |html:head| and/or |html:body|
1349     children were missing.
1350     (html:head checker): An error was not
1351     thrown if <meta charset> appered after other
1352     elements.
1353    
1354 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1355    
1356     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1357     and |noscript| elements are implemented.
1358     (new): New method.
1359    
1360 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1361    
1362 wakaba 1.4 * ContentChecker.pm: New module.
1363    
1364     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1365    
1366 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1367     document_element, manakai_local_name,
1368     manakai_element_type_match): New method.
1369    
1370     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1371    
1372     * HTML.pm.src: Replace decimal and hexadecimal numeric
1373     entities in C1 range using Windows-1252 mapping. Bare LF
1374     did not count as new line for error reporting.
1375    
1376 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1377    
1378     * NanoDOM.pm (DOMImplementation): New class.
1379     (append_child): Weaken the |parent_node| reference.
1380     (create_element_ns, Element new): Set the |owner_document|
1381     reference.
1382     (implementation): New attribute.
1383     (owner_document, local_name, namespace_uri): New attributes.
1384    
1385     * HTML.pm.src (parse_string): Line and column numbers
1386     are now provided to error handler.
1387     (!!!parse-error): Short descriptions are added.
1388     (_construct_tree): Split into three methods; support
1389     for innerHTML mode.
1390     (set_inner_html): New method.
1391    
1392 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1393    
1394     * NanoDOM.html: Documentation is added.
1395    
1396     * HTML.pod, ContentType.html: Documentation is revised.
1397    
1398     * .cvsignore: Pod2html temporary files are added.
1399    
1400     * Makefile: Make |NanoDOM.html|.
1401    
1402     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1403    
1404     * HTML.pm.src (parse_string): New method.
1405     (get_inner_html): Renamed from |inner_html|.
1406    
1407     * Makefile: A rule for |HTML.html| is added.
1408    
1409     * HTML.pod: New documentation.
1410    
1411     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1412    
1413     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1414     (clone_node): Attribute nodes were not completely copied.
1415    
1416     * HTML.pm.src: Many bugs are fixed.
1417    
1418     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1419    
1420     * HTML.pm.src: Some typos are fixed.
1421    
1422     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1423    
1424     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1425     are removed.
1426    
1427     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1428    
1429     * HTML-consume-entity.src: Removed.
1430    
1431     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1432    
1433     * HTML.pm.src: Tokenizer's handling on hexadecimal
1434     numeric entities are rewritten.
1435    
1436     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1437    
1438     * HTML.pm.src: Some tokenizer bugs are fixed.
1439    
1440     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1441    
1442     * HTML.pm.src: The tree construction stage is implemented.
1443    
1444     * mkhtmlparser.pl: New macros are added.
1445    
1446     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1447    
1448     * ContentType.pm: A note on bug in the specification
1449     is removed since it's been now fixed.
1450    
1451     * .cvsignore: New file.
1452    
1453     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1454    
1455     * HTML.pm.src, HTML-consume-entity.src: New files.
1456    
1457     * Makefile (HTML.pm): New rule.
1458    
1459     * mkhtmlparser.pl: New script.
1460    
1461     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1462    
1463     * Makefile: New file.
1464    
1465     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1466    
1467     * ContentType.pm: An error in pod is fixed.
1468    
1469     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1470    
1471     * ContentType.pm: Documentation is added.
1472    
1473     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1474    
1475     * ContentType.pm: New Perl module.
1476    
1477     * ChangeLog: New file.
1478    
1479    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24