/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.305 - (hide annotations) (download)
Sun Sep 14 07:19:47 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.304: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 07:19:17 -0000
	* HTML.pm.src: Make a "bare ero" error for unknown
	entities point the "&" character.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 07:19:40 -0000
	* DecodeHandle.pm: Merge the ShiftJIS class into the Encode class.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
4     entities point the "&" character.
5    
6     2008-09-14 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
9     be added to the list of excluded characters.
10    
11     2008-09-14 Wakaba <wakaba@suika.fam.cx>
12    
13 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
14     and |column| a higher priority than the one set by the
15     tokenizer's input handler.
16     ($self->{read_until}): Exclude U+FFFD (but this might
17     not be necessary, since now we do line/column fixup in
18     the character decode handle).
19    
20     2008-09-14 Wakaba <wakaba@suika.fam.cx>
21    
22 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
23    
24     2008-09-14 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
27     and |manakai_getc_until| to |manakai_read_until| to
28     reduce the number of string copies.
29    
30     2008-09-14 Wakaba <wakaba@suika.fam.cx>
31    
32 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
33     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
34     standard feature to |open| a string as a filehandle,
35     since Perl's string filehandle seems not supporting |ungetc|
36     method correctly.
37     (parse_char_stream): Define |{getc_until}| method.
38     (DATA_STATE): Experimental support for |getc_until| feature.
39    
40 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
41    
42 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
43    
44     2008-09-13 Wakaba <wakaba@suika.fam.cx>
45    
46 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
47     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
48     and replaced by |{prev_state}|.
49    
50     * mkhtmlparser.pl: Remove |{char}| feature.
51     Remove |!!!back-next-input-character;| macro.
52    
53     2008-09-13 Wakaba <wakaba@suika.fam.cx>
54    
55 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
56     entity related tokenizer states in favor of new states
57     implementing the consume character reference algorithm.
58    
59     2008-09-13 Wakaba <wakaba@suika.fam.cx>
60    
61     * HTML.pm.src: "Consume a character reference" algorithm is
62 wakaba 1.296 now implemented as a tokenizer's state, rather than
63     a method, with minimum changes (more changes will
64     be made, in due course). "Bogus comment state"'s inner
65     loop gets removed.
66    
67     2008-09-13 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
70 wakaba 1.295 into their own tokenizer states.
71    
72     2008-09-13 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
75     is split into three states.
76    
77     2008-09-13 Wakaba <wakaba@suika.fam.cx>
78    
79 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
80     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
81     no longer does the tokenizer have to push back next input
82     characters in those states.
83    
84     2008-09-13 Wakaba <wakaba@suika.fam.cx>
85    
86 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
87     into four states so that no longer does the tokenizer have to push
88     back next input characters in that state.
89    
90 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
91    
92     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
93     which can be used to insert some wrapper between the character
94     stream handle and the tokenizer. (It is currently not supported
95     for |set_inner_html| for |Element|s).
96    
97 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
98    
99 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
100    
101     2008-09-10 Wakaba <wakaba@suika.fam.cx>
102    
103 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
104    
105     * HTML.pm.src: Don't specify |text| argument for the
106     |chardecode:fallback| error, since it is not the encoding
107     being used alternatively.
108    
109 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
110    
111     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
112    
113 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
114    
115 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
116    
117     2008-08-31 Wakaba <wakaba@suika.fam.cx>
118    
119 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
120     to after after frameset insertion mode processing (HTML5
121     revision 1909). Note that the implementation was wrong
122     per the old spec before the r1909 changes.
123    
124 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
125    
126 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
127     spec (HTML5 revision 2093).
128     ($process_row): Algorithm step numbers synced with the
129     spec (HTML5 revision 2092).
130    
131     2008-08-30 Wakaba <wakaba@suika.fam.cx>
132    
133 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
134     revision 2094).
135    
136     2008-08-30 Wakaba <wakaba@suika.fam.cx>
137    
138     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
139    
140     2008-08-30 Wakaba <wakaba@suika.fam.cx>
141    
142 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
143     name (after another attribute) now raise parse error (HTML5
144     revision 2123). Empty unquoted attribute values are no
145     longer allowed (HTML5 revision 2122).
146    
147     2008-08-30 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
150     revision 2130).
151    
152     2008-08-30 Wakaba <wakaba@suika.fam.cx>
153    
154 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
155     as |lang| attribute value for HTML elements (HTML5 revision 2062
156     and so on).
157    
158     2008-08-30 Wakaba <wakaba@suika.fam.cx>
159    
160 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
161     was missing.
162    
163     * URIChecker.pm: The end of the URL should be marked as the
164     error location for an empty path error. The position
165     between the userinfo and the port components should be
166     marked as the error location for an empty host error.
167    
168     2008-08-30 Wakaba <wakaba@suika.fam.cx>
169    
170 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
171     value the error occurs for errors. Report unknown
172     address format error in warning level, since address
173     formats are rarely added. Path segments starting with "/.."
174     were misinterpreted as a dot-segment.
175    
176     2008-08-30 Wakaba <wakaba@suika.fam.cx>
177    
178 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
179     |Message::DOM::DOMImplementation|.
180    
181 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
182    
183     * IMTChecker.pm: Updated for the new error reporting architecture.
184    
185     * ContentChecker.pm: Error levels for IMTs are added.
186    
187 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
188    
189     * H2H.pm (_shift_token): Support for unquoted HTML attribute
190     values.
191    
192 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
193    
194 wakaba 1.275 * CacheManifest.pm: Support for new style of error
195     reports.
196    
197     * HTML.pm.src: Set line=1, column=1 to the document node.
198    
199     2008-08-16 Wakaba <wakaba@suika.fam.cx>
200    
201 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
202     and URL checkers. Support for more error levels for bogus
203     langauge tag and URL "standards".
204    
205     * LangTag.pm, URIChecker.pm: Support for new style error
206     level reporting.
207    
208 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
209    
210 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
211    
212     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
213     specifying. Error types are revised.
214    
215     2008-08-15 Wakaba <wakaba@suika.fam.cx>
216    
217 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
218     renewed.
219    
220     2008-08-15 Wakaba <wakaba@suika.fam.cx>
221    
222 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
223     are revised. Use new style for "level" specification.
224    
225     * mkhtmlparser.pl: Use new style for "level" specification.
226    
227 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
228    
229 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
230     reporting process for broken ignored valuetype definition.
231     (Valuetype idl_text): Support for special "DOMString" name.
232    
233     2008-08-03 Wakaba <wakaba@suika.fam.cx>
234    
235 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
236     terminal of the ScopedName is "DOMString", such that whether
237     the last part of the scoped name is "DOMString" or "_DOMString"
238     later. It is necessary to determine whether a |typedef|
239     definition should be ignored or not.
240     (parse_char_string): Unescape the identifier of
241     exception members.
242     ($resolve): Return undef for builtin types and sequence<T>
243     types (we might not have to do this, however...).
244     (check): Support checking for Exceptions, Valuetypes,
245     and Typedefs.
246     ($serialize_type): Support for "DOMString::::" syntax.
247     (Typedef idl_text): Output Type as "DOMString" if it
248     is really "DOMString" (i.e. its internal representation
249     is "::DOMString::").
250    
251     2008-08-03 Wakaba <wakaba@suika.fam.cx>
252    
253 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
254     for constant types in the |check| method.
255     (check): Support for checking of attributes, operations, and
256     arguments.
257     (Attribute/Operation idl_text): Exception names in getraises,
258     setraises, and raises clauses is serizlied by |$serialize_type|
259     code.
260    
261 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
262    
263 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
264     hexadecimal numbers (the original pattern, taken from the spec,
265     was not work for hexadecimal numbers, because the "0" prefix
266     matches to the [0-7]* part (as an empty string) and therefore
267     it does not match with remaining "x..." part of a "0x..." integer
268     literal.
269     ($get_type): It now returns a string, not an array reference,
270     for regular types and |sequence| types (i.e. it in any case
271     returns a string).
272     ($get_next_token): The second item in the array that represents
273     a integer or float token is now a Perl number value, not the
274     original string representation of the number.
275     (check): Support for const value consistency checking.
276     No extended attribute is defined for constants.
277     (Node subclasses): Use simple strings rather than array references
278     for default data type values.
279     ($serialize_type): Type values are now simple strings.
280     (value): If the new attribute value is a false value, then
281     a FALSE value is set to the attribute.
282    
283     2008-08-02 Wakaba <wakaba@suika.fam.cx>
284    
285 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
286     in its stringified format ("scoped name" as defined in the
287     spec). Note that future version of this module should not use
288     array references for type values and the |type_text| attribute
289     should be made obsolete.
290     (parse_char_string): Unescape attribute names.
291     (check): Support for checking of whether inherited interfaces
292     are actually defined or not. Support for checking of whether
293     interface member identifiers are duplicated or not.
294     ($serialize_type): Scoped names are returned as is. A future
295     version of this code should escape identifiers other than "DOMString",
296     otherwise the idl_text would be non-conforming.
297    
298     2008-08-02 Wakaba <wakaba@suika.fam.cx>
299    
300 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
301 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
302     for Definition's were ignored.
303     (append_child): Set |parent_node| attribute.
304     (parent_node): New attribute.
305     (check): Support interface/exception members. Support
306     extended attributes. Support definition identifier uniqueness
307     constraint.
308     (qualified_name): New attribute.
309     (Interface/Exception idl_text): Extended attributes were
310     not prepended to the returned text.
311    
312     2008-08-02 Wakaba <wakaba@suika.fam.cx>
313    
314     * WebIDL.pm (parse_char_string): Set line/column numbers
315 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
316     would make things wrong. Support for interface forward
317     declarations was missing. Broken interface declarations
318     with no block were not ignored entirely.
319     (Whatpm::WebIDL::Node): New abstract class. This class
320     makes things easier.
321     (child_nodes): New attribute. Unlike DOM's attribute with
322     same name, this attribute returns a dead list of nodes for
323     simplicity.
324     (get_user_data, set_user_data): New methods.
325     (Module idl_text): A SPACE character should be inserted
326     before the |{| character.
327     (Interface idl_text): Support for interface forward declarations.
328     (is_forward_declaration): New attribute.
329    
330 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
331    
332 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
333    
334     2008-07-19 Wakaba <wakaba@suika.fam.cx>
335    
336 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
337     it now can handle broken extended attributes and as such.
338    
339     2008-07-19 Wakaba <wakaba@suika.fam.cx>
340    
341 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
342     Support for extended attributes with arguments.
343    
344     2008-07-19 Wakaba <wakaba@suika.fam.cx>
345    
346 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
347     (Interface->idl_text): Tentative support for inheritances.
348    
349     2008-07-19 Wakaba <wakaba@suika.fam.cx>
350    
351 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
352     Support for raises, setraises, and getraises syntaxes.
353    
354 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
355    
356     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
357     proper support for types, extended attributes, and exceptions yet).
358     WebIDL parser, version 1 (no support for exceptions yet,
359     no proper support for extended attributes yet).
360    
361 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
362    
363     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
364    
365 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
366    
367 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
368    
369     2008-06-29 Wakaba <wakaba@suika.fam.cx>
370    
371 wakaba 1.255 * WebIDL.pm: New module.
372    
373 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
374    
375     * Makefile (Entities.html): URI changed.
376    
377 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
378    
379     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
380    
381 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
382    
383 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
384    
385     2008-06-01 Wakaba <wakaba@suika.fam.cx>
386    
387 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
388     spec (commented out). (HTML5 revision 1711).
389    
390 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
391    
392 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
393    
394     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
395     as UTF-16 (HTML5 revision 1701).
396    
397     2008-05-25 Wakaba <wakaba@suika.fam.cx>
398    
399 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
400     1692).
401    
402     2008-05-25 Wakaba <wakaba@suika.fam.cx>
403    
404     * HTML.pm.src: The secondary insertion mode used when switching
405     to foreign content is the "in body" insertion mode (HTML5 revision
406     1696).
407    
408     2008-05-25 Wakaba <wakaba@suika.fam.cx>
409    
410 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
411     revision 1697).
412    
413 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
414    
415 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
416     insertion mode (HTML5 revision 1693). Update SVG camelCase
417     attribute list (HTML5 revision 1700). <textarea> closes
418     </select> (HTML5 revision 1699). More start tags close in
419     foreign content insertion mode (HTML5 revision 1698).
420    
421     2008-05-24 Wakaba <wakaba@suika.fam.cx>
422    
423 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
424    
425     2008-05-24 Wakaba <wakaba@suika.fam.cx>
426    
427 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
428     revision 1674).
429    
430     2008-05-24 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
433     revision 1676).
434    
435     2008-05-24 Wakaba <wakaba@suika.fam.cx>
436    
437 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
438     quriks mode detection (HTML5 revision 1679).
439    
440     2008-05-24 Wakaba <wakaba@suika.fam.cx>
441    
442 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
443     revision 1685).
444    
445     2008-05-24 Wakaba <wakaba@suika.fam.cx>
446    
447 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
448     revision 1684).
449    
450     2008-05-24 Wakaba <wakaba@suika.fam.cx>
451    
452 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
453     work (HTML5 revision 1690).
454    
455     2008-05-24 Wakaba <wakaba@suika.fam.cx>
456    
457 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
458     end tags in head insertion modes (HTML5 revision 1686).
459     (parse_char_string): Bug fix for non-utf8 character string handlings.
460     (parse_char_stream): |ungetc| does not work well for this context.
461    
462 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
463    
464 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
465     |parse_byte_stream|.
466     (parse_byte_stream): New method.
467    
468     2008-05-18 Wakaba <wakaba@suika.fam.cx>
469    
470 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
471     by encoding layer error reporter.
472    
473 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
474    
475 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
476     rather than converting the whole byte string and then parsing.
477     Propagate errors in character encoding layer.
478     (get_next_token): Precise error reporting for |bare stago| error.
479    
480     2008-05-17 Wakaba <wakaba@suika.fam.cx>
481    
482 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
483     (parse_char_string): This method is now defined as an invocation
484     of the |parse_char_stream| method.
485    
486     2008-05-17 Wakaba <wakaba@suika.fam.cx>
487    
488 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
489     of the sniffing as info-level errors. Support for new
490     decoding framework in parser resestting.
491     (new): Various default error levels were not set.
492    
493     2008-05-17 Wakaba <wakaba@suika.fam.cx>
494    
495 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
496     algorithm, except for the actual sniffing, is implemented
497     with new framework with Message::Charset::Info.
498    
499 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
500    
501     * CacheManifest.pm (_parse): Drop fragment identifiers from
502     URIs in fallback section (HTML5 revision 1596).
503    
504 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
505    
506 wakaba 1.231 * Makefile (Entities.html): URI has changed.
507    
508     2008-05-10 Wakaba <wakaba@suika.fam.cx>
509    
510 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
511     1553).
512    
513 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
514    
515 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
516     no longer table elements, but table objects returned
517     by Whatpm::HTMLTable.
518    
519     * HTMLTable.pm (form_table): Return table element node
520     as |$table->{element}|.
521     (assign_header): Support for the |headers=""| attribute.
522    
523     2008-05-06 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
526     with no support for headers="".
527     (form_table): Include table width and height to the returned
528     table object for covenience. Indexing in column assignement was wrong.
529     Set whether a data cell is empty or not for convenience.
530    
531 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
532    
533 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
534    
535     2008-05-05 Wakaba <wakaba@suika.fam.cx>
536    
537 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
538     changed (HTML5 revision 1387).
539    
540     2008-05-05 Wakaba <wakaba@suika.fam.cx>
541    
542 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
543     to the end of the table (HTML5 revision 1380).
544    
545     2008-05-05 Wakaba <wakaba@suika.fam.cx>
546    
547 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
548     of 1-based (HTML5 revision 1376).
549    
550     2008-05-05 Wakaba <wakaba@suika.fam.cx>
551    
552 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
553     and "Content-Encoding" no longer prevent sniffing (HTML5
554     revision 1288).
555    
556     2008-05-05 Wakaba <wakaba@suika.fam.cx>
557    
558 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
559     1282).
560    
561 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
562    
563 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
564     Status of XML specs are added.
565    
566     2008-05-03 Wakaba <wakaba@suika.fam.cx>
567    
568 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
569     and |ref=""| attribute (referent element type checking).
570    
571 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
572    
573 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
574     1500) is implemented (except for IDNA part and URI-scheme-specific
575 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
576     documents.
577    
578 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
579    
580 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
581     character (HTML5 revision 1263).
582    
583     2008-04-13 Wakaba <wakaba@suika.fam.cx>
584    
585 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
586     definition for |AElig;| was missing).
587    
588     2008-04-13 Wakaba <wakaba@suika.fam.cx>
589    
590 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
591     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
592    
593     2008-04-13 Wakaba <wakaba@suika.fam.cx>
594    
595 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
596     insertion mode is added (HTML5 revisions 1412 and 1418).
597    
598     2008-04-13 Wakaba <wakaba@suika.fam.cx>
599    
600 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
601     elements (HTML5 revision 1410).
602    
603     2008-04-13 Wakaba <wakaba@suika.fam.cx>
604    
605 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
606     revision 1406).
607    
608     2008-04-13 Wakaba <wakaba@suika.fam.cx>
609    
610 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
611     elements (HTML5 revision 1404 and 1420).
612    
613 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
614    
615 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
616     elements (HTML5 revision 1404). Unused !!!macro definitions
617     are removed.
618    
619     2008-04-12 Wakaba <wakaba@suika.fam.cx>
620    
621 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
622     are raised is changed (HTML5 revision 1404).
623    
624 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
625    
626 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
627    
628     2008-04-06 Wakaba <wakaba@suika.fam.cx>
629    
630 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
631     replaced by constants representing element category.
632    
633     2008-04-06 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
636     for error reporting. (This is a preparation for using constant
637     value for ->[1].)
638    
639 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
640    
641 wakaba 1.205 * RDFXML.pm: Typo fixed.
642    
643     2008-03-22 Wakaba <wakaba@suika.fam.cx>
644    
645 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
646     as same as |must_level|, i.e. level = |m|.
647     (check_element): Make list of URIs in the DOM.
648    
649 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
652     Local (null-namespace) attribute support.
653    
654     2008-03-21 Wakaba <wakaba@suika.fam.cx>
655    
656 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
657    
658     2008-03-21 Wakaba <wakaba@suika.fam.cx>
659    
660 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
661     and URI attributes is implemented. Warn if unknown
662     value is used in rdf:parseType="" attribute.
663    
664     * URIChecker.pm (check_rdf_uri_reference): New function.
665    
666     2008-03-21 Wakaba <wakaba@suika.fam.cx>
667    
668 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
669     are now resolved.
670    
671     2008-03-21 Wakaba <wakaba@suika.fam.cx>
672    
673 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
674    
675     * RDFXML.pm: undef vs false bug fixed.
676     Reification implemented.
677    
678     2008-03-21 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
681     The |node| arguments are added for |ontriple| calls.
682     Too many "attribute not allowed" errors were raised.
683    
684     * ContentChecker.pm: Initial experimental support for rdf:RDF
685     element.
686    
687     2008-03-21 Wakaba <wakaba@suika.fam.cx>
688    
689 wakaba 1.197 * RDFXML.pm: New module.
690    
691 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
692    
693 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
694     code was old one yet.
695    
696     2008-03-20 Wakaba <wakaba@suika.fam.cx>
697    
698 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
699     errors. Line/column markings for DOCTYPE, comment, and
700     character tokens are reintroduced; otherwise,
701     error location for "not HTML5" error and errors
702     for implied elements are not attached.
703    
704     2008-03-20 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
707    
708     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
709     (value): Setter implemented.
710    
711     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
712    
713     2008-03-20 Wakaba <wakaba@suika.fam.cx>
714    
715 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
716    
717     2008-03-20 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
720     flag, which is no longer used.
721    
722 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
723    
724 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
725    
726     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
727     and !!!insert-element-t): Set line/column information to
728     element nodes.
729    
730     2008-03-17 Wakaba <wakaba@suika.fam.cx>
731    
732 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
733     in "<?", "<>", or "</>" should be the error point.
734    
735 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
738    
739     2008-03-16 Wakaba <wakaba@suika.fam.cx>
740    
741 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
742    
743     2008-03-16 Wakaba <wakaba@suika.fam.cx>
744    
745 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
746     reporting.
747    
748 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
749    
750 wakaba 1.186 * HTML.pm.src: Error type revised.
751    
752     2008-03-11 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
755    
756     2008-03-11 Wakaba <wakaba@suika.fam.cx>
757    
758 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
759    
760 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
761    
762     * mkhtmlparser.pl: Set "level" parameter to parse errors.
763    
764     * HTML.pm.src: Code refined.
765    
766 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
767    
768 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
769     revision 1348). Note that I really don't know this makes
770     any difference in the black-box behavior of the parser.
771    
772     2008-03-09 Wakaba <wakaba@suika.fam.cx>
773    
774 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
775     revision 1348).
776    
777     2008-03-09 Wakaba <wakaba@suika.fam.cx>
778    
779 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
780    
781     2008-03-09 Wakaba <wakaba@suika.fam.cx>
782    
783 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
784    
785     2008-03-09 Wakaba <wakaba@suika.fam.cx>
786    
787 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
788     mode and support for the "in select in table" insertion mode (HTML5
789     revision 1342).
790    
791     2008-03-09 Wakaba <wakaba@suika.fam.cx>
792    
793 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
794     in non-tainted <table>s (HTML5 revision 1336).
795    
796     2008-03-09 Wakaba <wakaba@suika.fam.cx>
797    
798 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
799     and <head> (HTML5 revision 1332).
800    
801     2008-03-09 Wakaba <wakaba@suika.fam.cx>
802    
803 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
804     white space (HTML5 revision 1331).
805    
806 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
807    
808 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
809     element (HTML5 revision 1330).
810    
811     2008-03-08 Wakaba <wakaba@suika.fam.cx>
812    
813 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
814     element (HTML5 revision 1328).
815    
816     2008-03-08 Wakaba <wakaba@suika.fam.cx>
817    
818 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
819     into foster parents (HTML5 revision 1326).
820    
821     2008-03-08 Wakaba <wakaba@suika.fam.cx>
822    
823 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
824     revision 1321).
825    
826     2008-03-08 Wakaba <wakaba@suika.fam.cx>
827    
828 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
829     the spec (HTML5 revision 1320). Note that we finally complete
830     all of HTML5 revision 1320 changes.
831    
832     2008-03-08 Wakaba <wakaba@suika.fam.cx>
833    
834 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
835     tags (HTML5 revision 1320).
836    
837     2008-03-08 Wakaba <wakaba@suika.fam.cx>
838    
839 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
840    
841     2008-03-08 Wakaba <wakaba@suika.fam.cx>
842    
843 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
844     could not be reached so that it is now removed (HTML5 revision
845     1320).
846    
847     2008-03-08 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
850     to align with new spec text (HTML5 revision 1320).
851    
852     2008-03-08 Wakaba <wakaba@suika.fam.cx>
853    
854 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
855     have had no harm in theory.
856    
857     2008-03-08 Wakaba <wakaba@suika.fam.cx>
858    
859 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
860     case is split from other end tags for the preparation
861     of implementing HTML revision 1320.
862    
863 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
864    
865 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
866     revision 1320).
867    
868     2008-03-07 Wakaba <wakaba@suika.fam.cx>
869    
870 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
871     and "div" (HTML5 revision 1318). Add comments to where
872     |form| pointer association codes should be inserted (HTML5 revision
873     1319).
874    
875 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
876    
877     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
878    
879 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
880    
881 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
882     other token type, there are), it is replaced by a |die| statement.
883    
884     2008-03-05 Wakaba <wakaba@suika.fam.cx>
885    
886 wakaba 1.159 * HTML.pm.src: Typo fixed.
887    
888 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
889    
890     * HTML.pm.src (_tree_construction_initial): Some limited quirks
891     doctypes were not uppercased for comparison.
892    
893 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
896     are added.
897    
898     2008-03-03 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
901     are set. Cases that are unlikely reached are noted as so.
902    
903     2008-03-03 Wakaba <wakaba@suika.fam.cx>
904    
905 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
906    
907     * mkhtmlparser.pl: Support for |!!!cp| syntax.
908    
909     2008-03-03 Wakaba <wakaba@suika.fam.cx>
910    
911 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
912     for simplicity.
913    
914     2008-03-03 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
917     revision 1307).
918    
919     2008-03-03 Wakaba <wakaba@suika.fam.cx>
920    
921 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
922     is set are changed (HTML5 revision 1305).
923    
924 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
925    
926 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
927     1292). Entities are not parsed in comment-like part in RCDATA
928     elements (HTML5 revision 1294). Allow bare & at the end
929     of attribute value literals (HTML5 revision 1296). More
930     quirks mode doctypes (HTML5 revision 1302). Requires spaces
931     between attributes and ban attribute names or unquoted
932     attribute values containing single or double quotes (HTML5
933     revision 1303).
934    
935     2008-03-02 Wakaba <wakaba@suika.fam.cx>
936    
937 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
938     and related errors unless it is an HTML document (though the spec
939     is unclear on whether it is applied to XHTML document).
940    
941     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
942     are added.
943    
944     2008-03-02 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
947    
948 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
949    
950     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
951    
952     * HTML.pm.src: |charset| in |content| attribute is
953     case-insensitive (HTML5 revision 1270).
954    
955 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
956    
957     * ContentChecker.pm: New status constants are added.
958     ($ElementDefault): |status| added.
959     (check_element): Err for non-standard or deprecated elements.
960     (_attr_status_info): For non-standard or deprecated attributes.
961    
962 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
963    
964 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
965    
966     2008-02-24 Wakaba <wakaba@suika.fam.cx>
967    
968 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
969     status information is now dispatched.
970    
971     2008-02-24 Wakaba <wakaba@suika.fam.cx>
972    
973 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
974     problem by adding some more arguments.
975    
976     2008-02-24 Wakaba <wakaba@suika.fam.cx>
977    
978 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
979     container-for-the-purpose-of-content-model element (not transparent
980     element) for |check_child_element| calling and significant
981     text flag marking. This reintroduces |<del>|-and-significant
982     problem again.
983    
984     2008-02-24 Wakaba <wakaba@suika.fam.cx>
985    
986 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
987     elements ignored for the purpose of phase changes in
988     content model checking.
989    
990 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
991    
992 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
993     was not properly managed for transparent cases.
994    
995     2008-02-23 Wakaba <wakaba@suika.fam.cx>
996    
997 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
998     and |audio| as semi-transparent elements.
999    
1000     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1001    
1002 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1003     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1004     Support for |html:object| as a semi-transparent element.
1005    
1006     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1009     the tree is entirely revised to make it easier to track
1010     the state of ancestors/descendants. As a result of this
1011     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1012     support for content model checking for HTML elements |figure|,
1013     |object|, |video|, and |audio| and checking for XML elements (and
1014     some XMLNS checkings) are dropped for now. They will be
1015     reimplemented in due cource.
1016    
1017 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1018    
1019 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1020     token (HTML5 revision 1225).
1021    
1022     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1023    
1024 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1025    
1026     * HTML.pm.src: Save whether |meta| |content| attribute
1027     contains character references or not.
1028    
1029     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1030    
1031 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1032     imlement significant content checking for contents
1033     with |del| elements.
1034    
1035     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1036    
1037 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1038     elements are added.
1039     (_get_children): HTML |object| elements are now semi-transparent.
1040    
1041     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1042    
1043 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1044    
1045     * CacheManifest.pm: HTML5 revision 1211 implemented.
1046    
1047     * CacheManifest.pod: Updated.
1048    
1049 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1050    
1051     * ContentChecker.pm (check_document, check_element): Support
1052     for second argument ($onsubdoc).
1053     (_get_css_parser): Removed (now it is part of WDCC).
1054    
1055 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1056    
1057     * ContentChecker.pm (_get_css_parser): New.
1058    
1059 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1060    
1061 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1062     for checking had been used.
1063    
1064     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1065    
1066 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1067    
1068 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1069    
1070 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1071     specced in RFC 2046 and RFC 4288.
1072     (application/atom+xml): Definition added.
1073    
1074     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1075    
1076 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1077     warnings (rather than SHOULD-level errors).
1078    
1079     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1080    
1081 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1082    
1083     * HTML.pm.src: A flag for character references in attribute
1084     values are added. Set |manakai_has_reference| user data
1085     to |charset| attribute.
1086    
1087     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1088    
1089 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1090     attributes.
1091    
1092     * ContentChecker.pm (check_document): Warn if charset requirements
1093     cannot be tested.
1094    
1095 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1096    
1097     * HTML.pm.src (parse_byte_string): Detect charset
1098     by universalchardet if charset parameter is not specified.
1099    
1100     * Makefile (Charset-all, Charset-clean): New rules.
1101    
1102 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1103    
1104 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1105     of character encoding declaration and selection of encoding
1106     for HTML document.
1107    
1108     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1109    
1110 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1111     type in list context.
1112    
1113     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1116     ignoring (HTML5 revisions 1013 and 1016).
1117    
1118     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1121     white spaces, are implemented.
1122    
1123 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1124    
1125 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1126     attribute if possible.
1127    
1128     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1129    
1130 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1131     (parse_char_string): New alias for |parse_string|.
1132     (main phase): Invoking "change the encoding" algorithm if desired.
1133    
1134     * HTML.pod: Updated.
1135    
1136     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1137    
1138 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1139    
1140     * Makefile (HTML-all, HTML-clean): New.
1141    
1142     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1143    
1144     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1145    
1146 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1147    
1148     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1149    
1150 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1151    
1152 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1153    
1154     * CacheManifest.pod: New file.
1155    
1156     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1157    
1158 wakaba 1.114 * CacheManifest.pm: New module.
1159    
1160     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1161    
1162 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1163     callback.
1164    
1165     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1166    
1167 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1168    
1169 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.111 * Makefile (clean): New rule.
1172    
1173     * NanoDOM.pm (public_id, system_id): New attributes.
1174    
1175     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1176    
1177 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1178    
1179 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1180    
1181     * ContentChecker.pm (check_document): Support for
1182     new |is_xml_root| flag.
1183     (check_element): Support for new |pluses| state.
1184     (_add_pluses): New method.
1185     (_remove_minuses): Support for new |minus| item.
1186    
1187 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1188    
1189 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1190     root element.
1191    
1192     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1193    
1194 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1195    
1196 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1197    
1198 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1199    
1200     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1201    
1202 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1203    
1204     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1205    
1206 wakaba 1.103 * LangTag.pm: New module.
1207    
1208 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1209    
1210     * HTML.pm.src: Some error types were wrong.
1211    
1212 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1213    
1214 wakaba 1.101 * CSS/: New directory.
1215    
1216     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1217    
1218 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1219     for |noscript| in |head|.
1220    
1221 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1222    
1223     * URI/: New directory.
1224    
1225 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1228     number.
1229    
1230     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1233     by |&|s.
1234    
1235     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1236    
1237 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1238    
1239     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1240    
1241 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1242    
1243     * HTML.pm.src: Insertion modes are now represented in number.
1244    
1245     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1246    
1247 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1248     999) is implemented.
1249    
1250 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1251    
1252     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1253     983) is implemented.
1254    
1255 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1256    
1257     * ContentChecker.pod: New documentation.
1258    
1259     * Makefile: A rule for |ContentChecker.html| is added.
1260    
1261     * ContentChecker.pm: A pod "LICENSE" section is added.
1262    
1263     * NanoDOM.pm ($VERSION): New variable.
1264    
1265 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1266    
1267 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1268     list of allowed HTML elements.
1269    
1270     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1271    
1272 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1273     HTML elements.
1274    
1275     * URIChecker.pm (check_iri): New.
1276     (check_iri_reference): Error type for IRI reference
1277     syntax error is changed.
1278    
1279 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1282     is added.
1283     (check_document): Load appropriate module before validation.
1284    
1285     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1286    
1287 wakaba 1.88 * ContentChecker/: New directory.
1288    
1289     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1290    
1291 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1292    
1293     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1294    
1295 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1296    
1297 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1298     even if no document element is found.
1299    
1300     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1301    
1302 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1303    
1304     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1305    
1306 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1307    
1308     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1309    
1310 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1311     is split into "after html body" and "after html frameset"
1312     insertion modes. Their codes are merged with "after body"
1313     and "after frameset" codes. |$previous_insertion_mode|
1314     has been removed. "after frameset" code is
1315     merged with "in frameset" code.
1316    
1317     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1318    
1319 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1320     merged with the "in head" insertion mode.
1321    
1322     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1323    
1324 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1325     reorganized.
1326    
1327     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1328    
1329 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1330     are merged.
1331    
1332     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1333    
1334 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1335     with "in table" insertion mode.
1336    
1337     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1338    
1339 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1340     modes are merged.
1341    
1342     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1345     insertion mode!
1346    
1347     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1350     now directly invoke the handler.
1351    
1352     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1353    
1354 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1355     is merged to the "in body" insertion mode code.
1356    
1357     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1360     insertion modes are merged.
1361    
1362     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1363    
1364 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1365    
1366     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1367    
1368 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1369     instead of string constant for |{content_model_flag}|.
1370    
1371     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1372    
1373 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1374     to the "duplicate attribute" error.
1375    
1376 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1377    
1378 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1379    
1380     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1381    
1382 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1383    
1384     * HTML.pm.src: A typo is fixed.
1385    
1386 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1387    
1388 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1389     from "in HTML:xml:lang" error. Character position
1390     is now the last part of the error type in the URI error
1391     description. Report "unsupported" status for language
1392     tags, media queries, script codes, and style sheets.
1393    
1394     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1395    
1396 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1397     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1398     in XML.
1399    
1400     * NanoDOM.pm (Attr.owner_document): New attribute.
1401    
1402     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1403    
1404 wakaba 1.66 * HTML.pm.src: The character immediately following
1405     a bare |hcro| was discarded. Fix handling of
1406     entity references in attribute values.
1407    
1408     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1409    
1410 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1411     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1412     factored out. Error types |in html:#DOCTYPE|
1413     and |after html:#DOCTYPE| are merged
1414     into |DOCTYPE in the middle|. |</frameset>|
1415     in fragment parsing mode changed the
1416     insertion mode.
1417    
1418     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1419    
1420 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1421     phase is now an insertion mode. Treatments for white
1422     space character tokens were incorrect for some
1423     insertion modes. An old |meta| case was not removed.
1424    
1425     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1426    
1427 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1428     implemented (but changing the encoding is not yet:-).
1429    
1430 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1431    
1432 wakaba 1.62 * Charset/: New directory.
1433    
1434     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.61 * H2H.pm: New Perl module (created from
1437     manakai's H2H.dis).
1438    
1439     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1442     manakai's SimpleLS.dis).
1443    
1444 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1445    
1446     * HTML.pm.src: |<!---x-->| was not processed correctly.
1447    
1448 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.58 * HTML.pm.src: Report correct error message
1451     for |<body></div></body>|.
1452    
1453     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1454    
1455 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1456    
1457     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1458    
1459     * ContentChecker.pm: Return |{term}| list.
1460    
1461 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1462    
1463 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1464     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1465     and <title> outside of head).
1466    
1467     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1468    
1469 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1470     and private types/subtypes.
1471    
1472     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1473     URIChecker.pm, HTMLTable.pm: Error messages are now
1474     consistent; they are all listed in
1475     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1476    
1477 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1478    
1479 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1480     is now erred. |<datalist>| is implemented.
1481     Attribute checker for |<command>| and |<menu>| are
1482     added. Support for |contextmenu| global attribute
1483     is added.
1484    
1485     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1486    
1487 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1488     of Step 3 has been changed.
1489    
1490     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1491    
1492 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1493    
1494 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1495    
1496 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1497     incorrect.
1498    
1499     * ContentChecker.pm: Support for the |footer| element.
1500     Check URI syntax for space-separated URI attributes.
1501     Support for the |tabindex| attribute. Support
1502     for |datetime| attribute.
1503    
1504     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1505    
1506 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1507     character references). HTML5 revision 1.146 (white space
1508     characters before root start tag). HTML5 revision
1509     1.148 (named character references in attribute values).
1510     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1511    
1512     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1513    
1514     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1515 wakaba 1.49 in <head>).
1516    
1517     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1518    
1519 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1520     in body).
1521    
1522     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1523    
1524     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1525    
1526 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1527     or RCDATA in fragment parsing mode). HTML5 revision
1528     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1529    
1530     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1531    
1532     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1533     for <pre> and <textarea>).
1534    
1535 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1536    
1537 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1538    
1539     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1540    
1541 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1542    
1543     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1544    
1545 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1546     HTML5 revision 917 (conforming bare &).
1547    
1548     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1549    
1550 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1551    
1552     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1553     HTML5 revision 915 (<nobr>).
1554    
1555     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1556    
1557 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1558    
1559     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1560     New attributes.
1561    
1562     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1563    
1564 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1565     in tags).
1566    
1567     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1568    
1569 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1570    
1571     * HTML.pm.src: |$entity_char| is removed and
1572     requires |Whatpm::_NamedEntityList| instead.
1573     HTML5 revision 898 (refc), except that lack of refc
1574     is parse error.
1575    
1576     * mkentitylist.pl: New script.
1577    
1578     * Makefile (all): |_NamedEntityList.pm| is added.
1579     (_NamedEntityList.pm, Entities.html): New rules.
1580    
1581     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1582    
1583 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1584     were ignored and U+000D immediately following another
1585     U+000D was not converted to U+000A.
1586    
1587     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1588    
1589 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1590     nodes before appended). Parser was not ready for NULL
1591     parse error and escape flag.
1592    
1593     * NanoDOM.pm (adopt_node): New.
1594    
1595     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1596    
1597 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1598     CDATA and RCDATA). Note that current implementation
1599     is simply repeating what the spec says and it is maybe not
1600     a best way to do it.
1601    
1602     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1603    
1604 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1605     the form element if a descendant element without implied end tag has
1606     still been open).
1607    
1608     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1611     space characters non-conforming).
1612    
1613     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1614    
1615 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1616     HTML5 revision 869 (C1 character references).
1617    
1618     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1619    
1620 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1621     a |textarea| is removed).
1622    
1623 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1624    
1625     * NanoDOM.pm (get_attribute_node_ns): New method.
1626    
1627     * ContentChecker.pm: |script| |async| and |defer|
1628     no longer require |src|. |async| MUST NOT be
1629     specified if |defer|. (HTML5 revision 858).
1630    
1631 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1632    
1633     * HTML.pm.src: |<form><form>| went to inifinite loop.
1634    
1635 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1636    
1637 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1638     as a document element) flag on.
1639     (new): Removed.
1640     (check_document): New method.
1641    
1642     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1643    
1644 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1645     were incorrect.
1646    
1647     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1648    
1649 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1650     error position in the |type| option of the error.
1651    
1652     * HTMLTable.pm (form_table): The |$onerror| parameter
1653     is now optional. Some bugs are fixed.
1654    
1655     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1656    
1657 wakaba 1.27 * HTMLTable.pm: New module.
1658    
1659     * ContentChecker.pm (table): Invoke table model error checker.
1660    
1661     * NanoDOM.pm (first_child, get_attribute_ns): New.
1662    
1663 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1664    
1665 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1666     (link/@rel, a/@rel, area/@rel): Use new checker.
1667    
1668     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1669    
1670     * _LinkTypeList.pm: New file.
1671    
1672     * mklinktypelist.pl: New file.
1673    
1674     * .cvsignore: |RelExtensions.html| added.
1675    
1676     * NanoDOM.pm (child_nodes): Returns an empty array
1677     for non-child-containing node types.
1678     (text_content): New attribute.
1679    
1680     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1681    
1682 wakaba 1.25 * IMTChecker.pm: New module.
1683    
1684     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1685     to test parameter value validity.
1686    
1687     * HTML.pm.src ($style_start_tag): Attributes were
1688     discarded.
1689    
1690 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1691    
1692 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1693    
1694     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1695    
1696 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1697     and RFC 3987 are listed (not all testable items are checked yet).
1698    
1699     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1700    
1701 wakaba 1.22 * URIChecker.pm: New module.
1702    
1703 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1704    
1705 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1706     or associated to some placeholder.
1707     ($ElementDefault): Warn unknown attributes
1708     for unknown elements as "attribute not supported".
1709     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1710     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1711     (|a| attribute checker): Reimplemented.
1712    
1713     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1714    
1715 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1716     ($HTMLAttrChecker): Event handler content attributes
1717     are added.
1718     (link, embed): Required attribute is now checked.
1719     (embed): Unknown local attributes are no longer warned.
1720    
1721     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1722    
1723 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1724     ($HTMLIMTAttrChecker): New checker.
1725     (link@rel, link@href, link@type, style@type,
1726     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1727     object@data, object@type, source@src, source@type, area@alt,
1728     area@shape, area@coords, area@href,
1729     area@ping, area@rel, area@type, script@src,
1730     script@defer, script@async, script@type): Checkers added.
1731    
1732     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1733    
1734 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1735    
1736 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1739     attributes. Report an error if |Element.prefix|
1740     is |xmlns|.
1741    
1742     * NanoDOM.pm (prefix): New attribute.
1743    
1744     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1745    
1746 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1747     mode, action for |<iframe>| was missing.
1748    
1749     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1750    
1751 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1752     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1753    
1754     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1755    
1756 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1757     ($HTMLURIAttrChecker): Placeholder.
1758     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1759     New.
1760    
1761     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1762    
1763 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1764     for global attributes, |html|, |base|, |style|, and |meta|.
1765    
1766     * NanoDOM.pm (insert_before): Weaken reference
1767     to the parent node.
1768     (Attr::new): Set |owner_element| attribute.
1769     (namespace_uri, manakai_local_name): New attribute implementations.
1770     (owner_element): New attribute.
1771    
1772     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1773    
1774 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1775     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1776     $Element->{$HTML_NS}->{''}): New.
1777     (check_element): Invoke attrs_checker for each element.
1778    
1779 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1780    
1781 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1782    
1783     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1784    
1785 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1786     element type testings.
1787    
1788     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1789    
1790 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1791     error when an element type is put in the "minus" list
1792     and the element type is not allowed explicitly in the particular
1793     element content model.
1794     (html:a checker): New checker.
1795     (html:details, html:datagrid): New checkers.
1796     (html:legend): New checker.
1797    
1798     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1799    
1800 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1801    
1802     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1803    
1804 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1805     checker.
1806     (html:dd checker): New checker.
1807     (html:q, html:em, html:strong, html:small,
1808     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1809    
1810     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1811    
1812 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1813     from |$ElementDefault->{checker}|.
1814     ($ElementDefault->{checker}): Throw an error that
1815     the element type is not supported by the checker.
1816     ($HTMLMetadataElement): |html:base| was missing.
1817     ($HTMLEmptyChecker): Don't throw an error
1818     for inter-element whitespace nodes.
1819     (html:html checker): Errors were not
1820     thrown even if |html:head| and/or |html:body|
1821     children were missing.
1822     (html:head checker): An error was not
1823     thrown if <meta charset> appered after other
1824     elements.
1825    
1826 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1827    
1828     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1829     and |noscript| elements are implemented.
1830     (new): New method.
1831    
1832 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1833    
1834 wakaba 1.4 * ContentChecker.pm: New module.
1835    
1836     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1837    
1838 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1839     document_element, manakai_local_name,
1840     manakai_element_type_match): New method.
1841    
1842     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1843    
1844     * HTML.pm.src: Replace decimal and hexadecimal numeric
1845     entities in C1 range using Windows-1252 mapping. Bare LF
1846     did not count as new line for error reporting.
1847    
1848 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1849    
1850     * NanoDOM.pm (DOMImplementation): New class.
1851     (append_child): Weaken the |parent_node| reference.
1852     (create_element_ns, Element new): Set the |owner_document|
1853     reference.
1854     (implementation): New attribute.
1855     (owner_document, local_name, namespace_uri): New attributes.
1856    
1857     * HTML.pm.src (parse_string): Line and column numbers
1858     are now provided to error handler.
1859     (!!!parse-error): Short descriptions are added.
1860     (_construct_tree): Split into three methods; support
1861     for innerHTML mode.
1862     (set_inner_html): New method.
1863    
1864 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1865    
1866     * NanoDOM.html: Documentation is added.
1867    
1868     * HTML.pod, ContentType.html: Documentation is revised.
1869    
1870     * .cvsignore: Pod2html temporary files are added.
1871    
1872     * Makefile: Make |NanoDOM.html|.
1873    
1874     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1875    
1876     * HTML.pm.src (parse_string): New method.
1877     (get_inner_html): Renamed from |inner_html|.
1878    
1879     * Makefile: A rule for |HTML.html| is added.
1880    
1881     * HTML.pod: New documentation.
1882    
1883     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1884    
1885     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1886     (clone_node): Attribute nodes were not completely copied.
1887    
1888     * HTML.pm.src: Many bugs are fixed.
1889    
1890     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1891    
1892     * HTML.pm.src: Some typos are fixed.
1893    
1894     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1895    
1896     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1897     are removed.
1898    
1899     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1900    
1901     * HTML-consume-entity.src: Removed.
1902    
1903     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1904    
1905     * HTML.pm.src: Tokenizer's handling on hexadecimal
1906     numeric entities are rewritten.
1907    
1908     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1909    
1910     * HTML.pm.src: Some tokenizer bugs are fixed.
1911    
1912     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1913    
1914     * HTML.pm.src: The tree construction stage is implemented.
1915    
1916     * mkhtmlparser.pl: New macros are added.
1917    
1918     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1919    
1920     * ContentType.pm: A note on bug in the specification
1921     is removed since it's been now fixed.
1922    
1923     * .cvsignore: New file.
1924    
1925     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1926    
1927     * HTML.pm.src, HTML-consume-entity.src: New files.
1928    
1929     * Makefile (HTML.pm): New rule.
1930    
1931     * mkhtmlparser.pl: New script.
1932    
1933     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1934    
1935     * Makefile: New file.
1936    
1937     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1938    
1939     * ContentType.pm: An error in pod is fixed.
1940    
1941     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1942    
1943     * ContentType.pm: Documentation is added.
1944    
1945     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1946    
1947     * ContentType.pm: New Perl module.
1948    
1949     * ChangeLog: New file.
1950    
1951    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24