/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.324 - (hide annotations) (download)
Thu Sep 18 14:32:48 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.323: +7 -0 lines
++ whatpm/t/ChangeLog	18 Sep 2008 14:32:17 -0000
	* langtag-1.dat: More test data from RFC 4646 are added.

2008-09-18  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	18 Sep 2008 14:31:58 -0000
	* LangTag.pm: Add checks for remaining requirements from RFC 4646.

	* mklangreg.pl: Sort 'Prefix' values by their length, to ease
	matching.

2008-09-18  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
4    
5     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
6     matching.
7    
8     2008-09-18 Wakaba <wakaba@suika.fam.cx>
9    
10 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
11     typos fixed. Support for Suppress-Script field.
12    
13     * mklangreg.pl: Support for dumping of nested structure.
14    
15     2008-09-18 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
18     recommended case as per RFC 4646.
19    
20     2008-09-18 Wakaba <wakaba@suika.fam.cx>
21    
22 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
23    
24     2008-09-18 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.320 * mklangreg.pl: New script.
27    
28     * Makefile: Updated for creation of the module for language subtag
29     registry.
30    
31 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
32    
33 wakaba 1.319 * Makefile: WebIDL.html added.
34    
35     * WebIDL.pod: New documentation.
36    
37     2008-09-16 Wakaba <wakaba@suika.fam.cx>
38    
39 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
40    
41     2008-09-16 Wakaba <wakaba@suika.fam.cx>
42    
43 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
44     compatible parsing bugs are fixed. Some unreachable codes are
45     commented out.
46    
47     2008-09-16 Wakaba <wakaba@suika.fam.cx>
48    
49 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
50     added. It does not satisfy the definition that a forward
51     interface declaration has an extended attribute. It seems that
52     unless explicitly allowed multiple extended attributes with the
53     same name is not allowed, though it is not explicitly mentioned in
54     the spec.
55    
56     2008-09-16 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
59     attribute identifiers. Preserve whether an extended attribute has
60     an argument list of not. Support for extended attributes:
61     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
62     NameGetter, NameSetter, and Null.
63     (has_argument_list): New attribute.
64     (idl_text): Stringifies argument lists, if any, even if it is
65     empty.
66    
67 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
70     |{s_kwd}| in DATA_STATE as default.
71    
72     2008-09-15 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
75     by |{s_kwd}| in DATA_STATE.
76    
77     2008-09-15 Wakaba <wakaba@suika.fam.cx>
78    
79 wakaba 1.312 * HTML.pm.src: Shorten keys.
80    
81     2008-09-15 Wakaba <wakaba@suika.fam.cx>
82    
83 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
84     pair, or noncharacter code points and non-Unicode code
85     points (they should be handled by Whatpm::Charset::UnicodeChecker).
86     (parse_char_stream): Support for the |$get_wrapper| argument and
87     character stream error handlers.
88    
89     2008-09-15 Wakaba <wakaba@suika.fam.cx>
90    
91 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
92     for null-namespace elements/attributes.
93    
94     * HTML.pm.src: Fact out $disallowed_control_chars
95     as a hash.
96    
97 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
98    
99 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
100     and |{next_char}| initializations are moved to initialization
101     method. |{read_until}| now supports buffering. Sync |set_inner_html|
102     with |parse_char_stream|.
103    
104     2008-09-14 Wakaba <wakaba@suika.fam.cx>
105    
106 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
107     invoke |manakai_read_until|, not only |read|, where
108     possible, to decrease the number of |read| method calls.
109    
110     * mkhtmlparser.pl: Related changes to the aforementioned
111     modification.
112    
113     2008-09-14 Wakaba <wakaba@suika.fam.cx>
114    
115 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
116     would report character error from now.
117    
118     2008-09-14 Wakaba <wakaba@suika.fam.cx>
119    
120 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
121     tokens in "before head insertion mode" was not
122     correctly handled.
123     (set_inner_html): Reimplemented using CharString decodehandle
124     class. Support for $get_wrapper argument. Support
125     for |{read_until}| feature.
126    
127     2008-09-14 Wakaba <wakaba@suika.fam.cx>
128    
129 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
130     entities point the "&" character.
131    
132     2008-09-14 Wakaba <wakaba@suika.fam.cx>
133    
134 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
135     be added to the list of excluded characters.
136    
137     2008-09-14 Wakaba <wakaba@suika.fam.cx>
138    
139 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
140     and |column| a higher priority than the one set by the
141     tokenizer's input handler.
142     ($self->{read_until}): Exclude U+FFFD (but this might
143     not be necessary, since now we do line/column fixup in
144     the character decode handle).
145    
146     2008-09-14 Wakaba <wakaba@suika.fam.cx>
147    
148 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
149    
150     2008-09-14 Wakaba <wakaba@suika.fam.cx>
151    
152 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
153     and |manakai_getc_until| to |manakai_read_until| to
154     reduce the number of string copies.
155    
156     2008-09-14 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
159     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
160     standard feature to |open| a string as a filehandle,
161     since Perl's string filehandle seems not supporting |ungetc|
162     method correctly.
163     (parse_char_stream): Define |{getc_until}| method.
164     (DATA_STATE): Experimental support for |getc_until| feature.
165    
166 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
167    
168 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
169    
170     2008-09-13 Wakaba <wakaba@suika.fam.cx>
171    
172 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
173     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
174     and replaced by |{prev_state}|.
175    
176     * mkhtmlparser.pl: Remove |{char}| feature.
177     Remove |!!!back-next-input-character;| macro.
178    
179     2008-09-13 Wakaba <wakaba@suika.fam.cx>
180    
181 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
182     entity related tokenizer states in favor of new states
183     implementing the consume character reference algorithm.
184    
185     2008-09-13 Wakaba <wakaba@suika.fam.cx>
186    
187     * HTML.pm.src: "Consume a character reference" algorithm is
188 wakaba 1.296 now implemented as a tokenizer's state, rather than
189     a method, with minimum changes (more changes will
190     be made, in due course). "Bogus comment state"'s inner
191     loop gets removed.
192    
193     2008-09-13 Wakaba <wakaba@suika.fam.cx>
194    
195 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
196 wakaba 1.295 into their own tokenizer states.
197    
198     2008-09-13 Wakaba <wakaba@suika.fam.cx>
199    
200 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
201     is split into three states.
202    
203     2008-09-13 Wakaba <wakaba@suika.fam.cx>
204    
205 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
206     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
207     no longer does the tokenizer have to push back next input
208     characters in those states.
209    
210     2008-09-13 Wakaba <wakaba@suika.fam.cx>
211    
212 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
213     into four states so that no longer does the tokenizer have to push
214     back next input characters in that state.
215    
216 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
217    
218     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
219     which can be used to insert some wrapper between the character
220     stream handle and the tokenizer. (It is currently not supported
221     for |set_inner_html| for |Element|s).
222    
223 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
224    
225 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
226    
227     2008-09-10 Wakaba <wakaba@suika.fam.cx>
228    
229 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
230    
231     * HTML.pm.src: Don't specify |text| argument for the
232     |chardecode:fallback| error, since it is not the encoding
233     being used alternatively.
234    
235 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
236    
237     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
238    
239 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
240    
241 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
242    
243     2008-08-31 Wakaba <wakaba@suika.fam.cx>
244    
245 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
246     to after after frameset insertion mode processing (HTML5
247     revision 1909). Note that the implementation was wrong
248     per the old spec before the r1909 changes.
249    
250 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
251    
252 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
253     spec (HTML5 revision 2093).
254     ($process_row): Algorithm step numbers synced with the
255     spec (HTML5 revision 2092).
256    
257     2008-08-30 Wakaba <wakaba@suika.fam.cx>
258    
259 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
260     revision 2094).
261    
262     2008-08-30 Wakaba <wakaba@suika.fam.cx>
263    
264     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
265    
266     2008-08-30 Wakaba <wakaba@suika.fam.cx>
267    
268 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
269     name (after another attribute) now raise parse error (HTML5
270     revision 2123). Empty unquoted attribute values are no
271     longer allowed (HTML5 revision 2122).
272    
273     2008-08-30 Wakaba <wakaba@suika.fam.cx>
274    
275 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
276     revision 2130).
277    
278     2008-08-30 Wakaba <wakaba@suika.fam.cx>
279    
280 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
281     as |lang| attribute value for HTML elements (HTML5 revision 2062
282     and so on).
283    
284     2008-08-30 Wakaba <wakaba@suika.fam.cx>
285    
286 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
287     was missing.
288    
289     * URIChecker.pm: The end of the URL should be marked as the
290     error location for an empty path error. The position
291     between the userinfo and the port components should be
292     marked as the error location for an empty host error.
293    
294     2008-08-30 Wakaba <wakaba@suika.fam.cx>
295    
296 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
297     value the error occurs for errors. Report unknown
298     address format error in warning level, since address
299     formats are rarely added. Path segments starting with "/.."
300     were misinterpreted as a dot-segment.
301    
302     2008-08-30 Wakaba <wakaba@suika.fam.cx>
303    
304 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
305     |Message::DOM::DOMImplementation|.
306    
307 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
308    
309     * IMTChecker.pm: Updated for the new error reporting architecture.
310    
311     * ContentChecker.pm: Error levels for IMTs are added.
312    
313 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
314    
315     * H2H.pm (_shift_token): Support for unquoted HTML attribute
316     values.
317    
318 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
319    
320 wakaba 1.275 * CacheManifest.pm: Support for new style of error
321     reports.
322    
323     * HTML.pm.src: Set line=1, column=1 to the document node.
324    
325     2008-08-16 Wakaba <wakaba@suika.fam.cx>
326    
327 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
328     and URL checkers. Support for more error levels for bogus
329     langauge tag and URL "standards".
330    
331     * LangTag.pm, URIChecker.pm: Support for new style error
332     level reporting.
333    
334 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
335    
336 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
337    
338     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
339     specifying. Error types are revised.
340    
341     2008-08-15 Wakaba <wakaba@suika.fam.cx>
342    
343 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
344     renewed.
345    
346     2008-08-15 Wakaba <wakaba@suika.fam.cx>
347    
348 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
349     are revised. Use new style for "level" specification.
350    
351     * mkhtmlparser.pl: Use new style for "level" specification.
352    
353 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
354    
355 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
356     reporting process for broken ignored valuetype definition.
357     (Valuetype idl_text): Support for special "DOMString" name.
358    
359     2008-08-03 Wakaba <wakaba@suika.fam.cx>
360    
361 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
362     terminal of the ScopedName is "DOMString", such that whether
363     the last part of the scoped name is "DOMString" or "_DOMString"
364     later. It is necessary to determine whether a |typedef|
365     definition should be ignored or not.
366     (parse_char_string): Unescape the identifier of
367     exception members.
368     ($resolve): Return undef for builtin types and sequence<T>
369     types (we might not have to do this, however...).
370     (check): Support checking for Exceptions, Valuetypes,
371     and Typedefs.
372     ($serialize_type): Support for "DOMString::::" syntax.
373     (Typedef idl_text): Output Type as "DOMString" if it
374     is really "DOMString" (i.e. its internal representation
375     is "::DOMString::").
376    
377     2008-08-03 Wakaba <wakaba@suika.fam.cx>
378    
379 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
380     for constant types in the |check| method.
381     (check): Support for checking of attributes, operations, and
382     arguments.
383     (Attribute/Operation idl_text): Exception names in getraises,
384     setraises, and raises clauses is serizlied by |$serialize_type|
385     code.
386    
387 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
388    
389 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
390     hexadecimal numbers (the original pattern, taken from the spec,
391     was not work for hexadecimal numbers, because the "0" prefix
392     matches to the [0-7]* part (as an empty string) and therefore
393     it does not match with remaining "x..." part of a "0x..." integer
394     literal.
395     ($get_type): It now returns a string, not an array reference,
396     for regular types and |sequence| types (i.e. it in any case
397     returns a string).
398     ($get_next_token): The second item in the array that represents
399     a integer or float token is now a Perl number value, not the
400     original string representation of the number.
401     (check): Support for const value consistency checking.
402     No extended attribute is defined for constants.
403     (Node subclasses): Use simple strings rather than array references
404     for default data type values.
405     ($serialize_type): Type values are now simple strings.
406     (value): If the new attribute value is a false value, then
407     a FALSE value is set to the attribute.
408    
409     2008-08-02 Wakaba <wakaba@suika.fam.cx>
410    
411 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
412     in its stringified format ("scoped name" as defined in the
413     spec). Note that future version of this module should not use
414     array references for type values and the |type_text| attribute
415     should be made obsolete.
416     (parse_char_string): Unescape attribute names.
417     (check): Support for checking of whether inherited interfaces
418     are actually defined or not. Support for checking of whether
419     interface member identifiers are duplicated or not.
420     ($serialize_type): Scoped names are returned as is. A future
421     version of this code should escape identifiers other than "DOMString",
422     otherwise the idl_text would be non-conforming.
423    
424     2008-08-02 Wakaba <wakaba@suika.fam.cx>
425    
426 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
427 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
428     for Definition's were ignored.
429     (append_child): Set |parent_node| attribute.
430     (parent_node): New attribute.
431     (check): Support interface/exception members. Support
432     extended attributes. Support definition identifier uniqueness
433     constraint.
434     (qualified_name): New attribute.
435     (Interface/Exception idl_text): Extended attributes were
436     not prepended to the returned text.
437    
438     2008-08-02 Wakaba <wakaba@suika.fam.cx>
439    
440     * WebIDL.pm (parse_char_string): Set line/column numbers
441 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
442     would make things wrong. Support for interface forward
443     declarations was missing. Broken interface declarations
444     with no block were not ignored entirely.
445     (Whatpm::WebIDL::Node): New abstract class. This class
446     makes things easier.
447     (child_nodes): New attribute. Unlike DOM's attribute with
448     same name, this attribute returns a dead list of nodes for
449     simplicity.
450     (get_user_data, set_user_data): New methods.
451     (Module idl_text): A SPACE character should be inserted
452     before the |{| character.
453     (Interface idl_text): Support for interface forward declarations.
454     (is_forward_declaration): New attribute.
455    
456 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
457    
458 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
459    
460     2008-07-19 Wakaba <wakaba@suika.fam.cx>
461    
462 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
463     it now can handle broken extended attributes and as such.
464    
465     2008-07-19 Wakaba <wakaba@suika.fam.cx>
466    
467 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
468     Support for extended attributes with arguments.
469    
470     2008-07-19 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
473     (Interface->idl_text): Tentative support for inheritances.
474    
475     2008-07-19 Wakaba <wakaba@suika.fam.cx>
476    
477 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
478     Support for raises, setraises, and getraises syntaxes.
479    
480 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
481    
482     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
483     proper support for types, extended attributes, and exceptions yet).
484     WebIDL parser, version 1 (no support for exceptions yet,
485     no proper support for extended attributes yet).
486    
487 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
488    
489     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
490    
491 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
492    
493 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
494    
495     2008-06-29 Wakaba <wakaba@suika.fam.cx>
496    
497 wakaba 1.255 * WebIDL.pm: New module.
498    
499 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
500    
501     * Makefile (Entities.html): URI changed.
502    
503 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
504    
505     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
506    
507 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
508    
509 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
510    
511     2008-06-01 Wakaba <wakaba@suika.fam.cx>
512    
513 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
514     spec (commented out). (HTML5 revision 1711).
515    
516 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
517    
518 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
519    
520     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
521     as UTF-16 (HTML5 revision 1701).
522    
523     2008-05-25 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
526     1692).
527    
528     2008-05-25 Wakaba <wakaba@suika.fam.cx>
529    
530     * HTML.pm.src: The secondary insertion mode used when switching
531     to foreign content is the "in body" insertion mode (HTML5 revision
532     1696).
533    
534     2008-05-25 Wakaba <wakaba@suika.fam.cx>
535    
536 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
537     revision 1697).
538    
539 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
540    
541 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
542     insertion mode (HTML5 revision 1693). Update SVG camelCase
543     attribute list (HTML5 revision 1700). <textarea> closes
544     </select> (HTML5 revision 1699). More start tags close in
545     foreign content insertion mode (HTML5 revision 1698).
546    
547     2008-05-24 Wakaba <wakaba@suika.fam.cx>
548    
549 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
550    
551     2008-05-24 Wakaba <wakaba@suika.fam.cx>
552    
553 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
554     revision 1674).
555    
556     2008-05-24 Wakaba <wakaba@suika.fam.cx>
557    
558 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
559     revision 1676).
560    
561     2008-05-24 Wakaba <wakaba@suika.fam.cx>
562    
563 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
564     quriks mode detection (HTML5 revision 1679).
565    
566     2008-05-24 Wakaba <wakaba@suika.fam.cx>
567    
568 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
569     revision 1685).
570    
571     2008-05-24 Wakaba <wakaba@suika.fam.cx>
572    
573 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
574     revision 1684).
575    
576     2008-05-24 Wakaba <wakaba@suika.fam.cx>
577    
578 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
579     work (HTML5 revision 1690).
580    
581     2008-05-24 Wakaba <wakaba@suika.fam.cx>
582    
583 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
584     end tags in head insertion modes (HTML5 revision 1686).
585     (parse_char_string): Bug fix for non-utf8 character string handlings.
586     (parse_char_stream): |ungetc| does not work well for this context.
587    
588 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
589    
590 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
591     |parse_byte_stream|.
592     (parse_byte_stream): New method.
593    
594     2008-05-18 Wakaba <wakaba@suika.fam.cx>
595    
596 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
597     by encoding layer error reporter.
598    
599 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
600    
601 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
602     rather than converting the whole byte string and then parsing.
603     Propagate errors in character encoding layer.
604     (get_next_token): Precise error reporting for |bare stago| error.
605    
606     2008-05-17 Wakaba <wakaba@suika.fam.cx>
607    
608 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
609     (parse_char_string): This method is now defined as an invocation
610     of the |parse_char_stream| method.
611    
612     2008-05-17 Wakaba <wakaba@suika.fam.cx>
613    
614 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
615     of the sniffing as info-level errors. Support for new
616     decoding framework in parser resestting.
617     (new): Various default error levels were not set.
618    
619     2008-05-17 Wakaba <wakaba@suika.fam.cx>
620    
621 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
622     algorithm, except for the actual sniffing, is implemented
623     with new framework with Message::Charset::Info.
624    
625 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
626    
627     * CacheManifest.pm (_parse): Drop fragment identifiers from
628     URIs in fallback section (HTML5 revision 1596).
629    
630 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
631    
632 wakaba 1.231 * Makefile (Entities.html): URI has changed.
633    
634     2008-05-10 Wakaba <wakaba@suika.fam.cx>
635    
636 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
637     1553).
638    
639 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
640    
641 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
642     no longer table elements, but table objects returned
643     by Whatpm::HTMLTable.
644    
645     * HTMLTable.pm (form_table): Return table element node
646     as |$table->{element}|.
647     (assign_header): Support for the |headers=""| attribute.
648    
649     2008-05-06 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
652     with no support for headers="".
653     (form_table): Include table width and height to the returned
654     table object for covenience. Indexing in column assignement was wrong.
655     Set whether a data cell is empty or not for convenience.
656    
657 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
658    
659 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
660    
661     2008-05-05 Wakaba <wakaba@suika.fam.cx>
662    
663 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
664     changed (HTML5 revision 1387).
665    
666     2008-05-05 Wakaba <wakaba@suika.fam.cx>
667    
668 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
669     to the end of the table (HTML5 revision 1380).
670    
671     2008-05-05 Wakaba <wakaba@suika.fam.cx>
672    
673 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
674     of 1-based (HTML5 revision 1376).
675    
676     2008-05-05 Wakaba <wakaba@suika.fam.cx>
677    
678 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
679     and "Content-Encoding" no longer prevent sniffing (HTML5
680     revision 1288).
681    
682     2008-05-05 Wakaba <wakaba@suika.fam.cx>
683    
684 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
685     1282).
686    
687 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
688    
689 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
690     Status of XML specs are added.
691    
692     2008-05-03 Wakaba <wakaba@suika.fam.cx>
693    
694 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
695     and |ref=""| attribute (referent element type checking).
696    
697 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
698    
699 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
700     1500) is implemented (except for IDNA part and URI-scheme-specific
701 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
702     documents.
703    
704 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
707     character (HTML5 revision 1263).
708    
709     2008-04-13 Wakaba <wakaba@suika.fam.cx>
710    
711 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
712     definition for |AElig;| was missing).
713    
714     2008-04-13 Wakaba <wakaba@suika.fam.cx>
715    
716 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
717     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
718    
719     2008-04-13 Wakaba <wakaba@suika.fam.cx>
720    
721 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
722     insertion mode is added (HTML5 revisions 1412 and 1418).
723    
724     2008-04-13 Wakaba <wakaba@suika.fam.cx>
725    
726 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
727     elements (HTML5 revision 1410).
728    
729     2008-04-13 Wakaba <wakaba@suika.fam.cx>
730    
731 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
732     revision 1406).
733    
734     2008-04-13 Wakaba <wakaba@suika.fam.cx>
735    
736 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
737     elements (HTML5 revision 1404 and 1420).
738    
739 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
740    
741 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
742     elements (HTML5 revision 1404). Unused !!!macro definitions
743     are removed.
744    
745     2008-04-12 Wakaba <wakaba@suika.fam.cx>
746    
747 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
748     are raised is changed (HTML5 revision 1404).
749    
750 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
751    
752 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
753    
754     2008-04-06 Wakaba <wakaba@suika.fam.cx>
755    
756 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
757     replaced by constants representing element category.
758    
759     2008-04-06 Wakaba <wakaba@suika.fam.cx>
760    
761 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
762     for error reporting. (This is a preparation for using constant
763     value for ->[1].)
764    
765 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
766    
767 wakaba 1.205 * RDFXML.pm: Typo fixed.
768    
769     2008-03-22 Wakaba <wakaba@suika.fam.cx>
770    
771 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
772     as same as |must_level|, i.e. level = |m|.
773     (check_element): Make list of URIs in the DOM.
774    
775 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
776    
777 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
778     Local (null-namespace) attribute support.
779    
780     2008-03-21 Wakaba <wakaba@suika.fam.cx>
781    
782 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
783    
784     2008-03-21 Wakaba <wakaba@suika.fam.cx>
785    
786 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
787     and URI attributes is implemented. Warn if unknown
788     value is used in rdf:parseType="" attribute.
789    
790     * URIChecker.pm (check_rdf_uri_reference): New function.
791    
792     2008-03-21 Wakaba <wakaba@suika.fam.cx>
793    
794 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
795     are now resolved.
796    
797     2008-03-21 Wakaba <wakaba@suika.fam.cx>
798    
799 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
800    
801     * RDFXML.pm: undef vs false bug fixed.
802     Reification implemented.
803    
804     2008-03-21 Wakaba <wakaba@suika.fam.cx>
805    
806 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
807     The |node| arguments are added for |ontriple| calls.
808     Too many "attribute not allowed" errors were raised.
809    
810     * ContentChecker.pm: Initial experimental support for rdf:RDF
811     element.
812    
813     2008-03-21 Wakaba <wakaba@suika.fam.cx>
814    
815 wakaba 1.197 * RDFXML.pm: New module.
816    
817 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
818    
819 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
820     code was old one yet.
821    
822     2008-03-20 Wakaba <wakaba@suika.fam.cx>
823    
824 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
825     errors. Line/column markings for DOCTYPE, comment, and
826     character tokens are reintroduced; otherwise,
827     error location for "not HTML5" error and errors
828     for implied elements are not attached.
829    
830     2008-03-20 Wakaba <wakaba@suika.fam.cx>
831    
832 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
833    
834     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
835     (value): Setter implemented.
836    
837     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
838    
839     2008-03-20 Wakaba <wakaba@suika.fam.cx>
840    
841 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
842    
843     2008-03-20 Wakaba <wakaba@suika.fam.cx>
844    
845 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
846     flag, which is no longer used.
847    
848 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
849    
850 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
851    
852     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
853     and !!!insert-element-t): Set line/column information to
854     element nodes.
855    
856     2008-03-17 Wakaba <wakaba@suika.fam.cx>
857    
858 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
859     in "<?", "<>", or "</>" should be the error point.
860    
861 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
862    
863 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
864    
865     2008-03-16 Wakaba <wakaba@suika.fam.cx>
866    
867 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
868    
869     2008-03-16 Wakaba <wakaba@suika.fam.cx>
870    
871 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
872     reporting.
873    
874 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
875    
876 wakaba 1.186 * HTML.pm.src: Error type revised.
877    
878     2008-03-11 Wakaba <wakaba@suika.fam.cx>
879    
880 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
881    
882     2008-03-11 Wakaba <wakaba@suika.fam.cx>
883    
884 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
885    
886 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
887    
888     * mkhtmlparser.pl: Set "level" parameter to parse errors.
889    
890     * HTML.pm.src: Code refined.
891    
892 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
893    
894 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
895     revision 1348). Note that I really don't know this makes
896     any difference in the black-box behavior of the parser.
897    
898     2008-03-09 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
901     revision 1348).
902    
903     2008-03-09 Wakaba <wakaba@suika.fam.cx>
904    
905 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
906    
907     2008-03-09 Wakaba <wakaba@suika.fam.cx>
908    
909 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
910    
911     2008-03-09 Wakaba <wakaba@suika.fam.cx>
912    
913 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
914     mode and support for the "in select in table" insertion mode (HTML5
915     revision 1342).
916    
917     2008-03-09 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
920     in non-tainted <table>s (HTML5 revision 1336).
921    
922     2008-03-09 Wakaba <wakaba@suika.fam.cx>
923    
924 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
925     and <head> (HTML5 revision 1332).
926    
927     2008-03-09 Wakaba <wakaba@suika.fam.cx>
928    
929 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
930     white space (HTML5 revision 1331).
931    
932 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
933    
934 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
935     element (HTML5 revision 1330).
936    
937     2008-03-08 Wakaba <wakaba@suika.fam.cx>
938    
939 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
940     element (HTML5 revision 1328).
941    
942     2008-03-08 Wakaba <wakaba@suika.fam.cx>
943    
944 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
945     into foster parents (HTML5 revision 1326).
946    
947     2008-03-08 Wakaba <wakaba@suika.fam.cx>
948    
949 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
950     revision 1321).
951    
952     2008-03-08 Wakaba <wakaba@suika.fam.cx>
953    
954 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
955     the spec (HTML5 revision 1320). Note that we finally complete
956     all of HTML5 revision 1320 changes.
957    
958     2008-03-08 Wakaba <wakaba@suika.fam.cx>
959    
960 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
961     tags (HTML5 revision 1320).
962    
963     2008-03-08 Wakaba <wakaba@suika.fam.cx>
964    
965 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
966    
967     2008-03-08 Wakaba <wakaba@suika.fam.cx>
968    
969 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
970     could not be reached so that it is now removed (HTML5 revision
971     1320).
972    
973     2008-03-08 Wakaba <wakaba@suika.fam.cx>
974    
975 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
976     to align with new spec text (HTML5 revision 1320).
977    
978     2008-03-08 Wakaba <wakaba@suika.fam.cx>
979    
980 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
981     have had no harm in theory.
982    
983     2008-03-08 Wakaba <wakaba@suika.fam.cx>
984    
985 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
986     case is split from other end tags for the preparation
987     of implementing HTML revision 1320.
988    
989 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
990    
991 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
992     revision 1320).
993    
994     2008-03-07 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
997     and "div" (HTML5 revision 1318). Add comments to where
998     |form| pointer association codes should be inserted (HTML5 revision
999     1319).
1000    
1001 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1002    
1003     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1004    
1005 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1006    
1007 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1008     other token type, there are), it is replaced by a |die| statement.
1009    
1010     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1011    
1012 wakaba 1.159 * HTML.pm.src: Typo fixed.
1013    
1014 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1015    
1016     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1017     doctypes were not uppercased for comparison.
1018    
1019 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1020    
1021 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1022     are added.
1023    
1024     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1025    
1026 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1027     are set. Cases that are unlikely reached are noted as so.
1028    
1029     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1030    
1031 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1032    
1033     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1034    
1035     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1036    
1037 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1038     for simplicity.
1039    
1040     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1041    
1042 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1043     revision 1307).
1044    
1045     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1046    
1047 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1048     is set are changed (HTML5 revision 1305).
1049    
1050 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1051    
1052 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1053     1292). Entities are not parsed in comment-like part in RCDATA
1054     elements (HTML5 revision 1294). Allow bare & at the end
1055     of attribute value literals (HTML5 revision 1296). More
1056     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1057     between attributes and ban attribute names or unquoted
1058     attribute values containing single or double quotes (HTML5
1059     revision 1303).
1060    
1061     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1062    
1063 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1064     and related errors unless it is an HTML document (though the spec
1065     is unclear on whether it is applied to XHTML document).
1066    
1067     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1068     are added.
1069    
1070     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1071    
1072 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1073    
1074 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1075    
1076     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1077    
1078     * HTML.pm.src: |charset| in |content| attribute is
1079     case-insensitive (HTML5 revision 1270).
1080    
1081 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1082    
1083     * ContentChecker.pm: New status constants are added.
1084     ($ElementDefault): |status| added.
1085     (check_element): Err for non-standard or deprecated elements.
1086     (_attr_status_info): For non-standard or deprecated attributes.
1087    
1088 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1089    
1090 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1091    
1092     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1093    
1094 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1095     status information is now dispatched.
1096    
1097     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1098    
1099 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1100     problem by adding some more arguments.
1101    
1102     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1103    
1104 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1105     container-for-the-purpose-of-content-model element (not transparent
1106     element) for |check_child_element| calling and significant
1107     text flag marking. This reintroduces |<del>|-and-significant
1108     problem again.
1109    
1110     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1111    
1112 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1113     elements ignored for the purpose of phase changes in
1114     content model checking.
1115    
1116 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1117    
1118 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1119     was not properly managed for transparent cases.
1120    
1121     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1122    
1123 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1124     and |audio| as semi-transparent elements.
1125    
1126     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1127    
1128 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1129     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1130     Support for |html:object| as a semi-transparent element.
1131    
1132     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1133    
1134 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1135     the tree is entirely revised to make it easier to track
1136     the state of ancestors/descendants. As a result of this
1137     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1138     support for content model checking for HTML elements |figure|,
1139     |object|, |video|, and |audio| and checking for XML elements (and
1140     some XMLNS checkings) are dropped for now. They will be
1141     reimplemented in due cource.
1142    
1143 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1144    
1145 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1146     token (HTML5 revision 1225).
1147    
1148     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1149    
1150 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1151    
1152     * HTML.pm.src: Save whether |meta| |content| attribute
1153     contains character references or not.
1154    
1155     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1156    
1157 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1158     imlement significant content checking for contents
1159     with |del| elements.
1160    
1161     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1162    
1163 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1164     elements are added.
1165     (_get_children): HTML |object| elements are now semi-transparent.
1166    
1167     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1168    
1169 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1170    
1171     * CacheManifest.pm: HTML5 revision 1211 implemented.
1172    
1173     * CacheManifest.pod: Updated.
1174    
1175 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1176    
1177     * ContentChecker.pm (check_document, check_element): Support
1178     for second argument ($onsubdoc).
1179     (_get_css_parser): Removed (now it is part of WDCC).
1180    
1181 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1182    
1183     * ContentChecker.pm (_get_css_parser): New.
1184    
1185 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1186    
1187 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1188     for checking had been used.
1189    
1190     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1191    
1192 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1193    
1194 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1195    
1196 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1197     specced in RFC 2046 and RFC 4288.
1198     (application/atom+xml): Definition added.
1199    
1200     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1201    
1202 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1203     warnings (rather than SHOULD-level errors).
1204    
1205     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1206    
1207 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1208    
1209     * HTML.pm.src: A flag for character references in attribute
1210     values are added. Set |manakai_has_reference| user data
1211     to |charset| attribute.
1212    
1213     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1214    
1215 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1216     attributes.
1217    
1218     * ContentChecker.pm (check_document): Warn if charset requirements
1219     cannot be tested.
1220    
1221 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1222    
1223     * HTML.pm.src (parse_byte_string): Detect charset
1224     by universalchardet if charset parameter is not specified.
1225    
1226     * Makefile (Charset-all, Charset-clean): New rules.
1227    
1228 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1229    
1230 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1231     of character encoding declaration and selection of encoding
1232     for HTML document.
1233    
1234     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1235    
1236 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1237     type in list context.
1238    
1239     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1240    
1241 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1242     ignoring (HTML5 revisions 1013 and 1016).
1243    
1244     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1245    
1246 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1247     white spaces, are implemented.
1248    
1249 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1250    
1251 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1252     attribute if possible.
1253    
1254     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1255    
1256 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1257     (parse_char_string): New alias for |parse_string|.
1258     (main phase): Invoking "change the encoding" algorithm if desired.
1259    
1260     * HTML.pod: Updated.
1261    
1262     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1263    
1264 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1265    
1266     * Makefile (HTML-all, HTML-clean): New.
1267    
1268     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1269    
1270     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1271    
1272 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1273    
1274     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1275    
1276 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1277    
1278 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1279    
1280     * CacheManifest.pod: New file.
1281    
1282     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1283    
1284 wakaba 1.114 * CacheManifest.pm: New module.
1285    
1286     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1287    
1288 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1289     callback.
1290    
1291     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1292    
1293 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1294    
1295 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1296    
1297 wakaba 1.111 * Makefile (clean): New rule.
1298    
1299     * NanoDOM.pm (public_id, system_id): New attributes.
1300    
1301     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1302    
1303 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1304    
1305 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1306    
1307     * ContentChecker.pm (check_document): Support for
1308     new |is_xml_root| flag.
1309     (check_element): Support for new |pluses| state.
1310     (_add_pluses): New method.
1311     (_remove_minuses): Support for new |minus| item.
1312    
1313 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1314    
1315 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1316     root element.
1317    
1318     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1319    
1320 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1321    
1322 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1323    
1324 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1325    
1326     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1327    
1328 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1329    
1330     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1331    
1332 wakaba 1.103 * LangTag.pm: New module.
1333    
1334 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1335    
1336     * HTML.pm.src: Some error types were wrong.
1337    
1338 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1339    
1340 wakaba 1.101 * CSS/: New directory.
1341    
1342     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1345     for |noscript| in |head|.
1346    
1347 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1348    
1349     * URI/: New directory.
1350    
1351 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1352    
1353 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1354     number.
1355    
1356     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1357    
1358 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1359     by |&|s.
1360    
1361     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1364    
1365     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1368    
1369     * HTML.pm.src: Insertion modes are now represented in number.
1370    
1371     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1372    
1373 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1374     999) is implemented.
1375    
1376 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1377    
1378     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1379     983) is implemented.
1380    
1381 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1382    
1383     * ContentChecker.pod: New documentation.
1384    
1385     * Makefile: A rule for |ContentChecker.html| is added.
1386    
1387     * ContentChecker.pm: A pod "LICENSE" section is added.
1388    
1389     * NanoDOM.pm ($VERSION): New variable.
1390    
1391 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1392    
1393 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1394     list of allowed HTML elements.
1395    
1396     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1397    
1398 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1399     HTML elements.
1400    
1401     * URIChecker.pm (check_iri): New.
1402     (check_iri_reference): Error type for IRI reference
1403     syntax error is changed.
1404    
1405 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1406    
1407 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1408     is added.
1409     (check_document): Load appropriate module before validation.
1410    
1411     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1412    
1413 wakaba 1.88 * ContentChecker/: New directory.
1414    
1415     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1416    
1417 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1418    
1419     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1420    
1421 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1422    
1423 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1424     even if no document element is found.
1425    
1426     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1427    
1428 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1429    
1430     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1431    
1432 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1433    
1434     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1437     is split into "after html body" and "after html frameset"
1438     insertion modes. Their codes are merged with "after body"
1439     and "after frameset" codes. |$previous_insertion_mode|
1440     has been removed. "after frameset" code is
1441     merged with "in frameset" code.
1442    
1443     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1444    
1445 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1446     merged with the "in head" insertion mode.
1447    
1448     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1451     reorganized.
1452    
1453     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1454    
1455 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1456     are merged.
1457    
1458     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1459    
1460 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1461     with "in table" insertion mode.
1462    
1463     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1464    
1465 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1466     modes are merged.
1467    
1468     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1469    
1470 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1471     insertion mode!
1472    
1473     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1474    
1475 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1476     now directly invoke the handler.
1477    
1478     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1479    
1480 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1481     is merged to the "in body" insertion mode code.
1482    
1483     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1484    
1485 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1486     insertion modes are merged.
1487    
1488     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1489    
1490 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1491    
1492     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1493    
1494 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1495     instead of string constant for |{content_model_flag}|.
1496    
1497     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1498    
1499 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1500     to the "duplicate attribute" error.
1501    
1502 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1503    
1504 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1505    
1506     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1507    
1508 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1509    
1510     * HTML.pm.src: A typo is fixed.
1511    
1512 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1513    
1514 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1515     from "in HTML:xml:lang" error. Character position
1516     is now the last part of the error type in the URI error
1517     description. Report "unsupported" status for language
1518     tags, media queries, script codes, and style sheets.
1519    
1520     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1523     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1524     in XML.
1525    
1526     * NanoDOM.pm (Attr.owner_document): New attribute.
1527    
1528     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1529    
1530 wakaba 1.66 * HTML.pm.src: The character immediately following
1531     a bare |hcro| was discarded. Fix handling of
1532     entity references in attribute values.
1533    
1534     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1535    
1536 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1537     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1538     factored out. Error types |in html:#DOCTYPE|
1539     and |after html:#DOCTYPE| are merged
1540     into |DOCTYPE in the middle|. |</frameset>|
1541     in fragment parsing mode changed the
1542     insertion mode.
1543    
1544     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1545    
1546 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1547     phase is now an insertion mode. Treatments for white
1548     space character tokens were incorrect for some
1549     insertion modes. An old |meta| case was not removed.
1550    
1551     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1552    
1553 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1554     implemented (but changing the encoding is not yet:-).
1555    
1556 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1557    
1558 wakaba 1.62 * Charset/: New directory.
1559    
1560     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1561    
1562 wakaba 1.61 * H2H.pm: New Perl module (created from
1563     manakai's H2H.dis).
1564    
1565     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1566    
1567 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1568     manakai's SimpleLS.dis).
1569    
1570 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1571    
1572     * HTML.pm.src: |<!---x-->| was not processed correctly.
1573    
1574 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1575    
1576 wakaba 1.58 * HTML.pm.src: Report correct error message
1577     for |<body></div></body>|.
1578    
1579     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1580    
1581 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1582    
1583     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1584    
1585     * ContentChecker.pm: Return |{term}| list.
1586    
1587 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1588    
1589 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1590     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1591     and <title> outside of head).
1592    
1593     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1594    
1595 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1596     and private types/subtypes.
1597    
1598     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1599     URIChecker.pm, HTMLTable.pm: Error messages are now
1600     consistent; they are all listed in
1601     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1602    
1603 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1604    
1605 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1606     is now erred. |<datalist>| is implemented.
1607     Attribute checker for |<command>| and |<menu>| are
1608     added. Support for |contextmenu| global attribute
1609     is added.
1610    
1611     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1612    
1613 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1614     of Step 3 has been changed.
1615    
1616     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1617    
1618 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1619    
1620 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1621    
1622 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1623     incorrect.
1624    
1625     * ContentChecker.pm: Support for the |footer| element.
1626     Check URI syntax for space-separated URI attributes.
1627     Support for the |tabindex| attribute. Support
1628     for |datetime| attribute.
1629    
1630     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1631    
1632 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1633     character references). HTML5 revision 1.146 (white space
1634     characters before root start tag). HTML5 revision
1635     1.148 (named character references in attribute values).
1636     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1637    
1638     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1639    
1640     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1641 wakaba 1.49 in <head>).
1642    
1643     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1644    
1645 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1646     in body).
1647    
1648     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1649    
1650     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1651    
1652 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1653     or RCDATA in fragment parsing mode). HTML5 revision
1654     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1655    
1656     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1657    
1658     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1659     for <pre> and <textarea>).
1660    
1661 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1662    
1663 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1664    
1665     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1666    
1667 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1668    
1669     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1670    
1671 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1672     HTML5 revision 917 (conforming bare &).
1673    
1674     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1675    
1676 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1677    
1678     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1679     HTML5 revision 915 (<nobr>).
1680    
1681     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1682    
1683 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1684    
1685     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1686     New attributes.
1687    
1688     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1689    
1690 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1691     in tags).
1692    
1693     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1696    
1697     * HTML.pm.src: |$entity_char| is removed and
1698     requires |Whatpm::_NamedEntityList| instead.
1699     HTML5 revision 898 (refc), except that lack of refc
1700     is parse error.
1701    
1702     * mkentitylist.pl: New script.
1703    
1704     * Makefile (all): |_NamedEntityList.pm| is added.
1705     (_NamedEntityList.pm, Entities.html): New rules.
1706    
1707     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1708    
1709 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1710     were ignored and U+000D immediately following another
1711     U+000D was not converted to U+000A.
1712    
1713     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1714    
1715 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1716     nodes before appended). Parser was not ready for NULL
1717     parse error and escape flag.
1718    
1719     * NanoDOM.pm (adopt_node): New.
1720    
1721     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1722    
1723 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1724     CDATA and RCDATA). Note that current implementation
1725     is simply repeating what the spec says and it is maybe not
1726     a best way to do it.
1727    
1728     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1729    
1730 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1731     the form element if a descendant element without implied end tag has
1732     still been open).
1733    
1734     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1735    
1736 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1737     space characters non-conforming).
1738    
1739     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1740    
1741 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1742     HTML5 revision 869 (C1 character references).
1743    
1744     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1745    
1746 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1747     a |textarea| is removed).
1748    
1749 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1750    
1751     * NanoDOM.pm (get_attribute_node_ns): New method.
1752    
1753     * ContentChecker.pm: |script| |async| and |defer|
1754     no longer require |src|. |async| MUST NOT be
1755     specified if |defer|. (HTML5 revision 858).
1756    
1757 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1758    
1759     * HTML.pm.src: |<form><form>| went to inifinite loop.
1760    
1761 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1762    
1763 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1764     as a document element) flag on.
1765     (new): Removed.
1766     (check_document): New method.
1767    
1768     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1769    
1770 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1771     were incorrect.
1772    
1773     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1774    
1775 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1776     error position in the |type| option of the error.
1777    
1778     * HTMLTable.pm (form_table): The |$onerror| parameter
1779     is now optional. Some bugs are fixed.
1780    
1781     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1782    
1783 wakaba 1.27 * HTMLTable.pm: New module.
1784    
1785     * ContentChecker.pm (table): Invoke table model error checker.
1786    
1787     * NanoDOM.pm (first_child, get_attribute_ns): New.
1788    
1789 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1790    
1791 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1792     (link/@rel, a/@rel, area/@rel): Use new checker.
1793    
1794     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1795    
1796     * _LinkTypeList.pm: New file.
1797    
1798     * mklinktypelist.pl: New file.
1799    
1800     * .cvsignore: |RelExtensions.html| added.
1801    
1802     * NanoDOM.pm (child_nodes): Returns an empty array
1803     for non-child-containing node types.
1804     (text_content): New attribute.
1805    
1806     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1807    
1808 wakaba 1.25 * IMTChecker.pm: New module.
1809    
1810     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1811     to test parameter value validity.
1812    
1813     * HTML.pm.src ($style_start_tag): Attributes were
1814     discarded.
1815    
1816 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1817    
1818 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1819    
1820     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1821    
1822 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1823     and RFC 3987 are listed (not all testable items are checked yet).
1824    
1825     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1826    
1827 wakaba 1.22 * URIChecker.pm: New module.
1828    
1829 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1830    
1831 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1832     or associated to some placeholder.
1833     ($ElementDefault): Warn unknown attributes
1834     for unknown elements as "attribute not supported".
1835     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1836     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1837     (|a| attribute checker): Reimplemented.
1838    
1839     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1840    
1841 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1842     ($HTMLAttrChecker): Event handler content attributes
1843     are added.
1844     (link, embed): Required attribute is now checked.
1845     (embed): Unknown local attributes are no longer warned.
1846    
1847     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1848    
1849 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1850     ($HTMLIMTAttrChecker): New checker.
1851     (link@rel, link@href, link@type, style@type,
1852     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1853     object@data, object@type, source@src, source@type, area@alt,
1854     area@shape, area@coords, area@href,
1855     area@ping, area@rel, area@type, script@src,
1856     script@defer, script@async, script@type): Checkers added.
1857    
1858     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1859    
1860 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1861    
1862 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1863    
1864 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1865     attributes. Report an error if |Element.prefix|
1866     is |xmlns|.
1867    
1868     * NanoDOM.pm (prefix): New attribute.
1869    
1870     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1871    
1872 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1873     mode, action for |<iframe>| was missing.
1874    
1875     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1876    
1877 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1878     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1879    
1880     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1881    
1882 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1883     ($HTMLURIAttrChecker): Placeholder.
1884     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1885     New.
1886    
1887     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1888    
1889 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1890     for global attributes, |html|, |base|, |style|, and |meta|.
1891    
1892     * NanoDOM.pm (insert_before): Weaken reference
1893     to the parent node.
1894     (Attr::new): Set |owner_element| attribute.
1895     (namespace_uri, manakai_local_name): New attribute implementations.
1896     (owner_element): New attribute.
1897    
1898     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1899    
1900 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1901     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1902     $Element->{$HTML_NS}->{''}): New.
1903     (check_element): Invoke attrs_checker for each element.
1904    
1905 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1906    
1907 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1908    
1909     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1910    
1911 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1912     element type testings.
1913    
1914     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1915    
1916 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1917     error when an element type is put in the "minus" list
1918     and the element type is not allowed explicitly in the particular
1919     element content model.
1920     (html:a checker): New checker.
1921     (html:details, html:datagrid): New checkers.
1922     (html:legend): New checker.
1923    
1924     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1925    
1926 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1927    
1928     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1929    
1930 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1931     checker.
1932     (html:dd checker): New checker.
1933     (html:q, html:em, html:strong, html:small,
1934     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1935    
1936     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1937    
1938 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1939     from |$ElementDefault->{checker}|.
1940     ($ElementDefault->{checker}): Throw an error that
1941     the element type is not supported by the checker.
1942     ($HTMLMetadataElement): |html:base| was missing.
1943     ($HTMLEmptyChecker): Don't throw an error
1944     for inter-element whitespace nodes.
1945     (html:html checker): Errors were not
1946     thrown even if |html:head| and/or |html:body|
1947     children were missing.
1948     (html:head checker): An error was not
1949     thrown if <meta charset> appered after other
1950     elements.
1951    
1952 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1953    
1954     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1955     and |noscript| elements are implemented.
1956     (new): New method.
1957    
1958 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1959    
1960 wakaba 1.4 * ContentChecker.pm: New module.
1961    
1962     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1963    
1964 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1965     document_element, manakai_local_name,
1966     manakai_element_type_match): New method.
1967    
1968     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1969    
1970     * HTML.pm.src: Replace decimal and hexadecimal numeric
1971     entities in C1 range using Windows-1252 mapping. Bare LF
1972     did not count as new line for error reporting.
1973    
1974 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1975    
1976     * NanoDOM.pm (DOMImplementation): New class.
1977     (append_child): Weaken the |parent_node| reference.
1978     (create_element_ns, Element new): Set the |owner_document|
1979     reference.
1980     (implementation): New attribute.
1981     (owner_document, local_name, namespace_uri): New attributes.
1982    
1983     * HTML.pm.src (parse_string): Line and column numbers
1984     are now provided to error handler.
1985     (!!!parse-error): Short descriptions are added.
1986     (_construct_tree): Split into three methods; support
1987     for innerHTML mode.
1988     (set_inner_html): New method.
1989    
1990 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1991    
1992     * NanoDOM.html: Documentation is added.
1993    
1994     * HTML.pod, ContentType.html: Documentation is revised.
1995    
1996     * .cvsignore: Pod2html temporary files are added.
1997    
1998     * Makefile: Make |NanoDOM.html|.
1999    
2000     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2001    
2002     * HTML.pm.src (parse_string): New method.
2003     (get_inner_html): Renamed from |inner_html|.
2004    
2005     * Makefile: A rule for |HTML.html| is added.
2006    
2007     * HTML.pod: New documentation.
2008    
2009     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2010    
2011     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2012     (clone_node): Attribute nodes were not completely copied.
2013    
2014     * HTML.pm.src: Many bugs are fixed.
2015    
2016     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2017    
2018     * HTML.pm.src: Some typos are fixed.
2019    
2020     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2021    
2022     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2023     are removed.
2024    
2025     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2026    
2027     * HTML-consume-entity.src: Removed.
2028    
2029     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2030    
2031     * HTML.pm.src: Tokenizer's handling on hexadecimal
2032     numeric entities are rewritten.
2033    
2034     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2035    
2036     * HTML.pm.src: Some tokenizer bugs are fixed.
2037    
2038     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2039    
2040     * HTML.pm.src: The tree construction stage is implemented.
2041    
2042     * mkhtmlparser.pl: New macros are added.
2043    
2044     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2045    
2046     * ContentType.pm: A note on bug in the specification
2047     is removed since it's been now fixed.
2048    
2049     * .cvsignore: New file.
2050    
2051     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2052    
2053     * HTML.pm.src, HTML-consume-entity.src: New files.
2054    
2055     * Makefile (HTML.pm): New rule.
2056    
2057     * mkhtmlparser.pl: New script.
2058    
2059     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2060    
2061     * Makefile: New file.
2062    
2063     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2064    
2065     * ContentType.pm: An error in pod is fixed.
2066    
2067     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2068    
2069     * ContentType.pm: Documentation is added.
2070    
2071     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2072    
2073     * ContentType.pm: New Perl module.
2074    
2075     * ChangeLog: New file.
2076    
2077    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24