/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.331 - (hide annotations) (download)
Sun Sep 21 09:45:02 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.330: +5 -0 lines
++ whatpm/t/ChangeLog	21 Sep 2008 09:32:48 -0000
2008-09-21  Wakaba  <wakaba@suika.fam.cx>

	* content-model-2.dat: Test data for |form| |name| are added.

++ whatpm/Whatpm/ChangeLog	21 Sep 2008 09:33:18 -0000
	* ContentChecker.pm: Prepare for |form| |name| attribute's
	duplication checking.

2008-09-21  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	21 Sep 2008 09:32:26 -0000
2008-09-21  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm: |form| element's |name| attribute is implemented.

1 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
4     duplication checking.
5    
6     2008-09-21 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
9     unsupported charset="" parameter value (HTML5 revision 2131).
10    
11 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
12    
13 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
14     character is fixed (cf. HTML5 revision 1738).
15    
16     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
17     space characters (HTML5 revision 1738).
18    
19     2008-09-20 Wakaba <wakaba@suika.fam.cx>
20    
21 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
22     insertion mode was not updated to swtich to the "in body"
23     insertion mode. U+000B is no longer a space character for the
24     purpose of tree construction phase (HTML5 revision 1738).
25    
26     2008-09-20 Wakaba <wakaba@suika.fam.cx>
27    
28 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
29     revision 1738).
30    
31     2008-09-20 Wakaba <wakaba@suika.fam.cx>
32    
33 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
34     revision 1738).
35    
36     * HTML.pm.src: U+000B is no longer a space character for the
37     algorithm for extracting an encoding from a Content-Type (HTML5
38     revision 1738).
39    
40     2008-09-20 Wakaba <wakaba@suika.fam.cx>
41    
42 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
43    
44 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
45    
46 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
47    
48     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
49     matching.
50    
51     2008-09-18 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
54     typos fixed. Support for Suppress-Script field.
55    
56     * mklangreg.pl: Support for dumping of nested structure.
57    
58     2008-09-18 Wakaba <wakaba@suika.fam.cx>
59    
60 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
61     recommended case as per RFC 4646.
62    
63     2008-09-18 Wakaba <wakaba@suika.fam.cx>
64    
65 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
66    
67     2008-09-18 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.320 * mklangreg.pl: New script.
70    
71     * Makefile: Updated for creation of the module for language subtag
72     registry.
73    
74 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.319 * Makefile: WebIDL.html added.
77    
78     * WebIDL.pod: New documentation.
79    
80     2008-09-16 Wakaba <wakaba@suika.fam.cx>
81    
82 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
83    
84     2008-09-16 Wakaba <wakaba@suika.fam.cx>
85    
86 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
87     compatible parsing bugs are fixed. Some unreachable codes are
88     commented out.
89    
90     2008-09-16 Wakaba <wakaba@suika.fam.cx>
91    
92 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
93     added. It does not satisfy the definition that a forward
94     interface declaration has an extended attribute. It seems that
95     unless explicitly allowed multiple extended attributes with the
96     same name is not allowed, though it is not explicitly mentioned in
97     the spec.
98    
99     2008-09-16 Wakaba <wakaba@suika.fam.cx>
100    
101 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
102     attribute identifiers. Preserve whether an extended attribute has
103     an argument list of not. Support for extended attributes:
104     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
105     NameGetter, NameSetter, and Null.
106     (has_argument_list): New attribute.
107     (idl_text): Stringifies argument lists, if any, even if it is
108     empty.
109    
110 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
111    
112 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
113     |{s_kwd}| in DATA_STATE as default.
114    
115     2008-09-15 Wakaba <wakaba@suika.fam.cx>
116    
117 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
118     by |{s_kwd}| in DATA_STATE.
119    
120     2008-09-15 Wakaba <wakaba@suika.fam.cx>
121    
122 wakaba 1.312 * HTML.pm.src: Shorten keys.
123    
124     2008-09-15 Wakaba <wakaba@suika.fam.cx>
125    
126 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
127     pair, or noncharacter code points and non-Unicode code
128     points (they should be handled by Whatpm::Charset::UnicodeChecker).
129     (parse_char_stream): Support for the |$get_wrapper| argument and
130     character stream error handlers.
131    
132     2008-09-15 Wakaba <wakaba@suika.fam.cx>
133    
134 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
135     for null-namespace elements/attributes.
136    
137     * HTML.pm.src: Fact out $disallowed_control_chars
138     as a hash.
139    
140 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
141    
142 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
143     and |{next_char}| initializations are moved to initialization
144     method. |{read_until}| now supports buffering. Sync |set_inner_html|
145     with |parse_char_stream|.
146    
147     2008-09-14 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
150     invoke |manakai_read_until|, not only |read|, where
151     possible, to decrease the number of |read| method calls.
152    
153     * mkhtmlparser.pl: Related changes to the aforementioned
154     modification.
155    
156     2008-09-14 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
159     would report character error from now.
160    
161     2008-09-14 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
164     tokens in "before head insertion mode" was not
165     correctly handled.
166     (set_inner_html): Reimplemented using CharString decodehandle
167     class. Support for $get_wrapper argument. Support
168     for |{read_until}| feature.
169    
170     2008-09-14 Wakaba <wakaba@suika.fam.cx>
171    
172 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
173     entities point the "&" character.
174    
175     2008-09-14 Wakaba <wakaba@suika.fam.cx>
176    
177 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
178     be added to the list of excluded characters.
179    
180     2008-09-14 Wakaba <wakaba@suika.fam.cx>
181    
182 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
183     and |column| a higher priority than the one set by the
184     tokenizer's input handler.
185     ($self->{read_until}): Exclude U+FFFD (but this might
186     not be necessary, since now we do line/column fixup in
187     the character decode handle).
188    
189     2008-09-14 Wakaba <wakaba@suika.fam.cx>
190    
191 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
192    
193     2008-09-14 Wakaba <wakaba@suika.fam.cx>
194    
195 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
196     and |manakai_getc_until| to |manakai_read_until| to
197     reduce the number of string copies.
198    
199     2008-09-14 Wakaba <wakaba@suika.fam.cx>
200    
201 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
202     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
203     standard feature to |open| a string as a filehandle,
204     since Perl's string filehandle seems not supporting |ungetc|
205     method correctly.
206     (parse_char_stream): Define |{getc_until}| method.
207     (DATA_STATE): Experimental support for |getc_until| feature.
208    
209 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
210    
211 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
212    
213     2008-09-13 Wakaba <wakaba@suika.fam.cx>
214    
215 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
216     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
217     and replaced by |{prev_state}|.
218    
219     * mkhtmlparser.pl: Remove |{char}| feature.
220     Remove |!!!back-next-input-character;| macro.
221    
222     2008-09-13 Wakaba <wakaba@suika.fam.cx>
223    
224 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
225     entity related tokenizer states in favor of new states
226     implementing the consume character reference algorithm.
227    
228     2008-09-13 Wakaba <wakaba@suika.fam.cx>
229    
230     * HTML.pm.src: "Consume a character reference" algorithm is
231 wakaba 1.296 now implemented as a tokenizer's state, rather than
232     a method, with minimum changes (more changes will
233     be made, in due course). "Bogus comment state"'s inner
234     loop gets removed.
235    
236     2008-09-13 Wakaba <wakaba@suika.fam.cx>
237    
238 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
239 wakaba 1.295 into their own tokenizer states.
240    
241     2008-09-13 Wakaba <wakaba@suika.fam.cx>
242    
243 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
244     is split into three states.
245    
246     2008-09-13 Wakaba <wakaba@suika.fam.cx>
247    
248 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
249     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
250     no longer does the tokenizer have to push back next input
251     characters in those states.
252    
253     2008-09-13 Wakaba <wakaba@suika.fam.cx>
254    
255 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
256     into four states so that no longer does the tokenizer have to push
257     back next input characters in that state.
258    
259 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
260    
261     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
262     which can be used to insert some wrapper between the character
263     stream handle and the tokenizer. (It is currently not supported
264     for |set_inner_html| for |Element|s).
265    
266 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
267    
268 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
269    
270     2008-09-10 Wakaba <wakaba@suika.fam.cx>
271    
272 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
273    
274     * HTML.pm.src: Don't specify |text| argument for the
275     |chardecode:fallback| error, since it is not the encoding
276     being used alternatively.
277    
278 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
279    
280     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
281    
282 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
283    
284 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
285    
286     2008-08-31 Wakaba <wakaba@suika.fam.cx>
287    
288 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
289     to after after frameset insertion mode processing (HTML5
290     revision 1909). Note that the implementation was wrong
291     per the old spec before the r1909 changes.
292    
293 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
294    
295 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
296     spec (HTML5 revision 2093).
297     ($process_row): Algorithm step numbers synced with the
298     spec (HTML5 revision 2092).
299    
300     2008-08-30 Wakaba <wakaba@suika.fam.cx>
301    
302 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
303     revision 2094).
304    
305     2008-08-30 Wakaba <wakaba@suika.fam.cx>
306    
307     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
308    
309     2008-08-30 Wakaba <wakaba@suika.fam.cx>
310    
311 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
312     name (after another attribute) now raise parse error (HTML5
313     revision 2123). Empty unquoted attribute values are no
314     longer allowed (HTML5 revision 2122).
315    
316     2008-08-30 Wakaba <wakaba@suika.fam.cx>
317    
318 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
319     revision 2130).
320    
321     2008-08-30 Wakaba <wakaba@suika.fam.cx>
322    
323 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
324     as |lang| attribute value for HTML elements (HTML5 revision 2062
325     and so on).
326    
327     2008-08-30 Wakaba <wakaba@suika.fam.cx>
328    
329 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
330     was missing.
331    
332     * URIChecker.pm: The end of the URL should be marked as the
333     error location for an empty path error. The position
334     between the userinfo and the port components should be
335     marked as the error location for an empty host error.
336    
337     2008-08-30 Wakaba <wakaba@suika.fam.cx>
338    
339 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
340     value the error occurs for errors. Report unknown
341     address format error in warning level, since address
342     formats are rarely added. Path segments starting with "/.."
343     were misinterpreted as a dot-segment.
344    
345     2008-08-30 Wakaba <wakaba@suika.fam.cx>
346    
347 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
348     |Message::DOM::DOMImplementation|.
349    
350 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
351    
352     * IMTChecker.pm: Updated for the new error reporting architecture.
353    
354     * ContentChecker.pm: Error levels for IMTs are added.
355    
356 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
357    
358     * H2H.pm (_shift_token): Support for unquoted HTML attribute
359     values.
360    
361 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
362    
363 wakaba 1.275 * CacheManifest.pm: Support for new style of error
364     reports.
365    
366     * HTML.pm.src: Set line=1, column=1 to the document node.
367    
368     2008-08-16 Wakaba <wakaba@suika.fam.cx>
369    
370 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
371     and URL checkers. Support for more error levels for bogus
372     langauge tag and URL "standards".
373    
374     * LangTag.pm, URIChecker.pm: Support for new style error
375     level reporting.
376    
377 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
378    
379 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
380    
381     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
382     specifying. Error types are revised.
383    
384     2008-08-15 Wakaba <wakaba@suika.fam.cx>
385    
386 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
387     renewed.
388    
389     2008-08-15 Wakaba <wakaba@suika.fam.cx>
390    
391 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
392     are revised. Use new style for "level" specification.
393    
394     * mkhtmlparser.pl: Use new style for "level" specification.
395    
396 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
397    
398 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
399     reporting process for broken ignored valuetype definition.
400     (Valuetype idl_text): Support for special "DOMString" name.
401    
402     2008-08-03 Wakaba <wakaba@suika.fam.cx>
403    
404 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
405     terminal of the ScopedName is "DOMString", such that whether
406     the last part of the scoped name is "DOMString" or "_DOMString"
407     later. It is necessary to determine whether a |typedef|
408     definition should be ignored or not.
409     (parse_char_string): Unescape the identifier of
410     exception members.
411     ($resolve): Return undef for builtin types and sequence<T>
412     types (we might not have to do this, however...).
413     (check): Support checking for Exceptions, Valuetypes,
414     and Typedefs.
415     ($serialize_type): Support for "DOMString::::" syntax.
416     (Typedef idl_text): Output Type as "DOMString" if it
417     is really "DOMString" (i.e. its internal representation
418     is "::DOMString::").
419    
420     2008-08-03 Wakaba <wakaba@suika.fam.cx>
421    
422 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
423     for constant types in the |check| method.
424     (check): Support for checking of attributes, operations, and
425     arguments.
426     (Attribute/Operation idl_text): Exception names in getraises,
427     setraises, and raises clauses is serizlied by |$serialize_type|
428     code.
429    
430 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
433     hexadecimal numbers (the original pattern, taken from the spec,
434     was not work for hexadecimal numbers, because the "0" prefix
435     matches to the [0-7]* part (as an empty string) and therefore
436     it does not match with remaining "x..." part of a "0x..." integer
437     literal.
438     ($get_type): It now returns a string, not an array reference,
439     for regular types and |sequence| types (i.e. it in any case
440     returns a string).
441     ($get_next_token): The second item in the array that represents
442     a integer or float token is now a Perl number value, not the
443     original string representation of the number.
444     (check): Support for const value consistency checking.
445     No extended attribute is defined for constants.
446     (Node subclasses): Use simple strings rather than array references
447     for default data type values.
448     ($serialize_type): Type values are now simple strings.
449     (value): If the new attribute value is a false value, then
450     a FALSE value is set to the attribute.
451    
452     2008-08-02 Wakaba <wakaba@suika.fam.cx>
453    
454 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
455     in its stringified format ("scoped name" as defined in the
456     spec). Note that future version of this module should not use
457     array references for type values and the |type_text| attribute
458     should be made obsolete.
459     (parse_char_string): Unescape attribute names.
460     (check): Support for checking of whether inherited interfaces
461     are actually defined or not. Support for checking of whether
462     interface member identifiers are duplicated or not.
463     ($serialize_type): Scoped names are returned as is. A future
464     version of this code should escape identifiers other than "DOMString",
465     otherwise the idl_text would be non-conforming.
466    
467     2008-08-02 Wakaba <wakaba@suika.fam.cx>
468    
469 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
470 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
471     for Definition's were ignored.
472     (append_child): Set |parent_node| attribute.
473     (parent_node): New attribute.
474     (check): Support interface/exception members. Support
475     extended attributes. Support definition identifier uniqueness
476     constraint.
477     (qualified_name): New attribute.
478     (Interface/Exception idl_text): Extended attributes were
479     not prepended to the returned text.
480    
481     2008-08-02 Wakaba <wakaba@suika.fam.cx>
482    
483     * WebIDL.pm (parse_char_string): Set line/column numbers
484 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
485     would make things wrong. Support for interface forward
486     declarations was missing. Broken interface declarations
487     with no block were not ignored entirely.
488     (Whatpm::WebIDL::Node): New abstract class. This class
489     makes things easier.
490     (child_nodes): New attribute. Unlike DOM's attribute with
491     same name, this attribute returns a dead list of nodes for
492     simplicity.
493     (get_user_data, set_user_data): New methods.
494     (Module idl_text): A SPACE character should be inserted
495     before the |{| character.
496     (Interface idl_text): Support for interface forward declarations.
497     (is_forward_declaration): New attribute.
498    
499 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
500    
501 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
502    
503     2008-07-19 Wakaba <wakaba@suika.fam.cx>
504    
505 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
506     it now can handle broken extended attributes and as such.
507    
508     2008-07-19 Wakaba <wakaba@suika.fam.cx>
509    
510 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
511     Support for extended attributes with arguments.
512    
513     2008-07-19 Wakaba <wakaba@suika.fam.cx>
514    
515 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
516     (Interface->idl_text): Tentative support for inheritances.
517    
518     2008-07-19 Wakaba <wakaba@suika.fam.cx>
519    
520 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
521     Support for raises, setraises, and getraises syntaxes.
522    
523 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
524    
525     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
526     proper support for types, extended attributes, and exceptions yet).
527     WebIDL parser, version 1 (no support for exceptions yet,
528     no proper support for extended attributes yet).
529    
530 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
531    
532     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
533    
534 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
535    
536 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
537    
538     2008-06-29 Wakaba <wakaba@suika.fam.cx>
539    
540 wakaba 1.255 * WebIDL.pm: New module.
541    
542 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
543    
544     * Makefile (Entities.html): URI changed.
545    
546 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
547    
548     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
549    
550 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
551    
552 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
553    
554     2008-06-01 Wakaba <wakaba@suika.fam.cx>
555    
556 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
557     spec (commented out). (HTML5 revision 1711).
558    
559 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
560    
561 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
562    
563     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
564     as UTF-16 (HTML5 revision 1701).
565    
566     2008-05-25 Wakaba <wakaba@suika.fam.cx>
567    
568 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
569     1692).
570    
571     2008-05-25 Wakaba <wakaba@suika.fam.cx>
572    
573     * HTML.pm.src: The secondary insertion mode used when switching
574     to foreign content is the "in body" insertion mode (HTML5 revision
575     1696).
576    
577     2008-05-25 Wakaba <wakaba@suika.fam.cx>
578    
579 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
580     revision 1697).
581    
582 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
583    
584 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
585     insertion mode (HTML5 revision 1693). Update SVG camelCase
586     attribute list (HTML5 revision 1700). <textarea> closes
587     </select> (HTML5 revision 1699). More start tags close in
588     foreign content insertion mode (HTML5 revision 1698).
589    
590     2008-05-24 Wakaba <wakaba@suika.fam.cx>
591    
592 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
593    
594     2008-05-24 Wakaba <wakaba@suika.fam.cx>
595    
596 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
597     revision 1674).
598    
599     2008-05-24 Wakaba <wakaba@suika.fam.cx>
600    
601 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
602     revision 1676).
603    
604     2008-05-24 Wakaba <wakaba@suika.fam.cx>
605    
606 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
607     quriks mode detection (HTML5 revision 1679).
608    
609     2008-05-24 Wakaba <wakaba@suika.fam.cx>
610    
611 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
612     revision 1685).
613    
614     2008-05-24 Wakaba <wakaba@suika.fam.cx>
615    
616 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
617     revision 1684).
618    
619     2008-05-24 Wakaba <wakaba@suika.fam.cx>
620    
621 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
622     work (HTML5 revision 1690).
623    
624     2008-05-24 Wakaba <wakaba@suika.fam.cx>
625    
626 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
627     end tags in head insertion modes (HTML5 revision 1686).
628     (parse_char_string): Bug fix for non-utf8 character string handlings.
629     (parse_char_stream): |ungetc| does not work well for this context.
630    
631 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
632    
633 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
634     |parse_byte_stream|.
635     (parse_byte_stream): New method.
636    
637     2008-05-18 Wakaba <wakaba@suika.fam.cx>
638    
639 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
640     by encoding layer error reporter.
641    
642 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
643    
644 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
645     rather than converting the whole byte string and then parsing.
646     Propagate errors in character encoding layer.
647     (get_next_token): Precise error reporting for |bare stago| error.
648    
649     2008-05-17 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
652     (parse_char_string): This method is now defined as an invocation
653     of the |parse_char_stream| method.
654    
655     2008-05-17 Wakaba <wakaba@suika.fam.cx>
656    
657 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
658     of the sniffing as info-level errors. Support for new
659     decoding framework in parser resestting.
660     (new): Various default error levels were not set.
661    
662     2008-05-17 Wakaba <wakaba@suika.fam.cx>
663    
664 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
665     algorithm, except for the actual sniffing, is implemented
666     with new framework with Message::Charset::Info.
667    
668 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
669    
670     * CacheManifest.pm (_parse): Drop fragment identifiers from
671     URIs in fallback section (HTML5 revision 1596).
672    
673 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.231 * Makefile (Entities.html): URI has changed.
676    
677     2008-05-10 Wakaba <wakaba@suika.fam.cx>
678    
679 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
680     1553).
681    
682 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
683    
684 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
685     no longer table elements, but table objects returned
686     by Whatpm::HTMLTable.
687    
688     * HTMLTable.pm (form_table): Return table element node
689     as |$table->{element}|.
690     (assign_header): Support for the |headers=""| attribute.
691    
692     2008-05-06 Wakaba <wakaba@suika.fam.cx>
693    
694 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
695     with no support for headers="".
696     (form_table): Include table width and height to the returned
697     table object for covenience. Indexing in column assignement was wrong.
698     Set whether a data cell is empty or not for convenience.
699    
700 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
701    
702 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
703    
704     2008-05-05 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
707     changed (HTML5 revision 1387).
708    
709     2008-05-05 Wakaba <wakaba@suika.fam.cx>
710    
711 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
712     to the end of the table (HTML5 revision 1380).
713    
714     2008-05-05 Wakaba <wakaba@suika.fam.cx>
715    
716 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
717     of 1-based (HTML5 revision 1376).
718    
719     2008-05-05 Wakaba <wakaba@suika.fam.cx>
720    
721 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
722     and "Content-Encoding" no longer prevent sniffing (HTML5
723     revision 1288).
724    
725     2008-05-05 Wakaba <wakaba@suika.fam.cx>
726    
727 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
728     1282).
729    
730 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
731    
732 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
733     Status of XML specs are added.
734    
735     2008-05-03 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
738     and |ref=""| attribute (referent element type checking).
739    
740 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
741    
742 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
743     1500) is implemented (except for IDNA part and URI-scheme-specific
744 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
745     documents.
746    
747 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
748    
749 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
750     character (HTML5 revision 1263).
751    
752     2008-04-13 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
755     definition for |AElig;| was missing).
756    
757     2008-04-13 Wakaba <wakaba@suika.fam.cx>
758    
759 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
760     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
761    
762     2008-04-13 Wakaba <wakaba@suika.fam.cx>
763    
764 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
765     insertion mode is added (HTML5 revisions 1412 and 1418).
766    
767     2008-04-13 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
770     elements (HTML5 revision 1410).
771    
772     2008-04-13 Wakaba <wakaba@suika.fam.cx>
773    
774 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
775     revision 1406).
776    
777     2008-04-13 Wakaba <wakaba@suika.fam.cx>
778    
779 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
780     elements (HTML5 revision 1404 and 1420).
781    
782 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
783    
784 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
785     elements (HTML5 revision 1404). Unused !!!macro definitions
786     are removed.
787    
788     2008-04-12 Wakaba <wakaba@suika.fam.cx>
789    
790 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
791     are raised is changed (HTML5 revision 1404).
792    
793 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
794    
795 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
796    
797     2008-04-06 Wakaba <wakaba@suika.fam.cx>
798    
799 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
800     replaced by constants representing element category.
801    
802     2008-04-06 Wakaba <wakaba@suika.fam.cx>
803    
804 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
805     for error reporting. (This is a preparation for using constant
806     value for ->[1].)
807    
808 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
809    
810 wakaba 1.205 * RDFXML.pm: Typo fixed.
811    
812     2008-03-22 Wakaba <wakaba@suika.fam.cx>
813    
814 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
815     as same as |must_level|, i.e. level = |m|.
816     (check_element): Make list of URIs in the DOM.
817    
818 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
819    
820 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
821     Local (null-namespace) attribute support.
822    
823     2008-03-21 Wakaba <wakaba@suika.fam.cx>
824    
825 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
826    
827     2008-03-21 Wakaba <wakaba@suika.fam.cx>
828    
829 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
830     and URI attributes is implemented. Warn if unknown
831     value is used in rdf:parseType="" attribute.
832    
833     * URIChecker.pm (check_rdf_uri_reference): New function.
834    
835     2008-03-21 Wakaba <wakaba@suika.fam.cx>
836    
837 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
838     are now resolved.
839    
840     2008-03-21 Wakaba <wakaba@suika.fam.cx>
841    
842 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
843    
844     * RDFXML.pm: undef vs false bug fixed.
845     Reification implemented.
846    
847     2008-03-21 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
850     The |node| arguments are added for |ontriple| calls.
851     Too many "attribute not allowed" errors were raised.
852    
853     * ContentChecker.pm: Initial experimental support for rdf:RDF
854     element.
855    
856     2008-03-21 Wakaba <wakaba@suika.fam.cx>
857    
858 wakaba 1.197 * RDFXML.pm: New module.
859    
860 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
861    
862 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
863     code was old one yet.
864    
865     2008-03-20 Wakaba <wakaba@suika.fam.cx>
866    
867 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
868     errors. Line/column markings for DOCTYPE, comment, and
869     character tokens are reintroduced; otherwise,
870     error location for "not HTML5" error and errors
871     for implied elements are not attached.
872    
873     2008-03-20 Wakaba <wakaba@suika.fam.cx>
874    
875 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
876    
877     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
878     (value): Setter implemented.
879    
880     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
881    
882     2008-03-20 Wakaba <wakaba@suika.fam.cx>
883    
884 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
885    
886     2008-03-20 Wakaba <wakaba@suika.fam.cx>
887    
888 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
889     flag, which is no longer used.
890    
891 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
892    
893 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
894    
895     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
896     and !!!insert-element-t): Set line/column information to
897     element nodes.
898    
899     2008-03-17 Wakaba <wakaba@suika.fam.cx>
900    
901 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
902     in "<?", "<>", or "</>" should be the error point.
903    
904 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
905    
906 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
907    
908     2008-03-16 Wakaba <wakaba@suika.fam.cx>
909    
910 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
911    
912     2008-03-16 Wakaba <wakaba@suika.fam.cx>
913    
914 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
915     reporting.
916    
917 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.186 * HTML.pm.src: Error type revised.
920    
921     2008-03-11 Wakaba <wakaba@suika.fam.cx>
922    
923 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
924    
925     2008-03-11 Wakaba <wakaba@suika.fam.cx>
926    
927 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
928    
929 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
930    
931     * mkhtmlparser.pl: Set "level" parameter to parse errors.
932    
933     * HTML.pm.src: Code refined.
934    
935 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
936    
937 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
938     revision 1348). Note that I really don't know this makes
939     any difference in the black-box behavior of the parser.
940    
941     2008-03-09 Wakaba <wakaba@suika.fam.cx>
942    
943 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
944     revision 1348).
945    
946     2008-03-09 Wakaba <wakaba@suika.fam.cx>
947    
948 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
949    
950     2008-03-09 Wakaba <wakaba@suika.fam.cx>
951    
952 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
953    
954     2008-03-09 Wakaba <wakaba@suika.fam.cx>
955    
956 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
957     mode and support for the "in select in table" insertion mode (HTML5
958     revision 1342).
959    
960     2008-03-09 Wakaba <wakaba@suika.fam.cx>
961    
962 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
963     in non-tainted <table>s (HTML5 revision 1336).
964    
965     2008-03-09 Wakaba <wakaba@suika.fam.cx>
966    
967 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
968     and <head> (HTML5 revision 1332).
969    
970     2008-03-09 Wakaba <wakaba@suika.fam.cx>
971    
972 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
973     white space (HTML5 revision 1331).
974    
975 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
976    
977 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
978     element (HTML5 revision 1330).
979    
980     2008-03-08 Wakaba <wakaba@suika.fam.cx>
981    
982 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
983     element (HTML5 revision 1328).
984    
985     2008-03-08 Wakaba <wakaba@suika.fam.cx>
986    
987 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
988     into foster parents (HTML5 revision 1326).
989    
990     2008-03-08 Wakaba <wakaba@suika.fam.cx>
991    
992 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
993     revision 1321).
994    
995     2008-03-08 Wakaba <wakaba@suika.fam.cx>
996    
997 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
998     the spec (HTML5 revision 1320). Note that we finally complete
999     all of HTML5 revision 1320 changes.
1000    
1001     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1002    
1003 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1004     tags (HTML5 revision 1320).
1005    
1006     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1009    
1010     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1011    
1012 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1013     could not be reached so that it is now removed (HTML5 revision
1014     1320).
1015    
1016     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1017    
1018 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1019     to align with new spec text (HTML5 revision 1320).
1020    
1021     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1022    
1023 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1024     have had no harm in theory.
1025    
1026     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1027    
1028 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1029     case is split from other end tags for the preparation
1030     of implementing HTML revision 1320.
1031    
1032 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1033    
1034 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1035     revision 1320).
1036    
1037     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1038    
1039 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1040     and "div" (HTML5 revision 1318). Add comments to where
1041     |form| pointer association codes should be inserted (HTML5 revision
1042     1319).
1043    
1044 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1045    
1046     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1047    
1048 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1049    
1050 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1051     other token type, there are), it is replaced by a |die| statement.
1052    
1053     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1054    
1055 wakaba 1.159 * HTML.pm.src: Typo fixed.
1056    
1057 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1058    
1059     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1060     doctypes were not uppercased for comparison.
1061    
1062 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1063    
1064 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1065     are added.
1066    
1067     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1068    
1069 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1070     are set. Cases that are unlikely reached are noted as so.
1071    
1072     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1073    
1074 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1075    
1076     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1077    
1078     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1079    
1080 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1081     for simplicity.
1082    
1083     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1084    
1085 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1086     revision 1307).
1087    
1088     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1089    
1090 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1091     is set are changed (HTML5 revision 1305).
1092    
1093 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1094    
1095 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1096     1292). Entities are not parsed in comment-like part in RCDATA
1097     elements (HTML5 revision 1294). Allow bare & at the end
1098     of attribute value literals (HTML5 revision 1296). More
1099     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1100     between attributes and ban attribute names or unquoted
1101     attribute values containing single or double quotes (HTML5
1102     revision 1303).
1103    
1104     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1105    
1106 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1107     and related errors unless it is an HTML document (though the spec
1108     is unclear on whether it is applied to XHTML document).
1109    
1110     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1111     are added.
1112    
1113     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1116    
1117 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1118    
1119     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1120    
1121     * HTML.pm.src: |charset| in |content| attribute is
1122     case-insensitive (HTML5 revision 1270).
1123    
1124 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1125    
1126     * ContentChecker.pm: New status constants are added.
1127     ($ElementDefault): |status| added.
1128     (check_element): Err for non-standard or deprecated elements.
1129     (_attr_status_info): For non-standard or deprecated attributes.
1130    
1131 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1132    
1133 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1134    
1135     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1136    
1137 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1138     status information is now dispatched.
1139    
1140     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1141    
1142 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1143     problem by adding some more arguments.
1144    
1145     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1146    
1147 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1148     container-for-the-purpose-of-content-model element (not transparent
1149     element) for |check_child_element| calling and significant
1150     text flag marking. This reintroduces |<del>|-and-significant
1151     problem again.
1152    
1153     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1154    
1155 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1156     elements ignored for the purpose of phase changes in
1157     content model checking.
1158    
1159 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1160    
1161 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1162     was not properly managed for transparent cases.
1163    
1164     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1165    
1166 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1167     and |audio| as semi-transparent elements.
1168    
1169     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1172     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1173     Support for |html:object| as a semi-transparent element.
1174    
1175     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1176    
1177 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1178     the tree is entirely revised to make it easier to track
1179     the state of ancestors/descendants. As a result of this
1180     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1181     support for content model checking for HTML elements |figure|,
1182     |object|, |video|, and |audio| and checking for XML elements (and
1183     some XMLNS checkings) are dropped for now. They will be
1184     reimplemented in due cource.
1185    
1186 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1189     token (HTML5 revision 1225).
1190    
1191     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1194    
1195     * HTML.pm.src: Save whether |meta| |content| attribute
1196     contains character references or not.
1197    
1198     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1199    
1200 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1201     imlement significant content checking for contents
1202     with |del| elements.
1203    
1204     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1205    
1206 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1207     elements are added.
1208     (_get_children): HTML |object| elements are now semi-transparent.
1209    
1210     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1211    
1212 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1213    
1214     * CacheManifest.pm: HTML5 revision 1211 implemented.
1215    
1216     * CacheManifest.pod: Updated.
1217    
1218 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1219    
1220     * ContentChecker.pm (check_document, check_element): Support
1221     for second argument ($onsubdoc).
1222     (_get_css_parser): Removed (now it is part of WDCC).
1223    
1224 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1225    
1226     * ContentChecker.pm (_get_css_parser): New.
1227    
1228 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1229    
1230 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1231     for checking had been used.
1232    
1233     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1234    
1235 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1236    
1237 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1238    
1239 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1240     specced in RFC 2046 and RFC 4288.
1241     (application/atom+xml): Definition added.
1242    
1243     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1244    
1245 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1246     warnings (rather than SHOULD-level errors).
1247    
1248     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1249    
1250 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1251    
1252     * HTML.pm.src: A flag for character references in attribute
1253     values are added. Set |manakai_has_reference| user data
1254     to |charset| attribute.
1255    
1256     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1257    
1258 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1259     attributes.
1260    
1261     * ContentChecker.pm (check_document): Warn if charset requirements
1262     cannot be tested.
1263    
1264 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1265    
1266     * HTML.pm.src (parse_byte_string): Detect charset
1267     by universalchardet if charset parameter is not specified.
1268    
1269     * Makefile (Charset-all, Charset-clean): New rules.
1270    
1271 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1272    
1273 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1274     of character encoding declaration and selection of encoding
1275     for HTML document.
1276    
1277     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1278    
1279 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1280     type in list context.
1281    
1282     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1283    
1284 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1285     ignoring (HTML5 revisions 1013 and 1016).
1286    
1287     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1288    
1289 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1290     white spaces, are implemented.
1291    
1292 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1293    
1294 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1295     attribute if possible.
1296    
1297     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1298    
1299 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1300     (parse_char_string): New alias for |parse_string|.
1301     (main phase): Invoking "change the encoding" algorithm if desired.
1302    
1303     * HTML.pod: Updated.
1304    
1305     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1306    
1307 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1308    
1309     * Makefile (HTML-all, HTML-clean): New.
1310    
1311     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1312    
1313     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1314    
1315 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1316    
1317     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1318    
1319 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1320    
1321 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1322    
1323     * CacheManifest.pod: New file.
1324    
1325     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1326    
1327 wakaba 1.114 * CacheManifest.pm: New module.
1328    
1329     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1330    
1331 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1332     callback.
1333    
1334     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1335    
1336 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1337    
1338 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1339    
1340 wakaba 1.111 * Makefile (clean): New rule.
1341    
1342     * NanoDOM.pm (public_id, system_id): New attributes.
1343    
1344     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1345    
1346 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1347    
1348 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1349    
1350     * ContentChecker.pm (check_document): Support for
1351     new |is_xml_root| flag.
1352     (check_element): Support for new |pluses| state.
1353     (_add_pluses): New method.
1354     (_remove_minuses): Support for new |minus| item.
1355    
1356 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1357    
1358 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1359     root element.
1360    
1361     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1364    
1365 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1368    
1369     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1370    
1371 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1372    
1373     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1374    
1375 wakaba 1.103 * LangTag.pm: New module.
1376    
1377 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1378    
1379     * HTML.pm.src: Some error types were wrong.
1380    
1381 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1382    
1383 wakaba 1.101 * CSS/: New directory.
1384    
1385     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1386    
1387 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1388     for |noscript| in |head|.
1389    
1390 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1391    
1392     * URI/: New directory.
1393    
1394 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1395    
1396 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1397     number.
1398    
1399     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1400    
1401 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1402     by |&|s.
1403    
1404     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1405    
1406 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1407    
1408     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1409    
1410 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1411    
1412     * HTML.pm.src: Insertion modes are now represented in number.
1413    
1414     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1415    
1416 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1417     999) is implemented.
1418    
1419 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1420    
1421     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1422     983) is implemented.
1423    
1424 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1425    
1426     * ContentChecker.pod: New documentation.
1427    
1428     * Makefile: A rule for |ContentChecker.html| is added.
1429    
1430     * ContentChecker.pm: A pod "LICENSE" section is added.
1431    
1432     * NanoDOM.pm ($VERSION): New variable.
1433    
1434 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1437     list of allowed HTML elements.
1438    
1439     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1442     HTML elements.
1443    
1444     * URIChecker.pm (check_iri): New.
1445     (check_iri_reference): Error type for IRI reference
1446     syntax error is changed.
1447    
1448 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1451     is added.
1452     (check_document): Load appropriate module before validation.
1453    
1454     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1455    
1456 wakaba 1.88 * ContentChecker/: New directory.
1457    
1458     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1459    
1460 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1461    
1462     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1463    
1464 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1465    
1466 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1467     even if no document element is found.
1468    
1469     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1470    
1471 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1472    
1473     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1474    
1475 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1476    
1477     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1478    
1479 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1480     is split into "after html body" and "after html frameset"
1481     insertion modes. Their codes are merged with "after body"
1482     and "after frameset" codes. |$previous_insertion_mode|
1483     has been removed. "after frameset" code is
1484     merged with "in frameset" code.
1485    
1486     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1487    
1488 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1489     merged with the "in head" insertion mode.
1490    
1491     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1492    
1493 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1494     reorganized.
1495    
1496     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1497    
1498 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1499     are merged.
1500    
1501     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1502    
1503 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1504     with "in table" insertion mode.
1505    
1506     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1507    
1508 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1509     modes are merged.
1510    
1511     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1512    
1513 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1514     insertion mode!
1515    
1516     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1517    
1518 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1519     now directly invoke the handler.
1520    
1521     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1522    
1523 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1524     is merged to the "in body" insertion mode code.
1525    
1526     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1527    
1528 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1529     insertion modes are merged.
1530    
1531     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1532    
1533 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1534    
1535     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1536    
1537 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1538     instead of string constant for |{content_model_flag}|.
1539    
1540     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1541    
1542 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1543     to the "duplicate attribute" error.
1544    
1545 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1546    
1547 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1548    
1549     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1550    
1551 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1552    
1553     * HTML.pm.src: A typo is fixed.
1554    
1555 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1556    
1557 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1558     from "in HTML:xml:lang" error. Character position
1559     is now the last part of the error type in the URI error
1560     description. Report "unsupported" status for language
1561     tags, media queries, script codes, and style sheets.
1562    
1563     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1564    
1565 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1566     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1567     in XML.
1568    
1569     * NanoDOM.pm (Attr.owner_document): New attribute.
1570    
1571     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1572    
1573 wakaba 1.66 * HTML.pm.src: The character immediately following
1574     a bare |hcro| was discarded. Fix handling of
1575     entity references in attribute values.
1576    
1577     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1578    
1579 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1580     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1581     factored out. Error types |in html:#DOCTYPE|
1582     and |after html:#DOCTYPE| are merged
1583     into |DOCTYPE in the middle|. |</frameset>|
1584     in fragment parsing mode changed the
1585     insertion mode.
1586    
1587     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1588    
1589 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1590     phase is now an insertion mode. Treatments for white
1591     space character tokens were incorrect for some
1592     insertion modes. An old |meta| case was not removed.
1593    
1594     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1595    
1596 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1597     implemented (but changing the encoding is not yet:-).
1598    
1599 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1600    
1601 wakaba 1.62 * Charset/: New directory.
1602    
1603     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1604    
1605 wakaba 1.61 * H2H.pm: New Perl module (created from
1606     manakai's H2H.dis).
1607    
1608     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1611     manakai's SimpleLS.dis).
1612    
1613 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1614    
1615     * HTML.pm.src: |<!---x-->| was not processed correctly.
1616    
1617 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1618    
1619 wakaba 1.58 * HTML.pm.src: Report correct error message
1620     for |<body></div></body>|.
1621    
1622     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1623    
1624 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1625    
1626     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1627    
1628     * ContentChecker.pm: Return |{term}| list.
1629    
1630 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1631    
1632 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1633     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1634     and <title> outside of head).
1635    
1636     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1637    
1638 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1639     and private types/subtypes.
1640    
1641     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1642     URIChecker.pm, HTMLTable.pm: Error messages are now
1643     consistent; they are all listed in
1644     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1645    
1646 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1647    
1648 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1649     is now erred. |<datalist>| is implemented.
1650     Attribute checker for |<command>| and |<menu>| are
1651     added. Support for |contextmenu| global attribute
1652     is added.
1653    
1654     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1655    
1656 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1657     of Step 3 has been changed.
1658    
1659     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1660    
1661 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1662    
1663 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1664    
1665 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1666     incorrect.
1667    
1668     * ContentChecker.pm: Support for the |footer| element.
1669     Check URI syntax for space-separated URI attributes.
1670     Support for the |tabindex| attribute. Support
1671     for |datetime| attribute.
1672    
1673     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1674    
1675 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1676     character references). HTML5 revision 1.146 (white space
1677     characters before root start tag). HTML5 revision
1678     1.148 (named character references in attribute values).
1679     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1680    
1681     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1682    
1683     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1684 wakaba 1.49 in <head>).
1685    
1686     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1687    
1688 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1689     in body).
1690    
1691     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1692    
1693     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1696     or RCDATA in fragment parsing mode). HTML5 revision
1697     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1698    
1699     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1700    
1701     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1702     for <pre> and <textarea>).
1703    
1704 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1705    
1706 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1707    
1708     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1709    
1710 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1711    
1712     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1713    
1714 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1715     HTML5 revision 917 (conforming bare &).
1716    
1717     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1718    
1719 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1720    
1721     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1722     HTML5 revision 915 (<nobr>).
1723    
1724     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1725    
1726 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1727    
1728     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1729     New attributes.
1730    
1731     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1732    
1733 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1734     in tags).
1735    
1736     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1739    
1740     * HTML.pm.src: |$entity_char| is removed and
1741     requires |Whatpm::_NamedEntityList| instead.
1742     HTML5 revision 898 (refc), except that lack of refc
1743     is parse error.
1744    
1745     * mkentitylist.pl: New script.
1746    
1747     * Makefile (all): |_NamedEntityList.pm| is added.
1748     (_NamedEntityList.pm, Entities.html): New rules.
1749    
1750     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1751    
1752 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1753     were ignored and U+000D immediately following another
1754     U+000D was not converted to U+000A.
1755    
1756     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1757    
1758 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1759     nodes before appended). Parser was not ready for NULL
1760     parse error and escape flag.
1761    
1762     * NanoDOM.pm (adopt_node): New.
1763    
1764     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1765    
1766 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1767     CDATA and RCDATA). Note that current implementation
1768     is simply repeating what the spec says and it is maybe not
1769     a best way to do it.
1770    
1771     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1772    
1773 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1774     the form element if a descendant element without implied end tag has
1775     still been open).
1776    
1777     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1778    
1779 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1780     space characters non-conforming).
1781    
1782     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1783    
1784 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1785     HTML5 revision 869 (C1 character references).
1786    
1787     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1788    
1789 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1790     a |textarea| is removed).
1791    
1792 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1793    
1794     * NanoDOM.pm (get_attribute_node_ns): New method.
1795    
1796     * ContentChecker.pm: |script| |async| and |defer|
1797     no longer require |src|. |async| MUST NOT be
1798     specified if |defer|. (HTML5 revision 858).
1799    
1800 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1801    
1802     * HTML.pm.src: |<form><form>| went to inifinite loop.
1803    
1804 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1805    
1806 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1807     as a document element) flag on.
1808     (new): Removed.
1809     (check_document): New method.
1810    
1811     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1812    
1813 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1814     were incorrect.
1815    
1816     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1817    
1818 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1819     error position in the |type| option of the error.
1820    
1821     * HTMLTable.pm (form_table): The |$onerror| parameter
1822     is now optional. Some bugs are fixed.
1823    
1824     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1825    
1826 wakaba 1.27 * HTMLTable.pm: New module.
1827    
1828     * ContentChecker.pm (table): Invoke table model error checker.
1829    
1830     * NanoDOM.pm (first_child, get_attribute_ns): New.
1831    
1832 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1833    
1834 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1835     (link/@rel, a/@rel, area/@rel): Use new checker.
1836    
1837     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1838    
1839     * _LinkTypeList.pm: New file.
1840    
1841     * mklinktypelist.pl: New file.
1842    
1843     * .cvsignore: |RelExtensions.html| added.
1844    
1845     * NanoDOM.pm (child_nodes): Returns an empty array
1846     for non-child-containing node types.
1847     (text_content): New attribute.
1848    
1849     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1850    
1851 wakaba 1.25 * IMTChecker.pm: New module.
1852    
1853     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1854     to test parameter value validity.
1855    
1856     * HTML.pm.src ($style_start_tag): Attributes were
1857     discarded.
1858    
1859 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1860    
1861 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1862    
1863     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1864    
1865 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1866     and RFC 3987 are listed (not all testable items are checked yet).
1867    
1868     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1869    
1870 wakaba 1.22 * URIChecker.pm: New module.
1871    
1872 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1873    
1874 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1875     or associated to some placeholder.
1876     ($ElementDefault): Warn unknown attributes
1877     for unknown elements as "attribute not supported".
1878     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1879     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1880     (|a| attribute checker): Reimplemented.
1881    
1882     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1883    
1884 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1885     ($HTMLAttrChecker): Event handler content attributes
1886     are added.
1887     (link, embed): Required attribute is now checked.
1888     (embed): Unknown local attributes are no longer warned.
1889    
1890     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1891    
1892 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1893     ($HTMLIMTAttrChecker): New checker.
1894     (link@rel, link@href, link@type, style@type,
1895     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1896     object@data, object@type, source@src, source@type, area@alt,
1897     area@shape, area@coords, area@href,
1898     area@ping, area@rel, area@type, script@src,
1899     script@defer, script@async, script@type): Checkers added.
1900    
1901     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1902    
1903 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1904    
1905 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1906    
1907 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1908     attributes. Report an error if |Element.prefix|
1909     is |xmlns|.
1910    
1911     * NanoDOM.pm (prefix): New attribute.
1912    
1913     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1914    
1915 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1916     mode, action for |<iframe>| was missing.
1917    
1918     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1919    
1920 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1921     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1922    
1923     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1924    
1925 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1926     ($HTMLURIAttrChecker): Placeholder.
1927     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1928     New.
1929    
1930     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1931    
1932 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1933     for global attributes, |html|, |base|, |style|, and |meta|.
1934    
1935     * NanoDOM.pm (insert_before): Weaken reference
1936     to the parent node.
1937     (Attr::new): Set |owner_element| attribute.
1938     (namespace_uri, manakai_local_name): New attribute implementations.
1939     (owner_element): New attribute.
1940    
1941     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1942    
1943 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1944     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1945     $Element->{$HTML_NS}->{''}): New.
1946     (check_element): Invoke attrs_checker for each element.
1947    
1948 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1949    
1950 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1951    
1952     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1953    
1954 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1955     element type testings.
1956    
1957     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1958    
1959 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1960     error when an element type is put in the "minus" list
1961     and the element type is not allowed explicitly in the particular
1962     element content model.
1963     (html:a checker): New checker.
1964     (html:details, html:datagrid): New checkers.
1965     (html:legend): New checker.
1966    
1967     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1968    
1969 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1970    
1971     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1972    
1973 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1974     checker.
1975     (html:dd checker): New checker.
1976     (html:q, html:em, html:strong, html:small,
1977     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1978    
1979     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1980    
1981 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1982     from |$ElementDefault->{checker}|.
1983     ($ElementDefault->{checker}): Throw an error that
1984     the element type is not supported by the checker.
1985     ($HTMLMetadataElement): |html:base| was missing.
1986     ($HTMLEmptyChecker): Don't throw an error
1987     for inter-element whitespace nodes.
1988     (html:html checker): Errors were not
1989     thrown even if |html:head| and/or |html:body|
1990     children were missing.
1991     (html:head checker): An error was not
1992     thrown if <meta charset> appered after other
1993     elements.
1994    
1995 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1996    
1997     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1998     and |noscript| elements are implemented.
1999     (new): New method.
2000    
2001 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2002    
2003 wakaba 1.4 * ContentChecker.pm: New module.
2004    
2005     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2006    
2007 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2008     document_element, manakai_local_name,
2009     manakai_element_type_match): New method.
2010    
2011     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2012    
2013     * HTML.pm.src: Replace decimal and hexadecimal numeric
2014     entities in C1 range using Windows-1252 mapping. Bare LF
2015     did not count as new line for error reporting.
2016    
2017 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2018    
2019     * NanoDOM.pm (DOMImplementation): New class.
2020     (append_child): Weaken the |parent_node| reference.
2021     (create_element_ns, Element new): Set the |owner_document|
2022     reference.
2023     (implementation): New attribute.
2024     (owner_document, local_name, namespace_uri): New attributes.
2025    
2026     * HTML.pm.src (parse_string): Line and column numbers
2027     are now provided to error handler.
2028     (!!!parse-error): Short descriptions are added.
2029     (_construct_tree): Split into three methods; support
2030     for innerHTML mode.
2031     (set_inner_html): New method.
2032    
2033 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2034    
2035     * NanoDOM.html: Documentation is added.
2036    
2037     * HTML.pod, ContentType.html: Documentation is revised.
2038    
2039     * .cvsignore: Pod2html temporary files are added.
2040    
2041     * Makefile: Make |NanoDOM.html|.
2042    
2043     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2044    
2045     * HTML.pm.src (parse_string): New method.
2046     (get_inner_html): Renamed from |inner_html|.
2047    
2048     * Makefile: A rule for |HTML.html| is added.
2049    
2050     * HTML.pod: New documentation.
2051    
2052     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2053    
2054     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2055     (clone_node): Attribute nodes were not completely copied.
2056    
2057     * HTML.pm.src: Many bugs are fixed.
2058    
2059     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2060    
2061     * HTML.pm.src: Some typos are fixed.
2062    
2063     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2064    
2065     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2066     are removed.
2067    
2068     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2069    
2070     * HTML-consume-entity.src: Removed.
2071    
2072     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2073    
2074     * HTML.pm.src: Tokenizer's handling on hexadecimal
2075     numeric entities are rewritten.
2076    
2077     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2078    
2079     * HTML.pm.src: Some tokenizer bugs are fixed.
2080    
2081     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2082    
2083     * HTML.pm.src: The tree construction stage is implemented.
2084    
2085     * mkhtmlparser.pl: New macros are added.
2086    
2087     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2088    
2089     * ContentType.pm: A note on bug in the specification
2090     is removed since it's been now fixed.
2091    
2092     * .cvsignore: New file.
2093    
2094     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2095    
2096     * HTML.pm.src, HTML-consume-entity.src: New files.
2097    
2098     * Makefile (HTML.pm): New rule.
2099    
2100     * mkhtmlparser.pl: New script.
2101    
2102     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2103    
2104     * Makefile: New file.
2105    
2106     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2107    
2108     * ContentType.pm: An error in pod is fixed.
2109    
2110     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2111    
2112     * ContentType.pm: Documentation is added.
2113    
2114     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2115    
2116     * ContentType.pm: New Perl module.
2117    
2118     * ChangeLog: New file.
2119    
2120    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24