/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.328 - (hide annotations) (download)
Sat Sep 20 10:20:47 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.327: +7 -0 lines
++ whatpm/t/ChangeLog	20 Sep 2008 10:20:39 -0000
	* tokenizer-test-2.dat: Test result updated (cf. HTML5 revision
	1738).

	* tree-test-1.dat: Test results updated.  New test data for the
	treatement of U+000B are added (HTML5 revision 1738).

2008-09-20  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	20 Sep 2008 10:19:37 -0000
	* HTML.pm.src: The "anything else" case for the "after after body"
	insertion mode was not updated to swtich to the "in body"
	insertion mode.  U+000B is no longer a space character for the
	purpose of tree construction phase (HTML5 revision 1738).

2008-09-20  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
4     insertion mode was not updated to swtich to the "in body"
5     insertion mode. U+000B is no longer a space character for the
6     purpose of tree construction phase (HTML5 revision 1738).
7    
8     2008-09-20 Wakaba <wakaba@suika.fam.cx>
9    
10 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
11     revision 1738).
12    
13     2008-09-20 Wakaba <wakaba@suika.fam.cx>
14    
15 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
16     revision 1738).
17    
18     * HTML.pm.src: U+000B is no longer a space character for the
19     algorithm for extracting an encoding from a Content-Type (HTML5
20     revision 1738).
21    
22     2008-09-20 Wakaba <wakaba@suika.fam.cx>
23    
24 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
25    
26 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
27    
28 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
29    
30     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
31     matching.
32    
33     2008-09-18 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
36     typos fixed. Support for Suppress-Script field.
37    
38     * mklangreg.pl: Support for dumping of nested structure.
39    
40     2008-09-18 Wakaba <wakaba@suika.fam.cx>
41    
42 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
43     recommended case as per RFC 4646.
44    
45     2008-09-18 Wakaba <wakaba@suika.fam.cx>
46    
47 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
48    
49     2008-09-18 Wakaba <wakaba@suika.fam.cx>
50    
51 wakaba 1.320 * mklangreg.pl: New script.
52    
53     * Makefile: Updated for creation of the module for language subtag
54     registry.
55    
56 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.319 * Makefile: WebIDL.html added.
59    
60     * WebIDL.pod: New documentation.
61    
62     2008-09-16 Wakaba <wakaba@suika.fam.cx>
63    
64 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
65    
66     2008-09-16 Wakaba <wakaba@suika.fam.cx>
67    
68 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
69     compatible parsing bugs are fixed. Some unreachable codes are
70     commented out.
71    
72     2008-09-16 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
75     added. It does not satisfy the definition that a forward
76     interface declaration has an extended attribute. It seems that
77     unless explicitly allowed multiple extended attributes with the
78     same name is not allowed, though it is not explicitly mentioned in
79     the spec.
80    
81     2008-09-16 Wakaba <wakaba@suika.fam.cx>
82    
83 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
84     attribute identifiers. Preserve whether an extended attribute has
85     an argument list of not. Support for extended attributes:
86     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
87     NameGetter, NameSetter, and Null.
88     (has_argument_list): New attribute.
89     (idl_text): Stringifies argument lists, if any, even if it is
90     empty.
91    
92 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
93    
94 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
95     |{s_kwd}| in DATA_STATE as default.
96    
97     2008-09-15 Wakaba <wakaba@suika.fam.cx>
98    
99 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
100     by |{s_kwd}| in DATA_STATE.
101    
102     2008-09-15 Wakaba <wakaba@suika.fam.cx>
103    
104 wakaba 1.312 * HTML.pm.src: Shorten keys.
105    
106     2008-09-15 Wakaba <wakaba@suika.fam.cx>
107    
108 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
109     pair, or noncharacter code points and non-Unicode code
110     points (they should be handled by Whatpm::Charset::UnicodeChecker).
111     (parse_char_stream): Support for the |$get_wrapper| argument and
112     character stream error handlers.
113    
114     2008-09-15 Wakaba <wakaba@suika.fam.cx>
115    
116 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
117     for null-namespace elements/attributes.
118    
119     * HTML.pm.src: Fact out $disallowed_control_chars
120     as a hash.
121    
122 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
123    
124 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
125     and |{next_char}| initializations are moved to initialization
126     method. |{read_until}| now supports buffering. Sync |set_inner_html|
127     with |parse_char_stream|.
128    
129     2008-09-14 Wakaba <wakaba@suika.fam.cx>
130    
131 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
132     invoke |manakai_read_until|, not only |read|, where
133     possible, to decrease the number of |read| method calls.
134    
135     * mkhtmlparser.pl: Related changes to the aforementioned
136     modification.
137    
138     2008-09-14 Wakaba <wakaba@suika.fam.cx>
139    
140 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
141     would report character error from now.
142    
143     2008-09-14 Wakaba <wakaba@suika.fam.cx>
144    
145 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
146     tokens in "before head insertion mode" was not
147     correctly handled.
148     (set_inner_html): Reimplemented using CharString decodehandle
149     class. Support for $get_wrapper argument. Support
150     for |{read_until}| feature.
151    
152     2008-09-14 Wakaba <wakaba@suika.fam.cx>
153    
154 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
155     entities point the "&" character.
156    
157     2008-09-14 Wakaba <wakaba@suika.fam.cx>
158    
159 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
160     be added to the list of excluded characters.
161    
162     2008-09-14 Wakaba <wakaba@suika.fam.cx>
163    
164 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
165     and |column| a higher priority than the one set by the
166     tokenizer's input handler.
167     ($self->{read_until}): Exclude U+FFFD (but this might
168     not be necessary, since now we do line/column fixup in
169     the character decode handle).
170    
171     2008-09-14 Wakaba <wakaba@suika.fam.cx>
172    
173 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
174    
175     2008-09-14 Wakaba <wakaba@suika.fam.cx>
176    
177 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
178     and |manakai_getc_until| to |manakai_read_until| to
179     reduce the number of string copies.
180    
181     2008-09-14 Wakaba <wakaba@suika.fam.cx>
182    
183 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
184     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
185     standard feature to |open| a string as a filehandle,
186     since Perl's string filehandle seems not supporting |ungetc|
187     method correctly.
188     (parse_char_stream): Define |{getc_until}| method.
189     (DATA_STATE): Experimental support for |getc_until| feature.
190    
191 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
192    
193 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
194    
195     2008-09-13 Wakaba <wakaba@suika.fam.cx>
196    
197 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
198     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
199     and replaced by |{prev_state}|.
200    
201     * mkhtmlparser.pl: Remove |{char}| feature.
202     Remove |!!!back-next-input-character;| macro.
203    
204     2008-09-13 Wakaba <wakaba@suika.fam.cx>
205    
206 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
207     entity related tokenizer states in favor of new states
208     implementing the consume character reference algorithm.
209    
210     2008-09-13 Wakaba <wakaba@suika.fam.cx>
211    
212     * HTML.pm.src: "Consume a character reference" algorithm is
213 wakaba 1.296 now implemented as a tokenizer's state, rather than
214     a method, with minimum changes (more changes will
215     be made, in due course). "Bogus comment state"'s inner
216     loop gets removed.
217    
218     2008-09-13 Wakaba <wakaba@suika.fam.cx>
219    
220 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
221 wakaba 1.295 into their own tokenizer states.
222    
223     2008-09-13 Wakaba <wakaba@suika.fam.cx>
224    
225 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
226     is split into three states.
227    
228     2008-09-13 Wakaba <wakaba@suika.fam.cx>
229    
230 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
231     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
232     no longer does the tokenizer have to push back next input
233     characters in those states.
234    
235     2008-09-13 Wakaba <wakaba@suika.fam.cx>
236    
237 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
238     into four states so that no longer does the tokenizer have to push
239     back next input characters in that state.
240    
241 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
242    
243     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
244     which can be used to insert some wrapper between the character
245     stream handle and the tokenizer. (It is currently not supported
246     for |set_inner_html| for |Element|s).
247    
248 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
249    
250 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
251    
252     2008-09-10 Wakaba <wakaba@suika.fam.cx>
253    
254 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
255    
256     * HTML.pm.src: Don't specify |text| argument for the
257     |chardecode:fallback| error, since it is not the encoding
258     being used alternatively.
259    
260 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
261    
262     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
263    
264 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
265    
266 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
267    
268     2008-08-31 Wakaba <wakaba@suika.fam.cx>
269    
270 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
271     to after after frameset insertion mode processing (HTML5
272     revision 1909). Note that the implementation was wrong
273     per the old spec before the r1909 changes.
274    
275 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
276    
277 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
278     spec (HTML5 revision 2093).
279     ($process_row): Algorithm step numbers synced with the
280     spec (HTML5 revision 2092).
281    
282     2008-08-30 Wakaba <wakaba@suika.fam.cx>
283    
284 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
285     revision 2094).
286    
287     2008-08-30 Wakaba <wakaba@suika.fam.cx>
288    
289     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
290    
291     2008-08-30 Wakaba <wakaba@suika.fam.cx>
292    
293 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
294     name (after another attribute) now raise parse error (HTML5
295     revision 2123). Empty unquoted attribute values are no
296     longer allowed (HTML5 revision 2122).
297    
298     2008-08-30 Wakaba <wakaba@suika.fam.cx>
299    
300 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
301     revision 2130).
302    
303     2008-08-30 Wakaba <wakaba@suika.fam.cx>
304    
305 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
306     as |lang| attribute value for HTML elements (HTML5 revision 2062
307     and so on).
308    
309     2008-08-30 Wakaba <wakaba@suika.fam.cx>
310    
311 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
312     was missing.
313    
314     * URIChecker.pm: The end of the URL should be marked as the
315     error location for an empty path error. The position
316     between the userinfo and the port components should be
317     marked as the error location for an empty host error.
318    
319     2008-08-30 Wakaba <wakaba@suika.fam.cx>
320    
321 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
322     value the error occurs for errors. Report unknown
323     address format error in warning level, since address
324     formats are rarely added. Path segments starting with "/.."
325     were misinterpreted as a dot-segment.
326    
327     2008-08-30 Wakaba <wakaba@suika.fam.cx>
328    
329 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
330     |Message::DOM::DOMImplementation|.
331    
332 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
333    
334     * IMTChecker.pm: Updated for the new error reporting architecture.
335    
336     * ContentChecker.pm: Error levels for IMTs are added.
337    
338 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
339    
340     * H2H.pm (_shift_token): Support for unquoted HTML attribute
341     values.
342    
343 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
344    
345 wakaba 1.275 * CacheManifest.pm: Support for new style of error
346     reports.
347    
348     * HTML.pm.src: Set line=1, column=1 to the document node.
349    
350     2008-08-16 Wakaba <wakaba@suika.fam.cx>
351    
352 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
353     and URL checkers. Support for more error levels for bogus
354     langauge tag and URL "standards".
355    
356     * LangTag.pm, URIChecker.pm: Support for new style error
357     level reporting.
358    
359 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
360    
361 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
362    
363     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
364     specifying. Error types are revised.
365    
366     2008-08-15 Wakaba <wakaba@suika.fam.cx>
367    
368 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
369     renewed.
370    
371     2008-08-15 Wakaba <wakaba@suika.fam.cx>
372    
373 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
374     are revised. Use new style for "level" specification.
375    
376     * mkhtmlparser.pl: Use new style for "level" specification.
377    
378 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
379    
380 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
381     reporting process for broken ignored valuetype definition.
382     (Valuetype idl_text): Support for special "DOMString" name.
383    
384     2008-08-03 Wakaba <wakaba@suika.fam.cx>
385    
386 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
387     terminal of the ScopedName is "DOMString", such that whether
388     the last part of the scoped name is "DOMString" or "_DOMString"
389     later. It is necessary to determine whether a |typedef|
390     definition should be ignored or not.
391     (parse_char_string): Unescape the identifier of
392     exception members.
393     ($resolve): Return undef for builtin types and sequence<T>
394     types (we might not have to do this, however...).
395     (check): Support checking for Exceptions, Valuetypes,
396     and Typedefs.
397     ($serialize_type): Support for "DOMString::::" syntax.
398     (Typedef idl_text): Output Type as "DOMString" if it
399     is really "DOMString" (i.e. its internal representation
400     is "::DOMString::").
401    
402     2008-08-03 Wakaba <wakaba@suika.fam.cx>
403    
404 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
405     for constant types in the |check| method.
406     (check): Support for checking of attributes, operations, and
407     arguments.
408     (Attribute/Operation idl_text): Exception names in getraises,
409     setraises, and raises clauses is serizlied by |$serialize_type|
410     code.
411    
412 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
413    
414 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
415     hexadecimal numbers (the original pattern, taken from the spec,
416     was not work for hexadecimal numbers, because the "0" prefix
417     matches to the [0-7]* part (as an empty string) and therefore
418     it does not match with remaining "x..." part of a "0x..." integer
419     literal.
420     ($get_type): It now returns a string, not an array reference,
421     for regular types and |sequence| types (i.e. it in any case
422     returns a string).
423     ($get_next_token): The second item in the array that represents
424     a integer or float token is now a Perl number value, not the
425     original string representation of the number.
426     (check): Support for const value consistency checking.
427     No extended attribute is defined for constants.
428     (Node subclasses): Use simple strings rather than array references
429     for default data type values.
430     ($serialize_type): Type values are now simple strings.
431     (value): If the new attribute value is a false value, then
432     a FALSE value is set to the attribute.
433    
434     2008-08-02 Wakaba <wakaba@suika.fam.cx>
435    
436 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
437     in its stringified format ("scoped name" as defined in the
438     spec). Note that future version of this module should not use
439     array references for type values and the |type_text| attribute
440     should be made obsolete.
441     (parse_char_string): Unescape attribute names.
442     (check): Support for checking of whether inherited interfaces
443     are actually defined or not. Support for checking of whether
444     interface member identifiers are duplicated or not.
445     ($serialize_type): Scoped names are returned as is. A future
446     version of this code should escape identifiers other than "DOMString",
447     otherwise the idl_text would be non-conforming.
448    
449     2008-08-02 Wakaba <wakaba@suika.fam.cx>
450    
451 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
452 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
453     for Definition's were ignored.
454     (append_child): Set |parent_node| attribute.
455     (parent_node): New attribute.
456     (check): Support interface/exception members. Support
457     extended attributes. Support definition identifier uniqueness
458     constraint.
459     (qualified_name): New attribute.
460     (Interface/Exception idl_text): Extended attributes were
461     not prepended to the returned text.
462    
463     2008-08-02 Wakaba <wakaba@suika.fam.cx>
464    
465     * WebIDL.pm (parse_char_string): Set line/column numbers
466 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
467     would make things wrong. Support for interface forward
468     declarations was missing. Broken interface declarations
469     with no block were not ignored entirely.
470     (Whatpm::WebIDL::Node): New abstract class. This class
471     makes things easier.
472     (child_nodes): New attribute. Unlike DOM's attribute with
473     same name, this attribute returns a dead list of nodes for
474     simplicity.
475     (get_user_data, set_user_data): New methods.
476     (Module idl_text): A SPACE character should be inserted
477     before the |{| character.
478     (Interface idl_text): Support for interface forward declarations.
479     (is_forward_declaration): New attribute.
480    
481 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
482    
483 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
484    
485     2008-07-19 Wakaba <wakaba@suika.fam.cx>
486    
487 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
488     it now can handle broken extended attributes and as such.
489    
490     2008-07-19 Wakaba <wakaba@suika.fam.cx>
491    
492 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
493     Support for extended attributes with arguments.
494    
495     2008-07-19 Wakaba <wakaba@suika.fam.cx>
496    
497 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
498     (Interface->idl_text): Tentative support for inheritances.
499    
500     2008-07-19 Wakaba <wakaba@suika.fam.cx>
501    
502 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
503     Support for raises, setraises, and getraises syntaxes.
504    
505 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
506    
507     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
508     proper support for types, extended attributes, and exceptions yet).
509     WebIDL parser, version 1 (no support for exceptions yet,
510     no proper support for extended attributes yet).
511    
512 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
513    
514     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
515    
516 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
517    
518 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
519    
520     2008-06-29 Wakaba <wakaba@suika.fam.cx>
521    
522 wakaba 1.255 * WebIDL.pm: New module.
523    
524 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
525    
526     * Makefile (Entities.html): URI changed.
527    
528 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
529    
530     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
531    
532 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
533    
534 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
535    
536     2008-06-01 Wakaba <wakaba@suika.fam.cx>
537    
538 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
539     spec (commented out). (HTML5 revision 1711).
540    
541 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
542    
543 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
544    
545     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
546     as UTF-16 (HTML5 revision 1701).
547    
548     2008-05-25 Wakaba <wakaba@suika.fam.cx>
549    
550 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
551     1692).
552    
553     2008-05-25 Wakaba <wakaba@suika.fam.cx>
554    
555     * HTML.pm.src: The secondary insertion mode used when switching
556     to foreign content is the "in body" insertion mode (HTML5 revision
557     1696).
558    
559     2008-05-25 Wakaba <wakaba@suika.fam.cx>
560    
561 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
562     revision 1697).
563    
564 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
565    
566 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
567     insertion mode (HTML5 revision 1693). Update SVG camelCase
568     attribute list (HTML5 revision 1700). <textarea> closes
569     </select> (HTML5 revision 1699). More start tags close in
570     foreign content insertion mode (HTML5 revision 1698).
571    
572     2008-05-24 Wakaba <wakaba@suika.fam.cx>
573    
574 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
575    
576     2008-05-24 Wakaba <wakaba@suika.fam.cx>
577    
578 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
579     revision 1674).
580    
581     2008-05-24 Wakaba <wakaba@suika.fam.cx>
582    
583 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
584     revision 1676).
585    
586     2008-05-24 Wakaba <wakaba@suika.fam.cx>
587    
588 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
589     quriks mode detection (HTML5 revision 1679).
590    
591     2008-05-24 Wakaba <wakaba@suika.fam.cx>
592    
593 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
594     revision 1685).
595    
596     2008-05-24 Wakaba <wakaba@suika.fam.cx>
597    
598 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
599     revision 1684).
600    
601     2008-05-24 Wakaba <wakaba@suika.fam.cx>
602    
603 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
604     work (HTML5 revision 1690).
605    
606     2008-05-24 Wakaba <wakaba@suika.fam.cx>
607    
608 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
609     end tags in head insertion modes (HTML5 revision 1686).
610     (parse_char_string): Bug fix for non-utf8 character string handlings.
611     (parse_char_stream): |ungetc| does not work well for this context.
612    
613 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
614    
615 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
616     |parse_byte_stream|.
617     (parse_byte_stream): New method.
618    
619     2008-05-18 Wakaba <wakaba@suika.fam.cx>
620    
621 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
622     by encoding layer error reporter.
623    
624 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
625    
626 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
627     rather than converting the whole byte string and then parsing.
628     Propagate errors in character encoding layer.
629     (get_next_token): Precise error reporting for |bare stago| error.
630    
631     2008-05-17 Wakaba <wakaba@suika.fam.cx>
632    
633 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
634     (parse_char_string): This method is now defined as an invocation
635     of the |parse_char_stream| method.
636    
637     2008-05-17 Wakaba <wakaba@suika.fam.cx>
638    
639 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
640     of the sniffing as info-level errors. Support for new
641     decoding framework in parser resestting.
642     (new): Various default error levels were not set.
643    
644     2008-05-17 Wakaba <wakaba@suika.fam.cx>
645    
646 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
647     algorithm, except for the actual sniffing, is implemented
648     with new framework with Message::Charset::Info.
649    
650 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
651    
652     * CacheManifest.pm (_parse): Drop fragment identifiers from
653     URIs in fallback section (HTML5 revision 1596).
654    
655 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
656    
657 wakaba 1.231 * Makefile (Entities.html): URI has changed.
658    
659     2008-05-10 Wakaba <wakaba@suika.fam.cx>
660    
661 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
662     1553).
663    
664 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
665    
666 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
667     no longer table elements, but table objects returned
668     by Whatpm::HTMLTable.
669    
670     * HTMLTable.pm (form_table): Return table element node
671     as |$table->{element}|.
672     (assign_header): Support for the |headers=""| attribute.
673    
674     2008-05-06 Wakaba <wakaba@suika.fam.cx>
675    
676 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
677     with no support for headers="".
678     (form_table): Include table width and height to the returned
679     table object for covenience. Indexing in column assignement was wrong.
680     Set whether a data cell is empty or not for convenience.
681    
682 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
683    
684 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
685    
686     2008-05-05 Wakaba <wakaba@suika.fam.cx>
687    
688 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
689     changed (HTML5 revision 1387).
690    
691     2008-05-05 Wakaba <wakaba@suika.fam.cx>
692    
693 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
694     to the end of the table (HTML5 revision 1380).
695    
696     2008-05-05 Wakaba <wakaba@suika.fam.cx>
697    
698 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
699     of 1-based (HTML5 revision 1376).
700    
701     2008-05-05 Wakaba <wakaba@suika.fam.cx>
702    
703 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
704     and "Content-Encoding" no longer prevent sniffing (HTML5
705     revision 1288).
706    
707     2008-05-05 Wakaba <wakaba@suika.fam.cx>
708    
709 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
710     1282).
711    
712 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
715     Status of XML specs are added.
716    
717     2008-05-03 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
720     and |ref=""| attribute (referent element type checking).
721    
722 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
723    
724 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
725     1500) is implemented (except for IDNA part and URI-scheme-specific
726 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
727     documents.
728    
729 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
730    
731 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
732     character (HTML5 revision 1263).
733    
734     2008-04-13 Wakaba <wakaba@suika.fam.cx>
735    
736 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
737     definition for |AElig;| was missing).
738    
739     2008-04-13 Wakaba <wakaba@suika.fam.cx>
740    
741 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
742     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
743    
744     2008-04-13 Wakaba <wakaba@suika.fam.cx>
745    
746 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
747     insertion mode is added (HTML5 revisions 1412 and 1418).
748    
749     2008-04-13 Wakaba <wakaba@suika.fam.cx>
750    
751 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
752     elements (HTML5 revision 1410).
753    
754     2008-04-13 Wakaba <wakaba@suika.fam.cx>
755    
756 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
757     revision 1406).
758    
759     2008-04-13 Wakaba <wakaba@suika.fam.cx>
760    
761 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
762     elements (HTML5 revision 1404 and 1420).
763    
764 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
765    
766 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
767     elements (HTML5 revision 1404). Unused !!!macro definitions
768     are removed.
769    
770     2008-04-12 Wakaba <wakaba@suika.fam.cx>
771    
772 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
773     are raised is changed (HTML5 revision 1404).
774    
775 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
776    
777 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
778    
779     2008-04-06 Wakaba <wakaba@suika.fam.cx>
780    
781 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
782     replaced by constants representing element category.
783    
784     2008-04-06 Wakaba <wakaba@suika.fam.cx>
785    
786 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
787     for error reporting. (This is a preparation for using constant
788     value for ->[1].)
789    
790 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
791    
792 wakaba 1.205 * RDFXML.pm: Typo fixed.
793    
794     2008-03-22 Wakaba <wakaba@suika.fam.cx>
795    
796 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
797     as same as |must_level|, i.e. level = |m|.
798     (check_element): Make list of URIs in the DOM.
799    
800 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
801    
802 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
803     Local (null-namespace) attribute support.
804    
805     2008-03-21 Wakaba <wakaba@suika.fam.cx>
806    
807 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
808    
809     2008-03-21 Wakaba <wakaba@suika.fam.cx>
810    
811 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
812     and URI attributes is implemented. Warn if unknown
813     value is used in rdf:parseType="" attribute.
814    
815     * URIChecker.pm (check_rdf_uri_reference): New function.
816    
817     2008-03-21 Wakaba <wakaba@suika.fam.cx>
818    
819 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
820     are now resolved.
821    
822     2008-03-21 Wakaba <wakaba@suika.fam.cx>
823    
824 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
825    
826     * RDFXML.pm: undef vs false bug fixed.
827     Reification implemented.
828    
829     2008-03-21 Wakaba <wakaba@suika.fam.cx>
830    
831 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
832     The |node| arguments are added for |ontriple| calls.
833     Too many "attribute not allowed" errors were raised.
834    
835     * ContentChecker.pm: Initial experimental support for rdf:RDF
836     element.
837    
838     2008-03-21 Wakaba <wakaba@suika.fam.cx>
839    
840 wakaba 1.197 * RDFXML.pm: New module.
841    
842 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
845     code was old one yet.
846    
847     2008-03-20 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
850     errors. Line/column markings for DOCTYPE, comment, and
851     character tokens are reintroduced; otherwise,
852     error location for "not HTML5" error and errors
853     for implied elements are not attached.
854    
855     2008-03-20 Wakaba <wakaba@suika.fam.cx>
856    
857 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
858    
859     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
860     (value): Setter implemented.
861    
862     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
863    
864     2008-03-20 Wakaba <wakaba@suika.fam.cx>
865    
866 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
867    
868     2008-03-20 Wakaba <wakaba@suika.fam.cx>
869    
870 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
871     flag, which is no longer used.
872    
873 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
874    
875 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
876    
877     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
878     and !!!insert-element-t): Set line/column information to
879     element nodes.
880    
881     2008-03-17 Wakaba <wakaba@suika.fam.cx>
882    
883 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
884     in "<?", "<>", or "</>" should be the error point.
885    
886 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
887    
888 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
889    
890     2008-03-16 Wakaba <wakaba@suika.fam.cx>
891    
892 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
893    
894     2008-03-16 Wakaba <wakaba@suika.fam.cx>
895    
896 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
897     reporting.
898    
899 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
900    
901 wakaba 1.186 * HTML.pm.src: Error type revised.
902    
903     2008-03-11 Wakaba <wakaba@suika.fam.cx>
904    
905 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
906    
907     2008-03-11 Wakaba <wakaba@suika.fam.cx>
908    
909 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
910    
911 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
912    
913     * mkhtmlparser.pl: Set "level" parameter to parse errors.
914    
915     * HTML.pm.src: Code refined.
916    
917 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
920     revision 1348). Note that I really don't know this makes
921     any difference in the black-box behavior of the parser.
922    
923     2008-03-09 Wakaba <wakaba@suika.fam.cx>
924    
925 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
926     revision 1348).
927    
928     2008-03-09 Wakaba <wakaba@suika.fam.cx>
929    
930 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
931    
932     2008-03-09 Wakaba <wakaba@suika.fam.cx>
933    
934 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
935    
936     2008-03-09 Wakaba <wakaba@suika.fam.cx>
937    
938 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
939     mode and support for the "in select in table" insertion mode (HTML5
940     revision 1342).
941    
942     2008-03-09 Wakaba <wakaba@suika.fam.cx>
943    
944 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
945     in non-tainted <table>s (HTML5 revision 1336).
946    
947     2008-03-09 Wakaba <wakaba@suika.fam.cx>
948    
949 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
950     and <head> (HTML5 revision 1332).
951    
952     2008-03-09 Wakaba <wakaba@suika.fam.cx>
953    
954 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
955     white space (HTML5 revision 1331).
956    
957 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
958    
959 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
960     element (HTML5 revision 1330).
961    
962     2008-03-08 Wakaba <wakaba@suika.fam.cx>
963    
964 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
965     element (HTML5 revision 1328).
966    
967     2008-03-08 Wakaba <wakaba@suika.fam.cx>
968    
969 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
970     into foster parents (HTML5 revision 1326).
971    
972     2008-03-08 Wakaba <wakaba@suika.fam.cx>
973    
974 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
975     revision 1321).
976    
977     2008-03-08 Wakaba <wakaba@suika.fam.cx>
978    
979 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
980     the spec (HTML5 revision 1320). Note that we finally complete
981     all of HTML5 revision 1320 changes.
982    
983     2008-03-08 Wakaba <wakaba@suika.fam.cx>
984    
985 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
986     tags (HTML5 revision 1320).
987    
988     2008-03-08 Wakaba <wakaba@suika.fam.cx>
989    
990 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
991    
992     2008-03-08 Wakaba <wakaba@suika.fam.cx>
993    
994 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
995     could not be reached so that it is now removed (HTML5 revision
996     1320).
997    
998     2008-03-08 Wakaba <wakaba@suika.fam.cx>
999    
1000 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1001     to align with new spec text (HTML5 revision 1320).
1002    
1003     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1004    
1005 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1006     have had no harm in theory.
1007    
1008     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1009    
1010 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1011     case is split from other end tags for the preparation
1012     of implementing HTML revision 1320.
1013    
1014 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1015    
1016 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1017     revision 1320).
1018    
1019     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1020    
1021 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1022     and "div" (HTML5 revision 1318). Add comments to where
1023     |form| pointer association codes should be inserted (HTML5 revision
1024     1319).
1025    
1026 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1027    
1028     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1029    
1030 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1031    
1032 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1033     other token type, there are), it is replaced by a |die| statement.
1034    
1035     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1036    
1037 wakaba 1.159 * HTML.pm.src: Typo fixed.
1038    
1039 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1040    
1041     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1042     doctypes were not uppercased for comparison.
1043    
1044 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1045    
1046 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1047     are added.
1048    
1049     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1050    
1051 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1052     are set. Cases that are unlikely reached are noted as so.
1053    
1054     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1055    
1056 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1057    
1058     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1059    
1060     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1061    
1062 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1063     for simplicity.
1064    
1065     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1066    
1067 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1068     revision 1307).
1069    
1070     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1071    
1072 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1073     is set are changed (HTML5 revision 1305).
1074    
1075 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1076    
1077 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1078     1292). Entities are not parsed in comment-like part in RCDATA
1079     elements (HTML5 revision 1294). Allow bare & at the end
1080     of attribute value literals (HTML5 revision 1296). More
1081     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1082     between attributes and ban attribute names or unquoted
1083     attribute values containing single or double quotes (HTML5
1084     revision 1303).
1085    
1086     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1087    
1088 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1089     and related errors unless it is an HTML document (though the spec
1090     is unclear on whether it is applied to XHTML document).
1091    
1092     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1093     are added.
1094    
1095     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1096    
1097 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1098    
1099 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1100    
1101     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1102    
1103     * HTML.pm.src: |charset| in |content| attribute is
1104     case-insensitive (HTML5 revision 1270).
1105    
1106 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1107    
1108     * ContentChecker.pm: New status constants are added.
1109     ($ElementDefault): |status| added.
1110     (check_element): Err for non-standard or deprecated elements.
1111     (_attr_status_info): For non-standard or deprecated attributes.
1112    
1113 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1116    
1117     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1118    
1119 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1120     status information is now dispatched.
1121    
1122     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1123    
1124 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1125     problem by adding some more arguments.
1126    
1127     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1128    
1129 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1130     container-for-the-purpose-of-content-model element (not transparent
1131     element) for |check_child_element| calling and significant
1132     text flag marking. This reintroduces |<del>|-and-significant
1133     problem again.
1134    
1135     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1136    
1137 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1138     elements ignored for the purpose of phase changes in
1139     content model checking.
1140    
1141 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1142    
1143 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1144     was not properly managed for transparent cases.
1145    
1146     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1147    
1148 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1149     and |audio| as semi-transparent elements.
1150    
1151     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1152    
1153 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1154     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1155     Support for |html:object| as a semi-transparent element.
1156    
1157     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1158    
1159 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1160     the tree is entirely revised to make it easier to track
1161     the state of ancestors/descendants. As a result of this
1162     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1163     support for content model checking for HTML elements |figure|,
1164     |object|, |video|, and |audio| and checking for XML elements (and
1165     some XMLNS checkings) are dropped for now. They will be
1166     reimplemented in due cource.
1167    
1168 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1169    
1170 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1171     token (HTML5 revision 1225).
1172    
1173     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1174    
1175 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1176    
1177     * HTML.pm.src: Save whether |meta| |content| attribute
1178     contains character references or not.
1179    
1180     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1181    
1182 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1183     imlement significant content checking for contents
1184     with |del| elements.
1185    
1186     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1189     elements are added.
1190     (_get_children): HTML |object| elements are now semi-transparent.
1191    
1192     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1193    
1194 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1195    
1196     * CacheManifest.pm: HTML5 revision 1211 implemented.
1197    
1198     * CacheManifest.pod: Updated.
1199    
1200 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1201    
1202     * ContentChecker.pm (check_document, check_element): Support
1203     for second argument ($onsubdoc).
1204     (_get_css_parser): Removed (now it is part of WDCC).
1205    
1206 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1207    
1208     * ContentChecker.pm (_get_css_parser): New.
1209    
1210 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1211    
1212 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1213     for checking had been used.
1214    
1215     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1216    
1217 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1218    
1219 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1220    
1221 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1222     specced in RFC 2046 and RFC 4288.
1223     (application/atom+xml): Definition added.
1224    
1225     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1228     warnings (rather than SHOULD-level errors).
1229    
1230     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1233    
1234     * HTML.pm.src: A flag for character references in attribute
1235     values are added. Set |manakai_has_reference| user data
1236     to |charset| attribute.
1237    
1238     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1239    
1240 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1241     attributes.
1242    
1243     * ContentChecker.pm (check_document): Warn if charset requirements
1244     cannot be tested.
1245    
1246 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1247    
1248     * HTML.pm.src (parse_byte_string): Detect charset
1249     by universalchardet if charset parameter is not specified.
1250    
1251     * Makefile (Charset-all, Charset-clean): New rules.
1252    
1253 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1254    
1255 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1256     of character encoding declaration and selection of encoding
1257     for HTML document.
1258    
1259     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1260    
1261 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1262     type in list context.
1263    
1264     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1265    
1266 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1267     ignoring (HTML5 revisions 1013 and 1016).
1268    
1269     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1270    
1271 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1272     white spaces, are implemented.
1273    
1274 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1275    
1276 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1277     attribute if possible.
1278    
1279     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1282     (parse_char_string): New alias for |parse_string|.
1283     (main phase): Invoking "change the encoding" algorithm if desired.
1284    
1285     * HTML.pod: Updated.
1286    
1287     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1288    
1289 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1290    
1291     * Makefile (HTML-all, HTML-clean): New.
1292    
1293     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1294    
1295     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1296    
1297 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1298    
1299     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1300    
1301 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1302    
1303 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1304    
1305     * CacheManifest.pod: New file.
1306    
1307     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1308    
1309 wakaba 1.114 * CacheManifest.pm: New module.
1310    
1311     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1312    
1313 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1314     callback.
1315    
1316     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1317    
1318 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1319    
1320 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1321    
1322 wakaba 1.111 * Makefile (clean): New rule.
1323    
1324     * NanoDOM.pm (public_id, system_id): New attributes.
1325    
1326     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1327    
1328 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1329    
1330 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1331    
1332     * ContentChecker.pm (check_document): Support for
1333     new |is_xml_root| flag.
1334     (check_element): Support for new |pluses| state.
1335     (_add_pluses): New method.
1336     (_remove_minuses): Support for new |minus| item.
1337    
1338 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1339    
1340 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1341     root element.
1342    
1343     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1344    
1345 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1346    
1347 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1350    
1351     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1352    
1353 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1354    
1355     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1356    
1357 wakaba 1.103 * LangTag.pm: New module.
1358    
1359 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1360    
1361     * HTML.pm.src: Some error types were wrong.
1362    
1363 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1364    
1365 wakaba 1.101 * CSS/: New directory.
1366    
1367     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1368    
1369 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1370     for |noscript| in |head|.
1371    
1372 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1373    
1374     * URI/: New directory.
1375    
1376 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1377    
1378 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1379     number.
1380    
1381     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1382    
1383 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1384     by |&|s.
1385    
1386     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1387    
1388 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1389    
1390     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1391    
1392 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1393    
1394     * HTML.pm.src: Insertion modes are now represented in number.
1395    
1396     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1397    
1398 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1399     999) is implemented.
1400    
1401 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1402    
1403     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1404     983) is implemented.
1405    
1406 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1407    
1408     * ContentChecker.pod: New documentation.
1409    
1410     * Makefile: A rule for |ContentChecker.html| is added.
1411    
1412     * ContentChecker.pm: A pod "LICENSE" section is added.
1413    
1414     * NanoDOM.pm ($VERSION): New variable.
1415    
1416 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1419     list of allowed HTML elements.
1420    
1421     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1422    
1423 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1424     HTML elements.
1425    
1426     * URIChecker.pm (check_iri): New.
1427     (check_iri_reference): Error type for IRI reference
1428     syntax error is changed.
1429    
1430 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1431    
1432 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1433     is added.
1434     (check_document): Load appropriate module before validation.
1435    
1436     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1437    
1438 wakaba 1.88 * ContentChecker/: New directory.
1439    
1440     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1441    
1442 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1443    
1444     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1445    
1446 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1447    
1448 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1449     even if no document element is found.
1450    
1451     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1452    
1453 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1454    
1455     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1456    
1457 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1458    
1459     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1460    
1461 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1462     is split into "after html body" and "after html frameset"
1463     insertion modes. Their codes are merged with "after body"
1464     and "after frameset" codes. |$previous_insertion_mode|
1465     has been removed. "after frameset" code is
1466     merged with "in frameset" code.
1467    
1468     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1469    
1470 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1471     merged with the "in head" insertion mode.
1472    
1473     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1474    
1475 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1476     reorganized.
1477    
1478     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1479    
1480 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1481     are merged.
1482    
1483     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1484    
1485 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1486     with "in table" insertion mode.
1487    
1488     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1489    
1490 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1491     modes are merged.
1492    
1493     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1494    
1495 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1496     insertion mode!
1497    
1498     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1499    
1500 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1501     now directly invoke the handler.
1502    
1503     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1504    
1505 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1506     is merged to the "in body" insertion mode code.
1507    
1508     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1509    
1510 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1511     insertion modes are merged.
1512    
1513     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1514    
1515 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1516    
1517     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1518    
1519 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1520     instead of string constant for |{content_model_flag}|.
1521    
1522     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1523    
1524 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1525     to the "duplicate attribute" error.
1526    
1527 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1528    
1529 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1530    
1531     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1532    
1533 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1534    
1535     * HTML.pm.src: A typo is fixed.
1536    
1537 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1538    
1539 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1540     from "in HTML:xml:lang" error. Character position
1541     is now the last part of the error type in the URI error
1542     description. Report "unsupported" status for language
1543     tags, media queries, script codes, and style sheets.
1544    
1545     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1546    
1547 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1548     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1549     in XML.
1550    
1551     * NanoDOM.pm (Attr.owner_document): New attribute.
1552    
1553     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1554    
1555 wakaba 1.66 * HTML.pm.src: The character immediately following
1556     a bare |hcro| was discarded. Fix handling of
1557     entity references in attribute values.
1558    
1559     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1560    
1561 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1562     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1563     factored out. Error types |in html:#DOCTYPE|
1564     and |after html:#DOCTYPE| are merged
1565     into |DOCTYPE in the middle|. |</frameset>|
1566     in fragment parsing mode changed the
1567     insertion mode.
1568    
1569     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1570    
1571 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1572     phase is now an insertion mode. Treatments for white
1573     space character tokens were incorrect for some
1574     insertion modes. An old |meta| case was not removed.
1575    
1576     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1577    
1578 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1579     implemented (but changing the encoding is not yet:-).
1580    
1581 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1582    
1583 wakaba 1.62 * Charset/: New directory.
1584    
1585     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.61 * H2H.pm: New Perl module (created from
1588     manakai's H2H.dis).
1589    
1590     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1591    
1592 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1593     manakai's SimpleLS.dis).
1594    
1595 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1596    
1597     * HTML.pm.src: |<!---x-->| was not processed correctly.
1598    
1599 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1600    
1601 wakaba 1.58 * HTML.pm.src: Report correct error message
1602     for |<body></div></body>|.
1603    
1604     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1605    
1606 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1607    
1608     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1609    
1610     * ContentChecker.pm: Return |{term}| list.
1611    
1612 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1613    
1614 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1615     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1616     and <title> outside of head).
1617    
1618     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1619    
1620 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1621     and private types/subtypes.
1622    
1623     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1624     URIChecker.pm, HTMLTable.pm: Error messages are now
1625     consistent; they are all listed in
1626     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1627    
1628 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1629    
1630 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1631     is now erred. |<datalist>| is implemented.
1632     Attribute checker for |<command>| and |<menu>| are
1633     added. Support for |contextmenu| global attribute
1634     is added.
1635    
1636     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1637    
1638 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1639     of Step 3 has been changed.
1640    
1641     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1642    
1643 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1644    
1645 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1646    
1647 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1648     incorrect.
1649    
1650     * ContentChecker.pm: Support for the |footer| element.
1651     Check URI syntax for space-separated URI attributes.
1652     Support for the |tabindex| attribute. Support
1653     for |datetime| attribute.
1654    
1655     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1656    
1657 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1658     character references). HTML5 revision 1.146 (white space
1659     characters before root start tag). HTML5 revision
1660     1.148 (named character references in attribute values).
1661     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1662    
1663     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1664    
1665     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1666 wakaba 1.49 in <head>).
1667    
1668     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1669    
1670 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1671     in body).
1672    
1673     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1674    
1675     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1676    
1677 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1678     or RCDATA in fragment parsing mode). HTML5 revision
1679     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1680    
1681     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1682    
1683     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1684     for <pre> and <textarea>).
1685    
1686 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1687    
1688 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1689    
1690     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1691    
1692 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1693    
1694     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1695    
1696 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1697     HTML5 revision 917 (conforming bare &).
1698    
1699     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1700    
1701 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1702    
1703     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1704     HTML5 revision 915 (<nobr>).
1705    
1706     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1707    
1708 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1709    
1710     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1711     New attributes.
1712    
1713     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1714    
1715 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1716     in tags).
1717    
1718     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1719    
1720 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1721    
1722     * HTML.pm.src: |$entity_char| is removed and
1723     requires |Whatpm::_NamedEntityList| instead.
1724     HTML5 revision 898 (refc), except that lack of refc
1725     is parse error.
1726    
1727     * mkentitylist.pl: New script.
1728    
1729     * Makefile (all): |_NamedEntityList.pm| is added.
1730     (_NamedEntityList.pm, Entities.html): New rules.
1731    
1732     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1733    
1734 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1735     were ignored and U+000D immediately following another
1736     U+000D was not converted to U+000A.
1737    
1738     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1739    
1740 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1741     nodes before appended). Parser was not ready for NULL
1742     parse error and escape flag.
1743    
1744     * NanoDOM.pm (adopt_node): New.
1745    
1746     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1747    
1748 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1749     CDATA and RCDATA). Note that current implementation
1750     is simply repeating what the spec says and it is maybe not
1751     a best way to do it.
1752    
1753     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1754    
1755 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1756     the form element if a descendant element without implied end tag has
1757     still been open).
1758    
1759     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1760    
1761 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1762     space characters non-conforming).
1763    
1764     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1765    
1766 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1767     HTML5 revision 869 (C1 character references).
1768    
1769     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1770    
1771 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1772     a |textarea| is removed).
1773    
1774 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1775    
1776     * NanoDOM.pm (get_attribute_node_ns): New method.
1777    
1778     * ContentChecker.pm: |script| |async| and |defer|
1779     no longer require |src|. |async| MUST NOT be
1780     specified if |defer|. (HTML5 revision 858).
1781    
1782 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1783    
1784     * HTML.pm.src: |<form><form>| went to inifinite loop.
1785    
1786 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1787    
1788 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1789     as a document element) flag on.
1790     (new): Removed.
1791     (check_document): New method.
1792    
1793     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1794    
1795 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1796     were incorrect.
1797    
1798     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1799    
1800 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1801     error position in the |type| option of the error.
1802    
1803     * HTMLTable.pm (form_table): The |$onerror| parameter
1804     is now optional. Some bugs are fixed.
1805    
1806     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1807    
1808 wakaba 1.27 * HTMLTable.pm: New module.
1809    
1810     * ContentChecker.pm (table): Invoke table model error checker.
1811    
1812     * NanoDOM.pm (first_child, get_attribute_ns): New.
1813    
1814 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1815    
1816 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1817     (link/@rel, a/@rel, area/@rel): Use new checker.
1818    
1819     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1820    
1821     * _LinkTypeList.pm: New file.
1822    
1823     * mklinktypelist.pl: New file.
1824    
1825     * .cvsignore: |RelExtensions.html| added.
1826    
1827     * NanoDOM.pm (child_nodes): Returns an empty array
1828     for non-child-containing node types.
1829     (text_content): New attribute.
1830    
1831     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1832    
1833 wakaba 1.25 * IMTChecker.pm: New module.
1834    
1835     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1836     to test parameter value validity.
1837    
1838     * HTML.pm.src ($style_start_tag): Attributes were
1839     discarded.
1840    
1841 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1842    
1843 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1844    
1845     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1846    
1847 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1848     and RFC 3987 are listed (not all testable items are checked yet).
1849    
1850     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1851    
1852 wakaba 1.22 * URIChecker.pm: New module.
1853    
1854 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1855    
1856 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1857     or associated to some placeholder.
1858     ($ElementDefault): Warn unknown attributes
1859     for unknown elements as "attribute not supported".
1860     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1861     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1862     (|a| attribute checker): Reimplemented.
1863    
1864     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1865    
1866 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1867     ($HTMLAttrChecker): Event handler content attributes
1868     are added.
1869     (link, embed): Required attribute is now checked.
1870     (embed): Unknown local attributes are no longer warned.
1871    
1872     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1873    
1874 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1875     ($HTMLIMTAttrChecker): New checker.
1876     (link@rel, link@href, link@type, style@type,
1877     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1878     object@data, object@type, source@src, source@type, area@alt,
1879     area@shape, area@coords, area@href,
1880     area@ping, area@rel, area@type, script@src,
1881     script@defer, script@async, script@type): Checkers added.
1882    
1883     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1884    
1885 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1886    
1887 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1888    
1889 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1890     attributes. Report an error if |Element.prefix|
1891     is |xmlns|.
1892    
1893     * NanoDOM.pm (prefix): New attribute.
1894    
1895     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1896    
1897 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1898     mode, action for |<iframe>| was missing.
1899    
1900     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1901    
1902 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1903     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1904    
1905     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1906    
1907 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1908     ($HTMLURIAttrChecker): Placeholder.
1909     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1910     New.
1911    
1912     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1913    
1914 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1915     for global attributes, |html|, |base|, |style|, and |meta|.
1916    
1917     * NanoDOM.pm (insert_before): Weaken reference
1918     to the parent node.
1919     (Attr::new): Set |owner_element| attribute.
1920     (namespace_uri, manakai_local_name): New attribute implementations.
1921     (owner_element): New attribute.
1922    
1923     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1924    
1925 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1926     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1927     $Element->{$HTML_NS}->{''}): New.
1928     (check_element): Invoke attrs_checker for each element.
1929    
1930 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1931    
1932 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1933    
1934     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1935    
1936 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1937     element type testings.
1938    
1939     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1940    
1941 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1942     error when an element type is put in the "minus" list
1943     and the element type is not allowed explicitly in the particular
1944     element content model.
1945     (html:a checker): New checker.
1946     (html:details, html:datagrid): New checkers.
1947     (html:legend): New checker.
1948    
1949     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1950    
1951 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1952    
1953     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1954    
1955 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1956     checker.
1957     (html:dd checker): New checker.
1958     (html:q, html:em, html:strong, html:small,
1959     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1960    
1961     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1962    
1963 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1964     from |$ElementDefault->{checker}|.
1965     ($ElementDefault->{checker}): Throw an error that
1966     the element type is not supported by the checker.
1967     ($HTMLMetadataElement): |html:base| was missing.
1968     ($HTMLEmptyChecker): Don't throw an error
1969     for inter-element whitespace nodes.
1970     (html:html checker): Errors were not
1971     thrown even if |html:head| and/or |html:body|
1972     children were missing.
1973     (html:head checker): An error was not
1974     thrown if <meta charset> appered after other
1975     elements.
1976    
1977 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1978    
1979     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1980     and |noscript| elements are implemented.
1981     (new): New method.
1982    
1983 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1984    
1985 wakaba 1.4 * ContentChecker.pm: New module.
1986    
1987     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1988    
1989 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1990     document_element, manakai_local_name,
1991     manakai_element_type_match): New method.
1992    
1993     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1994    
1995     * HTML.pm.src: Replace decimal and hexadecimal numeric
1996     entities in C1 range using Windows-1252 mapping. Bare LF
1997     did not count as new line for error reporting.
1998    
1999 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2000    
2001     * NanoDOM.pm (DOMImplementation): New class.
2002     (append_child): Weaken the |parent_node| reference.
2003     (create_element_ns, Element new): Set the |owner_document|
2004     reference.
2005     (implementation): New attribute.
2006     (owner_document, local_name, namespace_uri): New attributes.
2007    
2008     * HTML.pm.src (parse_string): Line and column numbers
2009     are now provided to error handler.
2010     (!!!parse-error): Short descriptions are added.
2011     (_construct_tree): Split into three methods; support
2012     for innerHTML mode.
2013     (set_inner_html): New method.
2014    
2015 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2016    
2017     * NanoDOM.html: Documentation is added.
2018    
2019     * HTML.pod, ContentType.html: Documentation is revised.
2020    
2021     * .cvsignore: Pod2html temporary files are added.
2022    
2023     * Makefile: Make |NanoDOM.html|.
2024    
2025     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2026    
2027     * HTML.pm.src (parse_string): New method.
2028     (get_inner_html): Renamed from |inner_html|.
2029    
2030     * Makefile: A rule for |HTML.html| is added.
2031    
2032     * HTML.pod: New documentation.
2033    
2034     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2035    
2036     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2037     (clone_node): Attribute nodes were not completely copied.
2038    
2039     * HTML.pm.src: Many bugs are fixed.
2040    
2041     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2042    
2043     * HTML.pm.src: Some typos are fixed.
2044    
2045     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2046    
2047     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2048     are removed.
2049    
2050     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2051    
2052     * HTML-consume-entity.src: Removed.
2053    
2054     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2055    
2056     * HTML.pm.src: Tokenizer's handling on hexadecimal
2057     numeric entities are rewritten.
2058    
2059     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2060    
2061     * HTML.pm.src: Some tokenizer bugs are fixed.
2062    
2063     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2064    
2065     * HTML.pm.src: The tree construction stage is implemented.
2066    
2067     * mkhtmlparser.pl: New macros are added.
2068    
2069     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2070    
2071     * ContentType.pm: A note on bug in the specification
2072     is removed since it's been now fixed.
2073    
2074     * .cvsignore: New file.
2075    
2076     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2077    
2078     * HTML.pm.src, HTML-consume-entity.src: New files.
2079    
2080     * Makefile (HTML.pm): New rule.
2081    
2082     * mkhtmlparser.pl: New script.
2083    
2084     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2085    
2086     * Makefile: New file.
2087    
2088     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2089    
2090     * ContentType.pm: An error in pod is fixed.
2091    
2092     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2093    
2094     * ContentType.pm: Documentation is added.
2095    
2096     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2097    
2098     * ContentType.pm: New Perl module.
2099    
2100     * ChangeLog: New file.
2101    
2102    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24