/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.332 - (hide annotations) (download)
Sun Sep 21 11:55:49 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.331: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	21 Sep 2008 11:41:39 -0000
	* ContentChecker.pm: |contextmenu| validness is now checked using
	|id| and |id_type| properties, and |menu| property is removed.

2008-09-21  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ContentChecker/ChangeLog	21 Sep 2008 11:44:45 -0000
	* HTML.pm: |id| attribute checkers for the |menu| and |map|
	elements are removed; the |id| attribute of those elements is now
	checked by the global |id| attribute.

2008-09-21  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
4     |id| and |id_type| properties, and |menu| property is removed.
5    
6     2008-09-21 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
9     duplication checking.
10    
11     2008-09-21 Wakaba <wakaba@suika.fam.cx>
12    
13 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
14     unsupported charset="" parameter value (HTML5 revision 2131).
15    
16 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
17    
18 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
19     character is fixed (cf. HTML5 revision 1738).
20    
21     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
22     space characters (HTML5 revision 1738).
23    
24     2008-09-20 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
27     insertion mode was not updated to swtich to the "in body"
28     insertion mode. U+000B is no longer a space character for the
29     purpose of tree construction phase (HTML5 revision 1738).
30    
31     2008-09-20 Wakaba <wakaba@suika.fam.cx>
32    
33 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
34     revision 1738).
35    
36     2008-09-20 Wakaba <wakaba@suika.fam.cx>
37    
38 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
39     revision 1738).
40    
41     * HTML.pm.src: U+000B is no longer a space character for the
42     algorithm for extracting an encoding from a Content-Type (HTML5
43     revision 1738).
44    
45     2008-09-20 Wakaba <wakaba@suika.fam.cx>
46    
47 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
48    
49 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
50    
51 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
52    
53     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
54     matching.
55    
56     2008-09-18 Wakaba <wakaba@suika.fam.cx>
57    
58 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
59     typos fixed. Support for Suppress-Script field.
60    
61     * mklangreg.pl: Support for dumping of nested structure.
62    
63     2008-09-18 Wakaba <wakaba@suika.fam.cx>
64    
65 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
66     recommended case as per RFC 4646.
67    
68     2008-09-18 Wakaba <wakaba@suika.fam.cx>
69    
70 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
71    
72     2008-09-18 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.320 * mklangreg.pl: New script.
75    
76     * Makefile: Updated for creation of the module for language subtag
77     registry.
78    
79 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
80    
81 wakaba 1.319 * Makefile: WebIDL.html added.
82    
83     * WebIDL.pod: New documentation.
84    
85     2008-09-16 Wakaba <wakaba@suika.fam.cx>
86    
87 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
88    
89     2008-09-16 Wakaba <wakaba@suika.fam.cx>
90    
91 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
92     compatible parsing bugs are fixed. Some unreachable codes are
93     commented out.
94    
95     2008-09-16 Wakaba <wakaba@suika.fam.cx>
96    
97 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
98     added. It does not satisfy the definition that a forward
99     interface declaration has an extended attribute. It seems that
100     unless explicitly allowed multiple extended attributes with the
101     same name is not allowed, though it is not explicitly mentioned in
102     the spec.
103    
104     2008-09-16 Wakaba <wakaba@suika.fam.cx>
105    
106 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
107     attribute identifiers. Preserve whether an extended attribute has
108     an argument list of not. Support for extended attributes:
109     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
110     NameGetter, NameSetter, and Null.
111     (has_argument_list): New attribute.
112     (idl_text): Stringifies argument lists, if any, even if it is
113     empty.
114    
115 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
116    
117 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
118     |{s_kwd}| in DATA_STATE as default.
119    
120     2008-09-15 Wakaba <wakaba@suika.fam.cx>
121    
122 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
123     by |{s_kwd}| in DATA_STATE.
124    
125     2008-09-15 Wakaba <wakaba@suika.fam.cx>
126    
127 wakaba 1.312 * HTML.pm.src: Shorten keys.
128    
129     2008-09-15 Wakaba <wakaba@suika.fam.cx>
130    
131 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
132     pair, or noncharacter code points and non-Unicode code
133     points (they should be handled by Whatpm::Charset::UnicodeChecker).
134     (parse_char_stream): Support for the |$get_wrapper| argument and
135     character stream error handlers.
136    
137     2008-09-15 Wakaba <wakaba@suika.fam.cx>
138    
139 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
140     for null-namespace elements/attributes.
141    
142     * HTML.pm.src: Fact out $disallowed_control_chars
143     as a hash.
144    
145 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
146    
147 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
148     and |{next_char}| initializations are moved to initialization
149     method. |{read_until}| now supports buffering. Sync |set_inner_html|
150     with |parse_char_stream|.
151    
152     2008-09-14 Wakaba <wakaba@suika.fam.cx>
153    
154 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
155     invoke |manakai_read_until|, not only |read|, where
156     possible, to decrease the number of |read| method calls.
157    
158     * mkhtmlparser.pl: Related changes to the aforementioned
159     modification.
160    
161     2008-09-14 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
164     would report character error from now.
165    
166     2008-09-14 Wakaba <wakaba@suika.fam.cx>
167    
168 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
169     tokens in "before head insertion mode" was not
170     correctly handled.
171     (set_inner_html): Reimplemented using CharString decodehandle
172     class. Support for $get_wrapper argument. Support
173     for |{read_until}| feature.
174    
175     2008-09-14 Wakaba <wakaba@suika.fam.cx>
176    
177 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
178     entities point the "&" character.
179    
180     2008-09-14 Wakaba <wakaba@suika.fam.cx>
181    
182 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
183     be added to the list of excluded characters.
184    
185     2008-09-14 Wakaba <wakaba@suika.fam.cx>
186    
187 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
188     and |column| a higher priority than the one set by the
189     tokenizer's input handler.
190     ($self->{read_until}): Exclude U+FFFD (but this might
191     not be necessary, since now we do line/column fixup in
192     the character decode handle).
193    
194     2008-09-14 Wakaba <wakaba@suika.fam.cx>
195    
196 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
197    
198     2008-09-14 Wakaba <wakaba@suika.fam.cx>
199    
200 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
201     and |manakai_getc_until| to |manakai_read_until| to
202     reduce the number of string copies.
203    
204     2008-09-14 Wakaba <wakaba@suika.fam.cx>
205    
206 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
207     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
208     standard feature to |open| a string as a filehandle,
209     since Perl's string filehandle seems not supporting |ungetc|
210     method correctly.
211     (parse_char_stream): Define |{getc_until}| method.
212     (DATA_STATE): Experimental support for |getc_until| feature.
213    
214 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
215    
216 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
217    
218     2008-09-13 Wakaba <wakaba@suika.fam.cx>
219    
220 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
221     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
222     and replaced by |{prev_state}|.
223    
224     * mkhtmlparser.pl: Remove |{char}| feature.
225     Remove |!!!back-next-input-character;| macro.
226    
227     2008-09-13 Wakaba <wakaba@suika.fam.cx>
228    
229 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
230     entity related tokenizer states in favor of new states
231     implementing the consume character reference algorithm.
232    
233     2008-09-13 Wakaba <wakaba@suika.fam.cx>
234    
235     * HTML.pm.src: "Consume a character reference" algorithm is
236 wakaba 1.296 now implemented as a tokenizer's state, rather than
237     a method, with minimum changes (more changes will
238     be made, in due course). "Bogus comment state"'s inner
239     loop gets removed.
240    
241     2008-09-13 Wakaba <wakaba@suika.fam.cx>
242    
243 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
244 wakaba 1.295 into their own tokenizer states.
245    
246     2008-09-13 Wakaba <wakaba@suika.fam.cx>
247    
248 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
249     is split into three states.
250    
251     2008-09-13 Wakaba <wakaba@suika.fam.cx>
252    
253 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
254     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
255     no longer does the tokenizer have to push back next input
256     characters in those states.
257    
258     2008-09-13 Wakaba <wakaba@suika.fam.cx>
259    
260 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
261     into four states so that no longer does the tokenizer have to push
262     back next input characters in that state.
263    
264 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
265    
266     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
267     which can be used to insert some wrapper between the character
268     stream handle and the tokenizer. (It is currently not supported
269     for |set_inner_html| for |Element|s).
270    
271 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
272    
273 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
274    
275     2008-09-10 Wakaba <wakaba@suika.fam.cx>
276    
277 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
278    
279     * HTML.pm.src: Don't specify |text| argument for the
280     |chardecode:fallback| error, since it is not the encoding
281     being used alternatively.
282    
283 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
284    
285     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
286    
287 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
288    
289 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
290    
291     2008-08-31 Wakaba <wakaba@suika.fam.cx>
292    
293 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
294     to after after frameset insertion mode processing (HTML5
295     revision 1909). Note that the implementation was wrong
296     per the old spec before the r1909 changes.
297    
298 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
299    
300 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
301     spec (HTML5 revision 2093).
302     ($process_row): Algorithm step numbers synced with the
303     spec (HTML5 revision 2092).
304    
305     2008-08-30 Wakaba <wakaba@suika.fam.cx>
306    
307 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
308     revision 2094).
309    
310     2008-08-30 Wakaba <wakaba@suika.fam.cx>
311    
312     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
313    
314     2008-08-30 Wakaba <wakaba@suika.fam.cx>
315    
316 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
317     name (after another attribute) now raise parse error (HTML5
318     revision 2123). Empty unquoted attribute values are no
319     longer allowed (HTML5 revision 2122).
320    
321     2008-08-30 Wakaba <wakaba@suika.fam.cx>
322    
323 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
324     revision 2130).
325    
326     2008-08-30 Wakaba <wakaba@suika.fam.cx>
327    
328 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
329     as |lang| attribute value for HTML elements (HTML5 revision 2062
330     and so on).
331    
332     2008-08-30 Wakaba <wakaba@suika.fam.cx>
333    
334 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
335     was missing.
336    
337     * URIChecker.pm: The end of the URL should be marked as the
338     error location for an empty path error. The position
339     between the userinfo and the port components should be
340     marked as the error location for an empty host error.
341    
342     2008-08-30 Wakaba <wakaba@suika.fam.cx>
343    
344 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
345     value the error occurs for errors. Report unknown
346     address format error in warning level, since address
347     formats are rarely added. Path segments starting with "/.."
348     were misinterpreted as a dot-segment.
349    
350     2008-08-30 Wakaba <wakaba@suika.fam.cx>
351    
352 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
353     |Message::DOM::DOMImplementation|.
354    
355 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
356    
357     * IMTChecker.pm: Updated for the new error reporting architecture.
358    
359     * ContentChecker.pm: Error levels for IMTs are added.
360    
361 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
362    
363     * H2H.pm (_shift_token): Support for unquoted HTML attribute
364     values.
365    
366 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
367    
368 wakaba 1.275 * CacheManifest.pm: Support for new style of error
369     reports.
370    
371     * HTML.pm.src: Set line=1, column=1 to the document node.
372    
373     2008-08-16 Wakaba <wakaba@suika.fam.cx>
374    
375 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
376     and URL checkers. Support for more error levels for bogus
377     langauge tag and URL "standards".
378    
379     * LangTag.pm, URIChecker.pm: Support for new style error
380     level reporting.
381    
382 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
383    
384 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
385    
386     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
387     specifying. Error types are revised.
388    
389     2008-08-15 Wakaba <wakaba@suika.fam.cx>
390    
391 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
392     renewed.
393    
394     2008-08-15 Wakaba <wakaba@suika.fam.cx>
395    
396 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
397     are revised. Use new style for "level" specification.
398    
399     * mkhtmlparser.pl: Use new style for "level" specification.
400    
401 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
402    
403 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
404     reporting process for broken ignored valuetype definition.
405     (Valuetype idl_text): Support for special "DOMString" name.
406    
407     2008-08-03 Wakaba <wakaba@suika.fam.cx>
408    
409 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
410     terminal of the ScopedName is "DOMString", such that whether
411     the last part of the scoped name is "DOMString" or "_DOMString"
412     later. It is necessary to determine whether a |typedef|
413     definition should be ignored or not.
414     (parse_char_string): Unescape the identifier of
415     exception members.
416     ($resolve): Return undef for builtin types and sequence<T>
417     types (we might not have to do this, however...).
418     (check): Support checking for Exceptions, Valuetypes,
419     and Typedefs.
420     ($serialize_type): Support for "DOMString::::" syntax.
421     (Typedef idl_text): Output Type as "DOMString" if it
422     is really "DOMString" (i.e. its internal representation
423     is "::DOMString::").
424    
425     2008-08-03 Wakaba <wakaba@suika.fam.cx>
426    
427 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
428     for constant types in the |check| method.
429     (check): Support for checking of attributes, operations, and
430     arguments.
431     (Attribute/Operation idl_text): Exception names in getraises,
432     setraises, and raises clauses is serizlied by |$serialize_type|
433     code.
434    
435 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
436    
437 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
438     hexadecimal numbers (the original pattern, taken from the spec,
439     was not work for hexadecimal numbers, because the "0" prefix
440     matches to the [0-7]* part (as an empty string) and therefore
441     it does not match with remaining "x..." part of a "0x..." integer
442     literal.
443     ($get_type): It now returns a string, not an array reference,
444     for regular types and |sequence| types (i.e. it in any case
445     returns a string).
446     ($get_next_token): The second item in the array that represents
447     a integer or float token is now a Perl number value, not the
448     original string representation of the number.
449     (check): Support for const value consistency checking.
450     No extended attribute is defined for constants.
451     (Node subclasses): Use simple strings rather than array references
452     for default data type values.
453     ($serialize_type): Type values are now simple strings.
454     (value): If the new attribute value is a false value, then
455     a FALSE value is set to the attribute.
456    
457     2008-08-02 Wakaba <wakaba@suika.fam.cx>
458    
459 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
460     in its stringified format ("scoped name" as defined in the
461     spec). Note that future version of this module should not use
462     array references for type values and the |type_text| attribute
463     should be made obsolete.
464     (parse_char_string): Unescape attribute names.
465     (check): Support for checking of whether inherited interfaces
466     are actually defined or not. Support for checking of whether
467     interface member identifiers are duplicated or not.
468     ($serialize_type): Scoped names are returned as is. A future
469     version of this code should escape identifiers other than "DOMString",
470     otherwise the idl_text would be non-conforming.
471    
472     2008-08-02 Wakaba <wakaba@suika.fam.cx>
473    
474 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
475 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
476     for Definition's were ignored.
477     (append_child): Set |parent_node| attribute.
478     (parent_node): New attribute.
479     (check): Support interface/exception members. Support
480     extended attributes. Support definition identifier uniqueness
481     constraint.
482     (qualified_name): New attribute.
483     (Interface/Exception idl_text): Extended attributes were
484     not prepended to the returned text.
485    
486     2008-08-02 Wakaba <wakaba@suika.fam.cx>
487    
488     * WebIDL.pm (parse_char_string): Set line/column numbers
489 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
490     would make things wrong. Support for interface forward
491     declarations was missing. Broken interface declarations
492     with no block were not ignored entirely.
493     (Whatpm::WebIDL::Node): New abstract class. This class
494     makes things easier.
495     (child_nodes): New attribute. Unlike DOM's attribute with
496     same name, this attribute returns a dead list of nodes for
497     simplicity.
498     (get_user_data, set_user_data): New methods.
499     (Module idl_text): A SPACE character should be inserted
500     before the |{| character.
501     (Interface idl_text): Support for interface forward declarations.
502     (is_forward_declaration): New attribute.
503    
504 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
505    
506 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
507    
508     2008-07-19 Wakaba <wakaba@suika.fam.cx>
509    
510 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
511     it now can handle broken extended attributes and as such.
512    
513     2008-07-19 Wakaba <wakaba@suika.fam.cx>
514    
515 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
516     Support for extended attributes with arguments.
517    
518     2008-07-19 Wakaba <wakaba@suika.fam.cx>
519    
520 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
521     (Interface->idl_text): Tentative support for inheritances.
522    
523     2008-07-19 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
526     Support for raises, setraises, and getraises syntaxes.
527    
528 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
529    
530     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
531     proper support for types, extended attributes, and exceptions yet).
532     WebIDL parser, version 1 (no support for exceptions yet,
533     no proper support for extended attributes yet).
534    
535 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
536    
537     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
538    
539 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
540    
541 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
542    
543     2008-06-29 Wakaba <wakaba@suika.fam.cx>
544    
545 wakaba 1.255 * WebIDL.pm: New module.
546    
547 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
548    
549     * Makefile (Entities.html): URI changed.
550    
551 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
552    
553     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
554    
555 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
556    
557 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
558    
559     2008-06-01 Wakaba <wakaba@suika.fam.cx>
560    
561 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
562     spec (commented out). (HTML5 revision 1711).
563    
564 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
565    
566 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
567    
568     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
569     as UTF-16 (HTML5 revision 1701).
570    
571     2008-05-25 Wakaba <wakaba@suika.fam.cx>
572    
573 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
574     1692).
575    
576     2008-05-25 Wakaba <wakaba@suika.fam.cx>
577    
578     * HTML.pm.src: The secondary insertion mode used when switching
579     to foreign content is the "in body" insertion mode (HTML5 revision
580     1696).
581    
582     2008-05-25 Wakaba <wakaba@suika.fam.cx>
583    
584 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
585     revision 1697).
586    
587 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
588    
589 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
590     insertion mode (HTML5 revision 1693). Update SVG camelCase
591     attribute list (HTML5 revision 1700). <textarea> closes
592     </select> (HTML5 revision 1699). More start tags close in
593     foreign content insertion mode (HTML5 revision 1698).
594    
595     2008-05-24 Wakaba <wakaba@suika.fam.cx>
596    
597 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
598    
599     2008-05-24 Wakaba <wakaba@suika.fam.cx>
600    
601 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
602     revision 1674).
603    
604     2008-05-24 Wakaba <wakaba@suika.fam.cx>
605    
606 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
607     revision 1676).
608    
609     2008-05-24 Wakaba <wakaba@suika.fam.cx>
610    
611 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
612     quriks mode detection (HTML5 revision 1679).
613    
614     2008-05-24 Wakaba <wakaba@suika.fam.cx>
615    
616 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
617     revision 1685).
618    
619     2008-05-24 Wakaba <wakaba@suika.fam.cx>
620    
621 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
622     revision 1684).
623    
624     2008-05-24 Wakaba <wakaba@suika.fam.cx>
625    
626 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
627     work (HTML5 revision 1690).
628    
629     2008-05-24 Wakaba <wakaba@suika.fam.cx>
630    
631 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
632     end tags in head insertion modes (HTML5 revision 1686).
633     (parse_char_string): Bug fix for non-utf8 character string handlings.
634     (parse_char_stream): |ungetc| does not work well for this context.
635    
636 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
637    
638 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
639     |parse_byte_stream|.
640     (parse_byte_stream): New method.
641    
642     2008-05-18 Wakaba <wakaba@suika.fam.cx>
643    
644 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
645     by encoding layer error reporter.
646    
647 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
648    
649 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
650     rather than converting the whole byte string and then parsing.
651     Propagate errors in character encoding layer.
652     (get_next_token): Precise error reporting for |bare stago| error.
653    
654     2008-05-17 Wakaba <wakaba@suika.fam.cx>
655    
656 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
657     (parse_char_string): This method is now defined as an invocation
658     of the |parse_char_stream| method.
659    
660     2008-05-17 Wakaba <wakaba@suika.fam.cx>
661    
662 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
663     of the sniffing as info-level errors. Support for new
664     decoding framework in parser resestting.
665     (new): Various default error levels were not set.
666    
667     2008-05-17 Wakaba <wakaba@suika.fam.cx>
668    
669 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
670     algorithm, except for the actual sniffing, is implemented
671     with new framework with Message::Charset::Info.
672    
673 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
674    
675     * CacheManifest.pm (_parse): Drop fragment identifiers from
676     URIs in fallback section (HTML5 revision 1596).
677    
678 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.231 * Makefile (Entities.html): URI has changed.
681    
682     2008-05-10 Wakaba <wakaba@suika.fam.cx>
683    
684 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
685     1553).
686    
687 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
688    
689 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
690     no longer table elements, but table objects returned
691     by Whatpm::HTMLTable.
692    
693     * HTMLTable.pm (form_table): Return table element node
694     as |$table->{element}|.
695     (assign_header): Support for the |headers=""| attribute.
696    
697     2008-05-06 Wakaba <wakaba@suika.fam.cx>
698    
699 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
700     with no support for headers="".
701     (form_table): Include table width and height to the returned
702     table object for covenience. Indexing in column assignement was wrong.
703     Set whether a data cell is empty or not for convenience.
704    
705 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
706    
707 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
708    
709     2008-05-05 Wakaba <wakaba@suika.fam.cx>
710    
711 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
712     changed (HTML5 revision 1387).
713    
714     2008-05-05 Wakaba <wakaba@suika.fam.cx>
715    
716 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
717     to the end of the table (HTML5 revision 1380).
718    
719     2008-05-05 Wakaba <wakaba@suika.fam.cx>
720    
721 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
722     of 1-based (HTML5 revision 1376).
723    
724     2008-05-05 Wakaba <wakaba@suika.fam.cx>
725    
726 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
727     and "Content-Encoding" no longer prevent sniffing (HTML5
728     revision 1288).
729    
730     2008-05-05 Wakaba <wakaba@suika.fam.cx>
731    
732 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
733     1282).
734    
735 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
736    
737 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
738     Status of XML specs are added.
739    
740     2008-05-03 Wakaba <wakaba@suika.fam.cx>
741    
742 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
743     and |ref=""| attribute (referent element type checking).
744    
745 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
746    
747 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
748     1500) is implemented (except for IDNA part and URI-scheme-specific
749 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
750     documents.
751    
752 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
755     character (HTML5 revision 1263).
756    
757     2008-04-13 Wakaba <wakaba@suika.fam.cx>
758    
759 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
760     definition for |AElig;| was missing).
761    
762     2008-04-13 Wakaba <wakaba@suika.fam.cx>
763    
764 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
765     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
766    
767     2008-04-13 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
770     insertion mode is added (HTML5 revisions 1412 and 1418).
771    
772     2008-04-13 Wakaba <wakaba@suika.fam.cx>
773    
774 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
775     elements (HTML5 revision 1410).
776    
777     2008-04-13 Wakaba <wakaba@suika.fam.cx>
778    
779 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
780     revision 1406).
781    
782     2008-04-13 Wakaba <wakaba@suika.fam.cx>
783    
784 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
785     elements (HTML5 revision 1404 and 1420).
786    
787 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
788    
789 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
790     elements (HTML5 revision 1404). Unused !!!macro definitions
791     are removed.
792    
793     2008-04-12 Wakaba <wakaba@suika.fam.cx>
794    
795 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
796     are raised is changed (HTML5 revision 1404).
797    
798 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
799    
800 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
801    
802     2008-04-06 Wakaba <wakaba@suika.fam.cx>
803    
804 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
805     replaced by constants representing element category.
806    
807     2008-04-06 Wakaba <wakaba@suika.fam.cx>
808    
809 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
810     for error reporting. (This is a preparation for using constant
811     value for ->[1].)
812    
813 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
814    
815 wakaba 1.205 * RDFXML.pm: Typo fixed.
816    
817     2008-03-22 Wakaba <wakaba@suika.fam.cx>
818    
819 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
820     as same as |must_level|, i.e. level = |m|.
821     (check_element): Make list of URIs in the DOM.
822    
823 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
824    
825 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
826     Local (null-namespace) attribute support.
827    
828     2008-03-21 Wakaba <wakaba@suika.fam.cx>
829    
830 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
831    
832     2008-03-21 Wakaba <wakaba@suika.fam.cx>
833    
834 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
835     and URI attributes is implemented. Warn if unknown
836     value is used in rdf:parseType="" attribute.
837    
838     * URIChecker.pm (check_rdf_uri_reference): New function.
839    
840     2008-03-21 Wakaba <wakaba@suika.fam.cx>
841    
842 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
843     are now resolved.
844    
845     2008-03-21 Wakaba <wakaba@suika.fam.cx>
846    
847 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
848    
849     * RDFXML.pm: undef vs false bug fixed.
850     Reification implemented.
851    
852     2008-03-21 Wakaba <wakaba@suika.fam.cx>
853    
854 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
855     The |node| arguments are added for |ontriple| calls.
856     Too many "attribute not allowed" errors were raised.
857    
858     * ContentChecker.pm: Initial experimental support for rdf:RDF
859     element.
860    
861     2008-03-21 Wakaba <wakaba@suika.fam.cx>
862    
863 wakaba 1.197 * RDFXML.pm: New module.
864    
865 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
866    
867 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
868     code was old one yet.
869    
870     2008-03-20 Wakaba <wakaba@suika.fam.cx>
871    
872 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
873     errors. Line/column markings for DOCTYPE, comment, and
874     character tokens are reintroduced; otherwise,
875     error location for "not HTML5" error and errors
876     for implied elements are not attached.
877    
878     2008-03-20 Wakaba <wakaba@suika.fam.cx>
879    
880 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
881    
882     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
883     (value): Setter implemented.
884    
885     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
886    
887     2008-03-20 Wakaba <wakaba@suika.fam.cx>
888    
889 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
890    
891     2008-03-20 Wakaba <wakaba@suika.fam.cx>
892    
893 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
894     flag, which is no longer used.
895    
896 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
897    
898 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
899    
900     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
901     and !!!insert-element-t): Set line/column information to
902     element nodes.
903    
904     2008-03-17 Wakaba <wakaba@suika.fam.cx>
905    
906 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
907     in "<?", "<>", or "</>" should be the error point.
908    
909 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
910    
911 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
912    
913     2008-03-16 Wakaba <wakaba@suika.fam.cx>
914    
915 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
916    
917     2008-03-16 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
920     reporting.
921    
922 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
923    
924 wakaba 1.186 * HTML.pm.src: Error type revised.
925    
926     2008-03-11 Wakaba <wakaba@suika.fam.cx>
927    
928 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
929    
930     2008-03-11 Wakaba <wakaba@suika.fam.cx>
931    
932 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
933    
934 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
935    
936     * mkhtmlparser.pl: Set "level" parameter to parse errors.
937    
938     * HTML.pm.src: Code refined.
939    
940 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
941    
942 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
943     revision 1348). Note that I really don't know this makes
944     any difference in the black-box behavior of the parser.
945    
946     2008-03-09 Wakaba <wakaba@suika.fam.cx>
947    
948 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
949     revision 1348).
950    
951     2008-03-09 Wakaba <wakaba@suika.fam.cx>
952    
953 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
954    
955     2008-03-09 Wakaba <wakaba@suika.fam.cx>
956    
957 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
958    
959     2008-03-09 Wakaba <wakaba@suika.fam.cx>
960    
961 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
962     mode and support for the "in select in table" insertion mode (HTML5
963     revision 1342).
964    
965     2008-03-09 Wakaba <wakaba@suika.fam.cx>
966    
967 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
968     in non-tainted <table>s (HTML5 revision 1336).
969    
970     2008-03-09 Wakaba <wakaba@suika.fam.cx>
971    
972 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
973     and <head> (HTML5 revision 1332).
974    
975     2008-03-09 Wakaba <wakaba@suika.fam.cx>
976    
977 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
978     white space (HTML5 revision 1331).
979    
980 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
981    
982 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
983     element (HTML5 revision 1330).
984    
985     2008-03-08 Wakaba <wakaba@suika.fam.cx>
986    
987 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
988     element (HTML5 revision 1328).
989    
990     2008-03-08 Wakaba <wakaba@suika.fam.cx>
991    
992 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
993     into foster parents (HTML5 revision 1326).
994    
995     2008-03-08 Wakaba <wakaba@suika.fam.cx>
996    
997 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
998     revision 1321).
999    
1000     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1001    
1002 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1003     the spec (HTML5 revision 1320). Note that we finally complete
1004     all of HTML5 revision 1320 changes.
1005    
1006     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1009     tags (HTML5 revision 1320).
1010    
1011     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1012    
1013 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1014    
1015     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1016    
1017 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1018     could not be reached so that it is now removed (HTML5 revision
1019     1320).
1020    
1021     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1022    
1023 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1024     to align with new spec text (HTML5 revision 1320).
1025    
1026     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1027    
1028 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1029     have had no harm in theory.
1030    
1031     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1032    
1033 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1034     case is split from other end tags for the preparation
1035     of implementing HTML revision 1320.
1036    
1037 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1038    
1039 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1040     revision 1320).
1041    
1042     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1043    
1044 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1045     and "div" (HTML5 revision 1318). Add comments to where
1046     |form| pointer association codes should be inserted (HTML5 revision
1047     1319).
1048    
1049 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1050    
1051     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1052    
1053 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1054    
1055 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1056     other token type, there are), it is replaced by a |die| statement.
1057    
1058     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1059    
1060 wakaba 1.159 * HTML.pm.src: Typo fixed.
1061    
1062 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1063    
1064     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1065     doctypes were not uppercased for comparison.
1066    
1067 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1068    
1069 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1070     are added.
1071    
1072     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1073    
1074 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1075     are set. Cases that are unlikely reached are noted as so.
1076    
1077     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1078    
1079 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1080    
1081     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1082    
1083     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1084    
1085 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1086     for simplicity.
1087    
1088     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1089    
1090 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1091     revision 1307).
1092    
1093     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1094    
1095 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1096     is set are changed (HTML5 revision 1305).
1097    
1098 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1099    
1100 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1101     1292). Entities are not parsed in comment-like part in RCDATA
1102     elements (HTML5 revision 1294). Allow bare & at the end
1103     of attribute value literals (HTML5 revision 1296). More
1104     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1105     between attributes and ban attribute names or unquoted
1106     attribute values containing single or double quotes (HTML5
1107     revision 1303).
1108    
1109     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1110    
1111 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1112     and related errors unless it is an HTML document (though the spec
1113     is unclear on whether it is applied to XHTML document).
1114    
1115     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1116     are added.
1117    
1118     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1121    
1122 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1123    
1124     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1125    
1126     * HTML.pm.src: |charset| in |content| attribute is
1127     case-insensitive (HTML5 revision 1270).
1128    
1129 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1130    
1131     * ContentChecker.pm: New status constants are added.
1132     ($ElementDefault): |status| added.
1133     (check_element): Err for non-standard or deprecated elements.
1134     (_attr_status_info): For non-standard or deprecated attributes.
1135    
1136 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1137    
1138 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1139    
1140     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1141    
1142 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1143     status information is now dispatched.
1144    
1145     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1146    
1147 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1148     problem by adding some more arguments.
1149    
1150     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1151    
1152 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1153     container-for-the-purpose-of-content-model element (not transparent
1154     element) for |check_child_element| calling and significant
1155     text flag marking. This reintroduces |<del>|-and-significant
1156     problem again.
1157    
1158     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1159    
1160 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1161     elements ignored for the purpose of phase changes in
1162     content model checking.
1163    
1164 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1165    
1166 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1167     was not properly managed for transparent cases.
1168    
1169     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1172     and |audio| as semi-transparent elements.
1173    
1174     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1175    
1176 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1177     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1178     Support for |html:object| as a semi-transparent element.
1179    
1180     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1181    
1182 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1183     the tree is entirely revised to make it easier to track
1184     the state of ancestors/descendants. As a result of this
1185     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1186     support for content model checking for HTML elements |figure|,
1187     |object|, |video|, and |audio| and checking for XML elements (and
1188     some XMLNS checkings) are dropped for now. They will be
1189     reimplemented in due cource.
1190    
1191 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1194     token (HTML5 revision 1225).
1195    
1196     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1197    
1198 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1199    
1200     * HTML.pm.src: Save whether |meta| |content| attribute
1201     contains character references or not.
1202    
1203     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1204    
1205 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1206     imlement significant content checking for contents
1207     with |del| elements.
1208    
1209     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1210    
1211 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1212     elements are added.
1213     (_get_children): HTML |object| elements are now semi-transparent.
1214    
1215     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1216    
1217 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1218    
1219     * CacheManifest.pm: HTML5 revision 1211 implemented.
1220    
1221     * CacheManifest.pod: Updated.
1222    
1223 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1224    
1225     * ContentChecker.pm (check_document, check_element): Support
1226     for second argument ($onsubdoc).
1227     (_get_css_parser): Removed (now it is part of WDCC).
1228    
1229 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1230    
1231     * ContentChecker.pm (_get_css_parser): New.
1232    
1233 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1234    
1235 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1236     for checking had been used.
1237    
1238     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1239    
1240 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1241    
1242 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1243    
1244 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1245     specced in RFC 2046 and RFC 4288.
1246     (application/atom+xml): Definition added.
1247    
1248     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1249    
1250 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1251     warnings (rather than SHOULD-level errors).
1252    
1253     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1254    
1255 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1256    
1257     * HTML.pm.src: A flag for character references in attribute
1258     values are added. Set |manakai_has_reference| user data
1259     to |charset| attribute.
1260    
1261     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1262    
1263 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1264     attributes.
1265    
1266     * ContentChecker.pm (check_document): Warn if charset requirements
1267     cannot be tested.
1268    
1269 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1270    
1271     * HTML.pm.src (parse_byte_string): Detect charset
1272     by universalchardet if charset parameter is not specified.
1273    
1274     * Makefile (Charset-all, Charset-clean): New rules.
1275    
1276 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1277    
1278 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1279     of character encoding declaration and selection of encoding
1280     for HTML document.
1281    
1282     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1283    
1284 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1285     type in list context.
1286    
1287     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1288    
1289 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1290     ignoring (HTML5 revisions 1013 and 1016).
1291    
1292     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1293    
1294 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1295     white spaces, are implemented.
1296    
1297 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1298    
1299 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1300     attribute if possible.
1301    
1302     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1303    
1304 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1305     (parse_char_string): New alias for |parse_string|.
1306     (main phase): Invoking "change the encoding" algorithm if desired.
1307    
1308     * HTML.pod: Updated.
1309    
1310     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1311    
1312 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1313    
1314     * Makefile (HTML-all, HTML-clean): New.
1315    
1316     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1317    
1318     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1319    
1320 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1321    
1322     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1323    
1324 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1325    
1326 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1327    
1328     * CacheManifest.pod: New file.
1329    
1330     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1331    
1332 wakaba 1.114 * CacheManifest.pm: New module.
1333    
1334     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1335    
1336 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1337     callback.
1338    
1339     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1340    
1341 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1342    
1343 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1344    
1345 wakaba 1.111 * Makefile (clean): New rule.
1346    
1347     * NanoDOM.pm (public_id, system_id): New attributes.
1348    
1349     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1350    
1351 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1352    
1353 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1354    
1355     * ContentChecker.pm (check_document): Support for
1356     new |is_xml_root| flag.
1357     (check_element): Support for new |pluses| state.
1358     (_add_pluses): New method.
1359     (_remove_minuses): Support for new |minus| item.
1360    
1361 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1364     root element.
1365    
1366     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1367    
1368 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1369    
1370 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1371    
1372 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1373    
1374     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1375    
1376 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1377    
1378     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1379    
1380 wakaba 1.103 * LangTag.pm: New module.
1381    
1382 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1383    
1384     * HTML.pm.src: Some error types were wrong.
1385    
1386 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1387    
1388 wakaba 1.101 * CSS/: New directory.
1389    
1390     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1391    
1392 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1393     for |noscript| in |head|.
1394    
1395 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1396    
1397     * URI/: New directory.
1398    
1399 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1400    
1401 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1402     number.
1403    
1404     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1405    
1406 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1407     by |&|s.
1408    
1409     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1410    
1411 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1412    
1413     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1414    
1415 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1416    
1417     * HTML.pm.src: Insertion modes are now represented in number.
1418    
1419     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1420    
1421 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1422     999) is implemented.
1423    
1424 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1425    
1426     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1427     983) is implemented.
1428    
1429 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1430    
1431     * ContentChecker.pod: New documentation.
1432    
1433     * Makefile: A rule for |ContentChecker.html| is added.
1434    
1435     * ContentChecker.pm: A pod "LICENSE" section is added.
1436    
1437     * NanoDOM.pm ($VERSION): New variable.
1438    
1439 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1442     list of allowed HTML elements.
1443    
1444     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1445    
1446 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1447     HTML elements.
1448    
1449     * URIChecker.pm (check_iri): New.
1450     (check_iri_reference): Error type for IRI reference
1451     syntax error is changed.
1452    
1453 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1454    
1455 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1456     is added.
1457     (check_document): Load appropriate module before validation.
1458    
1459     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1460    
1461 wakaba 1.88 * ContentChecker/: New directory.
1462    
1463     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1464    
1465 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1466    
1467     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1468    
1469 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1470    
1471 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1472     even if no document element is found.
1473    
1474     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1475    
1476 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1477    
1478     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1479    
1480 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1481    
1482     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1483    
1484 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1485     is split into "after html body" and "after html frameset"
1486     insertion modes. Their codes are merged with "after body"
1487     and "after frameset" codes. |$previous_insertion_mode|
1488     has been removed. "after frameset" code is
1489     merged with "in frameset" code.
1490    
1491     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1492    
1493 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1494     merged with the "in head" insertion mode.
1495    
1496     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1497    
1498 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1499     reorganized.
1500    
1501     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1502    
1503 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1504     are merged.
1505    
1506     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1507    
1508 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1509     with "in table" insertion mode.
1510    
1511     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1512    
1513 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1514     modes are merged.
1515    
1516     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1517    
1518 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1519     insertion mode!
1520    
1521     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1522    
1523 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1524     now directly invoke the handler.
1525    
1526     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1527    
1528 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1529     is merged to the "in body" insertion mode code.
1530    
1531     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1532    
1533 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1534     insertion modes are merged.
1535    
1536     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1537    
1538 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1539    
1540     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1541    
1542 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1543     instead of string constant for |{content_model_flag}|.
1544    
1545     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1546    
1547 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1548     to the "duplicate attribute" error.
1549    
1550 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1551    
1552 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1553    
1554     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1555    
1556 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1557    
1558     * HTML.pm.src: A typo is fixed.
1559    
1560 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1561    
1562 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1563     from "in HTML:xml:lang" error. Character position
1564     is now the last part of the error type in the URI error
1565     description. Report "unsupported" status for language
1566     tags, media queries, script codes, and style sheets.
1567    
1568     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1569    
1570 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1571     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1572     in XML.
1573    
1574     * NanoDOM.pm (Attr.owner_document): New attribute.
1575    
1576     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1577    
1578 wakaba 1.66 * HTML.pm.src: The character immediately following
1579     a bare |hcro| was discarded. Fix handling of
1580     entity references in attribute values.
1581    
1582     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1583    
1584 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1585     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1586     factored out. Error types |in html:#DOCTYPE|
1587     and |after html:#DOCTYPE| are merged
1588     into |DOCTYPE in the middle|. |</frameset>|
1589     in fragment parsing mode changed the
1590     insertion mode.
1591    
1592     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1593    
1594 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1595     phase is now an insertion mode. Treatments for white
1596     space character tokens were incorrect for some
1597     insertion modes. An old |meta| case was not removed.
1598    
1599     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1600    
1601 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1602     implemented (but changing the encoding is not yet:-).
1603    
1604 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1605    
1606 wakaba 1.62 * Charset/: New directory.
1607    
1608     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.61 * H2H.pm: New Perl module (created from
1611     manakai's H2H.dis).
1612    
1613     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1614    
1615 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1616     manakai's SimpleLS.dis).
1617    
1618 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1619    
1620     * HTML.pm.src: |<!---x-->| was not processed correctly.
1621    
1622 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1623    
1624 wakaba 1.58 * HTML.pm.src: Report correct error message
1625     for |<body></div></body>|.
1626    
1627     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1628    
1629 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1630    
1631     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1632    
1633     * ContentChecker.pm: Return |{term}| list.
1634    
1635 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1636    
1637 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1638     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1639     and <title> outside of head).
1640    
1641     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1642    
1643 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1644     and private types/subtypes.
1645    
1646     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1647     URIChecker.pm, HTMLTable.pm: Error messages are now
1648     consistent; they are all listed in
1649     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1650    
1651 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1652    
1653 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1654     is now erred. |<datalist>| is implemented.
1655     Attribute checker for |<command>| and |<menu>| are
1656     added. Support for |contextmenu| global attribute
1657     is added.
1658    
1659     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1660    
1661 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1662     of Step 3 has been changed.
1663    
1664     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1665    
1666 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1667    
1668 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1669    
1670 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1671     incorrect.
1672    
1673     * ContentChecker.pm: Support for the |footer| element.
1674     Check URI syntax for space-separated URI attributes.
1675     Support for the |tabindex| attribute. Support
1676     for |datetime| attribute.
1677    
1678     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1679    
1680 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1681     character references). HTML5 revision 1.146 (white space
1682     characters before root start tag). HTML5 revision
1683     1.148 (named character references in attribute values).
1684     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1685    
1686     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1687    
1688     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1689 wakaba 1.49 in <head>).
1690    
1691     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1692    
1693 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1694     in body).
1695    
1696     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1697    
1698     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1699    
1700 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1701     or RCDATA in fragment parsing mode). HTML5 revision
1702     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1703    
1704     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1705    
1706     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1707     for <pre> and <textarea>).
1708    
1709 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1710    
1711 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1712    
1713     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1714    
1715 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1716    
1717     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1718    
1719 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1720     HTML5 revision 917 (conforming bare &).
1721    
1722     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1723    
1724 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1725    
1726     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1727     HTML5 revision 915 (<nobr>).
1728    
1729     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1730    
1731 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1732    
1733     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1734     New attributes.
1735    
1736     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1737    
1738 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1739     in tags).
1740    
1741     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1742    
1743 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1744    
1745     * HTML.pm.src: |$entity_char| is removed and
1746     requires |Whatpm::_NamedEntityList| instead.
1747     HTML5 revision 898 (refc), except that lack of refc
1748     is parse error.
1749    
1750     * mkentitylist.pl: New script.
1751    
1752     * Makefile (all): |_NamedEntityList.pm| is added.
1753     (_NamedEntityList.pm, Entities.html): New rules.
1754    
1755     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1756    
1757 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1758     were ignored and U+000D immediately following another
1759     U+000D was not converted to U+000A.
1760    
1761     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1762    
1763 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1764     nodes before appended). Parser was not ready for NULL
1765     parse error and escape flag.
1766    
1767     * NanoDOM.pm (adopt_node): New.
1768    
1769     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1770    
1771 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1772     CDATA and RCDATA). Note that current implementation
1773     is simply repeating what the spec says and it is maybe not
1774     a best way to do it.
1775    
1776     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1777    
1778 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1779     the form element if a descendant element without implied end tag has
1780     still been open).
1781    
1782     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1783    
1784 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1785     space characters non-conforming).
1786    
1787     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1788    
1789 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1790     HTML5 revision 869 (C1 character references).
1791    
1792     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1793    
1794 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1795     a |textarea| is removed).
1796    
1797 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1798    
1799     * NanoDOM.pm (get_attribute_node_ns): New method.
1800    
1801     * ContentChecker.pm: |script| |async| and |defer|
1802     no longer require |src|. |async| MUST NOT be
1803     specified if |defer|. (HTML5 revision 858).
1804    
1805 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1806    
1807     * HTML.pm.src: |<form><form>| went to inifinite loop.
1808    
1809 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1810    
1811 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1812     as a document element) flag on.
1813     (new): Removed.
1814     (check_document): New method.
1815    
1816     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1817    
1818 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1819     were incorrect.
1820    
1821     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1822    
1823 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1824     error position in the |type| option of the error.
1825    
1826     * HTMLTable.pm (form_table): The |$onerror| parameter
1827     is now optional. Some bugs are fixed.
1828    
1829     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1830    
1831 wakaba 1.27 * HTMLTable.pm: New module.
1832    
1833     * ContentChecker.pm (table): Invoke table model error checker.
1834    
1835     * NanoDOM.pm (first_child, get_attribute_ns): New.
1836    
1837 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1838    
1839 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1840     (link/@rel, a/@rel, area/@rel): Use new checker.
1841    
1842     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1843    
1844     * _LinkTypeList.pm: New file.
1845    
1846     * mklinktypelist.pl: New file.
1847    
1848     * .cvsignore: |RelExtensions.html| added.
1849    
1850     * NanoDOM.pm (child_nodes): Returns an empty array
1851     for non-child-containing node types.
1852     (text_content): New attribute.
1853    
1854     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1855    
1856 wakaba 1.25 * IMTChecker.pm: New module.
1857    
1858     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1859     to test parameter value validity.
1860    
1861     * HTML.pm.src ($style_start_tag): Attributes were
1862     discarded.
1863    
1864 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1865    
1866 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1867    
1868     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1869    
1870 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1871     and RFC 3987 are listed (not all testable items are checked yet).
1872    
1873     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1874    
1875 wakaba 1.22 * URIChecker.pm: New module.
1876    
1877 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1878    
1879 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1880     or associated to some placeholder.
1881     ($ElementDefault): Warn unknown attributes
1882     for unknown elements as "attribute not supported".
1883     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1884     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1885     (|a| attribute checker): Reimplemented.
1886    
1887     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1888    
1889 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1890     ($HTMLAttrChecker): Event handler content attributes
1891     are added.
1892     (link, embed): Required attribute is now checked.
1893     (embed): Unknown local attributes are no longer warned.
1894    
1895     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1896    
1897 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1898     ($HTMLIMTAttrChecker): New checker.
1899     (link@rel, link@href, link@type, style@type,
1900     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1901     object@data, object@type, source@src, source@type, area@alt,
1902     area@shape, area@coords, area@href,
1903     area@ping, area@rel, area@type, script@src,
1904     script@defer, script@async, script@type): Checkers added.
1905    
1906     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1907    
1908 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1909    
1910 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1911    
1912 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1913     attributes. Report an error if |Element.prefix|
1914     is |xmlns|.
1915    
1916     * NanoDOM.pm (prefix): New attribute.
1917    
1918     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1919    
1920 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1921     mode, action for |<iframe>| was missing.
1922    
1923     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1924    
1925 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1926     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1927    
1928     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1929    
1930 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1931     ($HTMLURIAttrChecker): Placeholder.
1932     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1933     New.
1934    
1935     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1936    
1937 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1938     for global attributes, |html|, |base|, |style|, and |meta|.
1939    
1940     * NanoDOM.pm (insert_before): Weaken reference
1941     to the parent node.
1942     (Attr::new): Set |owner_element| attribute.
1943     (namespace_uri, manakai_local_name): New attribute implementations.
1944     (owner_element): New attribute.
1945    
1946     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1947    
1948 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1949     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1950     $Element->{$HTML_NS}->{''}): New.
1951     (check_element): Invoke attrs_checker for each element.
1952    
1953 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1954    
1955 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1956    
1957     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1958    
1959 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1960     element type testings.
1961    
1962     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1963    
1964 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1965     error when an element type is put in the "minus" list
1966     and the element type is not allowed explicitly in the particular
1967     element content model.
1968     (html:a checker): New checker.
1969     (html:details, html:datagrid): New checkers.
1970     (html:legend): New checker.
1971    
1972     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1973    
1974 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1975    
1976     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1977    
1978 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1979     checker.
1980     (html:dd checker): New checker.
1981     (html:q, html:em, html:strong, html:small,
1982     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1983    
1984     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1985    
1986 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1987     from |$ElementDefault->{checker}|.
1988     ($ElementDefault->{checker}): Throw an error that
1989     the element type is not supported by the checker.
1990     ($HTMLMetadataElement): |html:base| was missing.
1991     ($HTMLEmptyChecker): Don't throw an error
1992     for inter-element whitespace nodes.
1993     (html:html checker): Errors were not
1994     thrown even if |html:head| and/or |html:body|
1995     children were missing.
1996     (html:head checker): An error was not
1997     thrown if <meta charset> appered after other
1998     elements.
1999    
2000 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2001    
2002     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2003     and |noscript| elements are implemented.
2004     (new): New method.
2005    
2006 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2007    
2008 wakaba 1.4 * ContentChecker.pm: New module.
2009    
2010     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2011    
2012 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2013     document_element, manakai_local_name,
2014     manakai_element_type_match): New method.
2015    
2016     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2017    
2018     * HTML.pm.src: Replace decimal and hexadecimal numeric
2019     entities in C1 range using Windows-1252 mapping. Bare LF
2020     did not count as new line for error reporting.
2021    
2022 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2023    
2024     * NanoDOM.pm (DOMImplementation): New class.
2025     (append_child): Weaken the |parent_node| reference.
2026     (create_element_ns, Element new): Set the |owner_document|
2027     reference.
2028     (implementation): New attribute.
2029     (owner_document, local_name, namespace_uri): New attributes.
2030    
2031     * HTML.pm.src (parse_string): Line and column numbers
2032     are now provided to error handler.
2033     (!!!parse-error): Short descriptions are added.
2034     (_construct_tree): Split into three methods; support
2035     for innerHTML mode.
2036     (set_inner_html): New method.
2037    
2038 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2039    
2040     * NanoDOM.html: Documentation is added.
2041    
2042     * HTML.pod, ContentType.html: Documentation is revised.
2043    
2044     * .cvsignore: Pod2html temporary files are added.
2045    
2046     * Makefile: Make |NanoDOM.html|.
2047    
2048     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2049    
2050     * HTML.pm.src (parse_string): New method.
2051     (get_inner_html): Renamed from |inner_html|.
2052    
2053     * Makefile: A rule for |HTML.html| is added.
2054    
2055     * HTML.pod: New documentation.
2056    
2057     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2058    
2059     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2060     (clone_node): Attribute nodes were not completely copied.
2061    
2062     * HTML.pm.src: Many bugs are fixed.
2063    
2064     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2065    
2066     * HTML.pm.src: Some typos are fixed.
2067    
2068     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2069    
2070     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2071     are removed.
2072    
2073     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2074    
2075     * HTML-consume-entity.src: Removed.
2076    
2077     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2078    
2079     * HTML.pm.src: Tokenizer's handling on hexadecimal
2080     numeric entities are rewritten.
2081    
2082     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2083    
2084     * HTML.pm.src: Some tokenizer bugs are fixed.
2085    
2086     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2087    
2088     * HTML.pm.src: The tree construction stage is implemented.
2089    
2090     * mkhtmlparser.pl: New macros are added.
2091    
2092     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2093    
2094     * ContentType.pm: A note on bug in the specification
2095     is removed since it's been now fixed.
2096    
2097     * .cvsignore: New file.
2098    
2099     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2100    
2101     * HTML.pm.src, HTML-consume-entity.src: New files.
2102    
2103     * Makefile (HTML.pm): New rule.
2104    
2105     * mkhtmlparser.pl: New script.
2106    
2107     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2108    
2109     * Makefile: New file.
2110    
2111     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2112    
2113     * ContentType.pm: An error in pod is fixed.
2114    
2115     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2116    
2117     * ContentType.pm: Documentation is added.
2118    
2119     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2120    
2121     * ContentType.pm: New Perl module.
2122    
2123     * ChangeLog: New file.
2124    
2125    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24