/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.322 - (hide annotations) (download)
Thu Sep 18 07:42:57 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.321: +5 -0 lines
++ whatpm/t/ChangeLog	18 Sep 2008 07:42:52 -0000
	* LangTag.t: Completely rewritten using |testfiles.pl|.

	* testfiles.pl: New script.

	* langtag-1.dat: Tests for case are added.

2008-09-18  Wakaba  <wakaba@suika.fam.cx>


	* langtag-1.dat: New test data file.
++ whatpm/Whatpm/ChangeLog	18 Sep 2008 07:41:35 -0000
	* LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
	recommended case as per RFC 4646.

2008-09-18  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
4     recommended case as per RFC 4646.
5    
6     2008-09-18 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
9    
10     2008-09-18 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.320 * mklangreg.pl: New script.
13    
14     * Makefile: Updated for creation of the module for language subtag
15     registry.
16    
17 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
18    
19 wakaba 1.319 * Makefile: WebIDL.html added.
20    
21     * WebIDL.pod: New documentation.
22    
23     2008-09-16 Wakaba <wakaba@suika.fam.cx>
24    
25 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
26    
27     2008-09-16 Wakaba <wakaba@suika.fam.cx>
28    
29 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
30     compatible parsing bugs are fixed. Some unreachable codes are
31     commented out.
32    
33     2008-09-16 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
36     added. It does not satisfy the definition that a forward
37     interface declaration has an extended attribute. It seems that
38     unless explicitly allowed multiple extended attributes with the
39     same name is not allowed, though it is not explicitly mentioned in
40     the spec.
41    
42     2008-09-16 Wakaba <wakaba@suika.fam.cx>
43    
44 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
45     attribute identifiers. Preserve whether an extended attribute has
46     an argument list of not. Support for extended attributes:
47     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
48     NameGetter, NameSetter, and Null.
49     (has_argument_list): New attribute.
50     (idl_text): Stringifies argument lists, if any, even if it is
51     empty.
52    
53 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
54    
55 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
56     |{s_kwd}| in DATA_STATE as default.
57    
58     2008-09-15 Wakaba <wakaba@suika.fam.cx>
59    
60 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
61     by |{s_kwd}| in DATA_STATE.
62    
63     2008-09-15 Wakaba <wakaba@suika.fam.cx>
64    
65 wakaba 1.312 * HTML.pm.src: Shorten keys.
66    
67     2008-09-15 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
70     pair, or noncharacter code points and non-Unicode code
71     points (they should be handled by Whatpm::Charset::UnicodeChecker).
72     (parse_char_stream): Support for the |$get_wrapper| argument and
73     character stream error handlers.
74    
75     2008-09-15 Wakaba <wakaba@suika.fam.cx>
76    
77 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
78     for null-namespace elements/attributes.
79    
80     * HTML.pm.src: Fact out $disallowed_control_chars
81     as a hash.
82    
83 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
84    
85 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
86     and |{next_char}| initializations are moved to initialization
87     method. |{read_until}| now supports buffering. Sync |set_inner_html|
88     with |parse_char_stream|.
89    
90     2008-09-14 Wakaba <wakaba@suika.fam.cx>
91    
92 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
93     invoke |manakai_read_until|, not only |read|, where
94     possible, to decrease the number of |read| method calls.
95    
96     * mkhtmlparser.pl: Related changes to the aforementioned
97     modification.
98    
99     2008-09-14 Wakaba <wakaba@suika.fam.cx>
100    
101 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
102     would report character error from now.
103    
104     2008-09-14 Wakaba <wakaba@suika.fam.cx>
105    
106 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
107     tokens in "before head insertion mode" was not
108     correctly handled.
109     (set_inner_html): Reimplemented using CharString decodehandle
110     class. Support for $get_wrapper argument. Support
111     for |{read_until}| feature.
112    
113     2008-09-14 Wakaba <wakaba@suika.fam.cx>
114    
115 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
116     entities point the "&" character.
117    
118     2008-09-14 Wakaba <wakaba@suika.fam.cx>
119    
120 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
121     be added to the list of excluded characters.
122    
123     2008-09-14 Wakaba <wakaba@suika.fam.cx>
124    
125 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
126     and |column| a higher priority than the one set by the
127     tokenizer's input handler.
128     ($self->{read_until}): Exclude U+FFFD (but this might
129     not be necessary, since now we do line/column fixup in
130     the character decode handle).
131    
132     2008-09-14 Wakaba <wakaba@suika.fam.cx>
133    
134 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
135    
136     2008-09-14 Wakaba <wakaba@suika.fam.cx>
137    
138 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
139     and |manakai_getc_until| to |manakai_read_until| to
140     reduce the number of string copies.
141    
142     2008-09-14 Wakaba <wakaba@suika.fam.cx>
143    
144 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
145     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
146     standard feature to |open| a string as a filehandle,
147     since Perl's string filehandle seems not supporting |ungetc|
148     method correctly.
149     (parse_char_stream): Define |{getc_until}| method.
150     (DATA_STATE): Experimental support for |getc_until| feature.
151    
152 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
153    
154 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
155    
156     2008-09-13 Wakaba <wakaba@suika.fam.cx>
157    
158 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
159     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
160     and replaced by |{prev_state}|.
161    
162     * mkhtmlparser.pl: Remove |{char}| feature.
163     Remove |!!!back-next-input-character;| macro.
164    
165     2008-09-13 Wakaba <wakaba@suika.fam.cx>
166    
167 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
168     entity related tokenizer states in favor of new states
169     implementing the consume character reference algorithm.
170    
171     2008-09-13 Wakaba <wakaba@suika.fam.cx>
172    
173     * HTML.pm.src: "Consume a character reference" algorithm is
174 wakaba 1.296 now implemented as a tokenizer's state, rather than
175     a method, with minimum changes (more changes will
176     be made, in due course). "Bogus comment state"'s inner
177     loop gets removed.
178    
179     2008-09-13 Wakaba <wakaba@suika.fam.cx>
180    
181 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
182 wakaba 1.295 into their own tokenizer states.
183    
184     2008-09-13 Wakaba <wakaba@suika.fam.cx>
185    
186 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
187     is split into three states.
188    
189     2008-09-13 Wakaba <wakaba@suika.fam.cx>
190    
191 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
192     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
193     no longer does the tokenizer have to push back next input
194     characters in those states.
195    
196     2008-09-13 Wakaba <wakaba@suika.fam.cx>
197    
198 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
199     into four states so that no longer does the tokenizer have to push
200     back next input characters in that state.
201    
202 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
203    
204     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
205     which can be used to insert some wrapper between the character
206     stream handle and the tokenizer. (It is currently not supported
207     for |set_inner_html| for |Element|s).
208    
209 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
210    
211 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
212    
213     2008-09-10 Wakaba <wakaba@suika.fam.cx>
214    
215 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
216    
217     * HTML.pm.src: Don't specify |text| argument for the
218     |chardecode:fallback| error, since it is not the encoding
219     being used alternatively.
220    
221 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
222    
223     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
224    
225 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
226    
227 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
228    
229     2008-08-31 Wakaba <wakaba@suika.fam.cx>
230    
231 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
232     to after after frameset insertion mode processing (HTML5
233     revision 1909). Note that the implementation was wrong
234     per the old spec before the r1909 changes.
235    
236 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
237    
238 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
239     spec (HTML5 revision 2093).
240     ($process_row): Algorithm step numbers synced with the
241     spec (HTML5 revision 2092).
242    
243     2008-08-30 Wakaba <wakaba@suika.fam.cx>
244    
245 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
246     revision 2094).
247    
248     2008-08-30 Wakaba <wakaba@suika.fam.cx>
249    
250     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
251    
252     2008-08-30 Wakaba <wakaba@suika.fam.cx>
253    
254 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
255     name (after another attribute) now raise parse error (HTML5
256     revision 2123). Empty unquoted attribute values are no
257     longer allowed (HTML5 revision 2122).
258    
259     2008-08-30 Wakaba <wakaba@suika.fam.cx>
260    
261 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
262     revision 2130).
263    
264     2008-08-30 Wakaba <wakaba@suika.fam.cx>
265    
266 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
267     as |lang| attribute value for HTML elements (HTML5 revision 2062
268     and so on).
269    
270     2008-08-30 Wakaba <wakaba@suika.fam.cx>
271    
272 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
273     was missing.
274    
275     * URIChecker.pm: The end of the URL should be marked as the
276     error location for an empty path error. The position
277     between the userinfo and the port components should be
278     marked as the error location for an empty host error.
279    
280     2008-08-30 Wakaba <wakaba@suika.fam.cx>
281    
282 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
283     value the error occurs for errors. Report unknown
284     address format error in warning level, since address
285     formats are rarely added. Path segments starting with "/.."
286     were misinterpreted as a dot-segment.
287    
288     2008-08-30 Wakaba <wakaba@suika.fam.cx>
289    
290 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
291     |Message::DOM::DOMImplementation|.
292    
293 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
294    
295     * IMTChecker.pm: Updated for the new error reporting architecture.
296    
297     * ContentChecker.pm: Error levels for IMTs are added.
298    
299 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
300    
301     * H2H.pm (_shift_token): Support for unquoted HTML attribute
302     values.
303    
304 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
305    
306 wakaba 1.275 * CacheManifest.pm: Support for new style of error
307     reports.
308    
309     * HTML.pm.src: Set line=1, column=1 to the document node.
310    
311     2008-08-16 Wakaba <wakaba@suika.fam.cx>
312    
313 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
314     and URL checkers. Support for more error levels for bogus
315     langauge tag and URL "standards".
316    
317     * LangTag.pm, URIChecker.pm: Support for new style error
318     level reporting.
319    
320 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
321    
322 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
323    
324     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
325     specifying. Error types are revised.
326    
327     2008-08-15 Wakaba <wakaba@suika.fam.cx>
328    
329 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
330     renewed.
331    
332     2008-08-15 Wakaba <wakaba@suika.fam.cx>
333    
334 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
335     are revised. Use new style for "level" specification.
336    
337     * mkhtmlparser.pl: Use new style for "level" specification.
338    
339 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
340    
341 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
342     reporting process for broken ignored valuetype definition.
343     (Valuetype idl_text): Support for special "DOMString" name.
344    
345     2008-08-03 Wakaba <wakaba@suika.fam.cx>
346    
347 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
348     terminal of the ScopedName is "DOMString", such that whether
349     the last part of the scoped name is "DOMString" or "_DOMString"
350     later. It is necessary to determine whether a |typedef|
351     definition should be ignored or not.
352     (parse_char_string): Unescape the identifier of
353     exception members.
354     ($resolve): Return undef for builtin types and sequence<T>
355     types (we might not have to do this, however...).
356     (check): Support checking for Exceptions, Valuetypes,
357     and Typedefs.
358     ($serialize_type): Support for "DOMString::::" syntax.
359     (Typedef idl_text): Output Type as "DOMString" if it
360     is really "DOMString" (i.e. its internal representation
361     is "::DOMString::").
362    
363     2008-08-03 Wakaba <wakaba@suika.fam.cx>
364    
365 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
366     for constant types in the |check| method.
367     (check): Support for checking of attributes, operations, and
368     arguments.
369     (Attribute/Operation idl_text): Exception names in getraises,
370     setraises, and raises clauses is serizlied by |$serialize_type|
371     code.
372    
373 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
374    
375 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
376     hexadecimal numbers (the original pattern, taken from the spec,
377     was not work for hexadecimal numbers, because the "0" prefix
378     matches to the [0-7]* part (as an empty string) and therefore
379     it does not match with remaining "x..." part of a "0x..." integer
380     literal.
381     ($get_type): It now returns a string, not an array reference,
382     for regular types and |sequence| types (i.e. it in any case
383     returns a string).
384     ($get_next_token): The second item in the array that represents
385     a integer or float token is now a Perl number value, not the
386     original string representation of the number.
387     (check): Support for const value consistency checking.
388     No extended attribute is defined for constants.
389     (Node subclasses): Use simple strings rather than array references
390     for default data type values.
391     ($serialize_type): Type values are now simple strings.
392     (value): If the new attribute value is a false value, then
393     a FALSE value is set to the attribute.
394    
395     2008-08-02 Wakaba <wakaba@suika.fam.cx>
396    
397 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
398     in its stringified format ("scoped name" as defined in the
399     spec). Note that future version of this module should not use
400     array references for type values and the |type_text| attribute
401     should be made obsolete.
402     (parse_char_string): Unescape attribute names.
403     (check): Support for checking of whether inherited interfaces
404     are actually defined or not. Support for checking of whether
405     interface member identifiers are duplicated or not.
406     ($serialize_type): Scoped names are returned as is. A future
407     version of this code should escape identifiers other than "DOMString",
408     otherwise the idl_text would be non-conforming.
409    
410     2008-08-02 Wakaba <wakaba@suika.fam.cx>
411    
412 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
413 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
414     for Definition's were ignored.
415     (append_child): Set |parent_node| attribute.
416     (parent_node): New attribute.
417     (check): Support interface/exception members. Support
418     extended attributes. Support definition identifier uniqueness
419     constraint.
420     (qualified_name): New attribute.
421     (Interface/Exception idl_text): Extended attributes were
422     not prepended to the returned text.
423    
424     2008-08-02 Wakaba <wakaba@suika.fam.cx>
425    
426     * WebIDL.pm (parse_char_string): Set line/column numbers
427 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
428     would make things wrong. Support for interface forward
429     declarations was missing. Broken interface declarations
430     with no block were not ignored entirely.
431     (Whatpm::WebIDL::Node): New abstract class. This class
432     makes things easier.
433     (child_nodes): New attribute. Unlike DOM's attribute with
434     same name, this attribute returns a dead list of nodes for
435     simplicity.
436     (get_user_data, set_user_data): New methods.
437     (Module idl_text): A SPACE character should be inserted
438     before the |{| character.
439     (Interface idl_text): Support for interface forward declarations.
440     (is_forward_declaration): New attribute.
441    
442 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
443    
444 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
445    
446     2008-07-19 Wakaba <wakaba@suika.fam.cx>
447    
448 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
449     it now can handle broken extended attributes and as such.
450    
451     2008-07-19 Wakaba <wakaba@suika.fam.cx>
452    
453 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
454     Support for extended attributes with arguments.
455    
456     2008-07-19 Wakaba <wakaba@suika.fam.cx>
457    
458 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
459     (Interface->idl_text): Tentative support for inheritances.
460    
461     2008-07-19 Wakaba <wakaba@suika.fam.cx>
462    
463 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
464     Support for raises, setraises, and getraises syntaxes.
465    
466 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
467    
468     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
469     proper support for types, extended attributes, and exceptions yet).
470     WebIDL parser, version 1 (no support for exceptions yet,
471     no proper support for extended attributes yet).
472    
473 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
474    
475     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
476    
477 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
478    
479 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
480    
481     2008-06-29 Wakaba <wakaba@suika.fam.cx>
482    
483 wakaba 1.255 * WebIDL.pm: New module.
484    
485 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
486    
487     * Makefile (Entities.html): URI changed.
488    
489 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
490    
491     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
492    
493 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
494    
495 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
496    
497     2008-06-01 Wakaba <wakaba@suika.fam.cx>
498    
499 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
500     spec (commented out). (HTML5 revision 1711).
501    
502 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
503    
504 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
505    
506     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
507     as UTF-16 (HTML5 revision 1701).
508    
509     2008-05-25 Wakaba <wakaba@suika.fam.cx>
510    
511 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
512     1692).
513    
514     2008-05-25 Wakaba <wakaba@suika.fam.cx>
515    
516     * HTML.pm.src: The secondary insertion mode used when switching
517     to foreign content is the "in body" insertion mode (HTML5 revision
518     1696).
519    
520     2008-05-25 Wakaba <wakaba@suika.fam.cx>
521    
522 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
523     revision 1697).
524    
525 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
526    
527 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
528     insertion mode (HTML5 revision 1693). Update SVG camelCase
529     attribute list (HTML5 revision 1700). <textarea> closes
530     </select> (HTML5 revision 1699). More start tags close in
531     foreign content insertion mode (HTML5 revision 1698).
532    
533     2008-05-24 Wakaba <wakaba@suika.fam.cx>
534    
535 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
536    
537     2008-05-24 Wakaba <wakaba@suika.fam.cx>
538    
539 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
540     revision 1674).
541    
542     2008-05-24 Wakaba <wakaba@suika.fam.cx>
543    
544 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
545     revision 1676).
546    
547     2008-05-24 Wakaba <wakaba@suika.fam.cx>
548    
549 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
550     quriks mode detection (HTML5 revision 1679).
551    
552     2008-05-24 Wakaba <wakaba@suika.fam.cx>
553    
554 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
555     revision 1685).
556    
557     2008-05-24 Wakaba <wakaba@suika.fam.cx>
558    
559 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
560     revision 1684).
561    
562     2008-05-24 Wakaba <wakaba@suika.fam.cx>
563    
564 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
565     work (HTML5 revision 1690).
566    
567     2008-05-24 Wakaba <wakaba@suika.fam.cx>
568    
569 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
570     end tags in head insertion modes (HTML5 revision 1686).
571     (parse_char_string): Bug fix for non-utf8 character string handlings.
572     (parse_char_stream): |ungetc| does not work well for this context.
573    
574 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
575    
576 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
577     |parse_byte_stream|.
578     (parse_byte_stream): New method.
579    
580     2008-05-18 Wakaba <wakaba@suika.fam.cx>
581    
582 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
583     by encoding layer error reporter.
584    
585 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
586    
587 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
588     rather than converting the whole byte string and then parsing.
589     Propagate errors in character encoding layer.
590     (get_next_token): Precise error reporting for |bare stago| error.
591    
592     2008-05-17 Wakaba <wakaba@suika.fam.cx>
593    
594 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
595     (parse_char_string): This method is now defined as an invocation
596     of the |parse_char_stream| method.
597    
598     2008-05-17 Wakaba <wakaba@suika.fam.cx>
599    
600 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
601     of the sniffing as info-level errors. Support for new
602     decoding framework in parser resestting.
603     (new): Various default error levels were not set.
604    
605     2008-05-17 Wakaba <wakaba@suika.fam.cx>
606    
607 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
608     algorithm, except for the actual sniffing, is implemented
609     with new framework with Message::Charset::Info.
610    
611 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
612    
613     * CacheManifest.pm (_parse): Drop fragment identifiers from
614     URIs in fallback section (HTML5 revision 1596).
615    
616 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
617    
618 wakaba 1.231 * Makefile (Entities.html): URI has changed.
619    
620     2008-05-10 Wakaba <wakaba@suika.fam.cx>
621    
622 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
623     1553).
624    
625 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
626    
627 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
628     no longer table elements, but table objects returned
629     by Whatpm::HTMLTable.
630    
631     * HTMLTable.pm (form_table): Return table element node
632     as |$table->{element}|.
633     (assign_header): Support for the |headers=""| attribute.
634    
635     2008-05-06 Wakaba <wakaba@suika.fam.cx>
636    
637 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
638     with no support for headers="".
639     (form_table): Include table width and height to the returned
640     table object for covenience. Indexing in column assignement was wrong.
641     Set whether a data cell is empty or not for convenience.
642    
643 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
644    
645 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
646    
647     2008-05-05 Wakaba <wakaba@suika.fam.cx>
648    
649 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
650     changed (HTML5 revision 1387).
651    
652     2008-05-05 Wakaba <wakaba@suika.fam.cx>
653    
654 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
655     to the end of the table (HTML5 revision 1380).
656    
657     2008-05-05 Wakaba <wakaba@suika.fam.cx>
658    
659 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
660     of 1-based (HTML5 revision 1376).
661    
662     2008-05-05 Wakaba <wakaba@suika.fam.cx>
663    
664 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
665     and "Content-Encoding" no longer prevent sniffing (HTML5
666     revision 1288).
667    
668     2008-05-05 Wakaba <wakaba@suika.fam.cx>
669    
670 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
671     1282).
672    
673 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
676     Status of XML specs are added.
677    
678     2008-05-03 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
681     and |ref=""| attribute (referent element type checking).
682    
683 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
684    
685 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
686     1500) is implemented (except for IDNA part and URI-scheme-specific
687 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
688     documents.
689    
690 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
691    
692 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
693     character (HTML5 revision 1263).
694    
695     2008-04-13 Wakaba <wakaba@suika.fam.cx>
696    
697 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
698     definition for |AElig;| was missing).
699    
700     2008-04-13 Wakaba <wakaba@suika.fam.cx>
701    
702 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
703     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
704    
705     2008-04-13 Wakaba <wakaba@suika.fam.cx>
706    
707 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
708     insertion mode is added (HTML5 revisions 1412 and 1418).
709    
710     2008-04-13 Wakaba <wakaba@suika.fam.cx>
711    
712 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
713     elements (HTML5 revision 1410).
714    
715     2008-04-13 Wakaba <wakaba@suika.fam.cx>
716    
717 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
718     revision 1406).
719    
720     2008-04-13 Wakaba <wakaba@suika.fam.cx>
721    
722 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
723     elements (HTML5 revision 1404 and 1420).
724    
725 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
726    
727 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
728     elements (HTML5 revision 1404). Unused !!!macro definitions
729     are removed.
730    
731     2008-04-12 Wakaba <wakaba@suika.fam.cx>
732    
733 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
734     are raised is changed (HTML5 revision 1404).
735    
736 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
737    
738 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
739    
740     2008-04-06 Wakaba <wakaba@suika.fam.cx>
741    
742 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
743     replaced by constants representing element category.
744    
745     2008-04-06 Wakaba <wakaba@suika.fam.cx>
746    
747 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
748     for error reporting. (This is a preparation for using constant
749     value for ->[1].)
750    
751 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
752    
753 wakaba 1.205 * RDFXML.pm: Typo fixed.
754    
755     2008-03-22 Wakaba <wakaba@suika.fam.cx>
756    
757 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
758     as same as |must_level|, i.e. level = |m|.
759     (check_element): Make list of URIs in the DOM.
760    
761 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
762    
763 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
764     Local (null-namespace) attribute support.
765    
766     2008-03-21 Wakaba <wakaba@suika.fam.cx>
767    
768 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
769    
770     2008-03-21 Wakaba <wakaba@suika.fam.cx>
771    
772 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
773     and URI attributes is implemented. Warn if unknown
774     value is used in rdf:parseType="" attribute.
775    
776     * URIChecker.pm (check_rdf_uri_reference): New function.
777    
778     2008-03-21 Wakaba <wakaba@suika.fam.cx>
779    
780 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
781     are now resolved.
782    
783     2008-03-21 Wakaba <wakaba@suika.fam.cx>
784    
785 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
786    
787     * RDFXML.pm: undef vs false bug fixed.
788     Reification implemented.
789    
790     2008-03-21 Wakaba <wakaba@suika.fam.cx>
791    
792 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
793     The |node| arguments are added for |ontriple| calls.
794     Too many "attribute not allowed" errors were raised.
795    
796     * ContentChecker.pm: Initial experimental support for rdf:RDF
797     element.
798    
799     2008-03-21 Wakaba <wakaba@suika.fam.cx>
800    
801 wakaba 1.197 * RDFXML.pm: New module.
802    
803 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
804    
805 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
806     code was old one yet.
807    
808     2008-03-20 Wakaba <wakaba@suika.fam.cx>
809    
810 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
811     errors. Line/column markings for DOCTYPE, comment, and
812     character tokens are reintroduced; otherwise,
813     error location for "not HTML5" error and errors
814     for implied elements are not attached.
815    
816     2008-03-20 Wakaba <wakaba@suika.fam.cx>
817    
818 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
819    
820     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
821     (value): Setter implemented.
822    
823     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
824    
825     2008-03-20 Wakaba <wakaba@suika.fam.cx>
826    
827 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
828    
829     2008-03-20 Wakaba <wakaba@suika.fam.cx>
830    
831 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
832     flag, which is no longer used.
833    
834 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
835    
836 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
837    
838     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
839     and !!!insert-element-t): Set line/column information to
840     element nodes.
841    
842     2008-03-17 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
845     in "<?", "<>", or "</>" should be the error point.
846    
847 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
850    
851     2008-03-16 Wakaba <wakaba@suika.fam.cx>
852    
853 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
854    
855     2008-03-16 Wakaba <wakaba@suika.fam.cx>
856    
857 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
858     reporting.
859    
860 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
861    
862 wakaba 1.186 * HTML.pm.src: Error type revised.
863    
864     2008-03-11 Wakaba <wakaba@suika.fam.cx>
865    
866 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
867    
868     2008-03-11 Wakaba <wakaba@suika.fam.cx>
869    
870 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
871    
872 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
873    
874     * mkhtmlparser.pl: Set "level" parameter to parse errors.
875    
876     * HTML.pm.src: Code refined.
877    
878 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
879    
880 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
881     revision 1348). Note that I really don't know this makes
882     any difference in the black-box behavior of the parser.
883    
884     2008-03-09 Wakaba <wakaba@suika.fam.cx>
885    
886 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
887     revision 1348).
888    
889     2008-03-09 Wakaba <wakaba@suika.fam.cx>
890    
891 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
892    
893     2008-03-09 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
896    
897     2008-03-09 Wakaba <wakaba@suika.fam.cx>
898    
899 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
900     mode and support for the "in select in table" insertion mode (HTML5
901     revision 1342).
902    
903     2008-03-09 Wakaba <wakaba@suika.fam.cx>
904    
905 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
906     in non-tainted <table>s (HTML5 revision 1336).
907    
908     2008-03-09 Wakaba <wakaba@suika.fam.cx>
909    
910 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
911     and <head> (HTML5 revision 1332).
912    
913     2008-03-09 Wakaba <wakaba@suika.fam.cx>
914    
915 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
916     white space (HTML5 revision 1331).
917    
918 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
919    
920 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
921     element (HTML5 revision 1330).
922    
923     2008-03-08 Wakaba <wakaba@suika.fam.cx>
924    
925 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
926     element (HTML5 revision 1328).
927    
928     2008-03-08 Wakaba <wakaba@suika.fam.cx>
929    
930 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
931     into foster parents (HTML5 revision 1326).
932    
933     2008-03-08 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
936     revision 1321).
937    
938     2008-03-08 Wakaba <wakaba@suika.fam.cx>
939    
940 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
941     the spec (HTML5 revision 1320). Note that we finally complete
942     all of HTML5 revision 1320 changes.
943    
944     2008-03-08 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
947     tags (HTML5 revision 1320).
948    
949     2008-03-08 Wakaba <wakaba@suika.fam.cx>
950    
951 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
952    
953     2008-03-08 Wakaba <wakaba@suika.fam.cx>
954    
955 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
956     could not be reached so that it is now removed (HTML5 revision
957     1320).
958    
959     2008-03-08 Wakaba <wakaba@suika.fam.cx>
960    
961 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
962     to align with new spec text (HTML5 revision 1320).
963    
964     2008-03-08 Wakaba <wakaba@suika.fam.cx>
965    
966 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
967     have had no harm in theory.
968    
969     2008-03-08 Wakaba <wakaba@suika.fam.cx>
970    
971 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
972     case is split from other end tags for the preparation
973     of implementing HTML revision 1320.
974    
975 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
976    
977 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
978     revision 1320).
979    
980     2008-03-07 Wakaba <wakaba@suika.fam.cx>
981    
982 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
983     and "div" (HTML5 revision 1318). Add comments to where
984     |form| pointer association codes should be inserted (HTML5 revision
985     1319).
986    
987 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
988    
989     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
990    
991 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
992    
993 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
994     other token type, there are), it is replaced by a |die| statement.
995    
996     2008-03-05 Wakaba <wakaba@suika.fam.cx>
997    
998 wakaba 1.159 * HTML.pm.src: Typo fixed.
999    
1000 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1001    
1002     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1003     doctypes were not uppercased for comparison.
1004    
1005 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1006    
1007 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1008     are added.
1009    
1010     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1011    
1012 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1013     are set. Cases that are unlikely reached are noted as so.
1014    
1015     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1016    
1017 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1018    
1019     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1020    
1021     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1022    
1023 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1024     for simplicity.
1025    
1026     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1027    
1028 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1029     revision 1307).
1030    
1031     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1032    
1033 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1034     is set are changed (HTML5 revision 1305).
1035    
1036 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1037    
1038 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1039     1292). Entities are not parsed in comment-like part in RCDATA
1040     elements (HTML5 revision 1294). Allow bare & at the end
1041     of attribute value literals (HTML5 revision 1296). More
1042     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1043     between attributes and ban attribute names or unquoted
1044     attribute values containing single or double quotes (HTML5
1045     revision 1303).
1046    
1047     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1048    
1049 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1050     and related errors unless it is an HTML document (though the spec
1051     is unclear on whether it is applied to XHTML document).
1052    
1053     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1054     are added.
1055    
1056     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1057    
1058 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1059    
1060 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1061    
1062     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1063    
1064     * HTML.pm.src: |charset| in |content| attribute is
1065     case-insensitive (HTML5 revision 1270).
1066    
1067 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1068    
1069     * ContentChecker.pm: New status constants are added.
1070     ($ElementDefault): |status| added.
1071     (check_element): Err for non-standard or deprecated elements.
1072     (_attr_status_info): For non-standard or deprecated attributes.
1073    
1074 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1075    
1076 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1077    
1078     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1079    
1080 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1081     status information is now dispatched.
1082    
1083     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1084    
1085 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1086     problem by adding some more arguments.
1087    
1088     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1089    
1090 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1091     container-for-the-purpose-of-content-model element (not transparent
1092     element) for |check_child_element| calling and significant
1093     text flag marking. This reintroduces |<del>|-and-significant
1094     problem again.
1095    
1096     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1097    
1098 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1099     elements ignored for the purpose of phase changes in
1100     content model checking.
1101    
1102 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1103    
1104 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1105     was not properly managed for transparent cases.
1106    
1107     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1108    
1109 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1110     and |audio| as semi-transparent elements.
1111    
1112     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1113    
1114 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1115     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1116     Support for |html:object| as a semi-transparent element.
1117    
1118     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1121     the tree is entirely revised to make it easier to track
1122     the state of ancestors/descendants. As a result of this
1123     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1124     support for content model checking for HTML elements |figure|,
1125     |object|, |video|, and |audio| and checking for XML elements (and
1126     some XMLNS checkings) are dropped for now. They will be
1127     reimplemented in due cource.
1128    
1129 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1130    
1131 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1132     token (HTML5 revision 1225).
1133    
1134     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1135    
1136 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1137    
1138     * HTML.pm.src: Save whether |meta| |content| attribute
1139     contains character references or not.
1140    
1141     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1142    
1143 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1144     imlement significant content checking for contents
1145     with |del| elements.
1146    
1147     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1148    
1149 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1150     elements are added.
1151     (_get_children): HTML |object| elements are now semi-transparent.
1152    
1153     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1154    
1155 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1156    
1157     * CacheManifest.pm: HTML5 revision 1211 implemented.
1158    
1159     * CacheManifest.pod: Updated.
1160    
1161 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1162    
1163     * ContentChecker.pm (check_document, check_element): Support
1164     for second argument ($onsubdoc).
1165     (_get_css_parser): Removed (now it is part of WDCC).
1166    
1167 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1168    
1169     * ContentChecker.pm (_get_css_parser): New.
1170    
1171 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1172    
1173 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1174     for checking had been used.
1175    
1176     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1177    
1178 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1179    
1180 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1181    
1182 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1183     specced in RFC 2046 and RFC 4288.
1184     (application/atom+xml): Definition added.
1185    
1186     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1189     warnings (rather than SHOULD-level errors).
1190    
1191     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1194    
1195     * HTML.pm.src: A flag for character references in attribute
1196     values are added. Set |manakai_has_reference| user data
1197     to |charset| attribute.
1198    
1199     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1200    
1201 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1202     attributes.
1203    
1204     * ContentChecker.pm (check_document): Warn if charset requirements
1205     cannot be tested.
1206    
1207 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1208    
1209     * HTML.pm.src (parse_byte_string): Detect charset
1210     by universalchardet if charset parameter is not specified.
1211    
1212     * Makefile (Charset-all, Charset-clean): New rules.
1213    
1214 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1215    
1216 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1217     of character encoding declaration and selection of encoding
1218     for HTML document.
1219    
1220     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1221    
1222 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1223     type in list context.
1224    
1225     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1228     ignoring (HTML5 revisions 1013 and 1016).
1229    
1230     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1233     white spaces, are implemented.
1234    
1235 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1236    
1237 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1238     attribute if possible.
1239    
1240     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1241    
1242 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1243     (parse_char_string): New alias for |parse_string|.
1244     (main phase): Invoking "change the encoding" algorithm if desired.
1245    
1246     * HTML.pod: Updated.
1247    
1248     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1249    
1250 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1251    
1252     * Makefile (HTML-all, HTML-clean): New.
1253    
1254     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1255    
1256     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1257    
1258 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1259    
1260     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1261    
1262 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1263    
1264 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1265    
1266     * CacheManifest.pod: New file.
1267    
1268     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1269    
1270 wakaba 1.114 * CacheManifest.pm: New module.
1271    
1272     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1273    
1274 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1275     callback.
1276    
1277     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1278    
1279 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1280    
1281 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1282    
1283 wakaba 1.111 * Makefile (clean): New rule.
1284    
1285     * NanoDOM.pm (public_id, system_id): New attributes.
1286    
1287     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1288    
1289 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1290    
1291 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1292    
1293     * ContentChecker.pm (check_document): Support for
1294     new |is_xml_root| flag.
1295     (check_element): Support for new |pluses| state.
1296     (_add_pluses): New method.
1297     (_remove_minuses): Support for new |minus| item.
1298    
1299 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1300    
1301 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1302     root element.
1303    
1304     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1305    
1306 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1307    
1308 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1309    
1310 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1311    
1312     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1313    
1314 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1315    
1316     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1317    
1318 wakaba 1.103 * LangTag.pm: New module.
1319    
1320 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1321    
1322     * HTML.pm.src: Some error types were wrong.
1323    
1324 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1325    
1326 wakaba 1.101 * CSS/: New directory.
1327    
1328     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1329    
1330 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1331     for |noscript| in |head|.
1332    
1333 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1334    
1335     * URI/: New directory.
1336    
1337 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1338    
1339 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1340     number.
1341    
1342     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1345     by |&|s.
1346    
1347     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1350    
1351     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1352    
1353 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1354    
1355     * HTML.pm.src: Insertion modes are now represented in number.
1356    
1357     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1360     999) is implemented.
1361    
1362 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1363    
1364     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1365     983) is implemented.
1366    
1367 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1368    
1369     * ContentChecker.pod: New documentation.
1370    
1371     * Makefile: A rule for |ContentChecker.html| is added.
1372    
1373     * ContentChecker.pm: A pod "LICENSE" section is added.
1374    
1375     * NanoDOM.pm ($VERSION): New variable.
1376    
1377 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1378    
1379 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1380     list of allowed HTML elements.
1381    
1382     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1383    
1384 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1385     HTML elements.
1386    
1387     * URIChecker.pm (check_iri): New.
1388     (check_iri_reference): Error type for IRI reference
1389     syntax error is changed.
1390    
1391 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1392    
1393 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1394     is added.
1395     (check_document): Load appropriate module before validation.
1396    
1397     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1398    
1399 wakaba 1.88 * ContentChecker/: New directory.
1400    
1401     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1402    
1403 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1404    
1405     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1406    
1407 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1408    
1409 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1410     even if no document element is found.
1411    
1412     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1413    
1414 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1415    
1416     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1419    
1420     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1421    
1422 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1423     is split into "after html body" and "after html frameset"
1424     insertion modes. Their codes are merged with "after body"
1425     and "after frameset" codes. |$previous_insertion_mode|
1426     has been removed. "after frameset" code is
1427     merged with "in frameset" code.
1428    
1429     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1430    
1431 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1432     merged with the "in head" insertion mode.
1433    
1434     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1437     reorganized.
1438    
1439     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1440    
1441 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1442     are merged.
1443    
1444     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1445    
1446 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1447     with "in table" insertion mode.
1448    
1449     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1450    
1451 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1452     modes are merged.
1453    
1454     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1455    
1456 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1457     insertion mode!
1458    
1459     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1460    
1461 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1462     now directly invoke the handler.
1463    
1464     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1465    
1466 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1467     is merged to the "in body" insertion mode code.
1468    
1469     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1470    
1471 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1472     insertion modes are merged.
1473    
1474     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1475    
1476 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1477    
1478     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1479    
1480 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1481     instead of string constant for |{content_model_flag}|.
1482    
1483     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1484    
1485 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1486     to the "duplicate attribute" error.
1487    
1488 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1489    
1490 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1491    
1492     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1493    
1494 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1495    
1496     * HTML.pm.src: A typo is fixed.
1497    
1498 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1499    
1500 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1501     from "in HTML:xml:lang" error. Character position
1502     is now the last part of the error type in the URI error
1503     description. Report "unsupported" status for language
1504     tags, media queries, script codes, and style sheets.
1505    
1506     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1507    
1508 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1509     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1510     in XML.
1511    
1512     * NanoDOM.pm (Attr.owner_document): New attribute.
1513    
1514     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1515    
1516 wakaba 1.66 * HTML.pm.src: The character immediately following
1517     a bare |hcro| was discarded. Fix handling of
1518     entity references in attribute values.
1519    
1520     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1523     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1524     factored out. Error types |in html:#DOCTYPE|
1525     and |after html:#DOCTYPE| are merged
1526     into |DOCTYPE in the middle|. |</frameset>|
1527     in fragment parsing mode changed the
1528     insertion mode.
1529    
1530     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1531    
1532 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1533     phase is now an insertion mode. Treatments for white
1534     space character tokens were incorrect for some
1535     insertion modes. An old |meta| case was not removed.
1536    
1537     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1538    
1539 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1540     implemented (but changing the encoding is not yet:-).
1541    
1542 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1543    
1544 wakaba 1.62 * Charset/: New directory.
1545    
1546     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1547    
1548 wakaba 1.61 * H2H.pm: New Perl module (created from
1549     manakai's H2H.dis).
1550    
1551     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1552    
1553 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1554     manakai's SimpleLS.dis).
1555    
1556 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1557    
1558     * HTML.pm.src: |<!---x-->| was not processed correctly.
1559    
1560 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1561    
1562 wakaba 1.58 * HTML.pm.src: Report correct error message
1563     for |<body></div></body>|.
1564    
1565     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1566    
1567 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1568    
1569     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1570    
1571     * ContentChecker.pm: Return |{term}| list.
1572    
1573 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1574    
1575 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1576     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1577     and <title> outside of head).
1578    
1579     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1580    
1581 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1582     and private types/subtypes.
1583    
1584     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1585     URIChecker.pm, HTMLTable.pm: Error messages are now
1586     consistent; they are all listed in
1587     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1588    
1589 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1590    
1591 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1592     is now erred. |<datalist>| is implemented.
1593     Attribute checker for |<command>| and |<menu>| are
1594     added. Support for |contextmenu| global attribute
1595     is added.
1596    
1597     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1598    
1599 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1600     of Step 3 has been changed.
1601    
1602     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1603    
1604 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1605    
1606 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1607    
1608 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1609     incorrect.
1610    
1611     * ContentChecker.pm: Support for the |footer| element.
1612     Check URI syntax for space-separated URI attributes.
1613     Support for the |tabindex| attribute. Support
1614     for |datetime| attribute.
1615    
1616     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1617    
1618 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1619     character references). HTML5 revision 1.146 (white space
1620     characters before root start tag). HTML5 revision
1621     1.148 (named character references in attribute values).
1622     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1623    
1624     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1625    
1626     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1627 wakaba 1.49 in <head>).
1628    
1629     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1630    
1631 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1632     in body).
1633    
1634     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1635    
1636     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1637    
1638 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1639     or RCDATA in fragment parsing mode). HTML5 revision
1640     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1641    
1642     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1643    
1644     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1645     for <pre> and <textarea>).
1646    
1647 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1648    
1649 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1650    
1651     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1652    
1653 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1654    
1655     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1656    
1657 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1658     HTML5 revision 917 (conforming bare &).
1659    
1660     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1661    
1662 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1663    
1664     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1665     HTML5 revision 915 (<nobr>).
1666    
1667     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1668    
1669 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1670    
1671     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1672     New attributes.
1673    
1674     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1675    
1676 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1677     in tags).
1678    
1679     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1680    
1681 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1682    
1683     * HTML.pm.src: |$entity_char| is removed and
1684     requires |Whatpm::_NamedEntityList| instead.
1685     HTML5 revision 898 (refc), except that lack of refc
1686     is parse error.
1687    
1688     * mkentitylist.pl: New script.
1689    
1690     * Makefile (all): |_NamedEntityList.pm| is added.
1691     (_NamedEntityList.pm, Entities.html): New rules.
1692    
1693     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1696     were ignored and U+000D immediately following another
1697     U+000D was not converted to U+000A.
1698    
1699     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1700    
1701 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1702     nodes before appended). Parser was not ready for NULL
1703     parse error and escape flag.
1704    
1705     * NanoDOM.pm (adopt_node): New.
1706    
1707     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1708    
1709 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1710     CDATA and RCDATA). Note that current implementation
1711     is simply repeating what the spec says and it is maybe not
1712     a best way to do it.
1713    
1714     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1715    
1716 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1717     the form element if a descendant element without implied end tag has
1718     still been open).
1719    
1720     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1721    
1722 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1723     space characters non-conforming).
1724    
1725     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1726    
1727 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1728     HTML5 revision 869 (C1 character references).
1729    
1730     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1731    
1732 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1733     a |textarea| is removed).
1734    
1735 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1736    
1737     * NanoDOM.pm (get_attribute_node_ns): New method.
1738    
1739     * ContentChecker.pm: |script| |async| and |defer|
1740     no longer require |src|. |async| MUST NOT be
1741     specified if |defer|. (HTML5 revision 858).
1742    
1743 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1744    
1745     * HTML.pm.src: |<form><form>| went to inifinite loop.
1746    
1747 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1748    
1749 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1750     as a document element) flag on.
1751     (new): Removed.
1752     (check_document): New method.
1753    
1754     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1755    
1756 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1757     were incorrect.
1758    
1759     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1760    
1761 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1762     error position in the |type| option of the error.
1763    
1764     * HTMLTable.pm (form_table): The |$onerror| parameter
1765     is now optional. Some bugs are fixed.
1766    
1767     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1768    
1769 wakaba 1.27 * HTMLTable.pm: New module.
1770    
1771     * ContentChecker.pm (table): Invoke table model error checker.
1772    
1773     * NanoDOM.pm (first_child, get_attribute_ns): New.
1774    
1775 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1776    
1777 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1778     (link/@rel, a/@rel, area/@rel): Use new checker.
1779    
1780     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1781    
1782     * _LinkTypeList.pm: New file.
1783    
1784     * mklinktypelist.pl: New file.
1785    
1786     * .cvsignore: |RelExtensions.html| added.
1787    
1788     * NanoDOM.pm (child_nodes): Returns an empty array
1789     for non-child-containing node types.
1790     (text_content): New attribute.
1791    
1792     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1793    
1794 wakaba 1.25 * IMTChecker.pm: New module.
1795    
1796     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1797     to test parameter value validity.
1798    
1799     * HTML.pm.src ($style_start_tag): Attributes were
1800     discarded.
1801    
1802 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1803    
1804 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1805    
1806     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1807    
1808 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1809     and RFC 3987 are listed (not all testable items are checked yet).
1810    
1811     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1812    
1813 wakaba 1.22 * URIChecker.pm: New module.
1814    
1815 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1816    
1817 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1818     or associated to some placeholder.
1819     ($ElementDefault): Warn unknown attributes
1820     for unknown elements as "attribute not supported".
1821     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1822     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1823     (|a| attribute checker): Reimplemented.
1824    
1825     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1826    
1827 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1828     ($HTMLAttrChecker): Event handler content attributes
1829     are added.
1830     (link, embed): Required attribute is now checked.
1831     (embed): Unknown local attributes are no longer warned.
1832    
1833     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1834    
1835 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1836     ($HTMLIMTAttrChecker): New checker.
1837     (link@rel, link@href, link@type, style@type,
1838     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1839     object@data, object@type, source@src, source@type, area@alt,
1840     area@shape, area@coords, area@href,
1841     area@ping, area@rel, area@type, script@src,
1842     script@defer, script@async, script@type): Checkers added.
1843    
1844     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1845    
1846 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1847    
1848 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1849    
1850 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1851     attributes. Report an error if |Element.prefix|
1852     is |xmlns|.
1853    
1854     * NanoDOM.pm (prefix): New attribute.
1855    
1856     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1857    
1858 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1859     mode, action for |<iframe>| was missing.
1860    
1861     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1862    
1863 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1864     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1865    
1866     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1867    
1868 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1869     ($HTMLURIAttrChecker): Placeholder.
1870     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1871     New.
1872    
1873     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1874    
1875 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1876     for global attributes, |html|, |base|, |style|, and |meta|.
1877    
1878     * NanoDOM.pm (insert_before): Weaken reference
1879     to the parent node.
1880     (Attr::new): Set |owner_element| attribute.
1881     (namespace_uri, manakai_local_name): New attribute implementations.
1882     (owner_element): New attribute.
1883    
1884     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1885    
1886 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1887     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1888     $Element->{$HTML_NS}->{''}): New.
1889     (check_element): Invoke attrs_checker for each element.
1890    
1891 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1892    
1893 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1894    
1895     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1896    
1897 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1898     element type testings.
1899    
1900     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1901    
1902 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1903     error when an element type is put in the "minus" list
1904     and the element type is not allowed explicitly in the particular
1905     element content model.
1906     (html:a checker): New checker.
1907     (html:details, html:datagrid): New checkers.
1908     (html:legend): New checker.
1909    
1910     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1911    
1912 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1913    
1914     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1915    
1916 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1917     checker.
1918     (html:dd checker): New checker.
1919     (html:q, html:em, html:strong, html:small,
1920     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1921    
1922     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1923    
1924 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1925     from |$ElementDefault->{checker}|.
1926     ($ElementDefault->{checker}): Throw an error that
1927     the element type is not supported by the checker.
1928     ($HTMLMetadataElement): |html:base| was missing.
1929     ($HTMLEmptyChecker): Don't throw an error
1930     for inter-element whitespace nodes.
1931     (html:html checker): Errors were not
1932     thrown even if |html:head| and/or |html:body|
1933     children were missing.
1934     (html:head checker): An error was not
1935     thrown if <meta charset> appered after other
1936     elements.
1937    
1938 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1939    
1940     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1941     and |noscript| elements are implemented.
1942     (new): New method.
1943    
1944 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1945    
1946 wakaba 1.4 * ContentChecker.pm: New module.
1947    
1948     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1949    
1950 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1951     document_element, manakai_local_name,
1952     manakai_element_type_match): New method.
1953    
1954     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1955    
1956     * HTML.pm.src: Replace decimal and hexadecimal numeric
1957     entities in C1 range using Windows-1252 mapping. Bare LF
1958     did not count as new line for error reporting.
1959    
1960 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1961    
1962     * NanoDOM.pm (DOMImplementation): New class.
1963     (append_child): Weaken the |parent_node| reference.
1964     (create_element_ns, Element new): Set the |owner_document|
1965     reference.
1966     (implementation): New attribute.
1967     (owner_document, local_name, namespace_uri): New attributes.
1968    
1969     * HTML.pm.src (parse_string): Line and column numbers
1970     are now provided to error handler.
1971     (!!!parse-error): Short descriptions are added.
1972     (_construct_tree): Split into three methods; support
1973     for innerHTML mode.
1974     (set_inner_html): New method.
1975    
1976 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1977    
1978     * NanoDOM.html: Documentation is added.
1979    
1980     * HTML.pod, ContentType.html: Documentation is revised.
1981    
1982     * .cvsignore: Pod2html temporary files are added.
1983    
1984     * Makefile: Make |NanoDOM.html|.
1985    
1986     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1987    
1988     * HTML.pm.src (parse_string): New method.
1989     (get_inner_html): Renamed from |inner_html|.
1990    
1991     * Makefile: A rule for |HTML.html| is added.
1992    
1993     * HTML.pod: New documentation.
1994    
1995     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1996    
1997     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1998     (clone_node): Attribute nodes were not completely copied.
1999    
2000     * HTML.pm.src: Many bugs are fixed.
2001    
2002     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2003    
2004     * HTML.pm.src: Some typos are fixed.
2005    
2006     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2007    
2008     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2009     are removed.
2010    
2011     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2012    
2013     * HTML-consume-entity.src: Removed.
2014    
2015     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2016    
2017     * HTML.pm.src: Tokenizer's handling on hexadecimal
2018     numeric entities are rewritten.
2019    
2020     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2021    
2022     * HTML.pm.src: Some tokenizer bugs are fixed.
2023    
2024     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2025    
2026     * HTML.pm.src: The tree construction stage is implemented.
2027    
2028     * mkhtmlparser.pl: New macros are added.
2029    
2030     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2031    
2032     * ContentType.pm: A note on bug in the specification
2033     is removed since it's been now fixed.
2034    
2035     * .cvsignore: New file.
2036    
2037     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2038    
2039     * HTML.pm.src, HTML-consume-entity.src: New files.
2040    
2041     * Makefile (HTML.pm): New rule.
2042    
2043     * mkhtmlparser.pl: New script.
2044    
2045     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2046    
2047     * Makefile: New file.
2048    
2049     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2050    
2051     * ContentType.pm: An error in pod is fixed.
2052    
2053     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2054    
2055     * ContentType.pm: Documentation is added.
2056    
2057     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2058    
2059     * ContentType.pm: New Perl module.
2060    
2061     * ChangeLog: New file.
2062    
2063    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24