/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.303 - (hide annotations) (download)
Sun Sep 14 06:32:48 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.302: +9 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 06:32:02 -0000
	* HTML.pm.src ($char_onerror): Have character decoder's |line|
	and |column| a higher priority than the one set by the
	tokenizer's input handler.
	($self->{read_until}): Exclude U+FFFD (but this might
	not be necessary, since now we do line/column fixup in
	the character decode handle).

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 06:32:40 -0000
	* DecodeHandle.pm: EUCJP class reimplemented using |read|-centric
	model.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
4     and |column| a higher priority than the one set by the
5     tokenizer's input handler.
6     ($self->{read_until}): Exclude U+FFFD (but this might
7     not be necessary, since now we do line/column fixup in
8     the character decode handle).
9    
10     2008-09-14 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
13    
14     2008-09-14 Wakaba <wakaba@suika.fam.cx>
15    
16 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
17     and |manakai_getc_until| to |manakai_read_until| to
18     reduce the number of string copies.
19    
20     2008-09-14 Wakaba <wakaba@suika.fam.cx>
21    
22 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
23     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
24     standard feature to |open| a string as a filehandle,
25     since Perl's string filehandle seems not supporting |ungetc|
26     method correctly.
27     (parse_char_stream): Define |{getc_until}| method.
28     (DATA_STATE): Experimental support for |getc_until| feature.
29    
30 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
31    
32 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
33    
34     2008-09-13 Wakaba <wakaba@suika.fam.cx>
35    
36 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
37     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
38     and replaced by |{prev_state}|.
39    
40     * mkhtmlparser.pl: Remove |{char}| feature.
41     Remove |!!!back-next-input-character;| macro.
42    
43     2008-09-13 Wakaba <wakaba@suika.fam.cx>
44    
45 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
46     entity related tokenizer states in favor of new states
47     implementing the consume character reference algorithm.
48    
49     2008-09-13 Wakaba <wakaba@suika.fam.cx>
50    
51     * HTML.pm.src: "Consume a character reference" algorithm is
52 wakaba 1.296 now implemented as a tokenizer's state, rather than
53     a method, with minimum changes (more changes will
54     be made, in due course). "Bogus comment state"'s inner
55     loop gets removed.
56    
57     2008-09-13 Wakaba <wakaba@suika.fam.cx>
58    
59 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
60 wakaba 1.295 into their own tokenizer states.
61    
62     2008-09-13 Wakaba <wakaba@suika.fam.cx>
63    
64 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
65     is split into three states.
66    
67     2008-09-13 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
70     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
71     no longer does the tokenizer have to push back next input
72     characters in those states.
73    
74     2008-09-13 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
77     into four states so that no longer does the tokenizer have to push
78     back next input characters in that state.
79    
80 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
81    
82     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
83     which can be used to insert some wrapper between the character
84     stream handle and the tokenizer. (It is currently not supported
85     for |set_inner_html| for |Element|s).
86    
87 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
88    
89 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
90    
91     2008-09-10 Wakaba <wakaba@suika.fam.cx>
92    
93 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
94    
95     * HTML.pm.src: Don't specify |text| argument for the
96     |chardecode:fallback| error, since it is not the encoding
97     being used alternatively.
98    
99 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
100    
101     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
102    
103 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
104    
105 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
106    
107     2008-08-31 Wakaba <wakaba@suika.fam.cx>
108    
109 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
110     to after after frameset insertion mode processing (HTML5
111     revision 1909). Note that the implementation was wrong
112     per the old spec before the r1909 changes.
113    
114 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
115    
116 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
117     spec (HTML5 revision 2093).
118     ($process_row): Algorithm step numbers synced with the
119     spec (HTML5 revision 2092).
120    
121     2008-08-30 Wakaba <wakaba@suika.fam.cx>
122    
123 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
124     revision 2094).
125    
126     2008-08-30 Wakaba <wakaba@suika.fam.cx>
127    
128     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
129    
130     2008-08-30 Wakaba <wakaba@suika.fam.cx>
131    
132 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
133     name (after another attribute) now raise parse error (HTML5
134     revision 2123). Empty unquoted attribute values are no
135     longer allowed (HTML5 revision 2122).
136    
137     2008-08-30 Wakaba <wakaba@suika.fam.cx>
138    
139 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
140     revision 2130).
141    
142     2008-08-30 Wakaba <wakaba@suika.fam.cx>
143    
144 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
145     as |lang| attribute value for HTML elements (HTML5 revision 2062
146     and so on).
147    
148     2008-08-30 Wakaba <wakaba@suika.fam.cx>
149    
150 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
151     was missing.
152    
153     * URIChecker.pm: The end of the URL should be marked as the
154     error location for an empty path error. The position
155     between the userinfo and the port components should be
156     marked as the error location for an empty host error.
157    
158     2008-08-30 Wakaba <wakaba@suika.fam.cx>
159    
160 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
161     value the error occurs for errors. Report unknown
162     address format error in warning level, since address
163     formats are rarely added. Path segments starting with "/.."
164     were misinterpreted as a dot-segment.
165    
166     2008-08-30 Wakaba <wakaba@suika.fam.cx>
167    
168 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
169     |Message::DOM::DOMImplementation|.
170    
171 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
172    
173     * IMTChecker.pm: Updated for the new error reporting architecture.
174    
175     * ContentChecker.pm: Error levels for IMTs are added.
176    
177 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
178    
179     * H2H.pm (_shift_token): Support for unquoted HTML attribute
180     values.
181    
182 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
183    
184 wakaba 1.275 * CacheManifest.pm: Support for new style of error
185     reports.
186    
187     * HTML.pm.src: Set line=1, column=1 to the document node.
188    
189     2008-08-16 Wakaba <wakaba@suika.fam.cx>
190    
191 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
192     and URL checkers. Support for more error levels for bogus
193     langauge tag and URL "standards".
194    
195     * LangTag.pm, URIChecker.pm: Support for new style error
196     level reporting.
197    
198 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
199    
200 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
201    
202     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
203     specifying. Error types are revised.
204    
205     2008-08-15 Wakaba <wakaba@suika.fam.cx>
206    
207 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
208     renewed.
209    
210     2008-08-15 Wakaba <wakaba@suika.fam.cx>
211    
212 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
213     are revised. Use new style for "level" specification.
214    
215     * mkhtmlparser.pl: Use new style for "level" specification.
216    
217 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
218    
219 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
220     reporting process for broken ignored valuetype definition.
221     (Valuetype idl_text): Support for special "DOMString" name.
222    
223     2008-08-03 Wakaba <wakaba@suika.fam.cx>
224    
225 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
226     terminal of the ScopedName is "DOMString", such that whether
227     the last part of the scoped name is "DOMString" or "_DOMString"
228     later. It is necessary to determine whether a |typedef|
229     definition should be ignored or not.
230     (parse_char_string): Unescape the identifier of
231     exception members.
232     ($resolve): Return undef for builtin types and sequence<T>
233     types (we might not have to do this, however...).
234     (check): Support checking for Exceptions, Valuetypes,
235     and Typedefs.
236     ($serialize_type): Support for "DOMString::::" syntax.
237     (Typedef idl_text): Output Type as "DOMString" if it
238     is really "DOMString" (i.e. its internal representation
239     is "::DOMString::").
240    
241     2008-08-03 Wakaba <wakaba@suika.fam.cx>
242    
243 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
244     for constant types in the |check| method.
245     (check): Support for checking of attributes, operations, and
246     arguments.
247     (Attribute/Operation idl_text): Exception names in getraises,
248     setraises, and raises clauses is serizlied by |$serialize_type|
249     code.
250    
251 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
252    
253 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
254     hexadecimal numbers (the original pattern, taken from the spec,
255     was not work for hexadecimal numbers, because the "0" prefix
256     matches to the [0-7]* part (as an empty string) and therefore
257     it does not match with remaining "x..." part of a "0x..." integer
258     literal.
259     ($get_type): It now returns a string, not an array reference,
260     for regular types and |sequence| types (i.e. it in any case
261     returns a string).
262     ($get_next_token): The second item in the array that represents
263     a integer or float token is now a Perl number value, not the
264     original string representation of the number.
265     (check): Support for const value consistency checking.
266     No extended attribute is defined for constants.
267     (Node subclasses): Use simple strings rather than array references
268     for default data type values.
269     ($serialize_type): Type values are now simple strings.
270     (value): If the new attribute value is a false value, then
271     a FALSE value is set to the attribute.
272    
273     2008-08-02 Wakaba <wakaba@suika.fam.cx>
274    
275 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
276     in its stringified format ("scoped name" as defined in the
277     spec). Note that future version of this module should not use
278     array references for type values and the |type_text| attribute
279     should be made obsolete.
280     (parse_char_string): Unescape attribute names.
281     (check): Support for checking of whether inherited interfaces
282     are actually defined or not. Support for checking of whether
283     interface member identifiers are duplicated or not.
284     ($serialize_type): Scoped names are returned as is. A future
285     version of this code should escape identifiers other than "DOMString",
286     otherwise the idl_text would be non-conforming.
287    
288     2008-08-02 Wakaba <wakaba@suika.fam.cx>
289    
290 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
291 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
292     for Definition's were ignored.
293     (append_child): Set |parent_node| attribute.
294     (parent_node): New attribute.
295     (check): Support interface/exception members. Support
296     extended attributes. Support definition identifier uniqueness
297     constraint.
298     (qualified_name): New attribute.
299     (Interface/Exception idl_text): Extended attributes were
300     not prepended to the returned text.
301    
302     2008-08-02 Wakaba <wakaba@suika.fam.cx>
303    
304     * WebIDL.pm (parse_char_string): Set line/column numbers
305 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
306     would make things wrong. Support for interface forward
307     declarations was missing. Broken interface declarations
308     with no block were not ignored entirely.
309     (Whatpm::WebIDL::Node): New abstract class. This class
310     makes things easier.
311     (child_nodes): New attribute. Unlike DOM's attribute with
312     same name, this attribute returns a dead list of nodes for
313     simplicity.
314     (get_user_data, set_user_data): New methods.
315     (Module idl_text): A SPACE character should be inserted
316     before the |{| character.
317     (Interface idl_text): Support for interface forward declarations.
318     (is_forward_declaration): New attribute.
319    
320 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
321    
322 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
323    
324     2008-07-19 Wakaba <wakaba@suika.fam.cx>
325    
326 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
327     it now can handle broken extended attributes and as such.
328    
329     2008-07-19 Wakaba <wakaba@suika.fam.cx>
330    
331 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
332     Support for extended attributes with arguments.
333    
334     2008-07-19 Wakaba <wakaba@suika.fam.cx>
335    
336 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
337     (Interface->idl_text): Tentative support for inheritances.
338    
339     2008-07-19 Wakaba <wakaba@suika.fam.cx>
340    
341 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
342     Support for raises, setraises, and getraises syntaxes.
343    
344 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
345    
346     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
347     proper support for types, extended attributes, and exceptions yet).
348     WebIDL parser, version 1 (no support for exceptions yet,
349     no proper support for extended attributes yet).
350    
351 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
352    
353     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
354    
355 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
356    
357 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
358    
359     2008-06-29 Wakaba <wakaba@suika.fam.cx>
360    
361 wakaba 1.255 * WebIDL.pm: New module.
362    
363 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
364    
365     * Makefile (Entities.html): URI changed.
366    
367 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
368    
369     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
370    
371 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
372    
373 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
374    
375     2008-06-01 Wakaba <wakaba@suika.fam.cx>
376    
377 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
378     spec (commented out). (HTML5 revision 1711).
379    
380 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
381    
382 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
383    
384     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
385     as UTF-16 (HTML5 revision 1701).
386    
387     2008-05-25 Wakaba <wakaba@suika.fam.cx>
388    
389 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
390     1692).
391    
392     2008-05-25 Wakaba <wakaba@suika.fam.cx>
393    
394     * HTML.pm.src: The secondary insertion mode used when switching
395     to foreign content is the "in body" insertion mode (HTML5 revision
396     1696).
397    
398     2008-05-25 Wakaba <wakaba@suika.fam.cx>
399    
400 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
401     revision 1697).
402    
403 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
404    
405 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
406     insertion mode (HTML5 revision 1693). Update SVG camelCase
407     attribute list (HTML5 revision 1700). <textarea> closes
408     </select> (HTML5 revision 1699). More start tags close in
409     foreign content insertion mode (HTML5 revision 1698).
410    
411     2008-05-24 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
414    
415     2008-05-24 Wakaba <wakaba@suika.fam.cx>
416    
417 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
418     revision 1674).
419    
420     2008-05-24 Wakaba <wakaba@suika.fam.cx>
421    
422 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
423     revision 1676).
424    
425     2008-05-24 Wakaba <wakaba@suika.fam.cx>
426    
427 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
428     quriks mode detection (HTML5 revision 1679).
429    
430     2008-05-24 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
433     revision 1685).
434    
435     2008-05-24 Wakaba <wakaba@suika.fam.cx>
436    
437 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
438     revision 1684).
439    
440     2008-05-24 Wakaba <wakaba@suika.fam.cx>
441    
442 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
443     work (HTML5 revision 1690).
444    
445     2008-05-24 Wakaba <wakaba@suika.fam.cx>
446    
447 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
448     end tags in head insertion modes (HTML5 revision 1686).
449     (parse_char_string): Bug fix for non-utf8 character string handlings.
450     (parse_char_stream): |ungetc| does not work well for this context.
451    
452 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
453    
454 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
455     |parse_byte_stream|.
456     (parse_byte_stream): New method.
457    
458     2008-05-18 Wakaba <wakaba@suika.fam.cx>
459    
460 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
461     by encoding layer error reporter.
462    
463 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
464    
465 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
466     rather than converting the whole byte string and then parsing.
467     Propagate errors in character encoding layer.
468     (get_next_token): Precise error reporting for |bare stago| error.
469    
470     2008-05-17 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
473     (parse_char_string): This method is now defined as an invocation
474     of the |parse_char_stream| method.
475    
476     2008-05-17 Wakaba <wakaba@suika.fam.cx>
477    
478 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
479     of the sniffing as info-level errors. Support for new
480     decoding framework in parser resestting.
481     (new): Various default error levels were not set.
482    
483     2008-05-17 Wakaba <wakaba@suika.fam.cx>
484    
485 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
486     algorithm, except for the actual sniffing, is implemented
487     with new framework with Message::Charset::Info.
488    
489 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
490    
491     * CacheManifest.pm (_parse): Drop fragment identifiers from
492     URIs in fallback section (HTML5 revision 1596).
493    
494 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
495    
496 wakaba 1.231 * Makefile (Entities.html): URI has changed.
497    
498     2008-05-10 Wakaba <wakaba@suika.fam.cx>
499    
500 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
501     1553).
502    
503 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
504    
505 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
506     no longer table elements, but table objects returned
507     by Whatpm::HTMLTable.
508    
509     * HTMLTable.pm (form_table): Return table element node
510     as |$table->{element}|.
511     (assign_header): Support for the |headers=""| attribute.
512    
513     2008-05-06 Wakaba <wakaba@suika.fam.cx>
514    
515 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
516     with no support for headers="".
517     (form_table): Include table width and height to the returned
518     table object for covenience. Indexing in column assignement was wrong.
519     Set whether a data cell is empty or not for convenience.
520    
521 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
522    
523 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
524    
525     2008-05-05 Wakaba <wakaba@suika.fam.cx>
526    
527 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
528     changed (HTML5 revision 1387).
529    
530     2008-05-05 Wakaba <wakaba@suika.fam.cx>
531    
532 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
533     to the end of the table (HTML5 revision 1380).
534    
535     2008-05-05 Wakaba <wakaba@suika.fam.cx>
536    
537 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
538     of 1-based (HTML5 revision 1376).
539    
540     2008-05-05 Wakaba <wakaba@suika.fam.cx>
541    
542 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
543     and "Content-Encoding" no longer prevent sniffing (HTML5
544     revision 1288).
545    
546     2008-05-05 Wakaba <wakaba@suika.fam.cx>
547    
548 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
549     1282).
550    
551 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
552    
553 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
554     Status of XML specs are added.
555    
556     2008-05-03 Wakaba <wakaba@suika.fam.cx>
557    
558 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
559     and |ref=""| attribute (referent element type checking).
560    
561 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
562    
563 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
564     1500) is implemented (except for IDNA part and URI-scheme-specific
565 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
566     documents.
567    
568 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
569    
570 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
571     character (HTML5 revision 1263).
572    
573     2008-04-13 Wakaba <wakaba@suika.fam.cx>
574    
575 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
576     definition for |AElig;| was missing).
577    
578     2008-04-13 Wakaba <wakaba@suika.fam.cx>
579    
580 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
581     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
582    
583     2008-04-13 Wakaba <wakaba@suika.fam.cx>
584    
585 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
586     insertion mode is added (HTML5 revisions 1412 and 1418).
587    
588     2008-04-13 Wakaba <wakaba@suika.fam.cx>
589    
590 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
591     elements (HTML5 revision 1410).
592    
593     2008-04-13 Wakaba <wakaba@suika.fam.cx>
594    
595 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
596     revision 1406).
597    
598     2008-04-13 Wakaba <wakaba@suika.fam.cx>
599    
600 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
601     elements (HTML5 revision 1404 and 1420).
602    
603 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
604    
605 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
606     elements (HTML5 revision 1404). Unused !!!macro definitions
607     are removed.
608    
609     2008-04-12 Wakaba <wakaba@suika.fam.cx>
610    
611 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
612     are raised is changed (HTML5 revision 1404).
613    
614 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
615    
616 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
617    
618     2008-04-06 Wakaba <wakaba@suika.fam.cx>
619    
620 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
621     replaced by constants representing element category.
622    
623     2008-04-06 Wakaba <wakaba@suika.fam.cx>
624    
625 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
626     for error reporting. (This is a preparation for using constant
627     value for ->[1].)
628    
629 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
630    
631 wakaba 1.205 * RDFXML.pm: Typo fixed.
632    
633     2008-03-22 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
636     as same as |must_level|, i.e. level = |m|.
637     (check_element): Make list of URIs in the DOM.
638    
639 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
640    
641 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
642     Local (null-namespace) attribute support.
643    
644     2008-03-21 Wakaba <wakaba@suika.fam.cx>
645    
646 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
647    
648     2008-03-21 Wakaba <wakaba@suika.fam.cx>
649    
650 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
651     and URI attributes is implemented. Warn if unknown
652     value is used in rdf:parseType="" attribute.
653    
654     * URIChecker.pm (check_rdf_uri_reference): New function.
655    
656     2008-03-21 Wakaba <wakaba@suika.fam.cx>
657    
658 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
659     are now resolved.
660    
661     2008-03-21 Wakaba <wakaba@suika.fam.cx>
662    
663 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
664    
665     * RDFXML.pm: undef vs false bug fixed.
666     Reification implemented.
667    
668     2008-03-21 Wakaba <wakaba@suika.fam.cx>
669    
670 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
671     The |node| arguments are added for |ontriple| calls.
672     Too many "attribute not allowed" errors were raised.
673    
674     * ContentChecker.pm: Initial experimental support for rdf:RDF
675     element.
676    
677     2008-03-21 Wakaba <wakaba@suika.fam.cx>
678    
679 wakaba 1.197 * RDFXML.pm: New module.
680    
681 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
682    
683 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
684     code was old one yet.
685    
686     2008-03-20 Wakaba <wakaba@suika.fam.cx>
687    
688 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
689     errors. Line/column markings for DOCTYPE, comment, and
690     character tokens are reintroduced; otherwise,
691     error location for "not HTML5" error and errors
692     for implied elements are not attached.
693    
694     2008-03-20 Wakaba <wakaba@suika.fam.cx>
695    
696 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
697    
698     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
699     (value): Setter implemented.
700    
701     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
702    
703     2008-03-20 Wakaba <wakaba@suika.fam.cx>
704    
705 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
706    
707     2008-03-20 Wakaba <wakaba@suika.fam.cx>
708    
709 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
710     flag, which is no longer used.
711    
712 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
715    
716     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
717     and !!!insert-element-t): Set line/column information to
718     element nodes.
719    
720     2008-03-17 Wakaba <wakaba@suika.fam.cx>
721    
722 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
723     in "<?", "<>", or "</>" should be the error point.
724    
725 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
726    
727 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
728    
729     2008-03-16 Wakaba <wakaba@suika.fam.cx>
730    
731 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
732    
733     2008-03-16 Wakaba <wakaba@suika.fam.cx>
734    
735 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
736     reporting.
737    
738 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
739    
740 wakaba 1.186 * HTML.pm.src: Error type revised.
741    
742     2008-03-11 Wakaba <wakaba@suika.fam.cx>
743    
744 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
745    
746     2008-03-11 Wakaba <wakaba@suika.fam.cx>
747    
748 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
749    
750 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
751    
752     * mkhtmlparser.pl: Set "level" parameter to parse errors.
753    
754     * HTML.pm.src: Code refined.
755    
756 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
757    
758 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
759     revision 1348). Note that I really don't know this makes
760     any difference in the black-box behavior of the parser.
761    
762     2008-03-09 Wakaba <wakaba@suika.fam.cx>
763    
764 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
765     revision 1348).
766    
767     2008-03-09 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
770    
771     2008-03-09 Wakaba <wakaba@suika.fam.cx>
772    
773 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
774    
775     2008-03-09 Wakaba <wakaba@suika.fam.cx>
776    
777 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
778     mode and support for the "in select in table" insertion mode (HTML5
779     revision 1342).
780    
781     2008-03-09 Wakaba <wakaba@suika.fam.cx>
782    
783 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
784     in non-tainted <table>s (HTML5 revision 1336).
785    
786     2008-03-09 Wakaba <wakaba@suika.fam.cx>
787    
788 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
789     and <head> (HTML5 revision 1332).
790    
791     2008-03-09 Wakaba <wakaba@suika.fam.cx>
792    
793 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
794     white space (HTML5 revision 1331).
795    
796 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
797    
798 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
799     element (HTML5 revision 1330).
800    
801     2008-03-08 Wakaba <wakaba@suika.fam.cx>
802    
803 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
804     element (HTML5 revision 1328).
805    
806     2008-03-08 Wakaba <wakaba@suika.fam.cx>
807    
808 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
809     into foster parents (HTML5 revision 1326).
810    
811     2008-03-08 Wakaba <wakaba@suika.fam.cx>
812    
813 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
814     revision 1321).
815    
816     2008-03-08 Wakaba <wakaba@suika.fam.cx>
817    
818 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
819     the spec (HTML5 revision 1320). Note that we finally complete
820     all of HTML5 revision 1320 changes.
821    
822     2008-03-08 Wakaba <wakaba@suika.fam.cx>
823    
824 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
825     tags (HTML5 revision 1320).
826    
827     2008-03-08 Wakaba <wakaba@suika.fam.cx>
828    
829 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
830    
831     2008-03-08 Wakaba <wakaba@suika.fam.cx>
832    
833 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
834     could not be reached so that it is now removed (HTML5 revision
835     1320).
836    
837     2008-03-08 Wakaba <wakaba@suika.fam.cx>
838    
839 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
840     to align with new spec text (HTML5 revision 1320).
841    
842     2008-03-08 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
845     have had no harm in theory.
846    
847     2008-03-08 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
850     case is split from other end tags for the preparation
851     of implementing HTML revision 1320.
852    
853 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
854    
855 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
856     revision 1320).
857    
858     2008-03-07 Wakaba <wakaba@suika.fam.cx>
859    
860 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
861     and "div" (HTML5 revision 1318). Add comments to where
862     |form| pointer association codes should be inserted (HTML5 revision
863     1319).
864    
865 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
866    
867     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
868    
869 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
870    
871 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
872     other token type, there are), it is replaced by a |die| statement.
873    
874     2008-03-05 Wakaba <wakaba@suika.fam.cx>
875    
876 wakaba 1.159 * HTML.pm.src: Typo fixed.
877    
878 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
879    
880     * HTML.pm.src (_tree_construction_initial): Some limited quirks
881     doctypes were not uppercased for comparison.
882    
883 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
884    
885 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
886     are added.
887    
888     2008-03-03 Wakaba <wakaba@suika.fam.cx>
889    
890 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
891     are set. Cases that are unlikely reached are noted as so.
892    
893     2008-03-03 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
896    
897     * mkhtmlparser.pl: Support for |!!!cp| syntax.
898    
899     2008-03-03 Wakaba <wakaba@suika.fam.cx>
900    
901 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
902     for simplicity.
903    
904     2008-03-03 Wakaba <wakaba@suika.fam.cx>
905    
906 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
907     revision 1307).
908    
909     2008-03-03 Wakaba <wakaba@suika.fam.cx>
910    
911 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
912     is set are changed (HTML5 revision 1305).
913    
914 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
917     1292). Entities are not parsed in comment-like part in RCDATA
918     elements (HTML5 revision 1294). Allow bare & at the end
919     of attribute value literals (HTML5 revision 1296). More
920     quirks mode doctypes (HTML5 revision 1302). Requires spaces
921     between attributes and ban attribute names or unquoted
922     attribute values containing single or double quotes (HTML5
923     revision 1303).
924    
925     2008-03-02 Wakaba <wakaba@suika.fam.cx>
926    
927 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
928     and related errors unless it is an HTML document (though the spec
929     is unclear on whether it is applied to XHTML document).
930    
931     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
932     are added.
933    
934     2008-03-02 Wakaba <wakaba@suika.fam.cx>
935    
936 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
937    
938 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
939    
940     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
941    
942     * HTML.pm.src: |charset| in |content| attribute is
943     case-insensitive (HTML5 revision 1270).
944    
945 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
946    
947     * ContentChecker.pm: New status constants are added.
948     ($ElementDefault): |status| added.
949     (check_element): Err for non-standard or deprecated elements.
950     (_attr_status_info): For non-standard or deprecated attributes.
951    
952 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
953    
954 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
955    
956     2008-02-24 Wakaba <wakaba@suika.fam.cx>
957    
958 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
959     status information is now dispatched.
960    
961     2008-02-24 Wakaba <wakaba@suika.fam.cx>
962    
963 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
964     problem by adding some more arguments.
965    
966     2008-02-24 Wakaba <wakaba@suika.fam.cx>
967    
968 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
969     container-for-the-purpose-of-content-model element (not transparent
970     element) for |check_child_element| calling and significant
971     text flag marking. This reintroduces |<del>|-and-significant
972     problem again.
973    
974     2008-02-24 Wakaba <wakaba@suika.fam.cx>
975    
976 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
977     elements ignored for the purpose of phase changes in
978     content model checking.
979    
980 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
981    
982 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
983     was not properly managed for transparent cases.
984    
985     2008-02-23 Wakaba <wakaba@suika.fam.cx>
986    
987 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
988     and |audio| as semi-transparent elements.
989    
990     2008-02-23 Wakaba <wakaba@suika.fam.cx>
991    
992 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
993     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
994     Support for |html:object| as a semi-transparent element.
995    
996     2008-02-23 Wakaba <wakaba@suika.fam.cx>
997    
998 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
999     the tree is entirely revised to make it easier to track
1000     the state of ancestors/descendants. As a result of this
1001     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1002     support for content model checking for HTML elements |figure|,
1003     |object|, |video|, and |audio| and checking for XML elements (and
1004     some XMLNS checkings) are dropped for now. They will be
1005     reimplemented in due cource.
1006    
1007 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1008    
1009 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1010     token (HTML5 revision 1225).
1011    
1012     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1015    
1016     * HTML.pm.src: Save whether |meta| |content| attribute
1017     contains character references or not.
1018    
1019     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1020    
1021 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1022     imlement significant content checking for contents
1023     with |del| elements.
1024    
1025     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1026    
1027 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1028     elements are added.
1029     (_get_children): HTML |object| elements are now semi-transparent.
1030    
1031     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1032    
1033 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1034    
1035     * CacheManifest.pm: HTML5 revision 1211 implemented.
1036    
1037     * CacheManifest.pod: Updated.
1038    
1039 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1040    
1041     * ContentChecker.pm (check_document, check_element): Support
1042     for second argument ($onsubdoc).
1043     (_get_css_parser): Removed (now it is part of WDCC).
1044    
1045 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1046    
1047     * ContentChecker.pm (_get_css_parser): New.
1048    
1049 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1050    
1051 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1052     for checking had been used.
1053    
1054     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1055    
1056 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1057    
1058 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1059    
1060 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1061     specced in RFC 2046 and RFC 4288.
1062     (application/atom+xml): Definition added.
1063    
1064     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1065    
1066 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1067     warnings (rather than SHOULD-level errors).
1068    
1069     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1070    
1071 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1072    
1073     * HTML.pm.src: A flag for character references in attribute
1074     values are added. Set |manakai_has_reference| user data
1075     to |charset| attribute.
1076    
1077     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1078    
1079 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1080     attributes.
1081    
1082     * ContentChecker.pm (check_document): Warn if charset requirements
1083     cannot be tested.
1084    
1085 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1086    
1087     * HTML.pm.src (parse_byte_string): Detect charset
1088     by universalchardet if charset parameter is not specified.
1089    
1090     * Makefile (Charset-all, Charset-clean): New rules.
1091    
1092 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1093    
1094 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1095     of character encoding declaration and selection of encoding
1096     for HTML document.
1097    
1098     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1099    
1100 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1101     type in list context.
1102    
1103     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1104    
1105 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1106     ignoring (HTML5 revisions 1013 and 1016).
1107    
1108     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1109    
1110 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1111     white spaces, are implemented.
1112    
1113 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1116     attribute if possible.
1117    
1118     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1121     (parse_char_string): New alias for |parse_string|.
1122     (main phase): Invoking "change the encoding" algorithm if desired.
1123    
1124     * HTML.pod: Updated.
1125    
1126     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1127    
1128 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1129    
1130     * Makefile (HTML-all, HTML-clean): New.
1131    
1132     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1133    
1134     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1135    
1136 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1137    
1138     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1139    
1140 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1141    
1142 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1143    
1144     * CacheManifest.pod: New file.
1145    
1146     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1147    
1148 wakaba 1.114 * CacheManifest.pm: New module.
1149    
1150     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1151    
1152 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1153     callback.
1154    
1155     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1156    
1157 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1158    
1159 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1160    
1161 wakaba 1.111 * Makefile (clean): New rule.
1162    
1163     * NanoDOM.pm (public_id, system_id): New attributes.
1164    
1165     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1166    
1167 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1168    
1169 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1170    
1171     * ContentChecker.pm (check_document): Support for
1172     new |is_xml_root| flag.
1173     (check_element): Support for new |pluses| state.
1174     (_add_pluses): New method.
1175     (_remove_minuses): Support for new |minus| item.
1176    
1177 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1178    
1179 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1180     root element.
1181    
1182     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1183    
1184 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1185    
1186 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1189    
1190     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1191    
1192 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1193    
1194     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1195    
1196 wakaba 1.103 * LangTag.pm: New module.
1197    
1198 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1199    
1200     * HTML.pm.src: Some error types were wrong.
1201    
1202 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1203    
1204 wakaba 1.101 * CSS/: New directory.
1205    
1206     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1207    
1208 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1209     for |noscript| in |head|.
1210    
1211 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1212    
1213     * URI/: New directory.
1214    
1215 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1216    
1217 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1218     number.
1219    
1220     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1221    
1222 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1223     by |&|s.
1224    
1225     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1228    
1229     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1230    
1231 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1232    
1233     * HTML.pm.src: Insertion modes are now represented in number.
1234    
1235     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1236    
1237 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1238     999) is implemented.
1239    
1240 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1241    
1242     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1243     983) is implemented.
1244    
1245 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1246    
1247     * ContentChecker.pod: New documentation.
1248    
1249     * Makefile: A rule for |ContentChecker.html| is added.
1250    
1251     * ContentChecker.pm: A pod "LICENSE" section is added.
1252    
1253     * NanoDOM.pm ($VERSION): New variable.
1254    
1255 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1256    
1257 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1258     list of allowed HTML elements.
1259    
1260     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1261    
1262 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1263     HTML elements.
1264    
1265     * URIChecker.pm (check_iri): New.
1266     (check_iri_reference): Error type for IRI reference
1267     syntax error is changed.
1268    
1269 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1270    
1271 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1272     is added.
1273     (check_document): Load appropriate module before validation.
1274    
1275     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1276    
1277 wakaba 1.88 * ContentChecker/: New directory.
1278    
1279     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1282    
1283     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1284    
1285 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1286    
1287 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1288     even if no document element is found.
1289    
1290     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1291    
1292 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1293    
1294     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1295    
1296 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1297    
1298     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1299    
1300 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1301     is split into "after html body" and "after html frameset"
1302     insertion modes. Their codes are merged with "after body"
1303     and "after frameset" codes. |$previous_insertion_mode|
1304     has been removed. "after frameset" code is
1305     merged with "in frameset" code.
1306    
1307     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1308    
1309 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1310     merged with the "in head" insertion mode.
1311    
1312     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1313    
1314 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1315     reorganized.
1316    
1317     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1318    
1319 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1320     are merged.
1321    
1322     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1323    
1324 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1325     with "in table" insertion mode.
1326    
1327     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1328    
1329 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1330     modes are merged.
1331    
1332     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1333    
1334 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1335     insertion mode!
1336    
1337     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1338    
1339 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1340     now directly invoke the handler.
1341    
1342     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1345     is merged to the "in body" insertion mode code.
1346    
1347     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1350     insertion modes are merged.
1351    
1352     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1353    
1354 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1355    
1356     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1357    
1358 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1359     instead of string constant for |{content_model_flag}|.
1360    
1361     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1364     to the "duplicate attribute" error.
1365    
1366 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1367    
1368 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1369    
1370     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1371    
1372 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1373    
1374     * HTML.pm.src: A typo is fixed.
1375    
1376 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1377    
1378 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1379     from "in HTML:xml:lang" error. Character position
1380     is now the last part of the error type in the URI error
1381     description. Report "unsupported" status for language
1382     tags, media queries, script codes, and style sheets.
1383    
1384     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1385    
1386 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1387     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1388     in XML.
1389    
1390     * NanoDOM.pm (Attr.owner_document): New attribute.
1391    
1392     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1393    
1394 wakaba 1.66 * HTML.pm.src: The character immediately following
1395     a bare |hcro| was discarded. Fix handling of
1396     entity references in attribute values.
1397    
1398     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1399    
1400 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1401     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1402     factored out. Error types |in html:#DOCTYPE|
1403     and |after html:#DOCTYPE| are merged
1404     into |DOCTYPE in the middle|. |</frameset>|
1405     in fragment parsing mode changed the
1406     insertion mode.
1407    
1408     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1409    
1410 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1411     phase is now an insertion mode. Treatments for white
1412     space character tokens were incorrect for some
1413     insertion modes. An old |meta| case was not removed.
1414    
1415     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1416    
1417 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1418     implemented (but changing the encoding is not yet:-).
1419    
1420 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1421    
1422 wakaba 1.62 * Charset/: New directory.
1423    
1424     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1425    
1426 wakaba 1.61 * H2H.pm: New Perl module (created from
1427     manakai's H2H.dis).
1428    
1429     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1430    
1431 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1432     manakai's SimpleLS.dis).
1433    
1434 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1435    
1436     * HTML.pm.src: |<!---x-->| was not processed correctly.
1437    
1438 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1439    
1440 wakaba 1.58 * HTML.pm.src: Report correct error message
1441     for |<body></div></body>|.
1442    
1443     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1444    
1445 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1446    
1447     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1448    
1449     * ContentChecker.pm: Return |{term}| list.
1450    
1451 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1452    
1453 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1454     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1455     and <title> outside of head).
1456    
1457     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1458    
1459 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1460     and private types/subtypes.
1461    
1462     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1463     URIChecker.pm, HTMLTable.pm: Error messages are now
1464     consistent; they are all listed in
1465     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1466    
1467 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1468    
1469 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1470     is now erred. |<datalist>| is implemented.
1471     Attribute checker for |<command>| and |<menu>| are
1472     added. Support for |contextmenu| global attribute
1473     is added.
1474    
1475     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1476    
1477 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1478     of Step 3 has been changed.
1479    
1480     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1481    
1482 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1483    
1484 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1485    
1486 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1487     incorrect.
1488    
1489     * ContentChecker.pm: Support for the |footer| element.
1490     Check URI syntax for space-separated URI attributes.
1491     Support for the |tabindex| attribute. Support
1492     for |datetime| attribute.
1493    
1494     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1495    
1496 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1497     character references). HTML5 revision 1.146 (white space
1498     characters before root start tag). HTML5 revision
1499     1.148 (named character references in attribute values).
1500     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1501    
1502     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1503    
1504     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1505 wakaba 1.49 in <head>).
1506    
1507     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1508    
1509 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1510     in body).
1511    
1512     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1513    
1514     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1515    
1516 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1517     or RCDATA in fragment parsing mode). HTML5 revision
1518     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1519    
1520     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1521    
1522     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1523     for <pre> and <textarea>).
1524    
1525 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1526    
1527 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1528    
1529     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1530    
1531 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1532    
1533     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1534    
1535 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1536     HTML5 revision 917 (conforming bare &).
1537    
1538     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1539    
1540 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1541    
1542     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1543     HTML5 revision 915 (<nobr>).
1544    
1545     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1546    
1547 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1548    
1549     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1550     New attributes.
1551    
1552     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1553    
1554 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1555     in tags).
1556    
1557     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1558    
1559 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1560    
1561     * HTML.pm.src: |$entity_char| is removed and
1562     requires |Whatpm::_NamedEntityList| instead.
1563     HTML5 revision 898 (refc), except that lack of refc
1564     is parse error.
1565    
1566     * mkentitylist.pl: New script.
1567    
1568     * Makefile (all): |_NamedEntityList.pm| is added.
1569     (_NamedEntityList.pm, Entities.html): New rules.
1570    
1571     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1572    
1573 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1574     were ignored and U+000D immediately following another
1575     U+000D was not converted to U+000A.
1576    
1577     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1578    
1579 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1580     nodes before appended). Parser was not ready for NULL
1581     parse error and escape flag.
1582    
1583     * NanoDOM.pm (adopt_node): New.
1584    
1585     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1588     CDATA and RCDATA). Note that current implementation
1589     is simply repeating what the spec says and it is maybe not
1590     a best way to do it.
1591    
1592     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1593    
1594 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1595     the form element if a descendant element without implied end tag has
1596     still been open).
1597    
1598     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1599    
1600 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1601     space characters non-conforming).
1602    
1603     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1604    
1605 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1606     HTML5 revision 869 (C1 character references).
1607    
1608     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1611     a |textarea| is removed).
1612    
1613 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1614    
1615     * NanoDOM.pm (get_attribute_node_ns): New method.
1616    
1617     * ContentChecker.pm: |script| |async| and |defer|
1618     no longer require |src|. |async| MUST NOT be
1619     specified if |defer|. (HTML5 revision 858).
1620    
1621 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1622    
1623     * HTML.pm.src: |<form><form>| went to inifinite loop.
1624    
1625 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1626    
1627 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1628     as a document element) flag on.
1629     (new): Removed.
1630     (check_document): New method.
1631    
1632     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1633    
1634 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1635     were incorrect.
1636    
1637     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1638    
1639 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1640     error position in the |type| option of the error.
1641    
1642     * HTMLTable.pm (form_table): The |$onerror| parameter
1643     is now optional. Some bugs are fixed.
1644    
1645     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1646    
1647 wakaba 1.27 * HTMLTable.pm: New module.
1648    
1649     * ContentChecker.pm (table): Invoke table model error checker.
1650    
1651     * NanoDOM.pm (first_child, get_attribute_ns): New.
1652    
1653 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1654    
1655 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1656     (link/@rel, a/@rel, area/@rel): Use new checker.
1657    
1658     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1659    
1660     * _LinkTypeList.pm: New file.
1661    
1662     * mklinktypelist.pl: New file.
1663    
1664     * .cvsignore: |RelExtensions.html| added.
1665    
1666     * NanoDOM.pm (child_nodes): Returns an empty array
1667     for non-child-containing node types.
1668     (text_content): New attribute.
1669    
1670     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1671    
1672 wakaba 1.25 * IMTChecker.pm: New module.
1673    
1674     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1675     to test parameter value validity.
1676    
1677     * HTML.pm.src ($style_start_tag): Attributes were
1678     discarded.
1679    
1680 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1681    
1682 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1683    
1684     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1685    
1686 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1687     and RFC 3987 are listed (not all testable items are checked yet).
1688    
1689     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1690    
1691 wakaba 1.22 * URIChecker.pm: New module.
1692    
1693 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1696     or associated to some placeholder.
1697     ($ElementDefault): Warn unknown attributes
1698     for unknown elements as "attribute not supported".
1699     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1700     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1701     (|a| attribute checker): Reimplemented.
1702    
1703     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1704    
1705 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1706     ($HTMLAttrChecker): Event handler content attributes
1707     are added.
1708     (link, embed): Required attribute is now checked.
1709     (embed): Unknown local attributes are no longer warned.
1710    
1711     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1712    
1713 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1714     ($HTMLIMTAttrChecker): New checker.
1715     (link@rel, link@href, link@type, style@type,
1716     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1717     object@data, object@type, source@src, source@type, area@alt,
1718     area@shape, area@coords, area@href,
1719     area@ping, area@rel, area@type, script@src,
1720     script@defer, script@async, script@type): Checkers added.
1721    
1722     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1723    
1724 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1725    
1726 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1727    
1728 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1729     attributes. Report an error if |Element.prefix|
1730     is |xmlns|.
1731    
1732     * NanoDOM.pm (prefix): New attribute.
1733    
1734     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1735    
1736 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1737     mode, action for |<iframe>| was missing.
1738    
1739     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1740    
1741 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1742     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1743    
1744     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1745    
1746 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1747     ($HTMLURIAttrChecker): Placeholder.
1748     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1749     New.
1750    
1751     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1752    
1753 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1754     for global attributes, |html|, |base|, |style|, and |meta|.
1755    
1756     * NanoDOM.pm (insert_before): Weaken reference
1757     to the parent node.
1758     (Attr::new): Set |owner_element| attribute.
1759     (namespace_uri, manakai_local_name): New attribute implementations.
1760     (owner_element): New attribute.
1761    
1762     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1763    
1764 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1765     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1766     $Element->{$HTML_NS}->{''}): New.
1767     (check_element): Invoke attrs_checker for each element.
1768    
1769 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1770    
1771 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1772    
1773     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1774    
1775 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1776     element type testings.
1777    
1778     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1779    
1780 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1781     error when an element type is put in the "minus" list
1782     and the element type is not allowed explicitly in the particular
1783     element content model.
1784     (html:a checker): New checker.
1785     (html:details, html:datagrid): New checkers.
1786     (html:legend): New checker.
1787    
1788     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1789    
1790 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1791    
1792     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1793    
1794 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1795     checker.
1796     (html:dd checker): New checker.
1797     (html:q, html:em, html:strong, html:small,
1798     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1799    
1800     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1801    
1802 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1803     from |$ElementDefault->{checker}|.
1804     ($ElementDefault->{checker}): Throw an error that
1805     the element type is not supported by the checker.
1806     ($HTMLMetadataElement): |html:base| was missing.
1807     ($HTMLEmptyChecker): Don't throw an error
1808     for inter-element whitespace nodes.
1809     (html:html checker): Errors were not
1810     thrown even if |html:head| and/or |html:body|
1811     children were missing.
1812     (html:head checker): An error was not
1813     thrown if <meta charset> appered after other
1814     elements.
1815    
1816 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1817    
1818     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1819     and |noscript| elements are implemented.
1820     (new): New method.
1821    
1822 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1823    
1824 wakaba 1.4 * ContentChecker.pm: New module.
1825    
1826     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1827    
1828 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1829     document_element, manakai_local_name,
1830     manakai_element_type_match): New method.
1831    
1832     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1833    
1834     * HTML.pm.src: Replace decimal and hexadecimal numeric
1835     entities in C1 range using Windows-1252 mapping. Bare LF
1836     did not count as new line for error reporting.
1837    
1838 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1839    
1840     * NanoDOM.pm (DOMImplementation): New class.
1841     (append_child): Weaken the |parent_node| reference.
1842     (create_element_ns, Element new): Set the |owner_document|
1843     reference.
1844     (implementation): New attribute.
1845     (owner_document, local_name, namespace_uri): New attributes.
1846    
1847     * HTML.pm.src (parse_string): Line and column numbers
1848     are now provided to error handler.
1849     (!!!parse-error): Short descriptions are added.
1850     (_construct_tree): Split into three methods; support
1851     for innerHTML mode.
1852     (set_inner_html): New method.
1853    
1854 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1855    
1856     * NanoDOM.html: Documentation is added.
1857    
1858     * HTML.pod, ContentType.html: Documentation is revised.
1859    
1860     * .cvsignore: Pod2html temporary files are added.
1861    
1862     * Makefile: Make |NanoDOM.html|.
1863    
1864     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1865    
1866     * HTML.pm.src (parse_string): New method.
1867     (get_inner_html): Renamed from |inner_html|.
1868    
1869     * Makefile: A rule for |HTML.html| is added.
1870    
1871     * HTML.pod: New documentation.
1872    
1873     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1874    
1875     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1876     (clone_node): Attribute nodes were not completely copied.
1877    
1878     * HTML.pm.src: Many bugs are fixed.
1879    
1880     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1881    
1882     * HTML.pm.src: Some typos are fixed.
1883    
1884     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1885    
1886     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1887     are removed.
1888    
1889     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1890    
1891     * HTML-consume-entity.src: Removed.
1892    
1893     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1894    
1895     * HTML.pm.src: Tokenizer's handling on hexadecimal
1896     numeric entities are rewritten.
1897    
1898     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1899    
1900     * HTML.pm.src: Some tokenizer bugs are fixed.
1901    
1902     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1903    
1904     * HTML.pm.src: The tree construction stage is implemented.
1905    
1906     * mkhtmlparser.pl: New macros are added.
1907    
1908     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1909    
1910     * ContentType.pm: A note on bug in the specification
1911     is removed since it's been now fixed.
1912    
1913     * .cvsignore: New file.
1914    
1915     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1916    
1917     * HTML.pm.src, HTML-consume-entity.src: New files.
1918    
1919     * Makefile (HTML.pm): New rule.
1920    
1921     * mkhtmlparser.pl: New script.
1922    
1923     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1924    
1925     * Makefile: New file.
1926    
1927     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1928    
1929     * ContentType.pm: An error in pod is fixed.
1930    
1931     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1932    
1933     * ContentType.pm: Documentation is added.
1934    
1935     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1936    
1937     * ContentType.pm: New Perl module.
1938    
1939     * ChangeLog: New file.
1940    
1941    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24