/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.304 - (hide annotations) (download)
Sun Sep 14 06:58:28 2008 UTC (16 years, 1 month ago) by wakaba
Branch: MAIN
Changes since 1.303: +5 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 06:57:36 -0000
	* HTML.pm.src: It turns out that U+FFFD don't have to
	be added to the list of excluded characters.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 06:58:07 -0000
	* DecodeHandle.pm: Merge the EUCJP class into the Encode class.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
4     be added to the list of excluded characters.
5    
6     2008-09-14 Wakaba <wakaba@suika.fam.cx>
7    
8 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
9     and |column| a higher priority than the one set by the
10     tokenizer's input handler.
11     ($self->{read_until}): Exclude U+FFFD (but this might
12     not be necessary, since now we do line/column fixup in
13     the character decode handle).
14    
15     2008-09-14 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
18    
19     2008-09-14 Wakaba <wakaba@suika.fam.cx>
20    
21 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
22     and |manakai_getc_until| to |manakai_read_until| to
23     reduce the number of string copies.
24    
25     2008-09-14 Wakaba <wakaba@suika.fam.cx>
26    
27 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
28     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
29     standard feature to |open| a string as a filehandle,
30     since Perl's string filehandle seems not supporting |ungetc|
31     method correctly.
32     (parse_char_stream): Define |{getc_until}| method.
33     (DATA_STATE): Experimental support for |getc_until| feature.
34    
35 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
36    
37 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
38    
39     2008-09-13 Wakaba <wakaba@suika.fam.cx>
40    
41 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
42     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
43     and replaced by |{prev_state}|.
44    
45     * mkhtmlparser.pl: Remove |{char}| feature.
46     Remove |!!!back-next-input-character;| macro.
47    
48     2008-09-13 Wakaba <wakaba@suika.fam.cx>
49    
50 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
51     entity related tokenizer states in favor of new states
52     implementing the consume character reference algorithm.
53    
54     2008-09-13 Wakaba <wakaba@suika.fam.cx>
55    
56     * HTML.pm.src: "Consume a character reference" algorithm is
57 wakaba 1.296 now implemented as a tokenizer's state, rather than
58     a method, with minimum changes (more changes will
59     be made, in due course). "Bogus comment state"'s inner
60     loop gets removed.
61    
62     2008-09-13 Wakaba <wakaba@suika.fam.cx>
63    
64 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
65 wakaba 1.295 into their own tokenizer states.
66    
67     2008-09-13 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
70     is split into three states.
71    
72     2008-09-13 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
75     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
76     no longer does the tokenizer have to push back next input
77     characters in those states.
78    
79     2008-09-13 Wakaba <wakaba@suika.fam.cx>
80    
81 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
82     into four states so that no longer does the tokenizer have to push
83     back next input characters in that state.
84    
85 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
86    
87     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
88     which can be used to insert some wrapper between the character
89     stream handle and the tokenizer. (It is currently not supported
90     for |set_inner_html| for |Element|s).
91    
92 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
93    
94 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
95    
96     2008-09-10 Wakaba <wakaba@suika.fam.cx>
97    
98 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
99    
100     * HTML.pm.src: Don't specify |text| argument for the
101     |chardecode:fallback| error, since it is not the encoding
102     being used alternatively.
103    
104 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
105    
106     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
107    
108 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
109    
110 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
111    
112     2008-08-31 Wakaba <wakaba@suika.fam.cx>
113    
114 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
115     to after after frameset insertion mode processing (HTML5
116     revision 1909). Note that the implementation was wrong
117     per the old spec before the r1909 changes.
118    
119 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
120    
121 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
122     spec (HTML5 revision 2093).
123     ($process_row): Algorithm step numbers synced with the
124     spec (HTML5 revision 2092).
125    
126     2008-08-30 Wakaba <wakaba@suika.fam.cx>
127    
128 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
129     revision 2094).
130    
131     2008-08-30 Wakaba <wakaba@suika.fam.cx>
132    
133     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
134    
135     2008-08-30 Wakaba <wakaba@suika.fam.cx>
136    
137 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
138     name (after another attribute) now raise parse error (HTML5
139     revision 2123). Empty unquoted attribute values are no
140     longer allowed (HTML5 revision 2122).
141    
142     2008-08-30 Wakaba <wakaba@suika.fam.cx>
143    
144 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
145     revision 2130).
146    
147     2008-08-30 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
150     as |lang| attribute value for HTML elements (HTML5 revision 2062
151     and so on).
152    
153     2008-08-30 Wakaba <wakaba@suika.fam.cx>
154    
155 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
156     was missing.
157    
158     * URIChecker.pm: The end of the URL should be marked as the
159     error location for an empty path error. The position
160     between the userinfo and the port components should be
161     marked as the error location for an empty host error.
162    
163     2008-08-30 Wakaba <wakaba@suika.fam.cx>
164    
165 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
166     value the error occurs for errors. Report unknown
167     address format error in warning level, since address
168     formats are rarely added. Path segments starting with "/.."
169     were misinterpreted as a dot-segment.
170    
171     2008-08-30 Wakaba <wakaba@suika.fam.cx>
172    
173 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
174     |Message::DOM::DOMImplementation|.
175    
176 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
177    
178     * IMTChecker.pm: Updated for the new error reporting architecture.
179    
180     * ContentChecker.pm: Error levels for IMTs are added.
181    
182 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
183    
184     * H2H.pm (_shift_token): Support for unquoted HTML attribute
185     values.
186    
187 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
188    
189 wakaba 1.275 * CacheManifest.pm: Support for new style of error
190     reports.
191    
192     * HTML.pm.src: Set line=1, column=1 to the document node.
193    
194     2008-08-16 Wakaba <wakaba@suika.fam.cx>
195    
196 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
197     and URL checkers. Support for more error levels for bogus
198     langauge tag and URL "standards".
199    
200     * LangTag.pm, URIChecker.pm: Support for new style error
201     level reporting.
202    
203 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
204    
205 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
206    
207     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
208     specifying. Error types are revised.
209    
210     2008-08-15 Wakaba <wakaba@suika.fam.cx>
211    
212 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
213     renewed.
214    
215     2008-08-15 Wakaba <wakaba@suika.fam.cx>
216    
217 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
218     are revised. Use new style for "level" specification.
219    
220     * mkhtmlparser.pl: Use new style for "level" specification.
221    
222 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
223    
224 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
225     reporting process for broken ignored valuetype definition.
226     (Valuetype idl_text): Support for special "DOMString" name.
227    
228     2008-08-03 Wakaba <wakaba@suika.fam.cx>
229    
230 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
231     terminal of the ScopedName is "DOMString", such that whether
232     the last part of the scoped name is "DOMString" or "_DOMString"
233     later. It is necessary to determine whether a |typedef|
234     definition should be ignored or not.
235     (parse_char_string): Unescape the identifier of
236     exception members.
237     ($resolve): Return undef for builtin types and sequence<T>
238     types (we might not have to do this, however...).
239     (check): Support checking for Exceptions, Valuetypes,
240     and Typedefs.
241     ($serialize_type): Support for "DOMString::::" syntax.
242     (Typedef idl_text): Output Type as "DOMString" if it
243     is really "DOMString" (i.e. its internal representation
244     is "::DOMString::").
245    
246     2008-08-03 Wakaba <wakaba@suika.fam.cx>
247    
248 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
249     for constant types in the |check| method.
250     (check): Support for checking of attributes, operations, and
251     arguments.
252     (Attribute/Operation idl_text): Exception names in getraises,
253     setraises, and raises clauses is serizlied by |$serialize_type|
254     code.
255    
256 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
257    
258 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
259     hexadecimal numbers (the original pattern, taken from the spec,
260     was not work for hexadecimal numbers, because the "0" prefix
261     matches to the [0-7]* part (as an empty string) and therefore
262     it does not match with remaining "x..." part of a "0x..." integer
263     literal.
264     ($get_type): It now returns a string, not an array reference,
265     for regular types and |sequence| types (i.e. it in any case
266     returns a string).
267     ($get_next_token): The second item in the array that represents
268     a integer or float token is now a Perl number value, not the
269     original string representation of the number.
270     (check): Support for const value consistency checking.
271     No extended attribute is defined for constants.
272     (Node subclasses): Use simple strings rather than array references
273     for default data type values.
274     ($serialize_type): Type values are now simple strings.
275     (value): If the new attribute value is a false value, then
276     a FALSE value is set to the attribute.
277    
278     2008-08-02 Wakaba <wakaba@suika.fam.cx>
279    
280 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
281     in its stringified format ("scoped name" as defined in the
282     spec). Note that future version of this module should not use
283     array references for type values and the |type_text| attribute
284     should be made obsolete.
285     (parse_char_string): Unescape attribute names.
286     (check): Support for checking of whether inherited interfaces
287     are actually defined or not. Support for checking of whether
288     interface member identifiers are duplicated or not.
289     ($serialize_type): Scoped names are returned as is. A future
290     version of this code should escape identifiers other than "DOMString",
291     otherwise the idl_text would be non-conforming.
292    
293     2008-08-02 Wakaba <wakaba@suika.fam.cx>
294    
295 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
296 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
297     for Definition's were ignored.
298     (append_child): Set |parent_node| attribute.
299     (parent_node): New attribute.
300     (check): Support interface/exception members. Support
301     extended attributes. Support definition identifier uniqueness
302     constraint.
303     (qualified_name): New attribute.
304     (Interface/Exception idl_text): Extended attributes were
305     not prepended to the returned text.
306    
307     2008-08-02 Wakaba <wakaba@suika.fam.cx>
308    
309     * WebIDL.pm (parse_char_string): Set line/column numbers
310 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
311     would make things wrong. Support for interface forward
312     declarations was missing. Broken interface declarations
313     with no block were not ignored entirely.
314     (Whatpm::WebIDL::Node): New abstract class. This class
315     makes things easier.
316     (child_nodes): New attribute. Unlike DOM's attribute with
317     same name, this attribute returns a dead list of nodes for
318     simplicity.
319     (get_user_data, set_user_data): New methods.
320     (Module idl_text): A SPACE character should be inserted
321     before the |{| character.
322     (Interface idl_text): Support for interface forward declarations.
323     (is_forward_declaration): New attribute.
324    
325 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
326    
327 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
328    
329     2008-07-19 Wakaba <wakaba@suika.fam.cx>
330    
331 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
332     it now can handle broken extended attributes and as such.
333    
334     2008-07-19 Wakaba <wakaba@suika.fam.cx>
335    
336 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
337     Support for extended attributes with arguments.
338    
339     2008-07-19 Wakaba <wakaba@suika.fam.cx>
340    
341 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
342     (Interface->idl_text): Tentative support for inheritances.
343    
344     2008-07-19 Wakaba <wakaba@suika.fam.cx>
345    
346 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
347     Support for raises, setraises, and getraises syntaxes.
348    
349 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
350    
351     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
352     proper support for types, extended attributes, and exceptions yet).
353     WebIDL parser, version 1 (no support for exceptions yet,
354     no proper support for extended attributes yet).
355    
356 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
357    
358     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
359    
360 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
361    
362 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
363    
364     2008-06-29 Wakaba <wakaba@suika.fam.cx>
365    
366 wakaba 1.255 * WebIDL.pm: New module.
367    
368 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
369    
370     * Makefile (Entities.html): URI changed.
371    
372 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
373    
374     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
375    
376 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
377    
378 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
379    
380     2008-06-01 Wakaba <wakaba@suika.fam.cx>
381    
382 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
383     spec (commented out). (HTML5 revision 1711).
384    
385 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
386    
387 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
388    
389     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
390     as UTF-16 (HTML5 revision 1701).
391    
392     2008-05-25 Wakaba <wakaba@suika.fam.cx>
393    
394 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
395     1692).
396    
397     2008-05-25 Wakaba <wakaba@suika.fam.cx>
398    
399     * HTML.pm.src: The secondary insertion mode used when switching
400     to foreign content is the "in body" insertion mode (HTML5 revision
401     1696).
402    
403     2008-05-25 Wakaba <wakaba@suika.fam.cx>
404    
405 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
406     revision 1697).
407    
408 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
409    
410 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
411     insertion mode (HTML5 revision 1693). Update SVG camelCase
412     attribute list (HTML5 revision 1700). <textarea> closes
413     </select> (HTML5 revision 1699). More start tags close in
414     foreign content insertion mode (HTML5 revision 1698).
415    
416     2008-05-24 Wakaba <wakaba@suika.fam.cx>
417    
418 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
419    
420     2008-05-24 Wakaba <wakaba@suika.fam.cx>
421    
422 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
423     revision 1674).
424    
425     2008-05-24 Wakaba <wakaba@suika.fam.cx>
426    
427 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
428     revision 1676).
429    
430     2008-05-24 Wakaba <wakaba@suika.fam.cx>
431    
432 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
433     quriks mode detection (HTML5 revision 1679).
434    
435     2008-05-24 Wakaba <wakaba@suika.fam.cx>
436    
437 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
438     revision 1685).
439    
440     2008-05-24 Wakaba <wakaba@suika.fam.cx>
441    
442 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
443     revision 1684).
444    
445     2008-05-24 Wakaba <wakaba@suika.fam.cx>
446    
447 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
448     work (HTML5 revision 1690).
449    
450     2008-05-24 Wakaba <wakaba@suika.fam.cx>
451    
452 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
453     end tags in head insertion modes (HTML5 revision 1686).
454     (parse_char_string): Bug fix for non-utf8 character string handlings.
455     (parse_char_stream): |ungetc| does not work well for this context.
456    
457 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
458    
459 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
460     |parse_byte_stream|.
461     (parse_byte_stream): New method.
462    
463     2008-05-18 Wakaba <wakaba@suika.fam.cx>
464    
465 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
466     by encoding layer error reporter.
467    
468 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
469    
470 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
471     rather than converting the whole byte string and then parsing.
472     Propagate errors in character encoding layer.
473     (get_next_token): Precise error reporting for |bare stago| error.
474    
475     2008-05-17 Wakaba <wakaba@suika.fam.cx>
476    
477 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
478     (parse_char_string): This method is now defined as an invocation
479     of the |parse_char_stream| method.
480    
481     2008-05-17 Wakaba <wakaba@suika.fam.cx>
482    
483 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
484     of the sniffing as info-level errors. Support for new
485     decoding framework in parser resestting.
486     (new): Various default error levels were not set.
487    
488     2008-05-17 Wakaba <wakaba@suika.fam.cx>
489    
490 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
491     algorithm, except for the actual sniffing, is implemented
492     with new framework with Message::Charset::Info.
493    
494 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
495    
496     * CacheManifest.pm (_parse): Drop fragment identifiers from
497     URIs in fallback section (HTML5 revision 1596).
498    
499 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
500    
501 wakaba 1.231 * Makefile (Entities.html): URI has changed.
502    
503     2008-05-10 Wakaba <wakaba@suika.fam.cx>
504    
505 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
506     1553).
507    
508 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
509    
510 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
511     no longer table elements, but table objects returned
512     by Whatpm::HTMLTable.
513    
514     * HTMLTable.pm (form_table): Return table element node
515     as |$table->{element}|.
516     (assign_header): Support for the |headers=""| attribute.
517    
518     2008-05-06 Wakaba <wakaba@suika.fam.cx>
519    
520 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
521     with no support for headers="".
522     (form_table): Include table width and height to the returned
523     table object for covenience. Indexing in column assignement was wrong.
524     Set whether a data cell is empty or not for convenience.
525    
526 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
527    
528 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
529    
530     2008-05-05 Wakaba <wakaba@suika.fam.cx>
531    
532 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
533     changed (HTML5 revision 1387).
534    
535     2008-05-05 Wakaba <wakaba@suika.fam.cx>
536    
537 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
538     to the end of the table (HTML5 revision 1380).
539    
540     2008-05-05 Wakaba <wakaba@suika.fam.cx>
541    
542 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
543     of 1-based (HTML5 revision 1376).
544    
545     2008-05-05 Wakaba <wakaba@suika.fam.cx>
546    
547 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
548     and "Content-Encoding" no longer prevent sniffing (HTML5
549     revision 1288).
550    
551     2008-05-05 Wakaba <wakaba@suika.fam.cx>
552    
553 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
554     1282).
555    
556 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
557    
558 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
559     Status of XML specs are added.
560    
561     2008-05-03 Wakaba <wakaba@suika.fam.cx>
562    
563 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
564     and |ref=""| attribute (referent element type checking).
565    
566 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
567    
568 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
569     1500) is implemented (except for IDNA part and URI-scheme-specific
570 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
571     documents.
572    
573 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
574    
575 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
576     character (HTML5 revision 1263).
577    
578     2008-04-13 Wakaba <wakaba@suika.fam.cx>
579    
580 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
581     definition for |AElig;| was missing).
582    
583     2008-04-13 Wakaba <wakaba@suika.fam.cx>
584    
585 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
586     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
587    
588     2008-04-13 Wakaba <wakaba@suika.fam.cx>
589    
590 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
591     insertion mode is added (HTML5 revisions 1412 and 1418).
592    
593     2008-04-13 Wakaba <wakaba@suika.fam.cx>
594    
595 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
596     elements (HTML5 revision 1410).
597    
598     2008-04-13 Wakaba <wakaba@suika.fam.cx>
599    
600 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
601     revision 1406).
602    
603     2008-04-13 Wakaba <wakaba@suika.fam.cx>
604    
605 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
606     elements (HTML5 revision 1404 and 1420).
607    
608 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
609    
610 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
611     elements (HTML5 revision 1404). Unused !!!macro definitions
612     are removed.
613    
614     2008-04-12 Wakaba <wakaba@suika.fam.cx>
615    
616 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
617     are raised is changed (HTML5 revision 1404).
618    
619 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
620    
621 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
622    
623     2008-04-06 Wakaba <wakaba@suika.fam.cx>
624    
625 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
626     replaced by constants representing element category.
627    
628     2008-04-06 Wakaba <wakaba@suika.fam.cx>
629    
630 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
631     for error reporting. (This is a preparation for using constant
632     value for ->[1].)
633    
634 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
635    
636 wakaba 1.205 * RDFXML.pm: Typo fixed.
637    
638     2008-03-22 Wakaba <wakaba@suika.fam.cx>
639    
640 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
641     as same as |must_level|, i.e. level = |m|.
642     (check_element): Make list of URIs in the DOM.
643    
644 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
645    
646 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
647     Local (null-namespace) attribute support.
648    
649     2008-03-21 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
652    
653     2008-03-21 Wakaba <wakaba@suika.fam.cx>
654    
655 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
656     and URI attributes is implemented. Warn if unknown
657     value is used in rdf:parseType="" attribute.
658    
659     * URIChecker.pm (check_rdf_uri_reference): New function.
660    
661     2008-03-21 Wakaba <wakaba@suika.fam.cx>
662    
663 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
664     are now resolved.
665    
666     2008-03-21 Wakaba <wakaba@suika.fam.cx>
667    
668 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
669    
670     * RDFXML.pm: undef vs false bug fixed.
671     Reification implemented.
672    
673     2008-03-21 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
676     The |node| arguments are added for |ontriple| calls.
677     Too many "attribute not allowed" errors were raised.
678    
679     * ContentChecker.pm: Initial experimental support for rdf:RDF
680     element.
681    
682     2008-03-21 Wakaba <wakaba@suika.fam.cx>
683    
684 wakaba 1.197 * RDFXML.pm: New module.
685    
686 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
687    
688 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
689     code was old one yet.
690    
691     2008-03-20 Wakaba <wakaba@suika.fam.cx>
692    
693 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
694     errors. Line/column markings for DOCTYPE, comment, and
695     character tokens are reintroduced; otherwise,
696     error location for "not HTML5" error and errors
697     for implied elements are not attached.
698    
699     2008-03-20 Wakaba <wakaba@suika.fam.cx>
700    
701 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
702    
703     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
704     (value): Setter implemented.
705    
706     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
707    
708     2008-03-20 Wakaba <wakaba@suika.fam.cx>
709    
710 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
711    
712     2008-03-20 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
715     flag, which is no longer used.
716    
717 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
720    
721     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
722     and !!!insert-element-t): Set line/column information to
723     element nodes.
724    
725     2008-03-17 Wakaba <wakaba@suika.fam.cx>
726    
727 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
728     in "<?", "<>", or "</>" should be the error point.
729    
730 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
731    
732 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
733    
734     2008-03-16 Wakaba <wakaba@suika.fam.cx>
735    
736 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
737    
738     2008-03-16 Wakaba <wakaba@suika.fam.cx>
739    
740 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
741     reporting.
742    
743 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
744    
745 wakaba 1.186 * HTML.pm.src: Error type revised.
746    
747     2008-03-11 Wakaba <wakaba@suika.fam.cx>
748    
749 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
750    
751     2008-03-11 Wakaba <wakaba@suika.fam.cx>
752    
753 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
754    
755 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
756    
757     * mkhtmlparser.pl: Set "level" parameter to parse errors.
758    
759     * HTML.pm.src: Code refined.
760    
761 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
762    
763 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
764     revision 1348). Note that I really don't know this makes
765     any difference in the black-box behavior of the parser.
766    
767     2008-03-09 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
770     revision 1348).
771    
772     2008-03-09 Wakaba <wakaba@suika.fam.cx>
773    
774 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
775    
776     2008-03-09 Wakaba <wakaba@suika.fam.cx>
777    
778 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
779    
780     2008-03-09 Wakaba <wakaba@suika.fam.cx>
781    
782 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
783     mode and support for the "in select in table" insertion mode (HTML5
784     revision 1342).
785    
786     2008-03-09 Wakaba <wakaba@suika.fam.cx>
787    
788 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
789     in non-tainted <table>s (HTML5 revision 1336).
790    
791     2008-03-09 Wakaba <wakaba@suika.fam.cx>
792    
793 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
794     and <head> (HTML5 revision 1332).
795    
796     2008-03-09 Wakaba <wakaba@suika.fam.cx>
797    
798 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
799     white space (HTML5 revision 1331).
800    
801 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
802    
803 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
804     element (HTML5 revision 1330).
805    
806     2008-03-08 Wakaba <wakaba@suika.fam.cx>
807    
808 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
809     element (HTML5 revision 1328).
810    
811     2008-03-08 Wakaba <wakaba@suika.fam.cx>
812    
813 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
814     into foster parents (HTML5 revision 1326).
815    
816     2008-03-08 Wakaba <wakaba@suika.fam.cx>
817    
818 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
819     revision 1321).
820    
821     2008-03-08 Wakaba <wakaba@suika.fam.cx>
822    
823 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
824     the spec (HTML5 revision 1320). Note that we finally complete
825     all of HTML5 revision 1320 changes.
826    
827     2008-03-08 Wakaba <wakaba@suika.fam.cx>
828    
829 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
830     tags (HTML5 revision 1320).
831    
832     2008-03-08 Wakaba <wakaba@suika.fam.cx>
833    
834 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
835    
836     2008-03-08 Wakaba <wakaba@suika.fam.cx>
837    
838 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
839     could not be reached so that it is now removed (HTML5 revision
840     1320).
841    
842     2008-03-08 Wakaba <wakaba@suika.fam.cx>
843    
844 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
845     to align with new spec text (HTML5 revision 1320).
846    
847     2008-03-08 Wakaba <wakaba@suika.fam.cx>
848    
849 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
850     have had no harm in theory.
851    
852     2008-03-08 Wakaba <wakaba@suika.fam.cx>
853    
854 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
855     case is split from other end tags for the preparation
856     of implementing HTML revision 1320.
857    
858 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
859    
860 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
861     revision 1320).
862    
863     2008-03-07 Wakaba <wakaba@suika.fam.cx>
864    
865 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
866     and "div" (HTML5 revision 1318). Add comments to where
867     |form| pointer association codes should be inserted (HTML5 revision
868     1319).
869    
870 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
871    
872     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
873    
874 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
875    
876 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
877     other token type, there are), it is replaced by a |die| statement.
878    
879     2008-03-05 Wakaba <wakaba@suika.fam.cx>
880    
881 wakaba 1.159 * HTML.pm.src: Typo fixed.
882    
883 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
884    
885     * HTML.pm.src (_tree_construction_initial): Some limited quirks
886     doctypes were not uppercased for comparison.
887    
888 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
889    
890 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
891     are added.
892    
893     2008-03-03 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
896     are set. Cases that are unlikely reached are noted as so.
897    
898     2008-03-03 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
901    
902     * mkhtmlparser.pl: Support for |!!!cp| syntax.
903    
904     2008-03-03 Wakaba <wakaba@suika.fam.cx>
905    
906 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
907     for simplicity.
908    
909     2008-03-03 Wakaba <wakaba@suika.fam.cx>
910    
911 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
912     revision 1307).
913    
914     2008-03-03 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
917     is set are changed (HTML5 revision 1305).
918    
919 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
920    
921 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
922     1292). Entities are not parsed in comment-like part in RCDATA
923     elements (HTML5 revision 1294). Allow bare & at the end
924     of attribute value literals (HTML5 revision 1296). More
925     quirks mode doctypes (HTML5 revision 1302). Requires spaces
926     between attributes and ban attribute names or unquoted
927     attribute values containing single or double quotes (HTML5
928     revision 1303).
929    
930     2008-03-02 Wakaba <wakaba@suika.fam.cx>
931    
932 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
933     and related errors unless it is an HTML document (though the spec
934     is unclear on whether it is applied to XHTML document).
935    
936     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
937     are added.
938    
939     2008-03-02 Wakaba <wakaba@suika.fam.cx>
940    
941 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
942    
943 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
944    
945     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
946    
947     * HTML.pm.src: |charset| in |content| attribute is
948     case-insensitive (HTML5 revision 1270).
949    
950 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
951    
952     * ContentChecker.pm: New status constants are added.
953     ($ElementDefault): |status| added.
954     (check_element): Err for non-standard or deprecated elements.
955     (_attr_status_info): For non-standard or deprecated attributes.
956    
957 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
958    
959 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
960    
961     2008-02-24 Wakaba <wakaba@suika.fam.cx>
962    
963 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
964     status information is now dispatched.
965    
966     2008-02-24 Wakaba <wakaba@suika.fam.cx>
967    
968 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
969     problem by adding some more arguments.
970    
971     2008-02-24 Wakaba <wakaba@suika.fam.cx>
972    
973 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
974     container-for-the-purpose-of-content-model element (not transparent
975     element) for |check_child_element| calling and significant
976     text flag marking. This reintroduces |<del>|-and-significant
977     problem again.
978    
979     2008-02-24 Wakaba <wakaba@suika.fam.cx>
980    
981 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
982     elements ignored for the purpose of phase changes in
983     content model checking.
984    
985 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
986    
987 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
988     was not properly managed for transparent cases.
989    
990     2008-02-23 Wakaba <wakaba@suika.fam.cx>
991    
992 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
993     and |audio| as semi-transparent elements.
994    
995     2008-02-23 Wakaba <wakaba@suika.fam.cx>
996    
997 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
998     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
999     Support for |html:object| as a semi-transparent element.
1000    
1001     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1002    
1003 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1004     the tree is entirely revised to make it easier to track
1005     the state of ancestors/descendants. As a result of this
1006     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1007     support for content model checking for HTML elements |figure|,
1008     |object|, |video|, and |audio| and checking for XML elements (and
1009     some XMLNS checkings) are dropped for now. They will be
1010     reimplemented in due cource.
1011    
1012 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1015     token (HTML5 revision 1225).
1016    
1017     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1018    
1019 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1020    
1021     * HTML.pm.src: Save whether |meta| |content| attribute
1022     contains character references or not.
1023    
1024     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1025    
1026 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1027     imlement significant content checking for contents
1028     with |del| elements.
1029    
1030     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1031    
1032 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1033     elements are added.
1034     (_get_children): HTML |object| elements are now semi-transparent.
1035    
1036     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1037    
1038 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1039    
1040     * CacheManifest.pm: HTML5 revision 1211 implemented.
1041    
1042     * CacheManifest.pod: Updated.
1043    
1044 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1045    
1046     * ContentChecker.pm (check_document, check_element): Support
1047     for second argument ($onsubdoc).
1048     (_get_css_parser): Removed (now it is part of WDCC).
1049    
1050 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1051    
1052     * ContentChecker.pm (_get_css_parser): New.
1053    
1054 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1055    
1056 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1057     for checking had been used.
1058    
1059     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1060    
1061 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1062    
1063 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1064    
1065 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1066     specced in RFC 2046 and RFC 4288.
1067     (application/atom+xml): Definition added.
1068    
1069     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1070    
1071 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1072     warnings (rather than SHOULD-level errors).
1073    
1074     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1075    
1076 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1077    
1078     * HTML.pm.src: A flag for character references in attribute
1079     values are added. Set |manakai_has_reference| user data
1080     to |charset| attribute.
1081    
1082     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1083    
1084 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1085     attributes.
1086    
1087     * ContentChecker.pm (check_document): Warn if charset requirements
1088     cannot be tested.
1089    
1090 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1091    
1092     * HTML.pm.src (parse_byte_string): Detect charset
1093     by universalchardet if charset parameter is not specified.
1094    
1095     * Makefile (Charset-all, Charset-clean): New rules.
1096    
1097 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1098    
1099 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1100     of character encoding declaration and selection of encoding
1101     for HTML document.
1102    
1103     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1104    
1105 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1106     type in list context.
1107    
1108     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1109    
1110 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1111     ignoring (HTML5 revisions 1013 and 1016).
1112    
1113     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1116     white spaces, are implemented.
1117    
1118 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1119    
1120 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1121     attribute if possible.
1122    
1123     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1124    
1125 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1126     (parse_char_string): New alias for |parse_string|.
1127     (main phase): Invoking "change the encoding" algorithm if desired.
1128    
1129     * HTML.pod: Updated.
1130    
1131     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1132    
1133 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1134    
1135     * Makefile (HTML-all, HTML-clean): New.
1136    
1137     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1138    
1139     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1140    
1141 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1142    
1143     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1144    
1145 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1146    
1147 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1148    
1149     * CacheManifest.pod: New file.
1150    
1151     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1152    
1153 wakaba 1.114 * CacheManifest.pm: New module.
1154    
1155     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1156    
1157 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1158     callback.
1159    
1160     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1161    
1162 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1163    
1164 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1165    
1166 wakaba 1.111 * Makefile (clean): New rule.
1167    
1168     * NanoDOM.pm (public_id, system_id): New attributes.
1169    
1170     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1171    
1172 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1173    
1174 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1175    
1176     * ContentChecker.pm (check_document): Support for
1177     new |is_xml_root| flag.
1178     (check_element): Support for new |pluses| state.
1179     (_add_pluses): New method.
1180     (_remove_minuses): Support for new |minus| item.
1181    
1182 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1183    
1184 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1185     root element.
1186    
1187     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1188    
1189 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1190    
1191 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1194    
1195     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1196    
1197 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1198    
1199     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1200    
1201 wakaba 1.103 * LangTag.pm: New module.
1202    
1203 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1204    
1205     * HTML.pm.src: Some error types were wrong.
1206    
1207 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1208    
1209 wakaba 1.101 * CSS/: New directory.
1210    
1211     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1212    
1213 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1214     for |noscript| in |head|.
1215    
1216 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1217    
1218     * URI/: New directory.
1219    
1220 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1221    
1222 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1223     number.
1224    
1225     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1226    
1227 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1228     by |&|s.
1229    
1230     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1233    
1234     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1235    
1236 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1237    
1238     * HTML.pm.src: Insertion modes are now represented in number.
1239    
1240     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1241    
1242 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1243     999) is implemented.
1244    
1245 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1246    
1247     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1248     983) is implemented.
1249    
1250 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1251    
1252     * ContentChecker.pod: New documentation.
1253    
1254     * Makefile: A rule for |ContentChecker.html| is added.
1255    
1256     * ContentChecker.pm: A pod "LICENSE" section is added.
1257    
1258     * NanoDOM.pm ($VERSION): New variable.
1259    
1260 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1261    
1262 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1263     list of allowed HTML elements.
1264    
1265     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1266    
1267 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1268     HTML elements.
1269    
1270     * URIChecker.pm (check_iri): New.
1271     (check_iri_reference): Error type for IRI reference
1272     syntax error is changed.
1273    
1274 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1275    
1276 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1277     is added.
1278     (check_document): Load appropriate module before validation.
1279    
1280     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1281    
1282 wakaba 1.88 * ContentChecker/: New directory.
1283    
1284     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1285    
1286 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1287    
1288     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1289    
1290 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1291    
1292 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1293     even if no document element is found.
1294    
1295     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1296    
1297 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1298    
1299     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1300    
1301 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1302    
1303     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1304    
1305 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1306     is split into "after html body" and "after html frameset"
1307     insertion modes. Their codes are merged with "after body"
1308     and "after frameset" codes. |$previous_insertion_mode|
1309     has been removed. "after frameset" code is
1310     merged with "in frameset" code.
1311    
1312     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1313    
1314 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1315     merged with the "in head" insertion mode.
1316    
1317     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1318    
1319 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1320     reorganized.
1321    
1322     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1323    
1324 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1325     are merged.
1326    
1327     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1328    
1329 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1330     with "in table" insertion mode.
1331    
1332     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1333    
1334 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1335     modes are merged.
1336    
1337     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1338    
1339 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1340     insertion mode!
1341    
1342     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1345     now directly invoke the handler.
1346    
1347     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1350     is merged to the "in body" insertion mode code.
1351    
1352     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1353    
1354 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1355     insertion modes are merged.
1356    
1357     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1360    
1361     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1362    
1363 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1364     instead of string constant for |{content_model_flag}|.
1365    
1366     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1367    
1368 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1369     to the "duplicate attribute" error.
1370    
1371 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1372    
1373 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1374    
1375     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1376    
1377 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1378    
1379     * HTML.pm.src: A typo is fixed.
1380    
1381 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1382    
1383 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1384     from "in HTML:xml:lang" error. Character position
1385     is now the last part of the error type in the URI error
1386     description. Report "unsupported" status for language
1387     tags, media queries, script codes, and style sheets.
1388    
1389     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1390    
1391 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1392     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1393     in XML.
1394    
1395     * NanoDOM.pm (Attr.owner_document): New attribute.
1396    
1397     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1398    
1399 wakaba 1.66 * HTML.pm.src: The character immediately following
1400     a bare |hcro| was discarded. Fix handling of
1401     entity references in attribute values.
1402    
1403     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1404    
1405 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1406     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1407     factored out. Error types |in html:#DOCTYPE|
1408     and |after html:#DOCTYPE| are merged
1409     into |DOCTYPE in the middle|. |</frameset>|
1410     in fragment parsing mode changed the
1411     insertion mode.
1412    
1413     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1414    
1415 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1416     phase is now an insertion mode. Treatments for white
1417     space character tokens were incorrect for some
1418     insertion modes. An old |meta| case was not removed.
1419    
1420     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1421    
1422 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1423     implemented (but changing the encoding is not yet:-).
1424    
1425 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1426    
1427 wakaba 1.62 * Charset/: New directory.
1428    
1429     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1430    
1431 wakaba 1.61 * H2H.pm: New Perl module (created from
1432     manakai's H2H.dis).
1433    
1434     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1435    
1436 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1437     manakai's SimpleLS.dis).
1438    
1439 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1440    
1441     * HTML.pm.src: |<!---x-->| was not processed correctly.
1442    
1443 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1444    
1445 wakaba 1.58 * HTML.pm.src: Report correct error message
1446     for |<body></div></body>|.
1447    
1448     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1451    
1452     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1453    
1454     * ContentChecker.pm: Return |{term}| list.
1455    
1456 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1457    
1458 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1459     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1460     and <title> outside of head).
1461    
1462     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1463    
1464 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1465     and private types/subtypes.
1466    
1467     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1468     URIChecker.pm, HTMLTable.pm: Error messages are now
1469     consistent; they are all listed in
1470     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1471    
1472 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1473    
1474 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1475     is now erred. |<datalist>| is implemented.
1476     Attribute checker for |<command>| and |<menu>| are
1477     added. Support for |contextmenu| global attribute
1478     is added.
1479    
1480     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1481    
1482 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1483     of Step 3 has been changed.
1484    
1485     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1486    
1487 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1488    
1489 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1490    
1491 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1492     incorrect.
1493    
1494     * ContentChecker.pm: Support for the |footer| element.
1495     Check URI syntax for space-separated URI attributes.
1496     Support for the |tabindex| attribute. Support
1497     for |datetime| attribute.
1498    
1499     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1500    
1501 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1502     character references). HTML5 revision 1.146 (white space
1503     characters before root start tag). HTML5 revision
1504     1.148 (named character references in attribute values).
1505     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1506    
1507     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1508    
1509     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1510 wakaba 1.49 in <head>).
1511    
1512     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1513    
1514 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1515     in body).
1516    
1517     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1518    
1519     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1520    
1521 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1522     or RCDATA in fragment parsing mode). HTML5 revision
1523     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1524    
1525     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1526    
1527     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1528     for <pre> and <textarea>).
1529    
1530 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1531    
1532 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1533    
1534     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1535    
1536 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1537    
1538     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1539    
1540 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1541     HTML5 revision 917 (conforming bare &).
1542    
1543     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1544    
1545 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1546    
1547     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1548     HTML5 revision 915 (<nobr>).
1549    
1550     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1551    
1552 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1553    
1554     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1555     New attributes.
1556    
1557     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1558    
1559 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1560     in tags).
1561    
1562     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1563    
1564 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1565    
1566     * HTML.pm.src: |$entity_char| is removed and
1567     requires |Whatpm::_NamedEntityList| instead.
1568     HTML5 revision 898 (refc), except that lack of refc
1569     is parse error.
1570    
1571     * mkentitylist.pl: New script.
1572    
1573     * Makefile (all): |_NamedEntityList.pm| is added.
1574     (_NamedEntityList.pm, Entities.html): New rules.
1575    
1576     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1577    
1578 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1579     were ignored and U+000D immediately following another
1580     U+000D was not converted to U+000A.
1581    
1582     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1583    
1584 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1585     nodes before appended). Parser was not ready for NULL
1586     parse error and escape flag.
1587    
1588     * NanoDOM.pm (adopt_node): New.
1589    
1590     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1591    
1592 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1593     CDATA and RCDATA). Note that current implementation
1594     is simply repeating what the spec says and it is maybe not
1595     a best way to do it.
1596    
1597     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1598    
1599 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1600     the form element if a descendant element without implied end tag has
1601     still been open).
1602    
1603     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1604    
1605 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1606     space characters non-conforming).
1607    
1608     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1609    
1610 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1611     HTML5 revision 869 (C1 character references).
1612    
1613     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1614    
1615 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1616     a |textarea| is removed).
1617    
1618 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1619    
1620     * NanoDOM.pm (get_attribute_node_ns): New method.
1621    
1622     * ContentChecker.pm: |script| |async| and |defer|
1623     no longer require |src|. |async| MUST NOT be
1624     specified if |defer|. (HTML5 revision 858).
1625    
1626 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1627    
1628     * HTML.pm.src: |<form><form>| went to inifinite loop.
1629    
1630 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1631    
1632 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1633     as a document element) flag on.
1634     (new): Removed.
1635     (check_document): New method.
1636    
1637     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1638    
1639 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1640     were incorrect.
1641    
1642     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1643    
1644 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1645     error position in the |type| option of the error.
1646    
1647     * HTMLTable.pm (form_table): The |$onerror| parameter
1648     is now optional. Some bugs are fixed.
1649    
1650     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1651    
1652 wakaba 1.27 * HTMLTable.pm: New module.
1653    
1654     * ContentChecker.pm (table): Invoke table model error checker.
1655    
1656     * NanoDOM.pm (first_child, get_attribute_ns): New.
1657    
1658 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1659    
1660 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1661     (link/@rel, a/@rel, area/@rel): Use new checker.
1662    
1663     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1664    
1665     * _LinkTypeList.pm: New file.
1666    
1667     * mklinktypelist.pl: New file.
1668    
1669     * .cvsignore: |RelExtensions.html| added.
1670    
1671     * NanoDOM.pm (child_nodes): Returns an empty array
1672     for non-child-containing node types.
1673     (text_content): New attribute.
1674    
1675     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1676    
1677 wakaba 1.25 * IMTChecker.pm: New module.
1678    
1679     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1680     to test parameter value validity.
1681    
1682     * HTML.pm.src ($style_start_tag): Attributes were
1683     discarded.
1684    
1685 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1686    
1687 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1688    
1689     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1690    
1691 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1692     and RFC 3987 are listed (not all testable items are checked yet).
1693    
1694     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1695    
1696 wakaba 1.22 * URIChecker.pm: New module.
1697    
1698 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1699    
1700 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1701     or associated to some placeholder.
1702     ($ElementDefault): Warn unknown attributes
1703     for unknown elements as "attribute not supported".
1704     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1705     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1706     (|a| attribute checker): Reimplemented.
1707    
1708     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1709    
1710 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1711     ($HTMLAttrChecker): Event handler content attributes
1712     are added.
1713     (link, embed): Required attribute is now checked.
1714     (embed): Unknown local attributes are no longer warned.
1715    
1716     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1717    
1718 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1719     ($HTMLIMTAttrChecker): New checker.
1720     (link@rel, link@href, link@type, style@type,
1721     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1722     object@data, object@type, source@src, source@type, area@alt,
1723     area@shape, area@coords, area@href,
1724     area@ping, area@rel, area@type, script@src,
1725     script@defer, script@async, script@type): Checkers added.
1726    
1727     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1728    
1729 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1730    
1731 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1732    
1733 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1734     attributes. Report an error if |Element.prefix|
1735     is |xmlns|.
1736    
1737     * NanoDOM.pm (prefix): New attribute.
1738    
1739     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1740    
1741 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1742     mode, action for |<iframe>| was missing.
1743    
1744     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1745    
1746 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1747     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1748    
1749     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1750    
1751 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1752     ($HTMLURIAttrChecker): Placeholder.
1753     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1754     New.
1755    
1756     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1757    
1758 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1759     for global attributes, |html|, |base|, |style|, and |meta|.
1760    
1761     * NanoDOM.pm (insert_before): Weaken reference
1762     to the parent node.
1763     (Attr::new): Set |owner_element| attribute.
1764     (namespace_uri, manakai_local_name): New attribute implementations.
1765     (owner_element): New attribute.
1766    
1767     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1768    
1769 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1770     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1771     $Element->{$HTML_NS}->{''}): New.
1772     (check_element): Invoke attrs_checker for each element.
1773    
1774 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1775    
1776 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1777    
1778     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1779    
1780 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1781     element type testings.
1782    
1783     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1784    
1785 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1786     error when an element type is put in the "minus" list
1787     and the element type is not allowed explicitly in the particular
1788     element content model.
1789     (html:a checker): New checker.
1790     (html:details, html:datagrid): New checkers.
1791     (html:legend): New checker.
1792    
1793     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1794    
1795 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1796    
1797     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1798    
1799 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1800     checker.
1801     (html:dd checker): New checker.
1802     (html:q, html:em, html:strong, html:small,
1803     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1804    
1805     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1806    
1807 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1808     from |$ElementDefault->{checker}|.
1809     ($ElementDefault->{checker}): Throw an error that
1810     the element type is not supported by the checker.
1811     ($HTMLMetadataElement): |html:base| was missing.
1812     ($HTMLEmptyChecker): Don't throw an error
1813     for inter-element whitespace nodes.
1814     (html:html checker): Errors were not
1815     thrown even if |html:head| and/or |html:body|
1816     children were missing.
1817     (html:head checker): An error was not
1818     thrown if <meta charset> appered after other
1819     elements.
1820    
1821 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1822    
1823     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1824     and |noscript| elements are implemented.
1825     (new): New method.
1826    
1827 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1828    
1829 wakaba 1.4 * ContentChecker.pm: New module.
1830    
1831     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1832    
1833 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1834     document_element, manakai_local_name,
1835     manakai_element_type_match): New method.
1836    
1837     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1838    
1839     * HTML.pm.src: Replace decimal and hexadecimal numeric
1840     entities in C1 range using Windows-1252 mapping. Bare LF
1841     did not count as new line for error reporting.
1842    
1843 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1844    
1845     * NanoDOM.pm (DOMImplementation): New class.
1846     (append_child): Weaken the |parent_node| reference.
1847     (create_element_ns, Element new): Set the |owner_document|
1848     reference.
1849     (implementation): New attribute.
1850     (owner_document, local_name, namespace_uri): New attributes.
1851    
1852     * HTML.pm.src (parse_string): Line and column numbers
1853     are now provided to error handler.
1854     (!!!parse-error): Short descriptions are added.
1855     (_construct_tree): Split into three methods; support
1856     for innerHTML mode.
1857     (set_inner_html): New method.
1858    
1859 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1860    
1861     * NanoDOM.html: Documentation is added.
1862    
1863     * HTML.pod, ContentType.html: Documentation is revised.
1864    
1865     * .cvsignore: Pod2html temporary files are added.
1866    
1867     * Makefile: Make |NanoDOM.html|.
1868    
1869     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1870    
1871     * HTML.pm.src (parse_string): New method.
1872     (get_inner_html): Renamed from |inner_html|.
1873    
1874     * Makefile: A rule for |HTML.html| is added.
1875    
1876     * HTML.pod: New documentation.
1877    
1878     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1879    
1880     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1881     (clone_node): Attribute nodes were not completely copied.
1882    
1883     * HTML.pm.src: Many bugs are fixed.
1884    
1885     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1886    
1887     * HTML.pm.src: Some typos are fixed.
1888    
1889     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1890    
1891     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1892     are removed.
1893    
1894     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1895    
1896     * HTML-consume-entity.src: Removed.
1897    
1898     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1899    
1900     * HTML.pm.src: Tokenizer's handling on hexadecimal
1901     numeric entities are rewritten.
1902    
1903     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1904    
1905     * HTML.pm.src: Some tokenizer bugs are fixed.
1906    
1907     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1908    
1909     * HTML.pm.src: The tree construction stage is implemented.
1910    
1911     * mkhtmlparser.pl: New macros are added.
1912    
1913     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1914    
1915     * ContentType.pm: A note on bug in the specification
1916     is removed since it's been now fixed.
1917    
1918     * .cvsignore: New file.
1919    
1920     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1921    
1922     * HTML.pm.src, HTML-consume-entity.src: New files.
1923    
1924     * Makefile (HTML.pm): New rule.
1925    
1926     * mkhtmlparser.pl: New script.
1927    
1928     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1929    
1930     * Makefile: New file.
1931    
1932     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1933    
1934     * ContentType.pm: An error in pod is fixed.
1935    
1936     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1937    
1938     * ContentType.pm: Documentation is added.
1939    
1940     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1941    
1942     * ContentType.pm: New Perl module.
1943    
1944     * ChangeLog: New file.
1945    
1946    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24