/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.316 - (hide annotations) (download)
Tue Sep 16 10:43:18 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.315: +9 -0 lines
++ whatpm/t/webidl/ChangeLog	16 Sep 2008 10:42:14 -0000
	* webidl-exception.dat: Test results updated.

	* webidl-interface.dat: New test cases for the reminding extended
	attributes are added.

2008-09-16  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/ChangeLog	16 Sep 2008 10:41:47 -0000
	* WebIDL.pm: Support for the reminding extended attributes are
	added.  It does not satisfy the definition that a forward
	interface declaration has an extended attribute.  It seems that
	unless explicitly allowed multiple extended attributes with the
	same name is not allowed, though it is not explicitly mentioned in
	the spec.

2008-09-16  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
4     added. It does not satisfy the definition that a forward
5     interface declaration has an extended attribute. It seems that
6     unless explicitly allowed multiple extended attributes with the
7     same name is not allowed, though it is not explicitly mentioned in
8     the spec.
9    
10     2008-09-16 Wakaba <wakaba@suika.fam.cx>
11    
12 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
13     attribute identifiers. Preserve whether an extended attribute has
14     an argument list of not. Support for extended attributes:
15     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
16     NameGetter, NameSetter, and Null.
17     (has_argument_list): New attribute.
18     (idl_text): Stringifies argument lists, if any, even if it is
19     empty.
20    
21 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
22    
23 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
24     |{s_kwd}| in DATA_STATE as default.
25    
26     2008-09-15 Wakaba <wakaba@suika.fam.cx>
27    
28 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
29     by |{s_kwd}| in DATA_STATE.
30    
31     2008-09-15 Wakaba <wakaba@suika.fam.cx>
32    
33 wakaba 1.312 * HTML.pm.src: Shorten keys.
34    
35     2008-09-15 Wakaba <wakaba@suika.fam.cx>
36    
37 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
38     pair, or noncharacter code points and non-Unicode code
39     points (they should be handled by Whatpm::Charset::UnicodeChecker).
40     (parse_char_stream): Support for the |$get_wrapper| argument and
41     character stream error handlers.
42    
43     2008-09-15 Wakaba <wakaba@suika.fam.cx>
44    
45 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
46     for null-namespace elements/attributes.
47    
48     * HTML.pm.src: Fact out $disallowed_control_chars
49     as a hash.
50    
51 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
52    
53 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
54     and |{next_char}| initializations are moved to initialization
55     method. |{read_until}| now supports buffering. Sync |set_inner_html|
56     with |parse_char_stream|.
57    
58     2008-09-14 Wakaba <wakaba@suika.fam.cx>
59    
60 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
61     invoke |manakai_read_until|, not only |read|, where
62     possible, to decrease the number of |read| method calls.
63    
64     * mkhtmlparser.pl: Related changes to the aforementioned
65     modification.
66    
67     2008-09-14 Wakaba <wakaba@suika.fam.cx>
68    
69 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
70     would report character error from now.
71    
72     2008-09-14 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
75     tokens in "before head insertion mode" was not
76     correctly handled.
77     (set_inner_html): Reimplemented using CharString decodehandle
78     class. Support for $get_wrapper argument. Support
79     for |{read_until}| feature.
80    
81     2008-09-14 Wakaba <wakaba@suika.fam.cx>
82    
83 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
84     entities point the "&" character.
85    
86     2008-09-14 Wakaba <wakaba@suika.fam.cx>
87    
88 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
89     be added to the list of excluded characters.
90    
91     2008-09-14 Wakaba <wakaba@suika.fam.cx>
92    
93 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
94     and |column| a higher priority than the one set by the
95     tokenizer's input handler.
96     ($self->{read_until}): Exclude U+FFFD (but this might
97     not be necessary, since now we do line/column fixup in
98     the character decode handle).
99    
100     2008-09-14 Wakaba <wakaba@suika.fam.cx>
101    
102 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
103    
104     2008-09-14 Wakaba <wakaba@suika.fam.cx>
105    
106 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
107     and |manakai_getc_until| to |manakai_read_until| to
108     reduce the number of string copies.
109    
110     2008-09-14 Wakaba <wakaba@suika.fam.cx>
111    
112 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
113     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
114     standard feature to |open| a string as a filehandle,
115     since Perl's string filehandle seems not supporting |ungetc|
116     method correctly.
117     (parse_char_stream): Define |{getc_until}| method.
118     (DATA_STATE): Experimental support for |getc_until| feature.
119    
120 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
121    
122 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
123    
124     2008-09-13 Wakaba <wakaba@suika.fam.cx>
125    
126 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
127     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
128     and replaced by |{prev_state}|.
129    
130     * mkhtmlparser.pl: Remove |{char}| feature.
131     Remove |!!!back-next-input-character;| macro.
132    
133     2008-09-13 Wakaba <wakaba@suika.fam.cx>
134    
135 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
136     entity related tokenizer states in favor of new states
137     implementing the consume character reference algorithm.
138    
139     2008-09-13 Wakaba <wakaba@suika.fam.cx>
140    
141     * HTML.pm.src: "Consume a character reference" algorithm is
142 wakaba 1.296 now implemented as a tokenizer's state, rather than
143     a method, with minimum changes (more changes will
144     be made, in due course). "Bogus comment state"'s inner
145     loop gets removed.
146    
147     2008-09-13 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
150 wakaba 1.295 into their own tokenizer states.
151    
152     2008-09-13 Wakaba <wakaba@suika.fam.cx>
153    
154 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
155     is split into three states.
156    
157     2008-09-13 Wakaba <wakaba@suika.fam.cx>
158    
159 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
160     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
161     no longer does the tokenizer have to push back next input
162     characters in those states.
163    
164     2008-09-13 Wakaba <wakaba@suika.fam.cx>
165    
166 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
167     into four states so that no longer does the tokenizer have to push
168     back next input characters in that state.
169    
170 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
171    
172     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
173     which can be used to insert some wrapper between the character
174     stream handle and the tokenizer. (It is currently not supported
175     for |set_inner_html| for |Element|s).
176    
177 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
180    
181     2008-09-10 Wakaba <wakaba@suika.fam.cx>
182    
183 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
184    
185     * HTML.pm.src: Don't specify |text| argument for the
186     |chardecode:fallback| error, since it is not the encoding
187     being used alternatively.
188    
189 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
190    
191     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
192    
193 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
194    
195 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
196    
197     2008-08-31 Wakaba <wakaba@suika.fam.cx>
198    
199 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
200     to after after frameset insertion mode processing (HTML5
201     revision 1909). Note that the implementation was wrong
202     per the old spec before the r1909 changes.
203    
204 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
205    
206 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
207     spec (HTML5 revision 2093).
208     ($process_row): Algorithm step numbers synced with the
209     spec (HTML5 revision 2092).
210    
211     2008-08-30 Wakaba <wakaba@suika.fam.cx>
212    
213 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
214     revision 2094).
215    
216     2008-08-30 Wakaba <wakaba@suika.fam.cx>
217    
218     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
219    
220     2008-08-30 Wakaba <wakaba@suika.fam.cx>
221    
222 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
223     name (after another attribute) now raise parse error (HTML5
224     revision 2123). Empty unquoted attribute values are no
225     longer allowed (HTML5 revision 2122).
226    
227     2008-08-30 Wakaba <wakaba@suika.fam.cx>
228    
229 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
230     revision 2130).
231    
232     2008-08-30 Wakaba <wakaba@suika.fam.cx>
233    
234 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
235     as |lang| attribute value for HTML elements (HTML5 revision 2062
236     and so on).
237    
238     2008-08-30 Wakaba <wakaba@suika.fam.cx>
239    
240 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
241     was missing.
242    
243     * URIChecker.pm: The end of the URL should be marked as the
244     error location for an empty path error. The position
245     between the userinfo and the port components should be
246     marked as the error location for an empty host error.
247    
248     2008-08-30 Wakaba <wakaba@suika.fam.cx>
249    
250 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
251     value the error occurs for errors. Report unknown
252     address format error in warning level, since address
253     formats are rarely added. Path segments starting with "/.."
254     were misinterpreted as a dot-segment.
255    
256     2008-08-30 Wakaba <wakaba@suika.fam.cx>
257    
258 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
259     |Message::DOM::DOMImplementation|.
260    
261 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
262    
263     * IMTChecker.pm: Updated for the new error reporting architecture.
264    
265     * ContentChecker.pm: Error levels for IMTs are added.
266    
267 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
268    
269     * H2H.pm (_shift_token): Support for unquoted HTML attribute
270     values.
271    
272 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
273    
274 wakaba 1.275 * CacheManifest.pm: Support for new style of error
275     reports.
276    
277     * HTML.pm.src: Set line=1, column=1 to the document node.
278    
279     2008-08-16 Wakaba <wakaba@suika.fam.cx>
280    
281 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
282     and URL checkers. Support for more error levels for bogus
283     langauge tag and URL "standards".
284    
285     * LangTag.pm, URIChecker.pm: Support for new style error
286     level reporting.
287    
288 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
289    
290 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
291    
292     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
293     specifying. Error types are revised.
294    
295     2008-08-15 Wakaba <wakaba@suika.fam.cx>
296    
297 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
298     renewed.
299    
300     2008-08-15 Wakaba <wakaba@suika.fam.cx>
301    
302 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
303     are revised. Use new style for "level" specification.
304    
305     * mkhtmlparser.pl: Use new style for "level" specification.
306    
307 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
308    
309 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
310     reporting process for broken ignored valuetype definition.
311     (Valuetype idl_text): Support for special "DOMString" name.
312    
313     2008-08-03 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
316     terminal of the ScopedName is "DOMString", such that whether
317     the last part of the scoped name is "DOMString" or "_DOMString"
318     later. It is necessary to determine whether a |typedef|
319     definition should be ignored or not.
320     (parse_char_string): Unescape the identifier of
321     exception members.
322     ($resolve): Return undef for builtin types and sequence<T>
323     types (we might not have to do this, however...).
324     (check): Support checking for Exceptions, Valuetypes,
325     and Typedefs.
326     ($serialize_type): Support for "DOMString::::" syntax.
327     (Typedef idl_text): Output Type as "DOMString" if it
328     is really "DOMString" (i.e. its internal representation
329     is "::DOMString::").
330    
331     2008-08-03 Wakaba <wakaba@suika.fam.cx>
332    
333 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
334     for constant types in the |check| method.
335     (check): Support for checking of attributes, operations, and
336     arguments.
337     (Attribute/Operation idl_text): Exception names in getraises,
338     setraises, and raises clauses is serizlied by |$serialize_type|
339     code.
340    
341 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
342    
343 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
344     hexadecimal numbers (the original pattern, taken from the spec,
345     was not work for hexadecimal numbers, because the "0" prefix
346     matches to the [0-7]* part (as an empty string) and therefore
347     it does not match with remaining "x..." part of a "0x..." integer
348     literal.
349     ($get_type): It now returns a string, not an array reference,
350     for regular types and |sequence| types (i.e. it in any case
351     returns a string).
352     ($get_next_token): The second item in the array that represents
353     a integer or float token is now a Perl number value, not the
354     original string representation of the number.
355     (check): Support for const value consistency checking.
356     No extended attribute is defined for constants.
357     (Node subclasses): Use simple strings rather than array references
358     for default data type values.
359     ($serialize_type): Type values are now simple strings.
360     (value): If the new attribute value is a false value, then
361     a FALSE value is set to the attribute.
362    
363     2008-08-02 Wakaba <wakaba@suika.fam.cx>
364    
365 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
366     in its stringified format ("scoped name" as defined in the
367     spec). Note that future version of this module should not use
368     array references for type values and the |type_text| attribute
369     should be made obsolete.
370     (parse_char_string): Unescape attribute names.
371     (check): Support for checking of whether inherited interfaces
372     are actually defined or not. Support for checking of whether
373     interface member identifiers are duplicated or not.
374     ($serialize_type): Scoped names are returned as is. A future
375     version of this code should escape identifiers other than "DOMString",
376     otherwise the idl_text would be non-conforming.
377    
378     2008-08-02 Wakaba <wakaba@suika.fam.cx>
379    
380 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
381 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
382     for Definition's were ignored.
383     (append_child): Set |parent_node| attribute.
384     (parent_node): New attribute.
385     (check): Support interface/exception members. Support
386     extended attributes. Support definition identifier uniqueness
387     constraint.
388     (qualified_name): New attribute.
389     (Interface/Exception idl_text): Extended attributes were
390     not prepended to the returned text.
391    
392     2008-08-02 Wakaba <wakaba@suika.fam.cx>
393    
394     * WebIDL.pm (parse_char_string): Set line/column numbers
395 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
396     would make things wrong. Support for interface forward
397     declarations was missing. Broken interface declarations
398     with no block were not ignored entirely.
399     (Whatpm::WebIDL::Node): New abstract class. This class
400     makes things easier.
401     (child_nodes): New attribute. Unlike DOM's attribute with
402     same name, this attribute returns a dead list of nodes for
403     simplicity.
404     (get_user_data, set_user_data): New methods.
405     (Module idl_text): A SPACE character should be inserted
406     before the |{| character.
407     (Interface idl_text): Support for interface forward declarations.
408     (is_forward_declaration): New attribute.
409    
410 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
411    
412 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
413    
414     2008-07-19 Wakaba <wakaba@suika.fam.cx>
415    
416 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
417     it now can handle broken extended attributes and as such.
418    
419     2008-07-19 Wakaba <wakaba@suika.fam.cx>
420    
421 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
422     Support for extended attributes with arguments.
423    
424     2008-07-19 Wakaba <wakaba@suika.fam.cx>
425    
426 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
427     (Interface->idl_text): Tentative support for inheritances.
428    
429     2008-07-19 Wakaba <wakaba@suika.fam.cx>
430    
431 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
432     Support for raises, setraises, and getraises syntaxes.
433    
434 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
435    
436     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
437     proper support for types, extended attributes, and exceptions yet).
438     WebIDL parser, version 1 (no support for exceptions yet,
439     no proper support for extended attributes yet).
440    
441 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
442    
443     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
444    
445 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
446    
447 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
448    
449     2008-06-29 Wakaba <wakaba@suika.fam.cx>
450    
451 wakaba 1.255 * WebIDL.pm: New module.
452    
453 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
454    
455     * Makefile (Entities.html): URI changed.
456    
457 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
458    
459     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
460    
461 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
462    
463 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
464    
465     2008-06-01 Wakaba <wakaba@suika.fam.cx>
466    
467 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
468     spec (commented out). (HTML5 revision 1711).
469    
470 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
473    
474     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
475     as UTF-16 (HTML5 revision 1701).
476    
477     2008-05-25 Wakaba <wakaba@suika.fam.cx>
478    
479 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
480     1692).
481    
482     2008-05-25 Wakaba <wakaba@suika.fam.cx>
483    
484     * HTML.pm.src: The secondary insertion mode used when switching
485     to foreign content is the "in body" insertion mode (HTML5 revision
486     1696).
487    
488     2008-05-25 Wakaba <wakaba@suika.fam.cx>
489    
490 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
491     revision 1697).
492    
493 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
494    
495 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
496     insertion mode (HTML5 revision 1693). Update SVG camelCase
497     attribute list (HTML5 revision 1700). <textarea> closes
498     </select> (HTML5 revision 1699). More start tags close in
499     foreign content insertion mode (HTML5 revision 1698).
500    
501     2008-05-24 Wakaba <wakaba@suika.fam.cx>
502    
503 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
504    
505     2008-05-24 Wakaba <wakaba@suika.fam.cx>
506    
507 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
508     revision 1674).
509    
510     2008-05-24 Wakaba <wakaba@suika.fam.cx>
511    
512 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
513     revision 1676).
514    
515     2008-05-24 Wakaba <wakaba@suika.fam.cx>
516    
517 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
518     quriks mode detection (HTML5 revision 1679).
519    
520     2008-05-24 Wakaba <wakaba@suika.fam.cx>
521    
522 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
523     revision 1685).
524    
525     2008-05-24 Wakaba <wakaba@suika.fam.cx>
526    
527 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
528     revision 1684).
529    
530     2008-05-24 Wakaba <wakaba@suika.fam.cx>
531    
532 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
533     work (HTML5 revision 1690).
534    
535     2008-05-24 Wakaba <wakaba@suika.fam.cx>
536    
537 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
538     end tags in head insertion modes (HTML5 revision 1686).
539     (parse_char_string): Bug fix for non-utf8 character string handlings.
540     (parse_char_stream): |ungetc| does not work well for this context.
541    
542 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
543    
544 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
545     |parse_byte_stream|.
546     (parse_byte_stream): New method.
547    
548     2008-05-18 Wakaba <wakaba@suika.fam.cx>
549    
550 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
551     by encoding layer error reporter.
552    
553 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
554    
555 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
556     rather than converting the whole byte string and then parsing.
557     Propagate errors in character encoding layer.
558     (get_next_token): Precise error reporting for |bare stago| error.
559    
560     2008-05-17 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
563     (parse_char_string): This method is now defined as an invocation
564     of the |parse_char_stream| method.
565    
566     2008-05-17 Wakaba <wakaba@suika.fam.cx>
567    
568 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
569     of the sniffing as info-level errors. Support for new
570     decoding framework in parser resestting.
571     (new): Various default error levels were not set.
572    
573     2008-05-17 Wakaba <wakaba@suika.fam.cx>
574    
575 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
576     algorithm, except for the actual sniffing, is implemented
577     with new framework with Message::Charset::Info.
578    
579 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
580    
581     * CacheManifest.pm (_parse): Drop fragment identifiers from
582     URIs in fallback section (HTML5 revision 1596).
583    
584 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
585    
586 wakaba 1.231 * Makefile (Entities.html): URI has changed.
587    
588     2008-05-10 Wakaba <wakaba@suika.fam.cx>
589    
590 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
591     1553).
592    
593 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
594    
595 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
596     no longer table elements, but table objects returned
597     by Whatpm::HTMLTable.
598    
599     * HTMLTable.pm (form_table): Return table element node
600     as |$table->{element}|.
601     (assign_header): Support for the |headers=""| attribute.
602    
603     2008-05-06 Wakaba <wakaba@suika.fam.cx>
604    
605 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
606     with no support for headers="".
607     (form_table): Include table width and height to the returned
608     table object for covenience. Indexing in column assignement was wrong.
609     Set whether a data cell is empty or not for convenience.
610    
611 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
612    
613 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
614    
615     2008-05-05 Wakaba <wakaba@suika.fam.cx>
616    
617 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
618     changed (HTML5 revision 1387).
619    
620     2008-05-05 Wakaba <wakaba@suika.fam.cx>
621    
622 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
623     to the end of the table (HTML5 revision 1380).
624    
625     2008-05-05 Wakaba <wakaba@suika.fam.cx>
626    
627 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
628     of 1-based (HTML5 revision 1376).
629    
630     2008-05-05 Wakaba <wakaba@suika.fam.cx>
631    
632 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
633     and "Content-Encoding" no longer prevent sniffing (HTML5
634     revision 1288).
635    
636     2008-05-05 Wakaba <wakaba@suika.fam.cx>
637    
638 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
639     1282).
640    
641 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
642    
643 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
644     Status of XML specs are added.
645    
646     2008-05-03 Wakaba <wakaba@suika.fam.cx>
647    
648 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
649     and |ref=""| attribute (referent element type checking).
650    
651 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
652    
653 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
654     1500) is implemented (except for IDNA part and URI-scheme-specific
655 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
656     documents.
657    
658 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
659    
660 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
661     character (HTML5 revision 1263).
662    
663     2008-04-13 Wakaba <wakaba@suika.fam.cx>
664    
665 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
666     definition for |AElig;| was missing).
667    
668     2008-04-13 Wakaba <wakaba@suika.fam.cx>
669    
670 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
671     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
672    
673     2008-04-13 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
676     insertion mode is added (HTML5 revisions 1412 and 1418).
677    
678     2008-04-13 Wakaba <wakaba@suika.fam.cx>
679    
680 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
681     elements (HTML5 revision 1410).
682    
683     2008-04-13 Wakaba <wakaba@suika.fam.cx>
684    
685 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
686     revision 1406).
687    
688     2008-04-13 Wakaba <wakaba@suika.fam.cx>
689    
690 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
691     elements (HTML5 revision 1404 and 1420).
692    
693 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
694    
695 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
696     elements (HTML5 revision 1404). Unused !!!macro definitions
697     are removed.
698    
699     2008-04-12 Wakaba <wakaba@suika.fam.cx>
700    
701 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
702     are raised is changed (HTML5 revision 1404).
703    
704 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
707    
708     2008-04-06 Wakaba <wakaba@suika.fam.cx>
709    
710 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
711     replaced by constants representing element category.
712    
713     2008-04-06 Wakaba <wakaba@suika.fam.cx>
714    
715 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
716     for error reporting. (This is a preparation for using constant
717     value for ->[1].)
718    
719 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
720    
721 wakaba 1.205 * RDFXML.pm: Typo fixed.
722    
723     2008-03-22 Wakaba <wakaba@suika.fam.cx>
724    
725 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
726     as same as |must_level|, i.e. level = |m|.
727     (check_element): Make list of URIs in the DOM.
728    
729 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
730    
731 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
732     Local (null-namespace) attribute support.
733    
734     2008-03-21 Wakaba <wakaba@suika.fam.cx>
735    
736 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
737    
738     2008-03-21 Wakaba <wakaba@suika.fam.cx>
739    
740 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
741     and URI attributes is implemented. Warn if unknown
742     value is used in rdf:parseType="" attribute.
743    
744     * URIChecker.pm (check_rdf_uri_reference): New function.
745    
746     2008-03-21 Wakaba <wakaba@suika.fam.cx>
747    
748 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
749     are now resolved.
750    
751     2008-03-21 Wakaba <wakaba@suika.fam.cx>
752    
753 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
754    
755     * RDFXML.pm: undef vs false bug fixed.
756     Reification implemented.
757    
758     2008-03-21 Wakaba <wakaba@suika.fam.cx>
759    
760 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
761     The |node| arguments are added for |ontriple| calls.
762     Too many "attribute not allowed" errors were raised.
763    
764     * ContentChecker.pm: Initial experimental support for rdf:RDF
765     element.
766    
767     2008-03-21 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.197 * RDFXML.pm: New module.
770    
771 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
772    
773 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
774     code was old one yet.
775    
776     2008-03-20 Wakaba <wakaba@suika.fam.cx>
777    
778 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
779     errors. Line/column markings for DOCTYPE, comment, and
780     character tokens are reintroduced; otherwise,
781     error location for "not HTML5" error and errors
782     for implied elements are not attached.
783    
784     2008-03-20 Wakaba <wakaba@suika.fam.cx>
785    
786 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
787    
788     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
789     (value): Setter implemented.
790    
791     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
792    
793     2008-03-20 Wakaba <wakaba@suika.fam.cx>
794    
795 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
796    
797     2008-03-20 Wakaba <wakaba@suika.fam.cx>
798    
799 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
800     flag, which is no longer used.
801    
802 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
803    
804 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
805    
806     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
807     and !!!insert-element-t): Set line/column information to
808     element nodes.
809    
810     2008-03-17 Wakaba <wakaba@suika.fam.cx>
811    
812 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
813     in "<?", "<>", or "</>" should be the error point.
814    
815 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
816    
817 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
818    
819     2008-03-16 Wakaba <wakaba@suika.fam.cx>
820    
821 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
822    
823     2008-03-16 Wakaba <wakaba@suika.fam.cx>
824    
825 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
826     reporting.
827    
828 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
829    
830 wakaba 1.186 * HTML.pm.src: Error type revised.
831    
832     2008-03-11 Wakaba <wakaba@suika.fam.cx>
833    
834 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
835    
836     2008-03-11 Wakaba <wakaba@suika.fam.cx>
837    
838 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
839    
840 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
841    
842     * mkhtmlparser.pl: Set "level" parameter to parse errors.
843    
844     * HTML.pm.src: Code refined.
845    
846 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
847    
848 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
849     revision 1348). Note that I really don't know this makes
850     any difference in the black-box behavior of the parser.
851    
852     2008-03-09 Wakaba <wakaba@suika.fam.cx>
853    
854 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
855     revision 1348).
856    
857     2008-03-09 Wakaba <wakaba@suika.fam.cx>
858    
859 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
860    
861     2008-03-09 Wakaba <wakaba@suika.fam.cx>
862    
863 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
864    
865     2008-03-09 Wakaba <wakaba@suika.fam.cx>
866    
867 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
868     mode and support for the "in select in table" insertion mode (HTML5
869     revision 1342).
870    
871     2008-03-09 Wakaba <wakaba@suika.fam.cx>
872    
873 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
874     in non-tainted <table>s (HTML5 revision 1336).
875    
876     2008-03-09 Wakaba <wakaba@suika.fam.cx>
877    
878 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
879     and <head> (HTML5 revision 1332).
880    
881     2008-03-09 Wakaba <wakaba@suika.fam.cx>
882    
883 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
884     white space (HTML5 revision 1331).
885    
886 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
887    
888 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
889     element (HTML5 revision 1330).
890    
891     2008-03-08 Wakaba <wakaba@suika.fam.cx>
892    
893 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
894     element (HTML5 revision 1328).
895    
896     2008-03-08 Wakaba <wakaba@suika.fam.cx>
897    
898 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
899     into foster parents (HTML5 revision 1326).
900    
901     2008-03-08 Wakaba <wakaba@suika.fam.cx>
902    
903 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
904     revision 1321).
905    
906     2008-03-08 Wakaba <wakaba@suika.fam.cx>
907    
908 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
909     the spec (HTML5 revision 1320). Note that we finally complete
910     all of HTML5 revision 1320 changes.
911    
912     2008-03-08 Wakaba <wakaba@suika.fam.cx>
913    
914 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
915     tags (HTML5 revision 1320).
916    
917     2008-03-08 Wakaba <wakaba@suika.fam.cx>
918    
919 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
920    
921     2008-03-08 Wakaba <wakaba@suika.fam.cx>
922    
923 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
924     could not be reached so that it is now removed (HTML5 revision
925     1320).
926    
927     2008-03-08 Wakaba <wakaba@suika.fam.cx>
928    
929 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
930     to align with new spec text (HTML5 revision 1320).
931    
932     2008-03-08 Wakaba <wakaba@suika.fam.cx>
933    
934 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
935     have had no harm in theory.
936    
937     2008-03-08 Wakaba <wakaba@suika.fam.cx>
938    
939 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
940     case is split from other end tags for the preparation
941     of implementing HTML revision 1320.
942    
943 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
944    
945 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
946     revision 1320).
947    
948     2008-03-07 Wakaba <wakaba@suika.fam.cx>
949    
950 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
951     and "div" (HTML5 revision 1318). Add comments to where
952     |form| pointer association codes should be inserted (HTML5 revision
953     1319).
954    
955 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
956    
957     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
958    
959 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
960    
961 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
962     other token type, there are), it is replaced by a |die| statement.
963    
964     2008-03-05 Wakaba <wakaba@suika.fam.cx>
965    
966 wakaba 1.159 * HTML.pm.src: Typo fixed.
967    
968 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
969    
970     * HTML.pm.src (_tree_construction_initial): Some limited quirks
971     doctypes were not uppercased for comparison.
972    
973 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
974    
975 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
976     are added.
977    
978     2008-03-03 Wakaba <wakaba@suika.fam.cx>
979    
980 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
981     are set. Cases that are unlikely reached are noted as so.
982    
983     2008-03-03 Wakaba <wakaba@suika.fam.cx>
984    
985 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
986    
987     * mkhtmlparser.pl: Support for |!!!cp| syntax.
988    
989     2008-03-03 Wakaba <wakaba@suika.fam.cx>
990    
991 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
992     for simplicity.
993    
994     2008-03-03 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
997     revision 1307).
998    
999     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1000    
1001 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1002     is set are changed (HTML5 revision 1305).
1003    
1004 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1005    
1006 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1007     1292). Entities are not parsed in comment-like part in RCDATA
1008     elements (HTML5 revision 1294). Allow bare & at the end
1009     of attribute value literals (HTML5 revision 1296). More
1010     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1011     between attributes and ban attribute names or unquoted
1012     attribute values containing single or double quotes (HTML5
1013     revision 1303).
1014    
1015     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1016    
1017 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1018     and related errors unless it is an HTML document (though the spec
1019     is unclear on whether it is applied to XHTML document).
1020    
1021     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1022     are added.
1023    
1024     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1025    
1026 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1027    
1028 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1029    
1030     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1031    
1032     * HTML.pm.src: |charset| in |content| attribute is
1033     case-insensitive (HTML5 revision 1270).
1034    
1035 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1036    
1037     * ContentChecker.pm: New status constants are added.
1038     ($ElementDefault): |status| added.
1039     (check_element): Err for non-standard or deprecated elements.
1040     (_attr_status_info): For non-standard or deprecated attributes.
1041    
1042 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1043    
1044 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1045    
1046     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1047    
1048 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1049     status information is now dispatched.
1050    
1051     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1052    
1053 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1054     problem by adding some more arguments.
1055    
1056     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1057    
1058 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1059     container-for-the-purpose-of-content-model element (not transparent
1060     element) for |check_child_element| calling and significant
1061     text flag marking. This reintroduces |<del>|-and-significant
1062     problem again.
1063    
1064     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1065    
1066 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1067     elements ignored for the purpose of phase changes in
1068     content model checking.
1069    
1070 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1071    
1072 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1073     was not properly managed for transparent cases.
1074    
1075     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1076    
1077 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1078     and |audio| as semi-transparent elements.
1079    
1080     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1081    
1082 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1083     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1084     Support for |html:object| as a semi-transparent element.
1085    
1086     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1087    
1088 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1089     the tree is entirely revised to make it easier to track
1090     the state of ancestors/descendants. As a result of this
1091     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1092     support for content model checking for HTML elements |figure|,
1093     |object|, |video|, and |audio| and checking for XML elements (and
1094     some XMLNS checkings) are dropped for now. They will be
1095     reimplemented in due cource.
1096    
1097 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1098    
1099 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1100     token (HTML5 revision 1225).
1101    
1102     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1103    
1104 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1105    
1106     * HTML.pm.src: Save whether |meta| |content| attribute
1107     contains character references or not.
1108    
1109     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1110    
1111 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1112     imlement significant content checking for contents
1113     with |del| elements.
1114    
1115     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1116    
1117 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1118     elements are added.
1119     (_get_children): HTML |object| elements are now semi-transparent.
1120    
1121     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1122    
1123 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1124    
1125     * CacheManifest.pm: HTML5 revision 1211 implemented.
1126    
1127     * CacheManifest.pod: Updated.
1128    
1129 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1130    
1131     * ContentChecker.pm (check_document, check_element): Support
1132     for second argument ($onsubdoc).
1133     (_get_css_parser): Removed (now it is part of WDCC).
1134    
1135 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1136    
1137     * ContentChecker.pm (_get_css_parser): New.
1138    
1139 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1140    
1141 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1142     for checking had been used.
1143    
1144     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1145    
1146 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1147    
1148 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1149    
1150 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1151     specced in RFC 2046 and RFC 4288.
1152     (application/atom+xml): Definition added.
1153    
1154     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1155    
1156 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1157     warnings (rather than SHOULD-level errors).
1158    
1159     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1160    
1161 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1162    
1163     * HTML.pm.src: A flag for character references in attribute
1164     values are added. Set |manakai_has_reference| user data
1165     to |charset| attribute.
1166    
1167     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1168    
1169 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1170     attributes.
1171    
1172     * ContentChecker.pm (check_document): Warn if charset requirements
1173     cannot be tested.
1174    
1175 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1176    
1177     * HTML.pm.src (parse_byte_string): Detect charset
1178     by universalchardet if charset parameter is not specified.
1179    
1180     * Makefile (Charset-all, Charset-clean): New rules.
1181    
1182 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1183    
1184 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1185     of character encoding declaration and selection of encoding
1186     for HTML document.
1187    
1188     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1189    
1190 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1191     type in list context.
1192    
1193     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1194    
1195 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1196     ignoring (HTML5 revisions 1013 and 1016).
1197    
1198     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1199    
1200 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1201     white spaces, are implemented.
1202    
1203 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1204    
1205 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1206     attribute if possible.
1207    
1208     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1209    
1210 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1211     (parse_char_string): New alias for |parse_string|.
1212     (main phase): Invoking "change the encoding" algorithm if desired.
1213    
1214     * HTML.pod: Updated.
1215    
1216     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1217    
1218 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1219    
1220     * Makefile (HTML-all, HTML-clean): New.
1221    
1222     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1223    
1224     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1225    
1226 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1227    
1228     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1229    
1230 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1233    
1234     * CacheManifest.pod: New file.
1235    
1236     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1237    
1238 wakaba 1.114 * CacheManifest.pm: New module.
1239    
1240     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1241    
1242 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1243     callback.
1244    
1245     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1246    
1247 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1248    
1249 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1250    
1251 wakaba 1.111 * Makefile (clean): New rule.
1252    
1253     * NanoDOM.pm (public_id, system_id): New attributes.
1254    
1255     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1256    
1257 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1258    
1259 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1260    
1261     * ContentChecker.pm (check_document): Support for
1262     new |is_xml_root| flag.
1263     (check_element): Support for new |pluses| state.
1264     (_add_pluses): New method.
1265     (_remove_minuses): Support for new |minus| item.
1266    
1267 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1268    
1269 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1270     root element.
1271    
1272     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1273    
1274 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1275    
1276 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1277    
1278 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1279    
1280     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1281    
1282 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1283    
1284     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1285    
1286 wakaba 1.103 * LangTag.pm: New module.
1287    
1288 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1289    
1290     * HTML.pm.src: Some error types were wrong.
1291    
1292 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1293    
1294 wakaba 1.101 * CSS/: New directory.
1295    
1296     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1297    
1298 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1299     for |noscript| in |head|.
1300    
1301 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1302    
1303     * URI/: New directory.
1304    
1305 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1306    
1307 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1308     number.
1309    
1310     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1311    
1312 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1313     by |&|s.
1314    
1315     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1316    
1317 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1318    
1319     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1320    
1321 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1322    
1323     * HTML.pm.src: Insertion modes are now represented in number.
1324    
1325     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1326    
1327 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1328     999) is implemented.
1329    
1330 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1331    
1332     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1333     983) is implemented.
1334    
1335 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1336    
1337     * ContentChecker.pod: New documentation.
1338    
1339     * Makefile: A rule for |ContentChecker.html| is added.
1340    
1341     * ContentChecker.pm: A pod "LICENSE" section is added.
1342    
1343     * NanoDOM.pm ($VERSION): New variable.
1344    
1345 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1346    
1347 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1348     list of allowed HTML elements.
1349    
1350     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1351    
1352 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1353     HTML elements.
1354    
1355     * URIChecker.pm (check_iri): New.
1356     (check_iri_reference): Error type for IRI reference
1357     syntax error is changed.
1358    
1359 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1360    
1361 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1362     is added.
1363     (check_document): Load appropriate module before validation.
1364    
1365     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.88 * ContentChecker/: New directory.
1368    
1369     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1370    
1371 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1372    
1373     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1374    
1375 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1376    
1377 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1378     even if no document element is found.
1379    
1380     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1381    
1382 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1383    
1384     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1385    
1386 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1387    
1388     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1389    
1390 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1391     is split into "after html body" and "after html frameset"
1392     insertion modes. Their codes are merged with "after body"
1393     and "after frameset" codes. |$previous_insertion_mode|
1394     has been removed. "after frameset" code is
1395     merged with "in frameset" code.
1396    
1397     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1398    
1399 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1400     merged with the "in head" insertion mode.
1401    
1402     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1403    
1404 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1405     reorganized.
1406    
1407     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1408    
1409 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1410     are merged.
1411    
1412     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1413    
1414 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1415     with "in table" insertion mode.
1416    
1417     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1418    
1419 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1420     modes are merged.
1421    
1422     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1423    
1424 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1425     insertion mode!
1426    
1427     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1428    
1429 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1430     now directly invoke the handler.
1431    
1432     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1433    
1434 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1435     is merged to the "in body" insertion mode code.
1436    
1437     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1438    
1439 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1440     insertion modes are merged.
1441    
1442     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1443    
1444 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1445    
1446     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1447    
1448 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1449     instead of string constant for |{content_model_flag}|.
1450    
1451     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1452    
1453 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1454     to the "duplicate attribute" error.
1455    
1456 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1457    
1458 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1459    
1460     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1461    
1462 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1463    
1464     * HTML.pm.src: A typo is fixed.
1465    
1466 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1467    
1468 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1469     from "in HTML:xml:lang" error. Character position
1470     is now the last part of the error type in the URI error
1471     description. Report "unsupported" status for language
1472     tags, media queries, script codes, and style sheets.
1473    
1474     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1475    
1476 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1477     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1478     in XML.
1479    
1480     * NanoDOM.pm (Attr.owner_document): New attribute.
1481    
1482     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1483    
1484 wakaba 1.66 * HTML.pm.src: The character immediately following
1485     a bare |hcro| was discarded. Fix handling of
1486     entity references in attribute values.
1487    
1488     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1489    
1490 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1491     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1492     factored out. Error types |in html:#DOCTYPE|
1493     and |after html:#DOCTYPE| are merged
1494     into |DOCTYPE in the middle|. |</frameset>|
1495     in fragment parsing mode changed the
1496     insertion mode.
1497    
1498     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1499    
1500 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1501     phase is now an insertion mode. Treatments for white
1502     space character tokens were incorrect for some
1503     insertion modes. An old |meta| case was not removed.
1504    
1505     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1506    
1507 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1508     implemented (but changing the encoding is not yet:-).
1509    
1510 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1511    
1512 wakaba 1.62 * Charset/: New directory.
1513    
1514     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1515    
1516 wakaba 1.61 * H2H.pm: New Perl module (created from
1517     manakai's H2H.dis).
1518    
1519     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1520    
1521 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1522     manakai's SimpleLS.dis).
1523    
1524 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1525    
1526     * HTML.pm.src: |<!---x-->| was not processed correctly.
1527    
1528 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1529    
1530 wakaba 1.58 * HTML.pm.src: Report correct error message
1531     for |<body></div></body>|.
1532    
1533     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1534    
1535 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1536    
1537     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1538    
1539     * ContentChecker.pm: Return |{term}| list.
1540    
1541 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1542    
1543 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1544     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1545     and <title> outside of head).
1546    
1547     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1548    
1549 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1550     and private types/subtypes.
1551    
1552     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1553     URIChecker.pm, HTMLTable.pm: Error messages are now
1554     consistent; they are all listed in
1555     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1556    
1557 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1558    
1559 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1560     is now erred. |<datalist>| is implemented.
1561     Attribute checker for |<command>| and |<menu>| are
1562     added. Support for |contextmenu| global attribute
1563     is added.
1564    
1565     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1566    
1567 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1568     of Step 3 has been changed.
1569    
1570     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1571    
1572 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1573    
1574 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1575    
1576 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1577     incorrect.
1578    
1579     * ContentChecker.pm: Support for the |footer| element.
1580     Check URI syntax for space-separated URI attributes.
1581     Support for the |tabindex| attribute. Support
1582     for |datetime| attribute.
1583    
1584     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1585    
1586 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1587     character references). HTML5 revision 1.146 (white space
1588     characters before root start tag). HTML5 revision
1589     1.148 (named character references in attribute values).
1590     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1591    
1592     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1593    
1594     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1595 wakaba 1.49 in <head>).
1596    
1597     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1598    
1599 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1600     in body).
1601    
1602     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1603    
1604     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1605    
1606 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1607     or RCDATA in fragment parsing mode). HTML5 revision
1608     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1609    
1610     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1611    
1612     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1613     for <pre> and <textarea>).
1614    
1615 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1616    
1617 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1618    
1619     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1620    
1621 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1622    
1623     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1624    
1625 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1626     HTML5 revision 917 (conforming bare &).
1627    
1628     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1629    
1630 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1631    
1632     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1633     HTML5 revision 915 (<nobr>).
1634    
1635     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1636    
1637 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1638    
1639     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1640     New attributes.
1641    
1642     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1643    
1644 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1645     in tags).
1646    
1647     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1648    
1649 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1650    
1651     * HTML.pm.src: |$entity_char| is removed and
1652     requires |Whatpm::_NamedEntityList| instead.
1653     HTML5 revision 898 (refc), except that lack of refc
1654     is parse error.
1655    
1656     * mkentitylist.pl: New script.
1657    
1658     * Makefile (all): |_NamedEntityList.pm| is added.
1659     (_NamedEntityList.pm, Entities.html): New rules.
1660    
1661     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1662    
1663 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1664     were ignored and U+000D immediately following another
1665     U+000D was not converted to U+000A.
1666    
1667     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1668    
1669 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1670     nodes before appended). Parser was not ready for NULL
1671     parse error and escape flag.
1672    
1673     * NanoDOM.pm (adopt_node): New.
1674    
1675     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1676    
1677 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1678     CDATA and RCDATA). Note that current implementation
1679     is simply repeating what the spec says and it is maybe not
1680     a best way to do it.
1681    
1682     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1683    
1684 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1685     the form element if a descendant element without implied end tag has
1686     still been open).
1687    
1688     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1689    
1690 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1691     space characters non-conforming).
1692    
1693     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1696     HTML5 revision 869 (C1 character references).
1697    
1698     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1699    
1700 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1701     a |textarea| is removed).
1702    
1703 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1704    
1705     * NanoDOM.pm (get_attribute_node_ns): New method.
1706    
1707     * ContentChecker.pm: |script| |async| and |defer|
1708     no longer require |src|. |async| MUST NOT be
1709     specified if |defer|. (HTML5 revision 858).
1710    
1711 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1712    
1713     * HTML.pm.src: |<form><form>| went to inifinite loop.
1714    
1715 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1716    
1717 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1718     as a document element) flag on.
1719     (new): Removed.
1720     (check_document): New method.
1721    
1722     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1723    
1724 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1725     were incorrect.
1726    
1727     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1728    
1729 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1730     error position in the |type| option of the error.
1731    
1732     * HTMLTable.pm (form_table): The |$onerror| parameter
1733     is now optional. Some bugs are fixed.
1734    
1735     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1736    
1737 wakaba 1.27 * HTMLTable.pm: New module.
1738    
1739     * ContentChecker.pm (table): Invoke table model error checker.
1740    
1741     * NanoDOM.pm (first_child, get_attribute_ns): New.
1742    
1743 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1744    
1745 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1746     (link/@rel, a/@rel, area/@rel): Use new checker.
1747    
1748     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1749    
1750     * _LinkTypeList.pm: New file.
1751    
1752     * mklinktypelist.pl: New file.
1753    
1754     * .cvsignore: |RelExtensions.html| added.
1755    
1756     * NanoDOM.pm (child_nodes): Returns an empty array
1757     for non-child-containing node types.
1758     (text_content): New attribute.
1759    
1760     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1761    
1762 wakaba 1.25 * IMTChecker.pm: New module.
1763    
1764     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1765     to test parameter value validity.
1766    
1767     * HTML.pm.src ($style_start_tag): Attributes were
1768     discarded.
1769    
1770 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1771    
1772 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1773    
1774     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1775    
1776 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1777     and RFC 3987 are listed (not all testable items are checked yet).
1778    
1779     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1780    
1781 wakaba 1.22 * URIChecker.pm: New module.
1782    
1783 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1784    
1785 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1786     or associated to some placeholder.
1787     ($ElementDefault): Warn unknown attributes
1788     for unknown elements as "attribute not supported".
1789     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1790     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1791     (|a| attribute checker): Reimplemented.
1792    
1793     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1794    
1795 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1796     ($HTMLAttrChecker): Event handler content attributes
1797     are added.
1798     (link, embed): Required attribute is now checked.
1799     (embed): Unknown local attributes are no longer warned.
1800    
1801     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1802    
1803 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1804     ($HTMLIMTAttrChecker): New checker.
1805     (link@rel, link@href, link@type, style@type,
1806     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1807     object@data, object@type, source@src, source@type, area@alt,
1808     area@shape, area@coords, area@href,
1809     area@ping, area@rel, area@type, script@src,
1810     script@defer, script@async, script@type): Checkers added.
1811    
1812     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1813    
1814 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1815    
1816 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1817    
1818 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1819     attributes. Report an error if |Element.prefix|
1820     is |xmlns|.
1821    
1822     * NanoDOM.pm (prefix): New attribute.
1823    
1824     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1825    
1826 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1827     mode, action for |<iframe>| was missing.
1828    
1829     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1830    
1831 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1832     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1833    
1834     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1835    
1836 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1837     ($HTMLURIAttrChecker): Placeholder.
1838     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1839     New.
1840    
1841     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1842    
1843 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1844     for global attributes, |html|, |base|, |style|, and |meta|.
1845    
1846     * NanoDOM.pm (insert_before): Weaken reference
1847     to the parent node.
1848     (Attr::new): Set |owner_element| attribute.
1849     (namespace_uri, manakai_local_name): New attribute implementations.
1850     (owner_element): New attribute.
1851    
1852     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1853    
1854 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1855     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1856     $Element->{$HTML_NS}->{''}): New.
1857     (check_element): Invoke attrs_checker for each element.
1858    
1859 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1860    
1861 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1862    
1863     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1864    
1865 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1866     element type testings.
1867    
1868     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1869    
1870 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1871     error when an element type is put in the "minus" list
1872     and the element type is not allowed explicitly in the particular
1873     element content model.
1874     (html:a checker): New checker.
1875     (html:details, html:datagrid): New checkers.
1876     (html:legend): New checker.
1877    
1878     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1879    
1880 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1881    
1882     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1883    
1884 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1885     checker.
1886     (html:dd checker): New checker.
1887     (html:q, html:em, html:strong, html:small,
1888     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1889    
1890     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1891    
1892 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1893     from |$ElementDefault->{checker}|.
1894     ($ElementDefault->{checker}): Throw an error that
1895     the element type is not supported by the checker.
1896     ($HTMLMetadataElement): |html:base| was missing.
1897     ($HTMLEmptyChecker): Don't throw an error
1898     for inter-element whitespace nodes.
1899     (html:html checker): Errors were not
1900     thrown even if |html:head| and/or |html:body|
1901     children were missing.
1902     (html:head checker): An error was not
1903     thrown if <meta charset> appered after other
1904     elements.
1905    
1906 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1907    
1908     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1909     and |noscript| elements are implemented.
1910     (new): New method.
1911    
1912 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1913    
1914 wakaba 1.4 * ContentChecker.pm: New module.
1915    
1916     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1917    
1918 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1919     document_element, manakai_local_name,
1920     manakai_element_type_match): New method.
1921    
1922     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1923    
1924     * HTML.pm.src: Replace decimal and hexadecimal numeric
1925     entities in C1 range using Windows-1252 mapping. Bare LF
1926     did not count as new line for error reporting.
1927    
1928 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1929    
1930     * NanoDOM.pm (DOMImplementation): New class.
1931     (append_child): Weaken the |parent_node| reference.
1932     (create_element_ns, Element new): Set the |owner_document|
1933     reference.
1934     (implementation): New attribute.
1935     (owner_document, local_name, namespace_uri): New attributes.
1936    
1937     * HTML.pm.src (parse_string): Line and column numbers
1938     are now provided to error handler.
1939     (!!!parse-error): Short descriptions are added.
1940     (_construct_tree): Split into three methods; support
1941     for innerHTML mode.
1942     (set_inner_html): New method.
1943    
1944 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1945    
1946     * NanoDOM.html: Documentation is added.
1947    
1948     * HTML.pod, ContentType.html: Documentation is revised.
1949    
1950     * .cvsignore: Pod2html temporary files are added.
1951    
1952     * Makefile: Make |NanoDOM.html|.
1953    
1954     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1955    
1956     * HTML.pm.src (parse_string): New method.
1957     (get_inner_html): Renamed from |inner_html|.
1958    
1959     * Makefile: A rule for |HTML.html| is added.
1960    
1961     * HTML.pod: New documentation.
1962    
1963     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1964    
1965     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1966     (clone_node): Attribute nodes were not completely copied.
1967    
1968     * HTML.pm.src: Many bugs are fixed.
1969    
1970     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1971    
1972     * HTML.pm.src: Some typos are fixed.
1973    
1974     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1975    
1976     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1977     are removed.
1978    
1979     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1980    
1981     * HTML-consume-entity.src: Removed.
1982    
1983     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1984    
1985     * HTML.pm.src: Tokenizer's handling on hexadecimal
1986     numeric entities are rewritten.
1987    
1988     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1989    
1990     * HTML.pm.src: Some tokenizer bugs are fixed.
1991    
1992     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1993    
1994     * HTML.pm.src: The tree construction stage is implemented.
1995    
1996     * mkhtmlparser.pl: New macros are added.
1997    
1998     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1999    
2000     * ContentType.pm: A note on bug in the specification
2001     is removed since it's been now fixed.
2002    
2003     * .cvsignore: New file.
2004    
2005     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2006    
2007     * HTML.pm.src, HTML-consume-entity.src: New files.
2008    
2009     * Makefile (HTML.pm): New rule.
2010    
2011     * mkhtmlparser.pl: New script.
2012    
2013     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2014    
2015     * Makefile: New file.
2016    
2017     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2018    
2019     * ContentType.pm: An error in pod is fixed.
2020    
2021     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2022    
2023     * ContentType.pm: Documentation is added.
2024    
2025     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2026    
2027     * ContentType.pm: New Perl module.
2028    
2029     * ChangeLog: New file.
2030    
2031    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24