/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.309 - (hide annotations) (download)
Sun Sep 14 14:35:43 2008 UTC (16 years, 10 months ago) by wakaba
Branch: MAIN
Changes since 1.308: +7 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 14:35:32 -0000
	* HTML.pm.src: Regexp typo fixed.  |{prev_char}|
	and |{next_char}| initializations are moved to initialization
	method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
	with |parse_char_stream|.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
4     and |{next_char}| initializations are moved to initialization
5     method. |{read_until}| now supports buffering. Sync |set_inner_html|
6     with |parse_char_stream|.
7    
8     2008-09-14 Wakaba <wakaba@suika.fam.cx>
9    
10 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
11     invoke |manakai_read_until|, not only |read|, where
12     possible, to decrease the number of |read| method calls.
13    
14     * mkhtmlparser.pl: Related changes to the aforementioned
15     modification.
16    
17     2008-09-14 Wakaba <wakaba@suika.fam.cx>
18    
19 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
20     would report character error from now.
21    
22     2008-09-14 Wakaba <wakaba@suika.fam.cx>
23    
24 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
25     tokens in "before head insertion mode" was not
26     correctly handled.
27     (set_inner_html): Reimplemented using CharString decodehandle
28     class. Support for $get_wrapper argument. Support
29     for |{read_until}| feature.
30    
31     2008-09-14 Wakaba <wakaba@suika.fam.cx>
32    
33 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
34     entities point the "&" character.
35    
36     2008-09-14 Wakaba <wakaba@suika.fam.cx>
37    
38 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
39     be added to the list of excluded characters.
40    
41     2008-09-14 Wakaba <wakaba@suika.fam.cx>
42    
43 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
44     and |column| a higher priority than the one set by the
45     tokenizer's input handler.
46     ($self->{read_until}): Exclude U+FFFD (but this might
47     not be necessary, since now we do line/column fixup in
48     the character decode handle).
49    
50     2008-09-14 Wakaba <wakaba@suika.fam.cx>
51    
52 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
53    
54     2008-09-14 Wakaba <wakaba@suika.fam.cx>
55    
56 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
57     and |manakai_getc_until| to |manakai_read_until| to
58     reduce the number of string copies.
59    
60     2008-09-14 Wakaba <wakaba@suika.fam.cx>
61    
62 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
63     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
64     standard feature to |open| a string as a filehandle,
65     since Perl's string filehandle seems not supporting |ungetc|
66     method correctly.
67     (parse_char_stream): Define |{getc_until}| method.
68     (DATA_STATE): Experimental support for |getc_until| feature.
69    
70 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
71    
72 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
73    
74     2008-09-13 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
77     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
78     and replaced by |{prev_state}|.
79    
80     * mkhtmlparser.pl: Remove |{char}| feature.
81     Remove |!!!back-next-input-character;| macro.
82    
83     2008-09-13 Wakaba <wakaba@suika.fam.cx>
84    
85 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
86     entity related tokenizer states in favor of new states
87     implementing the consume character reference algorithm.
88    
89     2008-09-13 Wakaba <wakaba@suika.fam.cx>
90    
91     * HTML.pm.src: "Consume a character reference" algorithm is
92 wakaba 1.296 now implemented as a tokenizer's state, rather than
93     a method, with minimum changes (more changes will
94     be made, in due course). "Bogus comment state"'s inner
95     loop gets removed.
96    
97     2008-09-13 Wakaba <wakaba@suika.fam.cx>
98    
99 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
100 wakaba 1.295 into their own tokenizer states.
101    
102     2008-09-13 Wakaba <wakaba@suika.fam.cx>
103    
104 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
105     is split into three states.
106    
107     2008-09-13 Wakaba <wakaba@suika.fam.cx>
108    
109 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
110     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
111     no longer does the tokenizer have to push back next input
112     characters in those states.
113    
114     2008-09-13 Wakaba <wakaba@suika.fam.cx>
115    
116 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
117     into four states so that no longer does the tokenizer have to push
118     back next input characters in that state.
119    
120 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
121    
122     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
123     which can be used to insert some wrapper between the character
124     stream handle and the tokenizer. (It is currently not supported
125     for |set_inner_html| for |Element|s).
126    
127 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
128    
129 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
130    
131     2008-09-10 Wakaba <wakaba@suika.fam.cx>
132    
133 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
134    
135     * HTML.pm.src: Don't specify |text| argument for the
136     |chardecode:fallback| error, since it is not the encoding
137     being used alternatively.
138    
139 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
140    
141     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
142    
143 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
144    
145 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
146    
147     2008-08-31 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
150     to after after frameset insertion mode processing (HTML5
151     revision 1909). Note that the implementation was wrong
152     per the old spec before the r1909 changes.
153    
154 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
155    
156 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
157     spec (HTML5 revision 2093).
158     ($process_row): Algorithm step numbers synced with the
159     spec (HTML5 revision 2092).
160    
161     2008-08-30 Wakaba <wakaba@suika.fam.cx>
162    
163 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
164     revision 2094).
165    
166     2008-08-30 Wakaba <wakaba@suika.fam.cx>
167    
168     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
169    
170     2008-08-30 Wakaba <wakaba@suika.fam.cx>
171    
172 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
173     name (after another attribute) now raise parse error (HTML5
174     revision 2123). Empty unquoted attribute values are no
175     longer allowed (HTML5 revision 2122).
176    
177     2008-08-30 Wakaba <wakaba@suika.fam.cx>
178    
179 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
180     revision 2130).
181    
182     2008-08-30 Wakaba <wakaba@suika.fam.cx>
183    
184 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
185     as |lang| attribute value for HTML elements (HTML5 revision 2062
186     and so on).
187    
188     2008-08-30 Wakaba <wakaba@suika.fam.cx>
189    
190 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
191     was missing.
192    
193     * URIChecker.pm: The end of the URL should be marked as the
194     error location for an empty path error. The position
195     between the userinfo and the port components should be
196     marked as the error location for an empty host error.
197    
198     2008-08-30 Wakaba <wakaba@suika.fam.cx>
199    
200 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
201     value the error occurs for errors. Report unknown
202     address format error in warning level, since address
203     formats are rarely added. Path segments starting with "/.."
204     were misinterpreted as a dot-segment.
205    
206     2008-08-30 Wakaba <wakaba@suika.fam.cx>
207    
208 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
209     |Message::DOM::DOMImplementation|.
210    
211 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
212    
213     * IMTChecker.pm: Updated for the new error reporting architecture.
214    
215     * ContentChecker.pm: Error levels for IMTs are added.
216    
217 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
218    
219     * H2H.pm (_shift_token): Support for unquoted HTML attribute
220     values.
221    
222 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
223    
224 wakaba 1.275 * CacheManifest.pm: Support for new style of error
225     reports.
226    
227     * HTML.pm.src: Set line=1, column=1 to the document node.
228    
229     2008-08-16 Wakaba <wakaba@suika.fam.cx>
230    
231 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
232     and URL checkers. Support for more error levels for bogus
233     langauge tag and URL "standards".
234    
235     * LangTag.pm, URIChecker.pm: Support for new style error
236     level reporting.
237    
238 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
239    
240 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
241    
242     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
243     specifying. Error types are revised.
244    
245     2008-08-15 Wakaba <wakaba@suika.fam.cx>
246    
247 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
248     renewed.
249    
250     2008-08-15 Wakaba <wakaba@suika.fam.cx>
251    
252 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
253     are revised. Use new style for "level" specification.
254    
255     * mkhtmlparser.pl: Use new style for "level" specification.
256    
257 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
258    
259 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
260     reporting process for broken ignored valuetype definition.
261     (Valuetype idl_text): Support for special "DOMString" name.
262    
263     2008-08-03 Wakaba <wakaba@suika.fam.cx>
264    
265 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
266     terminal of the ScopedName is "DOMString", such that whether
267     the last part of the scoped name is "DOMString" or "_DOMString"
268     later. It is necessary to determine whether a |typedef|
269     definition should be ignored or not.
270     (parse_char_string): Unescape the identifier of
271     exception members.
272     ($resolve): Return undef for builtin types and sequence<T>
273     types (we might not have to do this, however...).
274     (check): Support checking for Exceptions, Valuetypes,
275     and Typedefs.
276     ($serialize_type): Support for "DOMString::::" syntax.
277     (Typedef idl_text): Output Type as "DOMString" if it
278     is really "DOMString" (i.e. its internal representation
279     is "::DOMString::").
280    
281     2008-08-03 Wakaba <wakaba@suika.fam.cx>
282    
283 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
284     for constant types in the |check| method.
285     (check): Support for checking of attributes, operations, and
286     arguments.
287     (Attribute/Operation idl_text): Exception names in getraises,
288     setraises, and raises clauses is serizlied by |$serialize_type|
289     code.
290    
291 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
292    
293 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
294     hexadecimal numbers (the original pattern, taken from the spec,
295     was not work for hexadecimal numbers, because the "0" prefix
296     matches to the [0-7]* part (as an empty string) and therefore
297     it does not match with remaining "x..." part of a "0x..." integer
298     literal.
299     ($get_type): It now returns a string, not an array reference,
300     for regular types and |sequence| types (i.e. it in any case
301     returns a string).
302     ($get_next_token): The second item in the array that represents
303     a integer or float token is now a Perl number value, not the
304     original string representation of the number.
305     (check): Support for const value consistency checking.
306     No extended attribute is defined for constants.
307     (Node subclasses): Use simple strings rather than array references
308     for default data type values.
309     ($serialize_type): Type values are now simple strings.
310     (value): If the new attribute value is a false value, then
311     a FALSE value is set to the attribute.
312    
313     2008-08-02 Wakaba <wakaba@suika.fam.cx>
314    
315 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
316     in its stringified format ("scoped name" as defined in the
317     spec). Note that future version of this module should not use
318     array references for type values and the |type_text| attribute
319     should be made obsolete.
320     (parse_char_string): Unescape attribute names.
321     (check): Support for checking of whether inherited interfaces
322     are actually defined or not. Support for checking of whether
323     interface member identifiers are duplicated or not.
324     ($serialize_type): Scoped names are returned as is. A future
325     version of this code should escape identifiers other than "DOMString",
326     otherwise the idl_text would be non-conforming.
327    
328     2008-08-02 Wakaba <wakaba@suika.fam.cx>
329    
330 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
331 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
332     for Definition's were ignored.
333     (append_child): Set |parent_node| attribute.
334     (parent_node): New attribute.
335     (check): Support interface/exception members. Support
336     extended attributes. Support definition identifier uniqueness
337     constraint.
338     (qualified_name): New attribute.
339     (Interface/Exception idl_text): Extended attributes were
340     not prepended to the returned text.
341    
342     2008-08-02 Wakaba <wakaba@suika.fam.cx>
343    
344     * WebIDL.pm (parse_char_string): Set line/column numbers
345 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
346     would make things wrong. Support for interface forward
347     declarations was missing. Broken interface declarations
348     with no block were not ignored entirely.
349     (Whatpm::WebIDL::Node): New abstract class. This class
350     makes things easier.
351     (child_nodes): New attribute. Unlike DOM's attribute with
352     same name, this attribute returns a dead list of nodes for
353     simplicity.
354     (get_user_data, set_user_data): New methods.
355     (Module idl_text): A SPACE character should be inserted
356     before the |{| character.
357     (Interface idl_text): Support for interface forward declarations.
358     (is_forward_declaration): New attribute.
359    
360 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
361    
362 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
363    
364     2008-07-19 Wakaba <wakaba@suika.fam.cx>
365    
366 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
367     it now can handle broken extended attributes and as such.
368    
369     2008-07-19 Wakaba <wakaba@suika.fam.cx>
370    
371 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
372     Support for extended attributes with arguments.
373    
374     2008-07-19 Wakaba <wakaba@suika.fam.cx>
375    
376 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
377     (Interface->idl_text): Tentative support for inheritances.
378    
379     2008-07-19 Wakaba <wakaba@suika.fam.cx>
380    
381 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
382     Support for raises, setraises, and getraises syntaxes.
383    
384 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
385    
386     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
387     proper support for types, extended attributes, and exceptions yet).
388     WebIDL parser, version 1 (no support for exceptions yet,
389     no proper support for extended attributes yet).
390    
391 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
392    
393     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
394    
395 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
396    
397 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
398    
399     2008-06-29 Wakaba <wakaba@suika.fam.cx>
400    
401 wakaba 1.255 * WebIDL.pm: New module.
402    
403 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
404    
405     * Makefile (Entities.html): URI changed.
406    
407 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
408    
409     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
410    
411 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
414    
415     2008-06-01 Wakaba <wakaba@suika.fam.cx>
416    
417 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
418     spec (commented out). (HTML5 revision 1711).
419    
420 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
421    
422 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
423    
424     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
425     as UTF-16 (HTML5 revision 1701).
426    
427     2008-05-25 Wakaba <wakaba@suika.fam.cx>
428    
429 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
430     1692).
431    
432     2008-05-25 Wakaba <wakaba@suika.fam.cx>
433    
434     * HTML.pm.src: The secondary insertion mode used when switching
435     to foreign content is the "in body" insertion mode (HTML5 revision
436     1696).
437    
438     2008-05-25 Wakaba <wakaba@suika.fam.cx>
439    
440 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
441     revision 1697).
442    
443 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
444    
445 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
446     insertion mode (HTML5 revision 1693). Update SVG camelCase
447     attribute list (HTML5 revision 1700). <textarea> closes
448     </select> (HTML5 revision 1699). More start tags close in
449     foreign content insertion mode (HTML5 revision 1698).
450    
451     2008-05-24 Wakaba <wakaba@suika.fam.cx>
452    
453 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
454    
455     2008-05-24 Wakaba <wakaba@suika.fam.cx>
456    
457 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
458     revision 1674).
459    
460     2008-05-24 Wakaba <wakaba@suika.fam.cx>
461    
462 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
463     revision 1676).
464    
465     2008-05-24 Wakaba <wakaba@suika.fam.cx>
466    
467 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
468     quriks mode detection (HTML5 revision 1679).
469    
470     2008-05-24 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
473     revision 1685).
474    
475     2008-05-24 Wakaba <wakaba@suika.fam.cx>
476    
477 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
478     revision 1684).
479    
480     2008-05-24 Wakaba <wakaba@suika.fam.cx>
481    
482 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
483     work (HTML5 revision 1690).
484    
485     2008-05-24 Wakaba <wakaba@suika.fam.cx>
486    
487 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
488     end tags in head insertion modes (HTML5 revision 1686).
489     (parse_char_string): Bug fix for non-utf8 character string handlings.
490     (parse_char_stream): |ungetc| does not work well for this context.
491    
492 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
493    
494 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
495     |parse_byte_stream|.
496     (parse_byte_stream): New method.
497    
498     2008-05-18 Wakaba <wakaba@suika.fam.cx>
499    
500 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
501     by encoding layer error reporter.
502    
503 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
504    
505 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
506     rather than converting the whole byte string and then parsing.
507     Propagate errors in character encoding layer.
508     (get_next_token): Precise error reporting for |bare stago| error.
509    
510     2008-05-17 Wakaba <wakaba@suika.fam.cx>
511    
512 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
513     (parse_char_string): This method is now defined as an invocation
514     of the |parse_char_stream| method.
515    
516     2008-05-17 Wakaba <wakaba@suika.fam.cx>
517    
518 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
519     of the sniffing as info-level errors. Support for new
520     decoding framework in parser resestting.
521     (new): Various default error levels were not set.
522    
523     2008-05-17 Wakaba <wakaba@suika.fam.cx>
524    
525 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
526     algorithm, except for the actual sniffing, is implemented
527     with new framework with Message::Charset::Info.
528    
529 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
530    
531     * CacheManifest.pm (_parse): Drop fragment identifiers from
532     URIs in fallback section (HTML5 revision 1596).
533    
534 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
535    
536 wakaba 1.231 * Makefile (Entities.html): URI has changed.
537    
538     2008-05-10 Wakaba <wakaba@suika.fam.cx>
539    
540 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
541     1553).
542    
543 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
544    
545 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
546     no longer table elements, but table objects returned
547     by Whatpm::HTMLTable.
548    
549     * HTMLTable.pm (form_table): Return table element node
550     as |$table->{element}|.
551     (assign_header): Support for the |headers=""| attribute.
552    
553     2008-05-06 Wakaba <wakaba@suika.fam.cx>
554    
555 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
556     with no support for headers="".
557     (form_table): Include table width and height to the returned
558     table object for covenience. Indexing in column assignement was wrong.
559     Set whether a data cell is empty or not for convenience.
560    
561 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
562    
563 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
564    
565     2008-05-05 Wakaba <wakaba@suika.fam.cx>
566    
567 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
568     changed (HTML5 revision 1387).
569    
570     2008-05-05 Wakaba <wakaba@suika.fam.cx>
571    
572 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
573     to the end of the table (HTML5 revision 1380).
574    
575     2008-05-05 Wakaba <wakaba@suika.fam.cx>
576    
577 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
578     of 1-based (HTML5 revision 1376).
579    
580     2008-05-05 Wakaba <wakaba@suika.fam.cx>
581    
582 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
583     and "Content-Encoding" no longer prevent sniffing (HTML5
584     revision 1288).
585    
586     2008-05-05 Wakaba <wakaba@suika.fam.cx>
587    
588 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
589     1282).
590    
591 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
592    
593 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
594     Status of XML specs are added.
595    
596     2008-05-03 Wakaba <wakaba@suika.fam.cx>
597    
598 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
599     and |ref=""| attribute (referent element type checking).
600    
601 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
602    
603 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
604     1500) is implemented (except for IDNA part and URI-scheme-specific
605 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
606     documents.
607    
608 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
609    
610 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
611     character (HTML5 revision 1263).
612    
613     2008-04-13 Wakaba <wakaba@suika.fam.cx>
614    
615 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
616     definition for |AElig;| was missing).
617    
618     2008-04-13 Wakaba <wakaba@suika.fam.cx>
619    
620 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
621     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
622    
623     2008-04-13 Wakaba <wakaba@suika.fam.cx>
624    
625 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
626     insertion mode is added (HTML5 revisions 1412 and 1418).
627    
628     2008-04-13 Wakaba <wakaba@suika.fam.cx>
629    
630 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
631     elements (HTML5 revision 1410).
632    
633     2008-04-13 Wakaba <wakaba@suika.fam.cx>
634    
635 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
636     revision 1406).
637    
638     2008-04-13 Wakaba <wakaba@suika.fam.cx>
639    
640 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
641     elements (HTML5 revision 1404 and 1420).
642    
643 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
644    
645 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
646     elements (HTML5 revision 1404). Unused !!!macro definitions
647     are removed.
648    
649     2008-04-12 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
652     are raised is changed (HTML5 revision 1404).
653    
654 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
655    
656 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
657    
658     2008-04-06 Wakaba <wakaba@suika.fam.cx>
659    
660 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
661     replaced by constants representing element category.
662    
663     2008-04-06 Wakaba <wakaba@suika.fam.cx>
664    
665 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
666     for error reporting. (This is a preparation for using constant
667     value for ->[1].)
668    
669 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
670    
671 wakaba 1.205 * RDFXML.pm: Typo fixed.
672    
673     2008-03-22 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
676     as same as |must_level|, i.e. level = |m|.
677     (check_element): Make list of URIs in the DOM.
678    
679 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
680    
681 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
682     Local (null-namespace) attribute support.
683    
684     2008-03-21 Wakaba <wakaba@suika.fam.cx>
685    
686 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
687    
688     2008-03-21 Wakaba <wakaba@suika.fam.cx>
689    
690 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
691     and URI attributes is implemented. Warn if unknown
692     value is used in rdf:parseType="" attribute.
693    
694     * URIChecker.pm (check_rdf_uri_reference): New function.
695    
696     2008-03-21 Wakaba <wakaba@suika.fam.cx>
697    
698 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
699     are now resolved.
700    
701     2008-03-21 Wakaba <wakaba@suika.fam.cx>
702    
703 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
704    
705     * RDFXML.pm: undef vs false bug fixed.
706     Reification implemented.
707    
708     2008-03-21 Wakaba <wakaba@suika.fam.cx>
709    
710 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
711     The |node| arguments are added for |ontriple| calls.
712     Too many "attribute not allowed" errors were raised.
713    
714     * ContentChecker.pm: Initial experimental support for rdf:RDF
715     element.
716    
717     2008-03-21 Wakaba <wakaba@suika.fam.cx>
718    
719 wakaba 1.197 * RDFXML.pm: New module.
720    
721 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
722    
723 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
724     code was old one yet.
725    
726     2008-03-20 Wakaba <wakaba@suika.fam.cx>
727    
728 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
729     errors. Line/column markings for DOCTYPE, comment, and
730     character tokens are reintroduced; otherwise,
731     error location for "not HTML5" error and errors
732     for implied elements are not attached.
733    
734     2008-03-20 Wakaba <wakaba@suika.fam.cx>
735    
736 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
737    
738     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
739     (value): Setter implemented.
740    
741     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
742    
743     2008-03-20 Wakaba <wakaba@suika.fam.cx>
744    
745 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
746    
747     2008-03-20 Wakaba <wakaba@suika.fam.cx>
748    
749 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
750     flag, which is no longer used.
751    
752 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
755    
756     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
757     and !!!insert-element-t): Set line/column information to
758     element nodes.
759    
760     2008-03-17 Wakaba <wakaba@suika.fam.cx>
761    
762 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
763     in "<?", "<>", or "</>" should be the error point.
764    
765 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
766    
767 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
768    
769     2008-03-16 Wakaba <wakaba@suika.fam.cx>
770    
771 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
772    
773     2008-03-16 Wakaba <wakaba@suika.fam.cx>
774    
775 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
776     reporting.
777    
778 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
779    
780 wakaba 1.186 * HTML.pm.src: Error type revised.
781    
782     2008-03-11 Wakaba <wakaba@suika.fam.cx>
783    
784 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
785    
786     2008-03-11 Wakaba <wakaba@suika.fam.cx>
787    
788 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
789    
790 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
791    
792     * mkhtmlparser.pl: Set "level" parameter to parse errors.
793    
794     * HTML.pm.src: Code refined.
795    
796 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
797    
798 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
799     revision 1348). Note that I really don't know this makes
800     any difference in the black-box behavior of the parser.
801    
802     2008-03-09 Wakaba <wakaba@suika.fam.cx>
803    
804 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
805     revision 1348).
806    
807     2008-03-09 Wakaba <wakaba@suika.fam.cx>
808    
809 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
810    
811     2008-03-09 Wakaba <wakaba@suika.fam.cx>
812    
813 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
814    
815     2008-03-09 Wakaba <wakaba@suika.fam.cx>
816    
817 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
818     mode and support for the "in select in table" insertion mode (HTML5
819     revision 1342).
820    
821     2008-03-09 Wakaba <wakaba@suika.fam.cx>
822    
823 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
824     in non-tainted <table>s (HTML5 revision 1336).
825    
826     2008-03-09 Wakaba <wakaba@suika.fam.cx>
827    
828 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
829     and <head> (HTML5 revision 1332).
830    
831     2008-03-09 Wakaba <wakaba@suika.fam.cx>
832    
833 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
834     white space (HTML5 revision 1331).
835    
836 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
837    
838 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
839     element (HTML5 revision 1330).
840    
841     2008-03-08 Wakaba <wakaba@suika.fam.cx>
842    
843 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
844     element (HTML5 revision 1328).
845    
846     2008-03-08 Wakaba <wakaba@suika.fam.cx>
847    
848 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
849     into foster parents (HTML5 revision 1326).
850    
851     2008-03-08 Wakaba <wakaba@suika.fam.cx>
852    
853 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
854     revision 1321).
855    
856     2008-03-08 Wakaba <wakaba@suika.fam.cx>
857    
858 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
859     the spec (HTML5 revision 1320). Note that we finally complete
860     all of HTML5 revision 1320 changes.
861    
862     2008-03-08 Wakaba <wakaba@suika.fam.cx>
863    
864 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
865     tags (HTML5 revision 1320).
866    
867     2008-03-08 Wakaba <wakaba@suika.fam.cx>
868    
869 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
870    
871     2008-03-08 Wakaba <wakaba@suika.fam.cx>
872    
873 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
874     could not be reached so that it is now removed (HTML5 revision
875     1320).
876    
877     2008-03-08 Wakaba <wakaba@suika.fam.cx>
878    
879 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
880     to align with new spec text (HTML5 revision 1320).
881    
882     2008-03-08 Wakaba <wakaba@suika.fam.cx>
883    
884 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
885     have had no harm in theory.
886    
887     2008-03-08 Wakaba <wakaba@suika.fam.cx>
888    
889 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
890     case is split from other end tags for the preparation
891     of implementing HTML revision 1320.
892    
893 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
894    
895 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
896     revision 1320).
897    
898     2008-03-07 Wakaba <wakaba@suika.fam.cx>
899    
900 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
901     and "div" (HTML5 revision 1318). Add comments to where
902     |form| pointer association codes should be inserted (HTML5 revision
903     1319).
904    
905 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
906    
907     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
908    
909 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
910    
911 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
912     other token type, there are), it is replaced by a |die| statement.
913    
914     2008-03-05 Wakaba <wakaba@suika.fam.cx>
915    
916 wakaba 1.159 * HTML.pm.src: Typo fixed.
917    
918 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
919    
920     * HTML.pm.src (_tree_construction_initial): Some limited quirks
921     doctypes were not uppercased for comparison.
922    
923 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
924    
925 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
926     are added.
927    
928     2008-03-03 Wakaba <wakaba@suika.fam.cx>
929    
930 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
931     are set. Cases that are unlikely reached are noted as so.
932    
933     2008-03-03 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
936    
937     * mkhtmlparser.pl: Support for |!!!cp| syntax.
938    
939     2008-03-03 Wakaba <wakaba@suika.fam.cx>
940    
941 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
942     for simplicity.
943    
944     2008-03-03 Wakaba <wakaba@suika.fam.cx>
945    
946 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
947     revision 1307).
948    
949     2008-03-03 Wakaba <wakaba@suika.fam.cx>
950    
951 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
952     is set are changed (HTML5 revision 1305).
953    
954 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
955    
956 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
957     1292). Entities are not parsed in comment-like part in RCDATA
958     elements (HTML5 revision 1294). Allow bare & at the end
959     of attribute value literals (HTML5 revision 1296). More
960     quirks mode doctypes (HTML5 revision 1302). Requires spaces
961     between attributes and ban attribute names or unquoted
962     attribute values containing single or double quotes (HTML5
963     revision 1303).
964    
965     2008-03-02 Wakaba <wakaba@suika.fam.cx>
966    
967 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
968     and related errors unless it is an HTML document (though the spec
969     is unclear on whether it is applied to XHTML document).
970    
971     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
972     are added.
973    
974     2008-03-02 Wakaba <wakaba@suika.fam.cx>
975    
976 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
977    
978 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
979    
980     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
981    
982     * HTML.pm.src: |charset| in |content| attribute is
983     case-insensitive (HTML5 revision 1270).
984    
985 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
986    
987     * ContentChecker.pm: New status constants are added.
988     ($ElementDefault): |status| added.
989     (check_element): Err for non-standard or deprecated elements.
990     (_attr_status_info): For non-standard or deprecated attributes.
991    
992 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
993    
994 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
995    
996     2008-02-24 Wakaba <wakaba@suika.fam.cx>
997    
998 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
999     status information is now dispatched.
1000    
1001     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1002    
1003 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1004     problem by adding some more arguments.
1005    
1006     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1009     container-for-the-purpose-of-content-model element (not transparent
1010     element) for |check_child_element| calling and significant
1011     text flag marking. This reintroduces |<del>|-and-significant
1012     problem again.
1013    
1014     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1015    
1016 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1017     elements ignored for the purpose of phase changes in
1018     content model checking.
1019    
1020 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1021    
1022 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1023     was not properly managed for transparent cases.
1024    
1025     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1026    
1027 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1028     and |audio| as semi-transparent elements.
1029    
1030     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1031    
1032 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1033     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1034     Support for |html:object| as a semi-transparent element.
1035    
1036     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1037    
1038 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1039     the tree is entirely revised to make it easier to track
1040     the state of ancestors/descendants. As a result of this
1041     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1042     support for content model checking for HTML elements |figure|,
1043     |object|, |video|, and |audio| and checking for XML elements (and
1044     some XMLNS checkings) are dropped for now. They will be
1045     reimplemented in due cource.
1046    
1047 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1048    
1049 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1050     token (HTML5 revision 1225).
1051    
1052     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1053    
1054 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1055    
1056     * HTML.pm.src: Save whether |meta| |content| attribute
1057     contains character references or not.
1058    
1059     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1060    
1061 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1062     imlement significant content checking for contents
1063     with |del| elements.
1064    
1065     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1066    
1067 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1068     elements are added.
1069     (_get_children): HTML |object| elements are now semi-transparent.
1070    
1071     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1072    
1073 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1074    
1075     * CacheManifest.pm: HTML5 revision 1211 implemented.
1076    
1077     * CacheManifest.pod: Updated.
1078    
1079 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1080    
1081     * ContentChecker.pm (check_document, check_element): Support
1082     for second argument ($onsubdoc).
1083     (_get_css_parser): Removed (now it is part of WDCC).
1084    
1085 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1086    
1087     * ContentChecker.pm (_get_css_parser): New.
1088    
1089 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1090    
1091 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1092     for checking had been used.
1093    
1094     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1095    
1096 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1097    
1098 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1099    
1100 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1101     specced in RFC 2046 and RFC 4288.
1102     (application/atom+xml): Definition added.
1103    
1104     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1105    
1106 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1107     warnings (rather than SHOULD-level errors).
1108    
1109     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1110    
1111 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1112    
1113     * HTML.pm.src: A flag for character references in attribute
1114     values are added. Set |manakai_has_reference| user data
1115     to |charset| attribute.
1116    
1117     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1118    
1119 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1120     attributes.
1121    
1122     * ContentChecker.pm (check_document): Warn if charset requirements
1123     cannot be tested.
1124    
1125 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1126    
1127     * HTML.pm.src (parse_byte_string): Detect charset
1128     by universalchardet if charset parameter is not specified.
1129    
1130     * Makefile (Charset-all, Charset-clean): New rules.
1131    
1132 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1133    
1134 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1135     of character encoding declaration and selection of encoding
1136     for HTML document.
1137    
1138     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1139    
1140 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1141     type in list context.
1142    
1143     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1144    
1145 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1146     ignoring (HTML5 revisions 1013 and 1016).
1147    
1148     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1149    
1150 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1151     white spaces, are implemented.
1152    
1153 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1154    
1155 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1156     attribute if possible.
1157    
1158     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1159    
1160 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1161     (parse_char_string): New alias for |parse_string|.
1162     (main phase): Invoking "change the encoding" algorithm if desired.
1163    
1164     * HTML.pod: Updated.
1165    
1166     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1167    
1168 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1169    
1170     * Makefile (HTML-all, HTML-clean): New.
1171    
1172     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1173    
1174     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1175    
1176 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1177    
1178     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1179    
1180 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1181    
1182 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1183    
1184     * CacheManifest.pod: New file.
1185    
1186     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.114 * CacheManifest.pm: New module.
1189    
1190     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1191    
1192 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1193     callback.
1194    
1195     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1196    
1197 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1198    
1199 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1200    
1201 wakaba 1.111 * Makefile (clean): New rule.
1202    
1203     * NanoDOM.pm (public_id, system_id): New attributes.
1204    
1205     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1206    
1207 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1208    
1209 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1210    
1211     * ContentChecker.pm (check_document): Support for
1212     new |is_xml_root| flag.
1213     (check_element): Support for new |pluses| state.
1214     (_add_pluses): New method.
1215     (_remove_minuses): Support for new |minus| item.
1216    
1217 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1218    
1219 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1220     root element.
1221    
1222     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1223    
1224 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1225    
1226 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1227    
1228 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1229    
1230     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1231    
1232 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1233    
1234     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1235    
1236 wakaba 1.103 * LangTag.pm: New module.
1237    
1238 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1239    
1240     * HTML.pm.src: Some error types were wrong.
1241    
1242 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1243    
1244 wakaba 1.101 * CSS/: New directory.
1245    
1246     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1247    
1248 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1249     for |noscript| in |head|.
1250    
1251 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1252    
1253     * URI/: New directory.
1254    
1255 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1256    
1257 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1258     number.
1259    
1260     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1261    
1262 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1263     by |&|s.
1264    
1265     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1266    
1267 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1268    
1269     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1270    
1271 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1272    
1273     * HTML.pm.src: Insertion modes are now represented in number.
1274    
1275     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1276    
1277 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1278     999) is implemented.
1279    
1280 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1281    
1282     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1283     983) is implemented.
1284    
1285 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1286    
1287     * ContentChecker.pod: New documentation.
1288    
1289     * Makefile: A rule for |ContentChecker.html| is added.
1290    
1291     * ContentChecker.pm: A pod "LICENSE" section is added.
1292    
1293     * NanoDOM.pm ($VERSION): New variable.
1294    
1295 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1296    
1297 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1298     list of allowed HTML elements.
1299    
1300     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1301    
1302 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1303     HTML elements.
1304    
1305     * URIChecker.pm (check_iri): New.
1306     (check_iri_reference): Error type for IRI reference
1307     syntax error is changed.
1308    
1309 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1310    
1311 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1312     is added.
1313     (check_document): Load appropriate module before validation.
1314    
1315     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1316    
1317 wakaba 1.88 * ContentChecker/: New directory.
1318    
1319     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1320    
1321 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1322    
1323     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1324    
1325 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1326    
1327 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1328     even if no document element is found.
1329    
1330     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1331    
1332 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1333    
1334     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1335    
1336 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1337    
1338     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1339    
1340 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1341     is split into "after html body" and "after html frameset"
1342     insertion modes. Their codes are merged with "after body"
1343     and "after frameset" codes. |$previous_insertion_mode|
1344     has been removed. "after frameset" code is
1345     merged with "in frameset" code.
1346    
1347     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1350     merged with the "in head" insertion mode.
1351    
1352     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1353    
1354 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1355     reorganized.
1356    
1357     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1360     are merged.
1361    
1362     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1363    
1364 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1365     with "in table" insertion mode.
1366    
1367     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1368    
1369 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1370     modes are merged.
1371    
1372     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1373    
1374 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1375     insertion mode!
1376    
1377     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1378    
1379 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1380     now directly invoke the handler.
1381    
1382     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1383    
1384 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1385     is merged to the "in body" insertion mode code.
1386    
1387     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1388    
1389 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1390     insertion modes are merged.
1391    
1392     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1393    
1394 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1395    
1396     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1397    
1398 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1399     instead of string constant for |{content_model_flag}|.
1400    
1401     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1402    
1403 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1404     to the "duplicate attribute" error.
1405    
1406 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1407    
1408 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1409    
1410     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1411    
1412 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1413    
1414     * HTML.pm.src: A typo is fixed.
1415    
1416 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1419     from "in HTML:xml:lang" error. Character position
1420     is now the last part of the error type in the URI error
1421     description. Report "unsupported" status for language
1422     tags, media queries, script codes, and style sheets.
1423    
1424     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1425    
1426 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1427     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1428     in XML.
1429    
1430     * NanoDOM.pm (Attr.owner_document): New attribute.
1431    
1432     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1433    
1434 wakaba 1.66 * HTML.pm.src: The character immediately following
1435     a bare |hcro| was discarded. Fix handling of
1436     entity references in attribute values.
1437    
1438     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1439    
1440 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1441     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1442     factored out. Error types |in html:#DOCTYPE|
1443     and |after html:#DOCTYPE| are merged
1444     into |DOCTYPE in the middle|. |</frameset>|
1445     in fragment parsing mode changed the
1446     insertion mode.
1447    
1448     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1451     phase is now an insertion mode. Treatments for white
1452     space character tokens were incorrect for some
1453     insertion modes. An old |meta| case was not removed.
1454    
1455     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1456    
1457 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1458     implemented (but changing the encoding is not yet:-).
1459    
1460 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1461    
1462 wakaba 1.62 * Charset/: New directory.
1463    
1464     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1465    
1466 wakaba 1.61 * H2H.pm: New Perl module (created from
1467     manakai's H2H.dis).
1468    
1469     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1470    
1471 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1472     manakai's SimpleLS.dis).
1473    
1474 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1475    
1476     * HTML.pm.src: |<!---x-->| was not processed correctly.
1477    
1478 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1479    
1480 wakaba 1.58 * HTML.pm.src: Report correct error message
1481     for |<body></div></body>|.
1482    
1483     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1484    
1485 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1486    
1487     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1488    
1489     * ContentChecker.pm: Return |{term}| list.
1490    
1491 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1492    
1493 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1494     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1495     and <title> outside of head).
1496    
1497     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1498    
1499 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1500     and private types/subtypes.
1501    
1502     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1503     URIChecker.pm, HTMLTable.pm: Error messages are now
1504     consistent; they are all listed in
1505     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1506    
1507 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1508    
1509 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1510     is now erred. |<datalist>| is implemented.
1511     Attribute checker for |<command>| and |<menu>| are
1512     added. Support for |contextmenu| global attribute
1513     is added.
1514    
1515     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1516    
1517 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1518     of Step 3 has been changed.
1519    
1520     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1523    
1524 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1525    
1526 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1527     incorrect.
1528    
1529     * ContentChecker.pm: Support for the |footer| element.
1530     Check URI syntax for space-separated URI attributes.
1531     Support for the |tabindex| attribute. Support
1532     for |datetime| attribute.
1533    
1534     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1535    
1536 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1537     character references). HTML5 revision 1.146 (white space
1538     characters before root start tag). HTML5 revision
1539     1.148 (named character references in attribute values).
1540     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1541    
1542     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1543    
1544     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1545 wakaba 1.49 in <head>).
1546    
1547     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1548    
1549 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1550     in body).
1551    
1552     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1553    
1554     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1555    
1556 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1557     or RCDATA in fragment parsing mode). HTML5 revision
1558     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1559    
1560     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1561    
1562     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1563     for <pre> and <textarea>).
1564    
1565 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1566    
1567 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1568    
1569     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1570    
1571 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1572    
1573     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1574    
1575 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1576     HTML5 revision 917 (conforming bare &).
1577    
1578     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1579    
1580 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1581    
1582     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1583     HTML5 revision 915 (<nobr>).
1584    
1585     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1588    
1589     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1590     New attributes.
1591    
1592     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1593    
1594 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1595     in tags).
1596    
1597     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1598    
1599 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1600    
1601     * HTML.pm.src: |$entity_char| is removed and
1602     requires |Whatpm::_NamedEntityList| instead.
1603     HTML5 revision 898 (refc), except that lack of refc
1604     is parse error.
1605    
1606     * mkentitylist.pl: New script.
1607    
1608     * Makefile (all): |_NamedEntityList.pm| is added.
1609     (_NamedEntityList.pm, Entities.html): New rules.
1610    
1611     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1612    
1613 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1614     were ignored and U+000D immediately following another
1615     U+000D was not converted to U+000A.
1616    
1617     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1618    
1619 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1620     nodes before appended). Parser was not ready for NULL
1621     parse error and escape flag.
1622    
1623     * NanoDOM.pm (adopt_node): New.
1624    
1625     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1626    
1627 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1628     CDATA and RCDATA). Note that current implementation
1629     is simply repeating what the spec says and it is maybe not
1630     a best way to do it.
1631    
1632     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1633    
1634 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1635     the form element if a descendant element without implied end tag has
1636     still been open).
1637    
1638     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1639    
1640 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1641     space characters non-conforming).
1642    
1643     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1644    
1645 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1646     HTML5 revision 869 (C1 character references).
1647    
1648     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1649    
1650 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1651     a |textarea| is removed).
1652    
1653 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1654    
1655     * NanoDOM.pm (get_attribute_node_ns): New method.
1656    
1657     * ContentChecker.pm: |script| |async| and |defer|
1658     no longer require |src|. |async| MUST NOT be
1659     specified if |defer|. (HTML5 revision 858).
1660    
1661 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1662    
1663     * HTML.pm.src: |<form><form>| went to inifinite loop.
1664    
1665 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1666    
1667 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1668     as a document element) flag on.
1669     (new): Removed.
1670     (check_document): New method.
1671    
1672     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1673    
1674 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1675     were incorrect.
1676    
1677     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1678    
1679 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1680     error position in the |type| option of the error.
1681    
1682     * HTMLTable.pm (form_table): The |$onerror| parameter
1683     is now optional. Some bugs are fixed.
1684    
1685     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1686    
1687 wakaba 1.27 * HTMLTable.pm: New module.
1688    
1689     * ContentChecker.pm (table): Invoke table model error checker.
1690    
1691     * NanoDOM.pm (first_child, get_attribute_ns): New.
1692    
1693 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1694    
1695 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1696     (link/@rel, a/@rel, area/@rel): Use new checker.
1697    
1698     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1699    
1700     * _LinkTypeList.pm: New file.
1701    
1702     * mklinktypelist.pl: New file.
1703    
1704     * .cvsignore: |RelExtensions.html| added.
1705    
1706     * NanoDOM.pm (child_nodes): Returns an empty array
1707     for non-child-containing node types.
1708     (text_content): New attribute.
1709    
1710     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1711    
1712 wakaba 1.25 * IMTChecker.pm: New module.
1713    
1714     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1715     to test parameter value validity.
1716    
1717     * HTML.pm.src ($style_start_tag): Attributes were
1718     discarded.
1719    
1720 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1721    
1722 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1723    
1724     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1725    
1726 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1727     and RFC 3987 are listed (not all testable items are checked yet).
1728    
1729     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1730    
1731 wakaba 1.22 * URIChecker.pm: New module.
1732    
1733 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1734    
1735 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1736     or associated to some placeholder.
1737     ($ElementDefault): Warn unknown attributes
1738     for unknown elements as "attribute not supported".
1739     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1740     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1741     (|a| attribute checker): Reimplemented.
1742    
1743     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1744    
1745 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1746     ($HTMLAttrChecker): Event handler content attributes
1747     are added.
1748     (link, embed): Required attribute is now checked.
1749     (embed): Unknown local attributes are no longer warned.
1750    
1751     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1752    
1753 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1754     ($HTMLIMTAttrChecker): New checker.
1755     (link@rel, link@href, link@type, style@type,
1756     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1757     object@data, object@type, source@src, source@type, area@alt,
1758     area@shape, area@coords, area@href,
1759     area@ping, area@rel, area@type, script@src,
1760     script@defer, script@async, script@type): Checkers added.
1761    
1762     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1763    
1764 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1765    
1766 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1767    
1768 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1769     attributes. Report an error if |Element.prefix|
1770     is |xmlns|.
1771    
1772     * NanoDOM.pm (prefix): New attribute.
1773    
1774     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1775    
1776 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1777     mode, action for |<iframe>| was missing.
1778    
1779     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1780    
1781 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1782     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1783    
1784     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1785    
1786 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1787     ($HTMLURIAttrChecker): Placeholder.
1788     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1789     New.
1790    
1791     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1792    
1793 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1794     for global attributes, |html|, |base|, |style|, and |meta|.
1795    
1796     * NanoDOM.pm (insert_before): Weaken reference
1797     to the parent node.
1798     (Attr::new): Set |owner_element| attribute.
1799     (namespace_uri, manakai_local_name): New attribute implementations.
1800     (owner_element): New attribute.
1801    
1802     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1803    
1804 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1805     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1806     $Element->{$HTML_NS}->{''}): New.
1807     (check_element): Invoke attrs_checker for each element.
1808    
1809 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1810    
1811 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1812    
1813     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1814    
1815 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1816     element type testings.
1817    
1818     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1819    
1820 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1821     error when an element type is put in the "minus" list
1822     and the element type is not allowed explicitly in the particular
1823     element content model.
1824     (html:a checker): New checker.
1825     (html:details, html:datagrid): New checkers.
1826     (html:legend): New checker.
1827    
1828     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1829    
1830 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1831    
1832     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1833    
1834 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1835     checker.
1836     (html:dd checker): New checker.
1837     (html:q, html:em, html:strong, html:small,
1838     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1839    
1840     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1841    
1842 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1843     from |$ElementDefault->{checker}|.
1844     ($ElementDefault->{checker}): Throw an error that
1845     the element type is not supported by the checker.
1846     ($HTMLMetadataElement): |html:base| was missing.
1847     ($HTMLEmptyChecker): Don't throw an error
1848     for inter-element whitespace nodes.
1849     (html:html checker): Errors were not
1850     thrown even if |html:head| and/or |html:body|
1851     children were missing.
1852     (html:head checker): An error was not
1853     thrown if <meta charset> appered after other
1854     elements.
1855    
1856 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1857    
1858     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1859     and |noscript| elements are implemented.
1860     (new): New method.
1861    
1862 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1863    
1864 wakaba 1.4 * ContentChecker.pm: New module.
1865    
1866     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1867    
1868 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1869     document_element, manakai_local_name,
1870     manakai_element_type_match): New method.
1871    
1872     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1873    
1874     * HTML.pm.src: Replace decimal and hexadecimal numeric
1875     entities in C1 range using Windows-1252 mapping. Bare LF
1876     did not count as new line for error reporting.
1877    
1878 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1879    
1880     * NanoDOM.pm (DOMImplementation): New class.
1881     (append_child): Weaken the |parent_node| reference.
1882     (create_element_ns, Element new): Set the |owner_document|
1883     reference.
1884     (implementation): New attribute.
1885     (owner_document, local_name, namespace_uri): New attributes.
1886    
1887     * HTML.pm.src (parse_string): Line and column numbers
1888     are now provided to error handler.
1889     (!!!parse-error): Short descriptions are added.
1890     (_construct_tree): Split into three methods; support
1891     for innerHTML mode.
1892     (set_inner_html): New method.
1893    
1894 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1895    
1896     * NanoDOM.html: Documentation is added.
1897    
1898     * HTML.pod, ContentType.html: Documentation is revised.
1899    
1900     * .cvsignore: Pod2html temporary files are added.
1901    
1902     * Makefile: Make |NanoDOM.html|.
1903    
1904     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1905    
1906     * HTML.pm.src (parse_string): New method.
1907     (get_inner_html): Renamed from |inner_html|.
1908    
1909     * Makefile: A rule for |HTML.html| is added.
1910    
1911     * HTML.pod: New documentation.
1912    
1913     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1914    
1915     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1916     (clone_node): Attribute nodes were not completely copied.
1917    
1918     * HTML.pm.src: Many bugs are fixed.
1919    
1920     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1921    
1922     * HTML.pm.src: Some typos are fixed.
1923    
1924     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1925    
1926     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1927     are removed.
1928    
1929     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1930    
1931     * HTML-consume-entity.src: Removed.
1932    
1933     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1934    
1935     * HTML.pm.src: Tokenizer's handling on hexadecimal
1936     numeric entities are rewritten.
1937    
1938     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1939    
1940     * HTML.pm.src: Some tokenizer bugs are fixed.
1941    
1942     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1943    
1944     * HTML.pm.src: The tree construction stage is implemented.
1945    
1946     * mkhtmlparser.pl: New macros are added.
1947    
1948     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1949    
1950     * ContentType.pm: A note on bug in the specification
1951     is removed since it's been now fixed.
1952    
1953     * .cvsignore: New file.
1954    
1955     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1956    
1957     * HTML.pm.src, HTML-consume-entity.src: New files.
1958    
1959     * Makefile (HTML.pm): New rule.
1960    
1961     * mkhtmlparser.pl: New script.
1962    
1963     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1964    
1965     * Makefile: New file.
1966    
1967     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1968    
1969     * ContentType.pm: An error in pod is fixed.
1970    
1971     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1972    
1973     * ContentType.pm: Documentation is added.
1974    
1975     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1976    
1977     * ContentType.pm: New Perl module.
1978    
1979     * ChangeLog: New file.
1980    
1981    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24