/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.301 - (hide annotations) (download)
Sun Sep 14 03:07:57 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.300: +6 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 03:06:56 -0000
	* HTML.pm.src: Change |{getc_until}| to |{read_until}|
	and |manakai_getc_until| to |manakai_read_until| to
	reduce the number of string copies.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 03:07:37 -0000
	* DecodeHandle.pm, UnicodeChecker.pm: Change |manakai_getc_until|
	to |manakai_read_until| to reduce the number of string copies.

2008-09-14  Wakaba  <wakaba@suika.fam.cx>

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
4     and |manakai_getc_until| to |manakai_read_until| to
5     reduce the number of string copies.
6    
7     2008-09-14 Wakaba <wakaba@suika.fam.cx>
8    
9 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
10     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
11     standard feature to |open| a string as a filehandle,
12     since Perl's string filehandle seems not supporting |ungetc|
13     method correctly.
14     (parse_char_stream): Define |{getc_until}| method.
15     (DATA_STATE): Experimental support for |getc_until| feature.
16    
17 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
18    
19 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
20    
21     2008-09-13 Wakaba <wakaba@suika.fam.cx>
22    
23 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
24     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
25     and replaced by |{prev_state}|.
26    
27     * mkhtmlparser.pl: Remove |{char}| feature.
28     Remove |!!!back-next-input-character;| macro.
29    
30     2008-09-13 Wakaba <wakaba@suika.fam.cx>
31    
32 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
33     entity related tokenizer states in favor of new states
34     implementing the consume character reference algorithm.
35    
36     2008-09-13 Wakaba <wakaba@suika.fam.cx>
37    
38     * HTML.pm.src: "Consume a character reference" algorithm is
39 wakaba 1.296 now implemented as a tokenizer's state, rather than
40     a method, with minimum changes (more changes will
41     be made, in due course). "Bogus comment state"'s inner
42     loop gets removed.
43    
44     2008-09-13 Wakaba <wakaba@suika.fam.cx>
45    
46 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
47 wakaba 1.295 into their own tokenizer states.
48    
49     2008-09-13 Wakaba <wakaba@suika.fam.cx>
50    
51 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
52     is split into three states.
53    
54     2008-09-13 Wakaba <wakaba@suika.fam.cx>
55    
56 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
57     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
58     no longer does the tokenizer have to push back next input
59     characters in those states.
60    
61     2008-09-13 Wakaba <wakaba@suika.fam.cx>
62    
63 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
64     into four states so that no longer does the tokenizer have to push
65     back next input characters in that state.
66    
67 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
68    
69     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
70     which can be used to insert some wrapper between the character
71     stream handle and the tokenizer. (It is currently not supported
72     for |set_inner_html| for |Element|s).
73    
74 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
75    
76 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
77    
78     2008-09-10 Wakaba <wakaba@suika.fam.cx>
79    
80 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
81    
82     * HTML.pm.src: Don't specify |text| argument for the
83     |chardecode:fallback| error, since it is not the encoding
84     being used alternatively.
85    
86 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
87    
88     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
89    
90 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
91    
92 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
93    
94     2008-08-31 Wakaba <wakaba@suika.fam.cx>
95    
96 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
97     to after after frameset insertion mode processing (HTML5
98     revision 1909). Note that the implementation was wrong
99     per the old spec before the r1909 changes.
100    
101 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
102    
103 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
104     spec (HTML5 revision 2093).
105     ($process_row): Algorithm step numbers synced with the
106     spec (HTML5 revision 2092).
107    
108     2008-08-30 Wakaba <wakaba@suika.fam.cx>
109    
110 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
111     revision 2094).
112    
113     2008-08-30 Wakaba <wakaba@suika.fam.cx>
114    
115     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
116    
117     2008-08-30 Wakaba <wakaba@suika.fam.cx>
118    
119 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
120     name (after another attribute) now raise parse error (HTML5
121     revision 2123). Empty unquoted attribute values are no
122     longer allowed (HTML5 revision 2122).
123    
124     2008-08-30 Wakaba <wakaba@suika.fam.cx>
125    
126 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
127     revision 2130).
128    
129     2008-08-30 Wakaba <wakaba@suika.fam.cx>
130    
131 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
132     as |lang| attribute value for HTML elements (HTML5 revision 2062
133     and so on).
134    
135     2008-08-30 Wakaba <wakaba@suika.fam.cx>
136    
137 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
138     was missing.
139    
140     * URIChecker.pm: The end of the URL should be marked as the
141     error location for an empty path error. The position
142     between the userinfo and the port components should be
143     marked as the error location for an empty host error.
144    
145     2008-08-30 Wakaba <wakaba@suika.fam.cx>
146    
147 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
148     value the error occurs for errors. Report unknown
149     address format error in warning level, since address
150     formats are rarely added. Path segments starting with "/.."
151     were misinterpreted as a dot-segment.
152    
153     2008-08-30 Wakaba <wakaba@suika.fam.cx>
154    
155 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
156     |Message::DOM::DOMImplementation|.
157    
158 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
159    
160     * IMTChecker.pm: Updated for the new error reporting architecture.
161    
162     * ContentChecker.pm: Error levels for IMTs are added.
163    
164 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
165    
166     * H2H.pm (_shift_token): Support for unquoted HTML attribute
167     values.
168    
169 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
170    
171 wakaba 1.275 * CacheManifest.pm: Support for new style of error
172     reports.
173    
174     * HTML.pm.src: Set line=1, column=1 to the document node.
175    
176     2008-08-16 Wakaba <wakaba@suika.fam.cx>
177    
178 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
179     and URL checkers. Support for more error levels for bogus
180     langauge tag and URL "standards".
181    
182     * LangTag.pm, URIChecker.pm: Support for new style error
183     level reporting.
184    
185 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
186    
187 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
188    
189     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
190     specifying. Error types are revised.
191    
192     2008-08-15 Wakaba <wakaba@suika.fam.cx>
193    
194 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
195     renewed.
196    
197     2008-08-15 Wakaba <wakaba@suika.fam.cx>
198    
199 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
200     are revised. Use new style for "level" specification.
201    
202     * mkhtmlparser.pl: Use new style for "level" specification.
203    
204 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
205    
206 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
207     reporting process for broken ignored valuetype definition.
208     (Valuetype idl_text): Support for special "DOMString" name.
209    
210     2008-08-03 Wakaba <wakaba@suika.fam.cx>
211    
212 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
213     terminal of the ScopedName is "DOMString", such that whether
214     the last part of the scoped name is "DOMString" or "_DOMString"
215     later. It is necessary to determine whether a |typedef|
216     definition should be ignored or not.
217     (parse_char_string): Unescape the identifier of
218     exception members.
219     ($resolve): Return undef for builtin types and sequence<T>
220     types (we might not have to do this, however...).
221     (check): Support checking for Exceptions, Valuetypes,
222     and Typedefs.
223     ($serialize_type): Support for "DOMString::::" syntax.
224     (Typedef idl_text): Output Type as "DOMString" if it
225     is really "DOMString" (i.e. its internal representation
226     is "::DOMString::").
227    
228     2008-08-03 Wakaba <wakaba@suika.fam.cx>
229    
230 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
231     for constant types in the |check| method.
232     (check): Support for checking of attributes, operations, and
233     arguments.
234     (Attribute/Operation idl_text): Exception names in getraises,
235     setraises, and raises clauses is serizlied by |$serialize_type|
236     code.
237    
238 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
239    
240 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
241     hexadecimal numbers (the original pattern, taken from the spec,
242     was not work for hexadecimal numbers, because the "0" prefix
243     matches to the [0-7]* part (as an empty string) and therefore
244     it does not match with remaining "x..." part of a "0x..." integer
245     literal.
246     ($get_type): It now returns a string, not an array reference,
247     for regular types and |sequence| types (i.e. it in any case
248     returns a string).
249     ($get_next_token): The second item in the array that represents
250     a integer or float token is now a Perl number value, not the
251     original string representation of the number.
252     (check): Support for const value consistency checking.
253     No extended attribute is defined for constants.
254     (Node subclasses): Use simple strings rather than array references
255     for default data type values.
256     ($serialize_type): Type values are now simple strings.
257     (value): If the new attribute value is a false value, then
258     a FALSE value is set to the attribute.
259    
260     2008-08-02 Wakaba <wakaba@suika.fam.cx>
261    
262 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
263     in its stringified format ("scoped name" as defined in the
264     spec). Note that future version of this module should not use
265     array references for type values and the |type_text| attribute
266     should be made obsolete.
267     (parse_char_string): Unescape attribute names.
268     (check): Support for checking of whether inherited interfaces
269     are actually defined or not. Support for checking of whether
270     interface member identifiers are duplicated or not.
271     ($serialize_type): Scoped names are returned as is. A future
272     version of this code should escape identifiers other than "DOMString",
273     otherwise the idl_text would be non-conforming.
274    
275     2008-08-02 Wakaba <wakaba@suika.fam.cx>
276    
277 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
278 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
279     for Definition's were ignored.
280     (append_child): Set |parent_node| attribute.
281     (parent_node): New attribute.
282     (check): Support interface/exception members. Support
283     extended attributes. Support definition identifier uniqueness
284     constraint.
285     (qualified_name): New attribute.
286     (Interface/Exception idl_text): Extended attributes were
287     not prepended to the returned text.
288    
289     2008-08-02 Wakaba <wakaba@suika.fam.cx>
290    
291     * WebIDL.pm (parse_char_string): Set line/column numbers
292 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
293     would make things wrong. Support for interface forward
294     declarations was missing. Broken interface declarations
295     with no block were not ignored entirely.
296     (Whatpm::WebIDL::Node): New abstract class. This class
297     makes things easier.
298     (child_nodes): New attribute. Unlike DOM's attribute with
299     same name, this attribute returns a dead list of nodes for
300     simplicity.
301     (get_user_data, set_user_data): New methods.
302     (Module idl_text): A SPACE character should be inserted
303     before the |{| character.
304     (Interface idl_text): Support for interface forward declarations.
305     (is_forward_declaration): New attribute.
306    
307 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
308    
309 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
310    
311     2008-07-19 Wakaba <wakaba@suika.fam.cx>
312    
313 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
314     it now can handle broken extended attributes and as such.
315    
316     2008-07-19 Wakaba <wakaba@suika.fam.cx>
317    
318 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
319     Support for extended attributes with arguments.
320    
321     2008-07-19 Wakaba <wakaba@suika.fam.cx>
322    
323 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
324     (Interface->idl_text): Tentative support for inheritances.
325    
326     2008-07-19 Wakaba <wakaba@suika.fam.cx>
327    
328 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
329     Support for raises, setraises, and getraises syntaxes.
330    
331 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
332    
333     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
334     proper support for types, extended attributes, and exceptions yet).
335     WebIDL parser, version 1 (no support for exceptions yet,
336     no proper support for extended attributes yet).
337    
338 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
339    
340     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
341    
342 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
343    
344 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
345    
346     2008-06-29 Wakaba <wakaba@suika.fam.cx>
347    
348 wakaba 1.255 * WebIDL.pm: New module.
349    
350 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
351    
352     * Makefile (Entities.html): URI changed.
353    
354 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
355    
356     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
357    
358 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
359    
360 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
361    
362     2008-06-01 Wakaba <wakaba@suika.fam.cx>
363    
364 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
365     spec (commented out). (HTML5 revision 1711).
366    
367 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
368    
369 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
370    
371     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
372     as UTF-16 (HTML5 revision 1701).
373    
374     2008-05-25 Wakaba <wakaba@suika.fam.cx>
375    
376 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
377     1692).
378    
379     2008-05-25 Wakaba <wakaba@suika.fam.cx>
380    
381     * HTML.pm.src: The secondary insertion mode used when switching
382     to foreign content is the "in body" insertion mode (HTML5 revision
383     1696).
384    
385     2008-05-25 Wakaba <wakaba@suika.fam.cx>
386    
387 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
388     revision 1697).
389    
390 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
391    
392 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
393     insertion mode (HTML5 revision 1693). Update SVG camelCase
394     attribute list (HTML5 revision 1700). <textarea> closes
395     </select> (HTML5 revision 1699). More start tags close in
396     foreign content insertion mode (HTML5 revision 1698).
397    
398     2008-05-24 Wakaba <wakaba@suika.fam.cx>
399    
400 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
401    
402     2008-05-24 Wakaba <wakaba@suika.fam.cx>
403    
404 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
405     revision 1674).
406    
407     2008-05-24 Wakaba <wakaba@suika.fam.cx>
408    
409 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
410     revision 1676).
411    
412     2008-05-24 Wakaba <wakaba@suika.fam.cx>
413    
414 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
415     quriks mode detection (HTML5 revision 1679).
416    
417     2008-05-24 Wakaba <wakaba@suika.fam.cx>
418    
419 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
420     revision 1685).
421    
422     2008-05-24 Wakaba <wakaba@suika.fam.cx>
423    
424 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
425     revision 1684).
426    
427     2008-05-24 Wakaba <wakaba@suika.fam.cx>
428    
429 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
430     work (HTML5 revision 1690).
431    
432     2008-05-24 Wakaba <wakaba@suika.fam.cx>
433    
434 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
435     end tags in head insertion modes (HTML5 revision 1686).
436     (parse_char_string): Bug fix for non-utf8 character string handlings.
437     (parse_char_stream): |ungetc| does not work well for this context.
438    
439 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
440    
441 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
442     |parse_byte_stream|.
443     (parse_byte_stream): New method.
444    
445     2008-05-18 Wakaba <wakaba@suika.fam.cx>
446    
447 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
448     by encoding layer error reporter.
449    
450 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
451    
452 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
453     rather than converting the whole byte string and then parsing.
454     Propagate errors in character encoding layer.
455     (get_next_token): Precise error reporting for |bare stago| error.
456    
457     2008-05-17 Wakaba <wakaba@suika.fam.cx>
458    
459 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
460     (parse_char_string): This method is now defined as an invocation
461     of the |parse_char_stream| method.
462    
463     2008-05-17 Wakaba <wakaba@suika.fam.cx>
464    
465 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
466     of the sniffing as info-level errors. Support for new
467     decoding framework in parser resestting.
468     (new): Various default error levels were not set.
469    
470     2008-05-17 Wakaba <wakaba@suika.fam.cx>
471    
472 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
473     algorithm, except for the actual sniffing, is implemented
474     with new framework with Message::Charset::Info.
475    
476 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
477    
478     * CacheManifest.pm (_parse): Drop fragment identifiers from
479     URIs in fallback section (HTML5 revision 1596).
480    
481 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
482    
483 wakaba 1.231 * Makefile (Entities.html): URI has changed.
484    
485     2008-05-10 Wakaba <wakaba@suika.fam.cx>
486    
487 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
488     1553).
489    
490 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
491    
492 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
493     no longer table elements, but table objects returned
494     by Whatpm::HTMLTable.
495    
496     * HTMLTable.pm (form_table): Return table element node
497     as |$table->{element}|.
498     (assign_header): Support for the |headers=""| attribute.
499    
500     2008-05-06 Wakaba <wakaba@suika.fam.cx>
501    
502 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
503     with no support for headers="".
504     (form_table): Include table width and height to the returned
505     table object for covenience. Indexing in column assignement was wrong.
506     Set whether a data cell is empty or not for convenience.
507    
508 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
509    
510 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
511    
512     2008-05-05 Wakaba <wakaba@suika.fam.cx>
513    
514 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
515     changed (HTML5 revision 1387).
516    
517     2008-05-05 Wakaba <wakaba@suika.fam.cx>
518    
519 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
520     to the end of the table (HTML5 revision 1380).
521    
522     2008-05-05 Wakaba <wakaba@suika.fam.cx>
523    
524 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
525     of 1-based (HTML5 revision 1376).
526    
527     2008-05-05 Wakaba <wakaba@suika.fam.cx>
528    
529 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
530     and "Content-Encoding" no longer prevent sniffing (HTML5
531     revision 1288).
532    
533     2008-05-05 Wakaba <wakaba@suika.fam.cx>
534    
535 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
536     1282).
537    
538 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
539    
540 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
541     Status of XML specs are added.
542    
543     2008-05-03 Wakaba <wakaba@suika.fam.cx>
544    
545 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
546     and |ref=""| attribute (referent element type checking).
547    
548 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
549    
550 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
551     1500) is implemented (except for IDNA part and URI-scheme-specific
552 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
553     documents.
554    
555 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
556    
557 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
558     character (HTML5 revision 1263).
559    
560     2008-04-13 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
563     definition for |AElig;| was missing).
564    
565     2008-04-13 Wakaba <wakaba@suika.fam.cx>
566    
567 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
568     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
569    
570     2008-04-13 Wakaba <wakaba@suika.fam.cx>
571    
572 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
573     insertion mode is added (HTML5 revisions 1412 and 1418).
574    
575     2008-04-13 Wakaba <wakaba@suika.fam.cx>
576    
577 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
578     elements (HTML5 revision 1410).
579    
580     2008-04-13 Wakaba <wakaba@suika.fam.cx>
581    
582 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
583     revision 1406).
584    
585     2008-04-13 Wakaba <wakaba@suika.fam.cx>
586    
587 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
588     elements (HTML5 revision 1404 and 1420).
589    
590 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
591    
592 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
593     elements (HTML5 revision 1404). Unused !!!macro definitions
594     are removed.
595    
596     2008-04-12 Wakaba <wakaba@suika.fam.cx>
597    
598 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
599     are raised is changed (HTML5 revision 1404).
600    
601 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
602    
603 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
604    
605     2008-04-06 Wakaba <wakaba@suika.fam.cx>
606    
607 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
608     replaced by constants representing element category.
609    
610     2008-04-06 Wakaba <wakaba@suika.fam.cx>
611    
612 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
613     for error reporting. (This is a preparation for using constant
614     value for ->[1].)
615    
616 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
617    
618 wakaba 1.205 * RDFXML.pm: Typo fixed.
619    
620     2008-03-22 Wakaba <wakaba@suika.fam.cx>
621    
622 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
623     as same as |must_level|, i.e. level = |m|.
624     (check_element): Make list of URIs in the DOM.
625    
626 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
627    
628 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
629     Local (null-namespace) attribute support.
630    
631     2008-03-21 Wakaba <wakaba@suika.fam.cx>
632    
633 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
634    
635     2008-03-21 Wakaba <wakaba@suika.fam.cx>
636    
637 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
638     and URI attributes is implemented. Warn if unknown
639     value is used in rdf:parseType="" attribute.
640    
641     * URIChecker.pm (check_rdf_uri_reference): New function.
642    
643     2008-03-21 Wakaba <wakaba@suika.fam.cx>
644    
645 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
646     are now resolved.
647    
648     2008-03-21 Wakaba <wakaba@suika.fam.cx>
649    
650 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
651    
652     * RDFXML.pm: undef vs false bug fixed.
653     Reification implemented.
654    
655     2008-03-21 Wakaba <wakaba@suika.fam.cx>
656    
657 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
658     The |node| arguments are added for |ontriple| calls.
659     Too many "attribute not allowed" errors were raised.
660    
661     * ContentChecker.pm: Initial experimental support for rdf:RDF
662     element.
663    
664     2008-03-21 Wakaba <wakaba@suika.fam.cx>
665    
666 wakaba 1.197 * RDFXML.pm: New module.
667    
668 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
669    
670 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
671     code was old one yet.
672    
673     2008-03-20 Wakaba <wakaba@suika.fam.cx>
674    
675 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
676     errors. Line/column markings for DOCTYPE, comment, and
677     character tokens are reintroduced; otherwise,
678     error location for "not HTML5" error and errors
679     for implied elements are not attached.
680    
681     2008-03-20 Wakaba <wakaba@suika.fam.cx>
682    
683 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
684    
685     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
686     (value): Setter implemented.
687    
688     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
689    
690     2008-03-20 Wakaba <wakaba@suika.fam.cx>
691    
692 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
693    
694     2008-03-20 Wakaba <wakaba@suika.fam.cx>
695    
696 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
697     flag, which is no longer used.
698    
699 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
700    
701 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
702    
703     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
704     and !!!insert-element-t): Set line/column information to
705     element nodes.
706    
707     2008-03-17 Wakaba <wakaba@suika.fam.cx>
708    
709 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
710     in "<?", "<>", or "</>" should be the error point.
711    
712 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
713    
714 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
715    
716     2008-03-16 Wakaba <wakaba@suika.fam.cx>
717    
718 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
719    
720     2008-03-16 Wakaba <wakaba@suika.fam.cx>
721    
722 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
723     reporting.
724    
725 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
726    
727 wakaba 1.186 * HTML.pm.src: Error type revised.
728    
729     2008-03-11 Wakaba <wakaba@suika.fam.cx>
730    
731 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
732    
733     2008-03-11 Wakaba <wakaba@suika.fam.cx>
734    
735 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
736    
737 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
738    
739     * mkhtmlparser.pl: Set "level" parameter to parse errors.
740    
741     * HTML.pm.src: Code refined.
742    
743 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
744    
745 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
746     revision 1348). Note that I really don't know this makes
747     any difference in the black-box behavior of the parser.
748    
749     2008-03-09 Wakaba <wakaba@suika.fam.cx>
750    
751 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
752     revision 1348).
753    
754     2008-03-09 Wakaba <wakaba@suika.fam.cx>
755    
756 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
757    
758     2008-03-09 Wakaba <wakaba@suika.fam.cx>
759    
760 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
761    
762     2008-03-09 Wakaba <wakaba@suika.fam.cx>
763    
764 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
765     mode and support for the "in select in table" insertion mode (HTML5
766     revision 1342).
767    
768     2008-03-09 Wakaba <wakaba@suika.fam.cx>
769    
770 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
771     in non-tainted <table>s (HTML5 revision 1336).
772    
773     2008-03-09 Wakaba <wakaba@suika.fam.cx>
774    
775 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
776     and <head> (HTML5 revision 1332).
777    
778     2008-03-09 Wakaba <wakaba@suika.fam.cx>
779    
780 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
781     white space (HTML5 revision 1331).
782    
783 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
784    
785 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
786     element (HTML5 revision 1330).
787    
788     2008-03-08 Wakaba <wakaba@suika.fam.cx>
789    
790 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
791     element (HTML5 revision 1328).
792    
793     2008-03-08 Wakaba <wakaba@suika.fam.cx>
794    
795 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
796     into foster parents (HTML5 revision 1326).
797    
798     2008-03-08 Wakaba <wakaba@suika.fam.cx>
799    
800 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
801     revision 1321).
802    
803     2008-03-08 Wakaba <wakaba@suika.fam.cx>
804    
805 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
806     the spec (HTML5 revision 1320). Note that we finally complete
807     all of HTML5 revision 1320 changes.
808    
809     2008-03-08 Wakaba <wakaba@suika.fam.cx>
810    
811 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
812     tags (HTML5 revision 1320).
813    
814     2008-03-08 Wakaba <wakaba@suika.fam.cx>
815    
816 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
817    
818     2008-03-08 Wakaba <wakaba@suika.fam.cx>
819    
820 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
821     could not be reached so that it is now removed (HTML5 revision
822     1320).
823    
824     2008-03-08 Wakaba <wakaba@suika.fam.cx>
825    
826 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
827     to align with new spec text (HTML5 revision 1320).
828    
829     2008-03-08 Wakaba <wakaba@suika.fam.cx>
830    
831 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
832     have had no harm in theory.
833    
834     2008-03-08 Wakaba <wakaba@suika.fam.cx>
835    
836 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
837     case is split from other end tags for the preparation
838     of implementing HTML revision 1320.
839    
840 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
841    
842 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
843     revision 1320).
844    
845     2008-03-07 Wakaba <wakaba@suika.fam.cx>
846    
847 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
848     and "div" (HTML5 revision 1318). Add comments to where
849     |form| pointer association codes should be inserted (HTML5 revision
850     1319).
851    
852 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
853    
854     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
855    
856 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
857    
858 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
859     other token type, there are), it is replaced by a |die| statement.
860    
861     2008-03-05 Wakaba <wakaba@suika.fam.cx>
862    
863 wakaba 1.159 * HTML.pm.src: Typo fixed.
864    
865 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
866    
867     * HTML.pm.src (_tree_construction_initial): Some limited quirks
868     doctypes were not uppercased for comparison.
869    
870 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
871    
872 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
873     are added.
874    
875     2008-03-03 Wakaba <wakaba@suika.fam.cx>
876    
877 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
878     are set. Cases that are unlikely reached are noted as so.
879    
880     2008-03-03 Wakaba <wakaba@suika.fam.cx>
881    
882 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
883    
884     * mkhtmlparser.pl: Support for |!!!cp| syntax.
885    
886     2008-03-03 Wakaba <wakaba@suika.fam.cx>
887    
888 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
889     for simplicity.
890    
891     2008-03-03 Wakaba <wakaba@suika.fam.cx>
892    
893 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
894     revision 1307).
895    
896     2008-03-03 Wakaba <wakaba@suika.fam.cx>
897    
898 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
899     is set are changed (HTML5 revision 1305).
900    
901 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
902    
903 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
904     1292). Entities are not parsed in comment-like part in RCDATA
905     elements (HTML5 revision 1294). Allow bare & at the end
906     of attribute value literals (HTML5 revision 1296). More
907     quirks mode doctypes (HTML5 revision 1302). Requires spaces
908     between attributes and ban attribute names or unquoted
909     attribute values containing single or double quotes (HTML5
910     revision 1303).
911    
912     2008-03-02 Wakaba <wakaba@suika.fam.cx>
913    
914 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
915     and related errors unless it is an HTML document (though the spec
916     is unclear on whether it is applied to XHTML document).
917    
918     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
919     are added.
920    
921     2008-03-02 Wakaba <wakaba@suika.fam.cx>
922    
923 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
924    
925 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
926    
927     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
928    
929     * HTML.pm.src: |charset| in |content| attribute is
930     case-insensitive (HTML5 revision 1270).
931    
932 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
933    
934     * ContentChecker.pm: New status constants are added.
935     ($ElementDefault): |status| added.
936     (check_element): Err for non-standard or deprecated elements.
937     (_attr_status_info): For non-standard or deprecated attributes.
938    
939 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
940    
941 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
942    
943     2008-02-24 Wakaba <wakaba@suika.fam.cx>
944    
945 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
946     status information is now dispatched.
947    
948     2008-02-24 Wakaba <wakaba@suika.fam.cx>
949    
950 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
951     problem by adding some more arguments.
952    
953     2008-02-24 Wakaba <wakaba@suika.fam.cx>
954    
955 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
956     container-for-the-purpose-of-content-model element (not transparent
957     element) for |check_child_element| calling and significant
958     text flag marking. This reintroduces |<del>|-and-significant
959     problem again.
960    
961     2008-02-24 Wakaba <wakaba@suika.fam.cx>
962    
963 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
964     elements ignored for the purpose of phase changes in
965     content model checking.
966    
967 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
968    
969 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
970     was not properly managed for transparent cases.
971    
972     2008-02-23 Wakaba <wakaba@suika.fam.cx>
973    
974 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
975     and |audio| as semi-transparent elements.
976    
977     2008-02-23 Wakaba <wakaba@suika.fam.cx>
978    
979 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
980     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
981     Support for |html:object| as a semi-transparent element.
982    
983     2008-02-23 Wakaba <wakaba@suika.fam.cx>
984    
985 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
986     the tree is entirely revised to make it easier to track
987     the state of ancestors/descendants. As a result of this
988     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
989     support for content model checking for HTML elements |figure|,
990     |object|, |video|, and |audio| and checking for XML elements (and
991     some XMLNS checkings) are dropped for now. They will be
992     reimplemented in due cource.
993    
994 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
995    
996 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
997     token (HTML5 revision 1225).
998    
999     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1000    
1001 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1002    
1003     * HTML.pm.src: Save whether |meta| |content| attribute
1004     contains character references or not.
1005    
1006     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1009     imlement significant content checking for contents
1010     with |del| elements.
1011    
1012     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1015     elements are added.
1016     (_get_children): HTML |object| elements are now semi-transparent.
1017    
1018     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1019    
1020 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1021    
1022     * CacheManifest.pm: HTML5 revision 1211 implemented.
1023    
1024     * CacheManifest.pod: Updated.
1025    
1026 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1027    
1028     * ContentChecker.pm (check_document, check_element): Support
1029     for second argument ($onsubdoc).
1030     (_get_css_parser): Removed (now it is part of WDCC).
1031    
1032 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1033    
1034     * ContentChecker.pm (_get_css_parser): New.
1035    
1036 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1037    
1038 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1039     for checking had been used.
1040    
1041     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1042    
1043 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1044    
1045 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1046    
1047 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1048     specced in RFC 2046 and RFC 4288.
1049     (application/atom+xml): Definition added.
1050    
1051     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1052    
1053 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1054     warnings (rather than SHOULD-level errors).
1055    
1056     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1057    
1058 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1059    
1060     * HTML.pm.src: A flag for character references in attribute
1061     values are added. Set |manakai_has_reference| user data
1062     to |charset| attribute.
1063    
1064     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1065    
1066 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1067     attributes.
1068    
1069     * ContentChecker.pm (check_document): Warn if charset requirements
1070     cannot be tested.
1071    
1072 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1073    
1074     * HTML.pm.src (parse_byte_string): Detect charset
1075     by universalchardet if charset parameter is not specified.
1076    
1077     * Makefile (Charset-all, Charset-clean): New rules.
1078    
1079 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1080    
1081 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1082     of character encoding declaration and selection of encoding
1083     for HTML document.
1084    
1085     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1086    
1087 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1088     type in list context.
1089    
1090     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1091    
1092 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1093     ignoring (HTML5 revisions 1013 and 1016).
1094    
1095     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1096    
1097 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1098     white spaces, are implemented.
1099    
1100 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1101    
1102 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1103     attribute if possible.
1104    
1105     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1106    
1107 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1108     (parse_char_string): New alias for |parse_string|.
1109     (main phase): Invoking "change the encoding" algorithm if desired.
1110    
1111     * HTML.pod: Updated.
1112    
1113     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1114    
1115 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1116    
1117     * Makefile (HTML-all, HTML-clean): New.
1118    
1119     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1120    
1121     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1122    
1123 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1124    
1125     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1126    
1127 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1128    
1129 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1130    
1131     * CacheManifest.pod: New file.
1132    
1133     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1134    
1135 wakaba 1.114 * CacheManifest.pm: New module.
1136    
1137     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1138    
1139 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1140     callback.
1141    
1142     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1143    
1144 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1145    
1146 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1147    
1148 wakaba 1.111 * Makefile (clean): New rule.
1149    
1150     * NanoDOM.pm (public_id, system_id): New attributes.
1151    
1152     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1153    
1154 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1155    
1156 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1157    
1158     * ContentChecker.pm (check_document): Support for
1159     new |is_xml_root| flag.
1160     (check_element): Support for new |pluses| state.
1161     (_add_pluses): New method.
1162     (_remove_minuses): Support for new |minus| item.
1163    
1164 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1165    
1166 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1167     root element.
1168    
1169     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1170    
1171 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1172    
1173 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1174    
1175 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1176    
1177     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1178    
1179 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1180    
1181     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1182    
1183 wakaba 1.103 * LangTag.pm: New module.
1184    
1185 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1186    
1187     * HTML.pm.src: Some error types were wrong.
1188    
1189 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1190    
1191 wakaba 1.101 * CSS/: New directory.
1192    
1193     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1194    
1195 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1196     for |noscript| in |head|.
1197    
1198 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1199    
1200     * URI/: New directory.
1201    
1202 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1203    
1204 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1205     number.
1206    
1207     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1208    
1209 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1210     by |&|s.
1211    
1212     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1213    
1214 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1215    
1216     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1217    
1218 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1219    
1220     * HTML.pm.src: Insertion modes are now represented in number.
1221    
1222     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1223    
1224 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1225     999) is implemented.
1226    
1227 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1228    
1229     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1230     983) is implemented.
1231    
1232 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1233    
1234     * ContentChecker.pod: New documentation.
1235    
1236     * Makefile: A rule for |ContentChecker.html| is added.
1237    
1238     * ContentChecker.pm: A pod "LICENSE" section is added.
1239    
1240     * NanoDOM.pm ($VERSION): New variable.
1241    
1242 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1243    
1244 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1245     list of allowed HTML elements.
1246    
1247     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1248    
1249 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1250     HTML elements.
1251    
1252     * URIChecker.pm (check_iri): New.
1253     (check_iri_reference): Error type for IRI reference
1254     syntax error is changed.
1255    
1256 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1257    
1258 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1259     is added.
1260     (check_document): Load appropriate module before validation.
1261    
1262     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1263    
1264 wakaba 1.88 * ContentChecker/: New directory.
1265    
1266     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1267    
1268 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1269    
1270     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1271    
1272 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1273    
1274 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1275     even if no document element is found.
1276    
1277     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1278    
1279 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1280    
1281     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1282    
1283 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1284    
1285     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1286    
1287 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1288     is split into "after html body" and "after html frameset"
1289     insertion modes. Their codes are merged with "after body"
1290     and "after frameset" codes. |$previous_insertion_mode|
1291     has been removed. "after frameset" code is
1292     merged with "in frameset" code.
1293    
1294     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1295    
1296 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1297     merged with the "in head" insertion mode.
1298    
1299     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1300    
1301 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1302     reorganized.
1303    
1304     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1305    
1306 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1307     are merged.
1308    
1309     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1310    
1311 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1312     with "in table" insertion mode.
1313    
1314     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1315    
1316 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1317     modes are merged.
1318    
1319     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1320    
1321 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1322     insertion mode!
1323    
1324     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1325    
1326 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1327     now directly invoke the handler.
1328    
1329     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1330    
1331 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1332     is merged to the "in body" insertion mode code.
1333    
1334     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1335    
1336 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1337     insertion modes are merged.
1338    
1339     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1340    
1341 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1342    
1343     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1344    
1345 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1346     instead of string constant for |{content_model_flag}|.
1347    
1348     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1349    
1350 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1351     to the "duplicate attribute" error.
1352    
1353 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1354    
1355 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1356    
1357     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1360    
1361     * HTML.pm.src: A typo is fixed.
1362    
1363 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1364    
1365 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1366     from "in HTML:xml:lang" error. Character position
1367     is now the last part of the error type in the URI error
1368     description. Report "unsupported" status for language
1369     tags, media queries, script codes, and style sheets.
1370    
1371     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1372    
1373 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1374     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1375     in XML.
1376    
1377     * NanoDOM.pm (Attr.owner_document): New attribute.
1378    
1379     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1380    
1381 wakaba 1.66 * HTML.pm.src: The character immediately following
1382     a bare |hcro| was discarded. Fix handling of
1383     entity references in attribute values.
1384    
1385     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1386    
1387 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1388     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1389     factored out. Error types |in html:#DOCTYPE|
1390     and |after html:#DOCTYPE| are merged
1391     into |DOCTYPE in the middle|. |</frameset>|
1392     in fragment parsing mode changed the
1393     insertion mode.
1394    
1395     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1396    
1397 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1398     phase is now an insertion mode. Treatments for white
1399     space character tokens were incorrect for some
1400     insertion modes. An old |meta| case was not removed.
1401    
1402     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1403    
1404 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1405     implemented (but changing the encoding is not yet:-).
1406    
1407 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1408    
1409 wakaba 1.62 * Charset/: New directory.
1410    
1411     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1412    
1413 wakaba 1.61 * H2H.pm: New Perl module (created from
1414     manakai's H2H.dis).
1415    
1416     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1419     manakai's SimpleLS.dis).
1420    
1421 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1422    
1423     * HTML.pm.src: |<!---x-->| was not processed correctly.
1424    
1425 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1426    
1427 wakaba 1.58 * HTML.pm.src: Report correct error message
1428     for |<body></div></body>|.
1429    
1430     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1431    
1432 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1433    
1434     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1435    
1436     * ContentChecker.pm: Return |{term}| list.
1437    
1438 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1439    
1440 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1441     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1442     and <title> outside of head).
1443    
1444     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1445    
1446 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1447     and private types/subtypes.
1448    
1449     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1450     URIChecker.pm, HTMLTable.pm: Error messages are now
1451     consistent; they are all listed in
1452     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1453    
1454 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1455    
1456 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1457     is now erred. |<datalist>| is implemented.
1458     Attribute checker for |<command>| and |<menu>| are
1459     added. Support for |contextmenu| global attribute
1460     is added.
1461    
1462     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1463    
1464 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1465     of Step 3 has been changed.
1466    
1467     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1468    
1469 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1470    
1471 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1472    
1473 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1474     incorrect.
1475    
1476     * ContentChecker.pm: Support for the |footer| element.
1477     Check URI syntax for space-separated URI attributes.
1478     Support for the |tabindex| attribute. Support
1479     for |datetime| attribute.
1480    
1481     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1482    
1483 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1484     character references). HTML5 revision 1.146 (white space
1485     characters before root start tag). HTML5 revision
1486     1.148 (named character references in attribute values).
1487     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1488    
1489     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1490    
1491     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1492 wakaba 1.49 in <head>).
1493    
1494     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1495    
1496 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1497     in body).
1498    
1499     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1500    
1501     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1502    
1503 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1504     or RCDATA in fragment parsing mode). HTML5 revision
1505     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1506    
1507     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1508    
1509     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1510     for <pre> and <textarea>).
1511    
1512 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1513    
1514 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1515    
1516     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1517    
1518 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1519    
1520     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1521    
1522 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1523     HTML5 revision 917 (conforming bare &).
1524    
1525     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1526    
1527 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1528    
1529     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1530     HTML5 revision 915 (<nobr>).
1531    
1532     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1533    
1534 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1535    
1536     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1537     New attributes.
1538    
1539     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1540    
1541 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1542     in tags).
1543    
1544     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1545    
1546 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1547    
1548     * HTML.pm.src: |$entity_char| is removed and
1549     requires |Whatpm::_NamedEntityList| instead.
1550     HTML5 revision 898 (refc), except that lack of refc
1551     is parse error.
1552    
1553     * mkentitylist.pl: New script.
1554    
1555     * Makefile (all): |_NamedEntityList.pm| is added.
1556     (_NamedEntityList.pm, Entities.html): New rules.
1557    
1558     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1559    
1560 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1561     were ignored and U+000D immediately following another
1562     U+000D was not converted to U+000A.
1563    
1564     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1565    
1566 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1567     nodes before appended). Parser was not ready for NULL
1568     parse error and escape flag.
1569    
1570     * NanoDOM.pm (adopt_node): New.
1571    
1572     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1573    
1574 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1575     CDATA and RCDATA). Note that current implementation
1576     is simply repeating what the spec says and it is maybe not
1577     a best way to do it.
1578    
1579     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1580    
1581 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1582     the form element if a descendant element without implied end tag has
1583     still been open).
1584    
1585     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1588     space characters non-conforming).
1589    
1590     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1591    
1592 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1593     HTML5 revision 869 (C1 character references).
1594    
1595     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1596    
1597 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1598     a |textarea| is removed).
1599    
1600 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1601    
1602     * NanoDOM.pm (get_attribute_node_ns): New method.
1603    
1604     * ContentChecker.pm: |script| |async| and |defer|
1605     no longer require |src|. |async| MUST NOT be
1606     specified if |defer|. (HTML5 revision 858).
1607    
1608 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1609    
1610     * HTML.pm.src: |<form><form>| went to inifinite loop.
1611    
1612 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1613    
1614 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1615     as a document element) flag on.
1616     (new): Removed.
1617     (check_document): New method.
1618    
1619     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1620    
1621 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1622     were incorrect.
1623    
1624     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1625    
1626 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1627     error position in the |type| option of the error.
1628    
1629     * HTMLTable.pm (form_table): The |$onerror| parameter
1630     is now optional. Some bugs are fixed.
1631    
1632     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1633    
1634 wakaba 1.27 * HTMLTable.pm: New module.
1635    
1636     * ContentChecker.pm (table): Invoke table model error checker.
1637    
1638     * NanoDOM.pm (first_child, get_attribute_ns): New.
1639    
1640 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1641    
1642 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1643     (link/@rel, a/@rel, area/@rel): Use new checker.
1644    
1645     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1646    
1647     * _LinkTypeList.pm: New file.
1648    
1649     * mklinktypelist.pl: New file.
1650    
1651     * .cvsignore: |RelExtensions.html| added.
1652    
1653     * NanoDOM.pm (child_nodes): Returns an empty array
1654     for non-child-containing node types.
1655     (text_content): New attribute.
1656    
1657     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1658    
1659 wakaba 1.25 * IMTChecker.pm: New module.
1660    
1661     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1662     to test parameter value validity.
1663    
1664     * HTML.pm.src ($style_start_tag): Attributes were
1665     discarded.
1666    
1667 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1668    
1669 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1670    
1671     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1672    
1673 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1674     and RFC 3987 are listed (not all testable items are checked yet).
1675    
1676     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1677    
1678 wakaba 1.22 * URIChecker.pm: New module.
1679    
1680 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1681    
1682 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1683     or associated to some placeholder.
1684     ($ElementDefault): Warn unknown attributes
1685     for unknown elements as "attribute not supported".
1686     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1687     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1688     (|a| attribute checker): Reimplemented.
1689    
1690     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1691    
1692 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1693     ($HTMLAttrChecker): Event handler content attributes
1694     are added.
1695     (link, embed): Required attribute is now checked.
1696     (embed): Unknown local attributes are no longer warned.
1697    
1698     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1699    
1700 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1701     ($HTMLIMTAttrChecker): New checker.
1702     (link@rel, link@href, link@type, style@type,
1703     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1704     object@data, object@type, source@src, source@type, area@alt,
1705     area@shape, area@coords, area@href,
1706     area@ping, area@rel, area@type, script@src,
1707     script@defer, script@async, script@type): Checkers added.
1708    
1709     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1710    
1711 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1712    
1713 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1714    
1715 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1716     attributes. Report an error if |Element.prefix|
1717     is |xmlns|.
1718    
1719     * NanoDOM.pm (prefix): New attribute.
1720    
1721     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1722    
1723 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1724     mode, action for |<iframe>| was missing.
1725    
1726     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1727    
1728 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1729     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1730    
1731     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1732    
1733 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1734     ($HTMLURIAttrChecker): Placeholder.
1735     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1736     New.
1737    
1738     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1739    
1740 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1741     for global attributes, |html|, |base|, |style|, and |meta|.
1742    
1743     * NanoDOM.pm (insert_before): Weaken reference
1744     to the parent node.
1745     (Attr::new): Set |owner_element| attribute.
1746     (namespace_uri, manakai_local_name): New attribute implementations.
1747     (owner_element): New attribute.
1748    
1749     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1750    
1751 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1752     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1753     $Element->{$HTML_NS}->{''}): New.
1754     (check_element): Invoke attrs_checker for each element.
1755    
1756 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1757    
1758 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1759    
1760     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1761    
1762 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1763     element type testings.
1764    
1765     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1766    
1767 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1768     error when an element type is put in the "minus" list
1769     and the element type is not allowed explicitly in the particular
1770     element content model.
1771     (html:a checker): New checker.
1772     (html:details, html:datagrid): New checkers.
1773     (html:legend): New checker.
1774    
1775     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1776    
1777 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1778    
1779     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1780    
1781 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1782     checker.
1783     (html:dd checker): New checker.
1784     (html:q, html:em, html:strong, html:small,
1785     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1786    
1787     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1788    
1789 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1790     from |$ElementDefault->{checker}|.
1791     ($ElementDefault->{checker}): Throw an error that
1792     the element type is not supported by the checker.
1793     ($HTMLMetadataElement): |html:base| was missing.
1794     ($HTMLEmptyChecker): Don't throw an error
1795     for inter-element whitespace nodes.
1796     (html:html checker): Errors were not
1797     thrown even if |html:head| and/or |html:body|
1798     children were missing.
1799     (html:head checker): An error was not
1800     thrown if <meta charset> appered after other
1801     elements.
1802    
1803 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1804    
1805     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1806     and |noscript| elements are implemented.
1807     (new): New method.
1808    
1809 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1810    
1811 wakaba 1.4 * ContentChecker.pm: New module.
1812    
1813     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1814    
1815 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1816     document_element, manakai_local_name,
1817     manakai_element_type_match): New method.
1818    
1819     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1820    
1821     * HTML.pm.src: Replace decimal and hexadecimal numeric
1822     entities in C1 range using Windows-1252 mapping. Bare LF
1823     did not count as new line for error reporting.
1824    
1825 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1826    
1827     * NanoDOM.pm (DOMImplementation): New class.
1828     (append_child): Weaken the |parent_node| reference.
1829     (create_element_ns, Element new): Set the |owner_document|
1830     reference.
1831     (implementation): New attribute.
1832     (owner_document, local_name, namespace_uri): New attributes.
1833    
1834     * HTML.pm.src (parse_string): Line and column numbers
1835     are now provided to error handler.
1836     (!!!parse-error): Short descriptions are added.
1837     (_construct_tree): Split into three methods; support
1838     for innerHTML mode.
1839     (set_inner_html): New method.
1840    
1841 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1842    
1843     * NanoDOM.html: Documentation is added.
1844    
1845     * HTML.pod, ContentType.html: Documentation is revised.
1846    
1847     * .cvsignore: Pod2html temporary files are added.
1848    
1849     * Makefile: Make |NanoDOM.html|.
1850    
1851     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1852    
1853     * HTML.pm.src (parse_string): New method.
1854     (get_inner_html): Renamed from |inner_html|.
1855    
1856     * Makefile: A rule for |HTML.html| is added.
1857    
1858     * HTML.pod: New documentation.
1859    
1860     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1861    
1862     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1863     (clone_node): Attribute nodes were not completely copied.
1864    
1865     * HTML.pm.src: Many bugs are fixed.
1866    
1867     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1868    
1869     * HTML.pm.src: Some typos are fixed.
1870    
1871     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1872    
1873     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1874     are removed.
1875    
1876     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1877    
1878     * HTML-consume-entity.src: Removed.
1879    
1880     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1881    
1882     * HTML.pm.src: Tokenizer's handling on hexadecimal
1883     numeric entities are rewritten.
1884    
1885     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1886    
1887     * HTML.pm.src: Some tokenizer bugs are fixed.
1888    
1889     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1890    
1891     * HTML.pm.src: The tree construction stage is implemented.
1892    
1893     * mkhtmlparser.pl: New macros are added.
1894    
1895     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1896    
1897     * ContentType.pm: A note on bug in the specification
1898     is removed since it's been now fixed.
1899    
1900     * .cvsignore: New file.
1901    
1902     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1903    
1904     * HTML.pm.src, HTML-consume-entity.src: New files.
1905    
1906     * Makefile (HTML.pm): New rule.
1907    
1908     * mkhtmlparser.pl: New script.
1909    
1910     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1911    
1912     * Makefile: New file.
1913    
1914     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1915    
1916     * ContentType.pm: An error in pod is fixed.
1917    
1918     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1919    
1920     * ContentType.pm: Documentation is added.
1921    
1922     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1923    
1924     * ContentType.pm: New Perl module.
1925    
1926     * ChangeLog: New file.
1927    
1928    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24