/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.300 - (hide annotations) (download)
Sun Sep 14 01:51:08 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.299: +10 -0 lines
++ whatpm/Whatpm/ChangeLog	14 Sep 2008 01:47:27 -0000
2008-09-14  Wakaba  <wakaba@suika.fam.cx>

	* HTML.pm.src (parse_char_string): Use newly created
	|Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
	standard feature to |open| a string as a filehandle,
	since Perl's string filehandle seems not supporting |ungetc|
	method correctly.
	(parse_char_stream): Define |{getc_until}| method.
	(DATA_STATE): Experimental support for |getc_until| feature.

++ whatpm/Whatpm/Charset/ChangeLog	14 Sep 2008 01:50:52 -0000
2008-09-14  Wakaba  <wakaba@suika.fam.cx>

	* DecodeHandle.pm (CharString): New class.
	(Encode read): Don't remove read string from |{char_buffer}|,
	to decease the number of string operations and to enable
	|manakai_getc_until| ungetc'ing without any string operation.
	(manakai_getc_until): New method.

	* UnicodeChecker.pm (getc): Don't |read| more than one
	character, to prevent characters being bufferred
	such that mixture of |getc| and |manakai_getc_until|
	calls does not make the result broken.

1 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
2    
3     * HTML.pm.src (parse_char_string): Use newly created
4     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
5     standard feature to |open| a string as a filehandle,
6     since Perl's string filehandle seems not supporting |ungetc|
7     method correctly.
8     (parse_char_stream): Define |{getc_until}| method.
9     (DATA_STATE): Experimental support for |getc_until| feature.
10    
11 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
12    
13 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
14    
15     2008-09-13 Wakaba <wakaba@suika.fam.cx>
16    
17 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
18     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
19     and replaced by |{prev_state}|.
20    
21     * mkhtmlparser.pl: Remove |{char}| feature.
22     Remove |!!!back-next-input-character;| macro.
23    
24     2008-09-13 Wakaba <wakaba@suika.fam.cx>
25    
26 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
27     entity related tokenizer states in favor of new states
28     implementing the consume character reference algorithm.
29    
30     2008-09-13 Wakaba <wakaba@suika.fam.cx>
31    
32     * HTML.pm.src: "Consume a character reference" algorithm is
33 wakaba 1.296 now implemented as a tokenizer's state, rather than
34     a method, with minimum changes (more changes will
35     be made, in due course). "Bogus comment state"'s inner
36     loop gets removed.
37    
38     2008-09-13 Wakaba <wakaba@suika.fam.cx>
39    
40 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
41 wakaba 1.295 into their own tokenizer states.
42    
43     2008-09-13 Wakaba <wakaba@suika.fam.cx>
44    
45 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
46     is split into three states.
47    
48     2008-09-13 Wakaba <wakaba@suika.fam.cx>
49    
50 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
51     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
52     no longer does the tokenizer have to push back next input
53     characters in those states.
54    
55     2008-09-13 Wakaba <wakaba@suika.fam.cx>
56    
57 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
58     into four states so that no longer does the tokenizer have to push
59     back next input characters in that state.
60    
61 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
62    
63     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
64     which can be used to insert some wrapper between the character
65     stream handle and the tokenizer. (It is currently not supported
66     for |set_inner_html| for |Element|s).
67    
68 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
69    
70 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
71    
72     2008-09-10 Wakaba <wakaba@suika.fam.cx>
73    
74 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
75    
76     * HTML.pm.src: Don't specify |text| argument for the
77     |chardecode:fallback| error, since it is not the encoding
78     being used alternatively.
79    
80 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
81    
82     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
83    
84 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
85    
86 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
87    
88     2008-08-31 Wakaba <wakaba@suika.fam.cx>
89    
90 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
91     to after after frameset insertion mode processing (HTML5
92     revision 1909). Note that the implementation was wrong
93     per the old spec before the r1909 changes.
94    
95 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
96    
97 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
98     spec (HTML5 revision 2093).
99     ($process_row): Algorithm step numbers synced with the
100     spec (HTML5 revision 2092).
101    
102     2008-08-30 Wakaba <wakaba@suika.fam.cx>
103    
104 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
105     revision 2094).
106    
107     2008-08-30 Wakaba <wakaba@suika.fam.cx>
108    
109     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
110    
111     2008-08-30 Wakaba <wakaba@suika.fam.cx>
112    
113 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
114     name (after another attribute) now raise parse error (HTML5
115     revision 2123). Empty unquoted attribute values are no
116     longer allowed (HTML5 revision 2122).
117    
118     2008-08-30 Wakaba <wakaba@suika.fam.cx>
119    
120 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
121     revision 2130).
122    
123     2008-08-30 Wakaba <wakaba@suika.fam.cx>
124    
125 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
126     as |lang| attribute value for HTML elements (HTML5 revision 2062
127     and so on).
128    
129     2008-08-30 Wakaba <wakaba@suika.fam.cx>
130    
131 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
132     was missing.
133    
134     * URIChecker.pm: The end of the URL should be marked as the
135     error location for an empty path error. The position
136     between the userinfo and the port components should be
137     marked as the error location for an empty host error.
138    
139     2008-08-30 Wakaba <wakaba@suika.fam.cx>
140    
141 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
142     value the error occurs for errors. Report unknown
143     address format error in warning level, since address
144     formats are rarely added. Path segments starting with "/.."
145     were misinterpreted as a dot-segment.
146    
147     2008-08-30 Wakaba <wakaba@suika.fam.cx>
148    
149 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
150     |Message::DOM::DOMImplementation|.
151    
152 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
153    
154     * IMTChecker.pm: Updated for the new error reporting architecture.
155    
156     * ContentChecker.pm: Error levels for IMTs are added.
157    
158 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
159    
160     * H2H.pm (_shift_token): Support for unquoted HTML attribute
161     values.
162    
163 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
164    
165 wakaba 1.275 * CacheManifest.pm: Support for new style of error
166     reports.
167    
168     * HTML.pm.src: Set line=1, column=1 to the document node.
169    
170     2008-08-16 Wakaba <wakaba@suika.fam.cx>
171    
172 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
173     and URL checkers. Support for more error levels for bogus
174     langauge tag and URL "standards".
175    
176     * LangTag.pm, URIChecker.pm: Support for new style error
177     level reporting.
178    
179 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
180    
181 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
182    
183     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
184     specifying. Error types are revised.
185    
186     2008-08-15 Wakaba <wakaba@suika.fam.cx>
187    
188 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
189     renewed.
190    
191     2008-08-15 Wakaba <wakaba@suika.fam.cx>
192    
193 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
194     are revised. Use new style for "level" specification.
195    
196     * mkhtmlparser.pl: Use new style for "level" specification.
197    
198 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
199    
200 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
201     reporting process for broken ignored valuetype definition.
202     (Valuetype idl_text): Support for special "DOMString" name.
203    
204     2008-08-03 Wakaba <wakaba@suika.fam.cx>
205    
206 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
207     terminal of the ScopedName is "DOMString", such that whether
208     the last part of the scoped name is "DOMString" or "_DOMString"
209     later. It is necessary to determine whether a |typedef|
210     definition should be ignored or not.
211     (parse_char_string): Unescape the identifier of
212     exception members.
213     ($resolve): Return undef for builtin types and sequence<T>
214     types (we might not have to do this, however...).
215     (check): Support checking for Exceptions, Valuetypes,
216     and Typedefs.
217     ($serialize_type): Support for "DOMString::::" syntax.
218     (Typedef idl_text): Output Type as "DOMString" if it
219     is really "DOMString" (i.e. its internal representation
220     is "::DOMString::").
221    
222     2008-08-03 Wakaba <wakaba@suika.fam.cx>
223    
224 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
225     for constant types in the |check| method.
226     (check): Support for checking of attributes, operations, and
227     arguments.
228     (Attribute/Operation idl_text): Exception names in getraises,
229     setraises, and raises clauses is serizlied by |$serialize_type|
230     code.
231    
232 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
233    
234 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
235     hexadecimal numbers (the original pattern, taken from the spec,
236     was not work for hexadecimal numbers, because the "0" prefix
237     matches to the [0-7]* part (as an empty string) and therefore
238     it does not match with remaining "x..." part of a "0x..." integer
239     literal.
240     ($get_type): It now returns a string, not an array reference,
241     for regular types and |sequence| types (i.e. it in any case
242     returns a string).
243     ($get_next_token): The second item in the array that represents
244     a integer or float token is now a Perl number value, not the
245     original string representation of the number.
246     (check): Support for const value consistency checking.
247     No extended attribute is defined for constants.
248     (Node subclasses): Use simple strings rather than array references
249     for default data type values.
250     ($serialize_type): Type values are now simple strings.
251     (value): If the new attribute value is a false value, then
252     a FALSE value is set to the attribute.
253    
254     2008-08-02 Wakaba <wakaba@suika.fam.cx>
255    
256 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
257     in its stringified format ("scoped name" as defined in the
258     spec). Note that future version of this module should not use
259     array references for type values and the |type_text| attribute
260     should be made obsolete.
261     (parse_char_string): Unescape attribute names.
262     (check): Support for checking of whether inherited interfaces
263     are actually defined or not. Support for checking of whether
264     interface member identifiers are duplicated or not.
265     ($serialize_type): Scoped names are returned as is. A future
266     version of this code should escape identifiers other than "DOMString",
267     otherwise the idl_text would be non-conforming.
268    
269     2008-08-02 Wakaba <wakaba@suika.fam.cx>
270    
271 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
272 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
273     for Definition's were ignored.
274     (append_child): Set |parent_node| attribute.
275     (parent_node): New attribute.
276     (check): Support interface/exception members. Support
277     extended attributes. Support definition identifier uniqueness
278     constraint.
279     (qualified_name): New attribute.
280     (Interface/Exception idl_text): Extended attributes were
281     not prepended to the returned text.
282    
283     2008-08-02 Wakaba <wakaba@suika.fam.cx>
284    
285     * WebIDL.pm (parse_char_string): Set line/column numbers
286 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
287     would make things wrong. Support for interface forward
288     declarations was missing. Broken interface declarations
289     with no block were not ignored entirely.
290     (Whatpm::WebIDL::Node): New abstract class. This class
291     makes things easier.
292     (child_nodes): New attribute. Unlike DOM's attribute with
293     same name, this attribute returns a dead list of nodes for
294     simplicity.
295     (get_user_data, set_user_data): New methods.
296     (Module idl_text): A SPACE character should be inserted
297     before the |{| character.
298     (Interface idl_text): Support for interface forward declarations.
299     (is_forward_declaration): New attribute.
300    
301 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
302    
303 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
304    
305     2008-07-19 Wakaba <wakaba@suika.fam.cx>
306    
307 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
308     it now can handle broken extended attributes and as such.
309    
310     2008-07-19 Wakaba <wakaba@suika.fam.cx>
311    
312 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
313     Support for extended attributes with arguments.
314    
315     2008-07-19 Wakaba <wakaba@suika.fam.cx>
316    
317 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
318     (Interface->idl_text): Tentative support for inheritances.
319    
320     2008-07-19 Wakaba <wakaba@suika.fam.cx>
321    
322 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
323     Support for raises, setraises, and getraises syntaxes.
324    
325 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
326    
327     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
328     proper support for types, extended attributes, and exceptions yet).
329     WebIDL parser, version 1 (no support for exceptions yet,
330     no proper support for extended attributes yet).
331    
332 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
333    
334     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
335    
336 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
337    
338 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
339    
340     2008-06-29 Wakaba <wakaba@suika.fam.cx>
341    
342 wakaba 1.255 * WebIDL.pm: New module.
343    
344 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
345    
346     * Makefile (Entities.html): URI changed.
347    
348 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
349    
350     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
351    
352 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
353    
354 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
355    
356     2008-06-01 Wakaba <wakaba@suika.fam.cx>
357    
358 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
359     spec (commented out). (HTML5 revision 1711).
360    
361 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
362    
363 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
364    
365     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
366     as UTF-16 (HTML5 revision 1701).
367    
368     2008-05-25 Wakaba <wakaba@suika.fam.cx>
369    
370 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
371     1692).
372    
373     2008-05-25 Wakaba <wakaba@suika.fam.cx>
374    
375     * HTML.pm.src: The secondary insertion mode used when switching
376     to foreign content is the "in body" insertion mode (HTML5 revision
377     1696).
378    
379     2008-05-25 Wakaba <wakaba@suika.fam.cx>
380    
381 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
382     revision 1697).
383    
384 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
385    
386 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
387     insertion mode (HTML5 revision 1693). Update SVG camelCase
388     attribute list (HTML5 revision 1700). <textarea> closes
389     </select> (HTML5 revision 1699). More start tags close in
390     foreign content insertion mode (HTML5 revision 1698).
391    
392     2008-05-24 Wakaba <wakaba@suika.fam.cx>
393    
394 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
395    
396     2008-05-24 Wakaba <wakaba@suika.fam.cx>
397    
398 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
399     revision 1674).
400    
401     2008-05-24 Wakaba <wakaba@suika.fam.cx>
402    
403 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
404     revision 1676).
405    
406     2008-05-24 Wakaba <wakaba@suika.fam.cx>
407    
408 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
409     quriks mode detection (HTML5 revision 1679).
410    
411     2008-05-24 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
414     revision 1685).
415    
416     2008-05-24 Wakaba <wakaba@suika.fam.cx>
417    
418 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
419     revision 1684).
420    
421     2008-05-24 Wakaba <wakaba@suika.fam.cx>
422    
423 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
424     work (HTML5 revision 1690).
425    
426     2008-05-24 Wakaba <wakaba@suika.fam.cx>
427    
428 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
429     end tags in head insertion modes (HTML5 revision 1686).
430     (parse_char_string): Bug fix for non-utf8 character string handlings.
431     (parse_char_stream): |ungetc| does not work well for this context.
432    
433 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
434    
435 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
436     |parse_byte_stream|.
437     (parse_byte_stream): New method.
438    
439     2008-05-18 Wakaba <wakaba@suika.fam.cx>
440    
441 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
442     by encoding layer error reporter.
443    
444 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
445    
446 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
447     rather than converting the whole byte string and then parsing.
448     Propagate errors in character encoding layer.
449     (get_next_token): Precise error reporting for |bare stago| error.
450    
451     2008-05-17 Wakaba <wakaba@suika.fam.cx>
452    
453 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
454     (parse_char_string): This method is now defined as an invocation
455     of the |parse_char_stream| method.
456    
457     2008-05-17 Wakaba <wakaba@suika.fam.cx>
458    
459 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
460     of the sniffing as info-level errors. Support for new
461     decoding framework in parser resestting.
462     (new): Various default error levels were not set.
463    
464     2008-05-17 Wakaba <wakaba@suika.fam.cx>
465    
466 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
467     algorithm, except for the actual sniffing, is implemented
468     with new framework with Message::Charset::Info.
469    
470 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
471    
472     * CacheManifest.pm (_parse): Drop fragment identifiers from
473     URIs in fallback section (HTML5 revision 1596).
474    
475 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
476    
477 wakaba 1.231 * Makefile (Entities.html): URI has changed.
478    
479     2008-05-10 Wakaba <wakaba@suika.fam.cx>
480    
481 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
482     1553).
483    
484 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
485    
486 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
487     no longer table elements, but table objects returned
488     by Whatpm::HTMLTable.
489    
490     * HTMLTable.pm (form_table): Return table element node
491     as |$table->{element}|.
492     (assign_header): Support for the |headers=""| attribute.
493    
494     2008-05-06 Wakaba <wakaba@suika.fam.cx>
495    
496 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
497     with no support for headers="".
498     (form_table): Include table width and height to the returned
499     table object for covenience. Indexing in column assignement was wrong.
500     Set whether a data cell is empty or not for convenience.
501    
502 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
503    
504 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
505    
506     2008-05-05 Wakaba <wakaba@suika.fam.cx>
507    
508 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
509     changed (HTML5 revision 1387).
510    
511     2008-05-05 Wakaba <wakaba@suika.fam.cx>
512    
513 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
514     to the end of the table (HTML5 revision 1380).
515    
516     2008-05-05 Wakaba <wakaba@suika.fam.cx>
517    
518 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
519     of 1-based (HTML5 revision 1376).
520    
521     2008-05-05 Wakaba <wakaba@suika.fam.cx>
522    
523 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
524     and "Content-Encoding" no longer prevent sniffing (HTML5
525     revision 1288).
526    
527     2008-05-05 Wakaba <wakaba@suika.fam.cx>
528    
529 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
530     1282).
531    
532 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
533    
534 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
535     Status of XML specs are added.
536    
537     2008-05-03 Wakaba <wakaba@suika.fam.cx>
538    
539 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
540     and |ref=""| attribute (referent element type checking).
541    
542 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
543    
544 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
545     1500) is implemented (except for IDNA part and URI-scheme-specific
546 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
547     documents.
548    
549 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
550    
551 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
552     character (HTML5 revision 1263).
553    
554     2008-04-13 Wakaba <wakaba@suika.fam.cx>
555    
556 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
557     definition for |AElig;| was missing).
558    
559     2008-04-13 Wakaba <wakaba@suika.fam.cx>
560    
561 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
562     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
563    
564     2008-04-13 Wakaba <wakaba@suika.fam.cx>
565    
566 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
567     insertion mode is added (HTML5 revisions 1412 and 1418).
568    
569     2008-04-13 Wakaba <wakaba@suika.fam.cx>
570    
571 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
572     elements (HTML5 revision 1410).
573    
574     2008-04-13 Wakaba <wakaba@suika.fam.cx>
575    
576 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
577     revision 1406).
578    
579     2008-04-13 Wakaba <wakaba@suika.fam.cx>
580    
581 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
582     elements (HTML5 revision 1404 and 1420).
583    
584 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
585    
586 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
587     elements (HTML5 revision 1404). Unused !!!macro definitions
588     are removed.
589    
590     2008-04-12 Wakaba <wakaba@suika.fam.cx>
591    
592 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
593     are raised is changed (HTML5 revision 1404).
594    
595 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
596    
597 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
598    
599     2008-04-06 Wakaba <wakaba@suika.fam.cx>
600    
601 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
602     replaced by constants representing element category.
603    
604     2008-04-06 Wakaba <wakaba@suika.fam.cx>
605    
606 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
607     for error reporting. (This is a preparation for using constant
608     value for ->[1].)
609    
610 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
611    
612 wakaba 1.205 * RDFXML.pm: Typo fixed.
613    
614     2008-03-22 Wakaba <wakaba@suika.fam.cx>
615    
616 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
617     as same as |must_level|, i.e. level = |m|.
618     (check_element): Make list of URIs in the DOM.
619    
620 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
621    
622 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
623     Local (null-namespace) attribute support.
624    
625     2008-03-21 Wakaba <wakaba@suika.fam.cx>
626    
627 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
628    
629     2008-03-21 Wakaba <wakaba@suika.fam.cx>
630    
631 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
632     and URI attributes is implemented. Warn if unknown
633     value is used in rdf:parseType="" attribute.
634    
635     * URIChecker.pm (check_rdf_uri_reference): New function.
636    
637     2008-03-21 Wakaba <wakaba@suika.fam.cx>
638    
639 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
640     are now resolved.
641    
642     2008-03-21 Wakaba <wakaba@suika.fam.cx>
643    
644 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
645    
646     * RDFXML.pm: undef vs false bug fixed.
647     Reification implemented.
648    
649     2008-03-21 Wakaba <wakaba@suika.fam.cx>
650    
651 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
652     The |node| arguments are added for |ontriple| calls.
653     Too many "attribute not allowed" errors were raised.
654    
655     * ContentChecker.pm: Initial experimental support for rdf:RDF
656     element.
657    
658     2008-03-21 Wakaba <wakaba@suika.fam.cx>
659    
660 wakaba 1.197 * RDFXML.pm: New module.
661    
662 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
663    
664 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
665     code was old one yet.
666    
667     2008-03-20 Wakaba <wakaba@suika.fam.cx>
668    
669 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
670     errors. Line/column markings for DOCTYPE, comment, and
671     character tokens are reintroduced; otherwise,
672     error location for "not HTML5" error and errors
673     for implied elements are not attached.
674    
675     2008-03-20 Wakaba <wakaba@suika.fam.cx>
676    
677 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
678    
679     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
680     (value): Setter implemented.
681    
682     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
683    
684     2008-03-20 Wakaba <wakaba@suika.fam.cx>
685    
686 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
687    
688     2008-03-20 Wakaba <wakaba@suika.fam.cx>
689    
690 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
691     flag, which is no longer used.
692    
693 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
694    
695 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
696    
697     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
698     and !!!insert-element-t): Set line/column information to
699     element nodes.
700    
701     2008-03-17 Wakaba <wakaba@suika.fam.cx>
702    
703 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
704     in "<?", "<>", or "</>" should be the error point.
705    
706 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
707    
708 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
709    
710     2008-03-16 Wakaba <wakaba@suika.fam.cx>
711    
712 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
713    
714     2008-03-16 Wakaba <wakaba@suika.fam.cx>
715    
716 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
717     reporting.
718    
719 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
720    
721 wakaba 1.186 * HTML.pm.src: Error type revised.
722    
723     2008-03-11 Wakaba <wakaba@suika.fam.cx>
724    
725 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
726    
727     2008-03-11 Wakaba <wakaba@suika.fam.cx>
728    
729 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
730    
731 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
732    
733     * mkhtmlparser.pl: Set "level" parameter to parse errors.
734    
735     * HTML.pm.src: Code refined.
736    
737 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
738    
739 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
740     revision 1348). Note that I really don't know this makes
741     any difference in the black-box behavior of the parser.
742    
743     2008-03-09 Wakaba <wakaba@suika.fam.cx>
744    
745 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
746     revision 1348).
747    
748     2008-03-09 Wakaba <wakaba@suika.fam.cx>
749    
750 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
751    
752     2008-03-09 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
755    
756     2008-03-09 Wakaba <wakaba@suika.fam.cx>
757    
758 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
759     mode and support for the "in select in table" insertion mode (HTML5
760     revision 1342).
761    
762     2008-03-09 Wakaba <wakaba@suika.fam.cx>
763    
764 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
765     in non-tainted <table>s (HTML5 revision 1336).
766    
767     2008-03-09 Wakaba <wakaba@suika.fam.cx>
768    
769 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
770     and <head> (HTML5 revision 1332).
771    
772     2008-03-09 Wakaba <wakaba@suika.fam.cx>
773    
774 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
775     white space (HTML5 revision 1331).
776    
777 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
778    
779 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
780     element (HTML5 revision 1330).
781    
782     2008-03-08 Wakaba <wakaba@suika.fam.cx>
783    
784 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
785     element (HTML5 revision 1328).
786    
787     2008-03-08 Wakaba <wakaba@suika.fam.cx>
788    
789 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
790     into foster parents (HTML5 revision 1326).
791    
792     2008-03-08 Wakaba <wakaba@suika.fam.cx>
793    
794 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
795     revision 1321).
796    
797     2008-03-08 Wakaba <wakaba@suika.fam.cx>
798    
799 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
800     the spec (HTML5 revision 1320). Note that we finally complete
801     all of HTML5 revision 1320 changes.
802    
803     2008-03-08 Wakaba <wakaba@suika.fam.cx>
804    
805 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
806     tags (HTML5 revision 1320).
807    
808     2008-03-08 Wakaba <wakaba@suika.fam.cx>
809    
810 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
811    
812     2008-03-08 Wakaba <wakaba@suika.fam.cx>
813    
814 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
815     could not be reached so that it is now removed (HTML5 revision
816     1320).
817    
818     2008-03-08 Wakaba <wakaba@suika.fam.cx>
819    
820 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
821     to align with new spec text (HTML5 revision 1320).
822    
823     2008-03-08 Wakaba <wakaba@suika.fam.cx>
824    
825 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
826     have had no harm in theory.
827    
828     2008-03-08 Wakaba <wakaba@suika.fam.cx>
829    
830 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
831     case is split from other end tags for the preparation
832     of implementing HTML revision 1320.
833    
834 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
835    
836 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
837     revision 1320).
838    
839     2008-03-07 Wakaba <wakaba@suika.fam.cx>
840    
841 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
842     and "div" (HTML5 revision 1318). Add comments to where
843     |form| pointer association codes should be inserted (HTML5 revision
844     1319).
845    
846 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
847    
848     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
849    
850 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
851    
852 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
853     other token type, there are), it is replaced by a |die| statement.
854    
855     2008-03-05 Wakaba <wakaba@suika.fam.cx>
856    
857 wakaba 1.159 * HTML.pm.src: Typo fixed.
858    
859 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
860    
861     * HTML.pm.src (_tree_construction_initial): Some limited quirks
862     doctypes were not uppercased for comparison.
863    
864 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
865    
866 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
867     are added.
868    
869     2008-03-03 Wakaba <wakaba@suika.fam.cx>
870    
871 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
872     are set. Cases that are unlikely reached are noted as so.
873    
874     2008-03-03 Wakaba <wakaba@suika.fam.cx>
875    
876 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
877    
878     * mkhtmlparser.pl: Support for |!!!cp| syntax.
879    
880     2008-03-03 Wakaba <wakaba@suika.fam.cx>
881    
882 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
883     for simplicity.
884    
885     2008-03-03 Wakaba <wakaba@suika.fam.cx>
886    
887 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
888     revision 1307).
889    
890     2008-03-03 Wakaba <wakaba@suika.fam.cx>
891    
892 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
893     is set are changed (HTML5 revision 1305).
894    
895 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
896    
897 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
898     1292). Entities are not parsed in comment-like part in RCDATA
899     elements (HTML5 revision 1294). Allow bare & at the end
900     of attribute value literals (HTML5 revision 1296). More
901     quirks mode doctypes (HTML5 revision 1302). Requires spaces
902     between attributes and ban attribute names or unquoted
903     attribute values containing single or double quotes (HTML5
904     revision 1303).
905    
906     2008-03-02 Wakaba <wakaba@suika.fam.cx>
907    
908 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
909     and related errors unless it is an HTML document (though the spec
910     is unclear on whether it is applied to XHTML document).
911    
912     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
913     are added.
914    
915     2008-03-02 Wakaba <wakaba@suika.fam.cx>
916    
917 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
918    
919 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
920    
921     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
922    
923     * HTML.pm.src: |charset| in |content| attribute is
924     case-insensitive (HTML5 revision 1270).
925    
926 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
927    
928     * ContentChecker.pm: New status constants are added.
929     ($ElementDefault): |status| added.
930     (check_element): Err for non-standard or deprecated elements.
931     (_attr_status_info): For non-standard or deprecated attributes.
932    
933 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
934    
935 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
936    
937     2008-02-24 Wakaba <wakaba@suika.fam.cx>
938    
939 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
940     status information is now dispatched.
941    
942     2008-02-24 Wakaba <wakaba@suika.fam.cx>
943    
944 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
945     problem by adding some more arguments.
946    
947     2008-02-24 Wakaba <wakaba@suika.fam.cx>
948    
949 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
950     container-for-the-purpose-of-content-model element (not transparent
951     element) for |check_child_element| calling and significant
952     text flag marking. This reintroduces |<del>|-and-significant
953     problem again.
954    
955     2008-02-24 Wakaba <wakaba@suika.fam.cx>
956    
957 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
958     elements ignored for the purpose of phase changes in
959     content model checking.
960    
961 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
962    
963 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
964     was not properly managed for transparent cases.
965    
966     2008-02-23 Wakaba <wakaba@suika.fam.cx>
967    
968 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
969     and |audio| as semi-transparent elements.
970    
971     2008-02-23 Wakaba <wakaba@suika.fam.cx>
972    
973 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
974     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
975     Support for |html:object| as a semi-transparent element.
976    
977     2008-02-23 Wakaba <wakaba@suika.fam.cx>
978    
979 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
980     the tree is entirely revised to make it easier to track
981     the state of ancestors/descendants. As a result of this
982     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
983     support for content model checking for HTML elements |figure|,
984     |object|, |video|, and |audio| and checking for XML elements (and
985     some XMLNS checkings) are dropped for now. They will be
986     reimplemented in due cource.
987    
988 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
989    
990 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
991     token (HTML5 revision 1225).
992    
993     2008-02-17 Wakaba <wakaba@suika.fam.cx>
994    
995 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
996    
997     * HTML.pm.src: Save whether |meta| |content| attribute
998     contains character references or not.
999    
1000     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1001    
1002 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1003     imlement significant content checking for contents
1004     with |del| elements.
1005    
1006     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1007    
1008 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1009     elements are added.
1010     (_get_children): HTML |object| elements are now semi-transparent.
1011    
1012     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1013    
1014 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1015    
1016     * CacheManifest.pm: HTML5 revision 1211 implemented.
1017    
1018     * CacheManifest.pod: Updated.
1019    
1020 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1021    
1022     * ContentChecker.pm (check_document, check_element): Support
1023     for second argument ($onsubdoc).
1024     (_get_css_parser): Removed (now it is part of WDCC).
1025    
1026 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1027    
1028     * ContentChecker.pm (_get_css_parser): New.
1029    
1030 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1031    
1032 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1033     for checking had been used.
1034    
1035     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1036    
1037 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1038    
1039 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1040    
1041 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1042     specced in RFC 2046 and RFC 4288.
1043     (application/atom+xml): Definition added.
1044    
1045     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1046    
1047 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1048     warnings (rather than SHOULD-level errors).
1049    
1050     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1051    
1052 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1053    
1054     * HTML.pm.src: A flag for character references in attribute
1055     values are added. Set |manakai_has_reference| user data
1056     to |charset| attribute.
1057    
1058     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1059    
1060 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1061     attributes.
1062    
1063     * ContentChecker.pm (check_document): Warn if charset requirements
1064     cannot be tested.
1065    
1066 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1067    
1068     * HTML.pm.src (parse_byte_string): Detect charset
1069     by universalchardet if charset parameter is not specified.
1070    
1071     * Makefile (Charset-all, Charset-clean): New rules.
1072    
1073 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1074    
1075 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1076     of character encoding declaration and selection of encoding
1077     for HTML document.
1078    
1079     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1080    
1081 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1082     type in list context.
1083    
1084     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1085    
1086 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1087     ignoring (HTML5 revisions 1013 and 1016).
1088    
1089     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1090    
1091 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1092     white spaces, are implemented.
1093    
1094 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1095    
1096 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1097     attribute if possible.
1098    
1099     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1100    
1101 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1102     (parse_char_string): New alias for |parse_string|.
1103     (main phase): Invoking "change the encoding" algorithm if desired.
1104    
1105     * HTML.pod: Updated.
1106    
1107     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1108    
1109 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1110    
1111     * Makefile (HTML-all, HTML-clean): New.
1112    
1113     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1114    
1115     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1116    
1117 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1118    
1119     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1120    
1121 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1122    
1123 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1124    
1125     * CacheManifest.pod: New file.
1126    
1127     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1128    
1129 wakaba 1.114 * CacheManifest.pm: New module.
1130    
1131     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1132    
1133 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1134     callback.
1135    
1136     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1137    
1138 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1139    
1140 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1141    
1142 wakaba 1.111 * Makefile (clean): New rule.
1143    
1144     * NanoDOM.pm (public_id, system_id): New attributes.
1145    
1146     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1147    
1148 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1149    
1150 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1151    
1152     * ContentChecker.pm (check_document): Support for
1153     new |is_xml_root| flag.
1154     (check_element): Support for new |pluses| state.
1155     (_add_pluses): New method.
1156     (_remove_minuses): Support for new |minus| item.
1157    
1158 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1159    
1160 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1161     root element.
1162    
1163     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1164    
1165 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1166    
1167 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1168    
1169 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1170    
1171     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1172    
1173 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1174    
1175     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1176    
1177 wakaba 1.103 * LangTag.pm: New module.
1178    
1179 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1180    
1181     * HTML.pm.src: Some error types were wrong.
1182    
1183 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1184    
1185 wakaba 1.101 * CSS/: New directory.
1186    
1187     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1188    
1189 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1190     for |noscript| in |head|.
1191    
1192 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1193    
1194     * URI/: New directory.
1195    
1196 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1197    
1198 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1199     number.
1200    
1201     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1202    
1203 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1204     by |&|s.
1205    
1206     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1207    
1208 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1209    
1210     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1211    
1212 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1213    
1214     * HTML.pm.src: Insertion modes are now represented in number.
1215    
1216     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1217    
1218 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1219     999) is implemented.
1220    
1221 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1222    
1223     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1224     983) is implemented.
1225    
1226 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1227    
1228     * ContentChecker.pod: New documentation.
1229    
1230     * Makefile: A rule for |ContentChecker.html| is added.
1231    
1232     * ContentChecker.pm: A pod "LICENSE" section is added.
1233    
1234     * NanoDOM.pm ($VERSION): New variable.
1235    
1236 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1237    
1238 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1239     list of allowed HTML elements.
1240    
1241     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1242    
1243 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1244     HTML elements.
1245    
1246     * URIChecker.pm (check_iri): New.
1247     (check_iri_reference): Error type for IRI reference
1248     syntax error is changed.
1249    
1250 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1251    
1252 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1253     is added.
1254     (check_document): Load appropriate module before validation.
1255    
1256     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1257    
1258 wakaba 1.88 * ContentChecker/: New directory.
1259    
1260     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1261    
1262 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1263    
1264     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1265    
1266 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1267    
1268 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1269     even if no document element is found.
1270    
1271     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1272    
1273 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1274    
1275     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1276    
1277 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1278    
1279     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1280    
1281 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1282     is split into "after html body" and "after html frameset"
1283     insertion modes. Their codes are merged with "after body"
1284     and "after frameset" codes. |$previous_insertion_mode|
1285     has been removed. "after frameset" code is
1286     merged with "in frameset" code.
1287    
1288     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1289    
1290 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1291     merged with the "in head" insertion mode.
1292    
1293     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1294    
1295 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1296     reorganized.
1297    
1298     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1299    
1300 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1301     are merged.
1302    
1303     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1304    
1305 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1306     with "in table" insertion mode.
1307    
1308     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1309    
1310 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1311     modes are merged.
1312    
1313     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1314    
1315 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1316     insertion mode!
1317    
1318     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1319    
1320 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1321     now directly invoke the handler.
1322    
1323     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1324    
1325 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1326     is merged to the "in body" insertion mode code.
1327    
1328     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1329    
1330 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1331     insertion modes are merged.
1332    
1333     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1334    
1335 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1336    
1337     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1338    
1339 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1340     instead of string constant for |{content_model_flag}|.
1341    
1342     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1343    
1344 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1345     to the "duplicate attribute" error.
1346    
1347 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1350    
1351     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1352    
1353 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1354    
1355     * HTML.pm.src: A typo is fixed.
1356    
1357 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1358    
1359 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1360     from "in HTML:xml:lang" error. Character position
1361     is now the last part of the error type in the URI error
1362     description. Report "unsupported" status for language
1363     tags, media queries, script codes, and style sheets.
1364    
1365     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1366    
1367 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1368     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1369     in XML.
1370    
1371     * NanoDOM.pm (Attr.owner_document): New attribute.
1372    
1373     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1374    
1375 wakaba 1.66 * HTML.pm.src: The character immediately following
1376     a bare |hcro| was discarded. Fix handling of
1377     entity references in attribute values.
1378    
1379     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1380    
1381 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1382     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1383     factored out. Error types |in html:#DOCTYPE|
1384     and |after html:#DOCTYPE| are merged
1385     into |DOCTYPE in the middle|. |</frameset>|
1386     in fragment parsing mode changed the
1387     insertion mode.
1388    
1389     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1390    
1391 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1392     phase is now an insertion mode. Treatments for white
1393     space character tokens were incorrect for some
1394     insertion modes. An old |meta| case was not removed.
1395    
1396     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1397    
1398 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1399     implemented (but changing the encoding is not yet:-).
1400    
1401 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1402    
1403 wakaba 1.62 * Charset/: New directory.
1404    
1405     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1406    
1407 wakaba 1.61 * H2H.pm: New Perl module (created from
1408     manakai's H2H.dis).
1409    
1410     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1411    
1412 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1413     manakai's SimpleLS.dis).
1414    
1415 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1416    
1417     * HTML.pm.src: |<!---x-->| was not processed correctly.
1418    
1419 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1420    
1421 wakaba 1.58 * HTML.pm.src: Report correct error message
1422     for |<body></div></body>|.
1423    
1424     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1425    
1426 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1427    
1428     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1429    
1430     * ContentChecker.pm: Return |{term}| list.
1431    
1432 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1433    
1434 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1435     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1436     and <title> outside of head).
1437    
1438     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1439    
1440 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1441     and private types/subtypes.
1442    
1443     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1444     URIChecker.pm, HTMLTable.pm: Error messages are now
1445     consistent; they are all listed in
1446     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1447    
1448 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1449    
1450 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1451     is now erred. |<datalist>| is implemented.
1452     Attribute checker for |<command>| and |<menu>| are
1453     added. Support for |contextmenu| global attribute
1454     is added.
1455    
1456     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1457    
1458 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1459     of Step 3 has been changed.
1460    
1461     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1462    
1463 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1464    
1465 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1466    
1467 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1468     incorrect.
1469    
1470     * ContentChecker.pm: Support for the |footer| element.
1471     Check URI syntax for space-separated URI attributes.
1472     Support for the |tabindex| attribute. Support
1473     for |datetime| attribute.
1474    
1475     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1476    
1477 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1478     character references). HTML5 revision 1.146 (white space
1479     characters before root start tag). HTML5 revision
1480     1.148 (named character references in attribute values).
1481     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1482    
1483     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1484    
1485     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1486 wakaba 1.49 in <head>).
1487    
1488     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1489    
1490 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1491     in body).
1492    
1493     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1494    
1495     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1496    
1497 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1498     or RCDATA in fragment parsing mode). HTML5 revision
1499     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1500    
1501     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1502    
1503     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1504     for <pre> and <textarea>).
1505    
1506 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1507    
1508 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1509    
1510     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1511    
1512 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1513    
1514     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1515    
1516 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1517     HTML5 revision 917 (conforming bare &).
1518    
1519     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1520    
1521 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1522    
1523     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1524     HTML5 revision 915 (<nobr>).
1525    
1526     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1527    
1528 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1529    
1530     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1531     New attributes.
1532    
1533     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1534    
1535 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1536     in tags).
1537    
1538     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1539    
1540 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1541    
1542     * HTML.pm.src: |$entity_char| is removed and
1543     requires |Whatpm::_NamedEntityList| instead.
1544     HTML5 revision 898 (refc), except that lack of refc
1545     is parse error.
1546    
1547     * mkentitylist.pl: New script.
1548    
1549     * Makefile (all): |_NamedEntityList.pm| is added.
1550     (_NamedEntityList.pm, Entities.html): New rules.
1551    
1552     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1553    
1554 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1555     were ignored and U+000D immediately following another
1556     U+000D was not converted to U+000A.
1557    
1558     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1559    
1560 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1561     nodes before appended). Parser was not ready for NULL
1562     parse error and escape flag.
1563    
1564     * NanoDOM.pm (adopt_node): New.
1565    
1566     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1567    
1568 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1569     CDATA and RCDATA). Note that current implementation
1570     is simply repeating what the spec says and it is maybe not
1571     a best way to do it.
1572    
1573     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1574    
1575 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1576     the form element if a descendant element without implied end tag has
1577     still been open).
1578    
1579     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1580    
1581 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1582     space characters non-conforming).
1583    
1584     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1585    
1586 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1587     HTML5 revision 869 (C1 character references).
1588    
1589     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1590    
1591 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1592     a |textarea| is removed).
1593    
1594 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1595    
1596     * NanoDOM.pm (get_attribute_node_ns): New method.
1597    
1598     * ContentChecker.pm: |script| |async| and |defer|
1599     no longer require |src|. |async| MUST NOT be
1600     specified if |defer|. (HTML5 revision 858).
1601    
1602 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1603    
1604     * HTML.pm.src: |<form><form>| went to inifinite loop.
1605    
1606 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1607    
1608 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1609     as a document element) flag on.
1610     (new): Removed.
1611     (check_document): New method.
1612    
1613     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1614    
1615 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1616     were incorrect.
1617    
1618     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1619    
1620 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1621     error position in the |type| option of the error.
1622    
1623     * HTMLTable.pm (form_table): The |$onerror| parameter
1624     is now optional. Some bugs are fixed.
1625    
1626     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1627    
1628 wakaba 1.27 * HTMLTable.pm: New module.
1629    
1630     * ContentChecker.pm (table): Invoke table model error checker.
1631    
1632     * NanoDOM.pm (first_child, get_attribute_ns): New.
1633    
1634 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1635    
1636 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1637     (link/@rel, a/@rel, area/@rel): Use new checker.
1638    
1639     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1640    
1641     * _LinkTypeList.pm: New file.
1642    
1643     * mklinktypelist.pl: New file.
1644    
1645     * .cvsignore: |RelExtensions.html| added.
1646    
1647     * NanoDOM.pm (child_nodes): Returns an empty array
1648     for non-child-containing node types.
1649     (text_content): New attribute.
1650    
1651     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1652    
1653 wakaba 1.25 * IMTChecker.pm: New module.
1654    
1655     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1656     to test parameter value validity.
1657    
1658     * HTML.pm.src ($style_start_tag): Attributes were
1659     discarded.
1660    
1661 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1662    
1663 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1664    
1665     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1666    
1667 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1668     and RFC 3987 are listed (not all testable items are checked yet).
1669    
1670     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1671    
1672 wakaba 1.22 * URIChecker.pm: New module.
1673    
1674 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1675    
1676 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1677     or associated to some placeholder.
1678     ($ElementDefault): Warn unknown attributes
1679     for unknown elements as "attribute not supported".
1680     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1681     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1682     (|a| attribute checker): Reimplemented.
1683    
1684     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1685    
1686 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1687     ($HTMLAttrChecker): Event handler content attributes
1688     are added.
1689     (link, embed): Required attribute is now checked.
1690     (embed): Unknown local attributes are no longer warned.
1691    
1692     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1693    
1694 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1695     ($HTMLIMTAttrChecker): New checker.
1696     (link@rel, link@href, link@type, style@type,
1697     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1698     object@data, object@type, source@src, source@type, area@alt,
1699     area@shape, area@coords, area@href,
1700     area@ping, area@rel, area@type, script@src,
1701     script@defer, script@async, script@type): Checkers added.
1702    
1703     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1704    
1705 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1706    
1707 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1708    
1709 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1710     attributes. Report an error if |Element.prefix|
1711     is |xmlns|.
1712    
1713     * NanoDOM.pm (prefix): New attribute.
1714    
1715     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1716    
1717 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1718     mode, action for |<iframe>| was missing.
1719    
1720     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1721    
1722 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1723     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1724    
1725     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1726    
1727 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1728     ($HTMLURIAttrChecker): Placeholder.
1729     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1730     New.
1731    
1732     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1733    
1734 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1735     for global attributes, |html|, |base|, |style|, and |meta|.
1736    
1737     * NanoDOM.pm (insert_before): Weaken reference
1738     to the parent node.
1739     (Attr::new): Set |owner_element| attribute.
1740     (namespace_uri, manakai_local_name): New attribute implementations.
1741     (owner_element): New attribute.
1742    
1743     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1744    
1745 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1746     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1747     $Element->{$HTML_NS}->{''}): New.
1748     (check_element): Invoke attrs_checker for each element.
1749    
1750 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1751    
1752 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1753    
1754     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1755    
1756 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1757     element type testings.
1758    
1759     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1760    
1761 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1762     error when an element type is put in the "minus" list
1763     and the element type is not allowed explicitly in the particular
1764     element content model.
1765     (html:a checker): New checker.
1766     (html:details, html:datagrid): New checkers.
1767     (html:legend): New checker.
1768    
1769     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1770    
1771 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1772    
1773     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1774    
1775 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1776     checker.
1777     (html:dd checker): New checker.
1778     (html:q, html:em, html:strong, html:small,
1779     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
1780    
1781     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1782    
1783 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
1784     from |$ElementDefault->{checker}|.
1785     ($ElementDefault->{checker}): Throw an error that
1786     the element type is not supported by the checker.
1787     ($HTMLMetadataElement): |html:base| was missing.
1788     ($HTMLEmptyChecker): Don't throw an error
1789     for inter-element whitespace nodes.
1790     (html:html checker): Errors were not
1791     thrown even if |html:head| and/or |html:body|
1792     children were missing.
1793     (html:head checker): An error was not
1794     thrown if <meta charset> appered after other
1795     elements.
1796    
1797 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
1798    
1799     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
1800     and |noscript| elements are implemented.
1801     (new): New method.
1802    
1803 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
1804    
1805 wakaba 1.4 * ContentChecker.pm: New module.
1806    
1807     2007-05-04 Wakaba <wakaba@suika.fam.cx>
1808    
1809 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
1810     document_element, manakai_local_name,
1811     manakai_element_type_match): New method.
1812    
1813     2007-05-03 Wakaba <wakaba@suika.fam.cx>
1814    
1815     * HTML.pm.src: Replace decimal and hexadecimal numeric
1816     entities in C1 range using Windows-1252 mapping. Bare LF
1817     did not count as new line for error reporting.
1818    
1819 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
1820    
1821     * NanoDOM.pm (DOMImplementation): New class.
1822     (append_child): Weaken the |parent_node| reference.
1823     (create_element_ns, Element new): Set the |owner_document|
1824     reference.
1825     (implementation): New attribute.
1826     (owner_document, local_name, namespace_uri): New attributes.
1827    
1828     * HTML.pm.src (parse_string): Line and column numbers
1829     are now provided to error handler.
1830     (!!!parse-error): Short descriptions are added.
1831     (_construct_tree): Split into three methods; support
1832     for innerHTML mode.
1833     (set_inner_html): New method.
1834    
1835 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
1836    
1837     * NanoDOM.html: Documentation is added.
1838    
1839     * HTML.pod, ContentType.html: Documentation is revised.
1840    
1841     * .cvsignore: Pod2html temporary files are added.
1842    
1843     * Makefile: Make |NanoDOM.html|.
1844    
1845     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1846    
1847     * HTML.pm.src (parse_string): New method.
1848     (get_inner_html): Renamed from |inner_html|.
1849    
1850     * Makefile: A rule for |HTML.html| is added.
1851    
1852     * HTML.pod: New documentation.
1853    
1854     2007-05-01 Wakaba <wakaba@suika.fam.cx>
1855    
1856     * NanoDOM.pm (last_child, previous_sibling): New attributes.
1857     (clone_node): Attribute nodes were not completely copied.
1858    
1859     * HTML.pm.src: Many bugs are fixed.
1860    
1861     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1862    
1863     * HTML.pm.src: Some typos are fixed.
1864    
1865     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1866    
1867     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
1868     are removed.
1869    
1870     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
1871    
1872     * HTML-consume-entity.src: Removed.
1873    
1874     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1875    
1876     * HTML.pm.src: Tokenizer's handling on hexadecimal
1877     numeric entities are rewritten.
1878    
1879     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1880    
1881     * HTML.pm.src: Some tokenizer bugs are fixed.
1882    
1883     2007-04-30 Wakaba <wakaba@suika.fam.cx>
1884    
1885     * HTML.pm.src: The tree construction stage is implemented.
1886    
1887     * mkhtmlparser.pl: New macros are added.
1888    
1889     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1890    
1891     * ContentType.pm: A note on bug in the specification
1892     is removed since it's been now fixed.
1893    
1894     * .cvsignore: New file.
1895    
1896     2007-04-28 Wakaba <wakaba@suika.fam.cx>
1897    
1898     * HTML.pm.src, HTML-consume-entity.src: New files.
1899    
1900     * Makefile (HTML.pm): New rule.
1901    
1902     * mkhtmlparser.pl: New script.
1903    
1904     2007-04-25 Wakaba <wakaba@suika.fam.cx>
1905    
1906     * Makefile: New file.
1907    
1908     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1909    
1910     * ContentType.pm: An error in pod is fixed.
1911    
1912     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1913    
1914     * ContentType.pm: Documentation is added.
1915    
1916     2007-04-24 Wakaba <wakaba@suika.fam.cx>
1917    
1918     * ContentType.pm: New Perl module.
1919    
1920     * ChangeLog: New file.
1921    
1922    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24