/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Contents of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.335 - (hide annotations) (download)
Thu Oct 2 10:59:04 2008 UTC (16 years, 9 months ago) by wakaba
Branch: MAIN
Changes since 1.334: +7 -0 lines
++ whatpm/t/ChangeLog	2 Oct 2008 10:58:49 -0000
2008-10-02  Wakaba  <wakaba@suika.fam.cx>

	* ContentType.t: More test cases on BOM are added.  Some test
	results updated according to HTML5 revision 1927.  Tests on the
	returned values in the list context are added.

++ whatpm/Whatpm/ChangeLog	2 Oct 2008 10:57:47 -0000
2008-10-02  Wakaba  <wakaba@suika.fam.cx>

	* ContentType.pm (get_sniffed_type): Support for the "better"
	content sniffing (HTML5 revision 1927).  In a case the official
	type was not returned when the method is invoked in the list
	context.

1 wakaba 1.335 2008-10-02 Wakaba <wakaba@suika.fam.cx>
2    
3     * ContentType.pm (get_sniffed_type): Support for the "better"
4     content sniffing (HTML5 revision 1927). In a case the official
5     type was not returned when the method is invoked in the list
6     context.
7    
8 wakaba 1.334 2008-09-22 Wakaba <wakaba@suika.fam.cx>
9    
10     * HTML.pm.src: Character references for non-space C0 characters,
11     including U+000B VT, DEL character, noncharacter code points, are
12     now converted to the U+FFFD character (cf. HTML5 revision 2138).
13    
14 wakaba 1.330 2008-09-21 Wakaba <wakaba@suika.fam.cx>
15    
16 wakaba 1.333 * ContentChecker.pm: |form=""| check support added.
17    
18     2008-09-21 Wakaba <wakaba@suika.fam.cx>
19    
20 wakaba 1.332 * ContentChecker.pm: |contextmenu| validness is now checked using
21     |id| and |id_type| properties, and |menu| property is removed.
22    
23     2008-09-21 Wakaba <wakaba@suika.fam.cx>
24    
25 wakaba 1.331 * ContentChecker.pm: Prepare for |form| |name| attribute's
26     duplication checking.
27    
28     2008-09-21 Wakaba <wakaba@suika.fam.cx>
29    
30 wakaba 1.330 * HTML.pm.src (parse_byte_stream): Support (or non-support) for
31     unsupported charset="" parameter value (HTML5 revision 2131).
32    
33 wakaba 1.325 2008-09-20 Wakaba <wakaba@suika.fam.cx>
34    
35 wakaba 1.329 * HTML.pm.src: Reminding places where U+000B is allowed as a space
36     character is fixed (cf. HTML5 revision 1738).
37    
38     * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
39     space characters (HTML5 revision 1738).
40    
41     2008-09-20 Wakaba <wakaba@suika.fam.cx>
42    
43 wakaba 1.328 * HTML.pm.src: The "anything else" case for the "after after body"
44     insertion mode was not updated to swtich to the "in body"
45     insertion mode. U+000B is no longer a space character for the
46     purpose of tree construction phase (HTML5 revision 1738).
47    
48     2008-09-20 Wakaba <wakaba@suika.fam.cx>
49    
50 wakaba 1.327 * HTML.pm.src: U+000B is no longer a space character (HTML5
51     revision 1738).
52    
53     2008-09-20 Wakaba <wakaba@suika.fam.cx>
54    
55 wakaba 1.326 * ContentType.pm: 0x0B is no longer a space character (HTML5
56     revision 1738).
57    
58     * HTML.pm.src: U+000B is no longer a space character for the
59     algorithm for extracting an encoding from a Content-Type (HTML5
60     revision 1738).
61    
62     2008-09-20 Wakaba <wakaba@suika.fam.cx>
63    
64 wakaba 1.325 * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
65    
66 wakaba 1.320 2008-09-18 Wakaba <wakaba@suika.fam.cx>
67    
68 wakaba 1.324 * LangTag.pm: Add checks for remaining requirements from RFC 4646.
69    
70     * mklangreg.pl: Sort 'Prefix' values by their length, to ease
71     matching.
72    
73     2008-09-18 Wakaba <wakaba@suika.fam.cx>
74    
75 wakaba 1.323 * LangTag.pm: Warn for private use language subtags. Error level
76     typos fixed. Support for Suppress-Script field.
77    
78     * mklangreg.pl: Support for dumping of nested structure.
79    
80     2008-09-18 Wakaba <wakaba@suika.fam.cx>
81    
82 wakaba 1.322 * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
83     recommended case as per RFC 4646.
84    
85     2008-09-18 Wakaba <wakaba@suika.fam.cx>
86    
87 wakaba 1.321 * LangTag.pm (check_rfc4646_langtag): New method.
88    
89     2008-09-18 Wakaba <wakaba@suika.fam.cx>
90    
91 wakaba 1.320 * mklangreg.pl: New script.
92    
93     * Makefile: Updated for creation of the module for language subtag
94     registry.
95    
96 wakaba 1.315 2008-09-16 Wakaba <wakaba@suika.fam.cx>
97    
98 wakaba 1.319 * Makefile: WebIDL.html added.
99    
100     * WebIDL.pod: New documentation.
101    
102     2008-09-16 Wakaba <wakaba@suika.fam.cx>
103    
104 wakaba 1.318 * WebIDL.pm: Checker's error types are redefined.
105    
106     2008-09-16 Wakaba <wakaba@suika.fam.cx>
107    
108 wakaba 1.317 * WebIDL.pm: Parser's error types are redefined. Some forward
109     compatible parsing bugs are fixed. Some unreachable codes are
110     commented out.
111    
112     2008-09-16 Wakaba <wakaba@suika.fam.cx>
113    
114 wakaba 1.316 * WebIDL.pm: Support for the reminding extended attributes are
115     added. It does not satisfy the definition that a forward
116     interface declaration has an extended attribute. It seems that
117     unless explicitly allowed multiple extended attributes with the
118     same name is not allowed, though it is not explicitly mentioned in
119     the spec.
120    
121     2008-09-16 Wakaba <wakaba@suika.fam.cx>
122    
123 wakaba 1.315 * WebIDL.pm: Unescapes extended attribute names and extended
124     attribute identifiers. Preserve whether an extended attribute has
125     an argument list of not. Support for extended attributes:
126     Constructor, ExceptionConsts, IndexGetter, IndexSetter,
127     NameGetter, NameSetter, and Null.
128     (has_argument_list): New attribute.
129     (idl_text): Stringifies argument lists, if any, even if it is
130     empty.
131    
132 wakaba 1.310 2008-09-15 Wakaba <wakaba@suika.fam.cx>
133    
134 wakaba 1.314 * HTML.pm.src: New state |PCDATA_STATE|. Use an empty string for
135     |{s_kwd}| in DATA_STATE as default.
136    
137     2008-09-15 Wakaba <wakaba@suika.fam.cx>
138    
139 wakaba 1.313 * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
140     by |{s_kwd}| in DATA_STATE.
141    
142     2008-09-15 Wakaba <wakaba@suika.fam.cx>
143    
144 wakaba 1.312 * HTML.pm.src: Shorten keys.
145    
146     2008-09-15 Wakaba <wakaba@suika.fam.cx>
147    
148 wakaba 1.311 * HTML.pm.src: Remove checking for control character, surrogate
149     pair, or noncharacter code points and non-Unicode code
150     points (they should be handled by Whatpm::Charset::UnicodeChecker).
151     (parse_char_stream): Support for the |$get_wrapper| argument and
152     character stream error handlers.
153    
154     2008-09-15 Wakaba <wakaba@suika.fam.cx>
155    
156 wakaba 1.310 * ContentChecker.pm: Don't call |loda_ns_module|
157     for null-namespace elements/attributes.
158    
159     * HTML.pm.src: Fact out $disallowed_control_chars
160     as a hash.
161    
162 wakaba 1.300 2008-09-14 Wakaba <wakaba@suika.fam.cx>
163    
164 wakaba 1.309 * HTML.pm.src: Regexp typo fixed. |{prev_char}|
165     and |{next_char}| initializations are moved to initialization
166     method. |{read_until}| now supports buffering. Sync |set_inner_html|
167     with |parse_char_stream|.
168    
169     2008-09-14 Wakaba <wakaba@suika.fam.cx>
170    
171 wakaba 1.308 * HTML.pm.src (parse_char_stream): Make |set_next_char|
172     invoke |manakai_read_until|, not only |read|, where
173     possible, to decrease the number of |read| method calls.
174    
175     * mkhtmlparser.pl: Related changes to the aforementioned
176     modification.
177    
178     2008-09-14 Wakaba <wakaba@suika.fam.cx>
179    
180 wakaba 1.307 * HTML.pm.src: Use |read| instead of |getc|. |set_inner_html|
181     would report character error from now.
182    
183     2008-09-14 Wakaba <wakaba@suika.fam.cx>
184    
185 wakaba 1.306 * HTML.pm.src: White-space-leaded non-white-space character
186     tokens in "before head insertion mode" was not
187     correctly handled.
188     (set_inner_html): Reimplemented using CharString decodehandle
189     class. Support for $get_wrapper argument. Support
190     for |{read_until}| feature.
191    
192     2008-09-14 Wakaba <wakaba@suika.fam.cx>
193    
194 wakaba 1.305 * HTML.pm.src: Make a "bare ero" error for unknown
195     entities point the "&" character.
196    
197     2008-09-14 Wakaba <wakaba@suika.fam.cx>
198    
199 wakaba 1.304 * HTML.pm.src: It turns out that U+FFFD don't have to
200     be added to the list of excluded characters.
201    
202     2008-09-14 Wakaba <wakaba@suika.fam.cx>
203    
204 wakaba 1.303 * HTML.pm.src ($char_onerror): Have character decoder's |line|
205     and |column| a higher priority than the one set by the
206     tokenizer's input handler.
207     ($self->{read_until}): Exclude U+FFFD (but this might
208     not be necessary, since now we do line/column fixup in
209     the character decode handle).
210    
211     2008-09-14 Wakaba <wakaba@suika.fam.cx>
212    
213 wakaba 1.302 * HTML.pm.src: Use |{read_until}| where possible.
214    
215     2008-09-14 Wakaba <wakaba@suika.fam.cx>
216    
217 wakaba 1.301 * HTML.pm.src: Change |{getc_until}| to |{read_until}|
218     and |manakai_getc_until| to |manakai_read_until| to
219     reduce the number of string copies.
220    
221     2008-09-14 Wakaba <wakaba@suika.fam.cx>
222    
223 wakaba 1.300 * HTML.pm.src (parse_char_string): Use newly created
224     |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
225     standard feature to |open| a string as a filehandle,
226     since Perl's string filehandle seems not supporting |ungetc|
227     method correctly.
228     (parse_char_stream): Define |{getc_until}| method.
229     (DATA_STATE): Experimental support for |getc_until| feature.
230    
231 wakaba 1.292 2008-09-13 Wakaba <wakaba@suika.fam.cx>
232    
233 wakaba 1.299 * HTML.pm.src: Check points added to newly added branches.
234    
235     2008-09-13 Wakaba <wakaba@suika.fam.cx>
236    
237 wakaba 1.298 * HTML.pm.src: Remove |{char}|, which is no longer used.
238     Remove |{entity_in_attr}| and |{last_attribute_value_state}|
239     and replaced by |{prev_state}|.
240    
241     * mkhtmlparser.pl: Remove |{char}| feature.
242     Remove |!!!back-next-input-character;| macro.
243    
244     2008-09-13 Wakaba <wakaba@suika.fam.cx>
245    
246 wakaba 1.297 * HTML.pm.src: Finally we get rid of all the inner loops. Remove
247     entity related tokenizer states in favor of new states
248     implementing the consume character reference algorithm.
249    
250     2008-09-13 Wakaba <wakaba@suika.fam.cx>
251    
252     * HTML.pm.src: "Consume a character reference" algorithm is
253 wakaba 1.296 now implemented as a tokenizer's state, rather than
254     a method, with minimum changes (more changes will
255     be made, in due course). "Bogus comment state"'s inner
256     loop gets removed.
257    
258     2008-09-13 Wakaba <wakaba@suika.fam.cx>
259    
260 wakaba 1.297 * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
261 wakaba 1.295 into their own tokenizer states.
262    
263     2008-09-13 Wakaba <wakaba@suika.fam.cx>
264    
265 wakaba 1.294 * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
266     is split into three states.
267    
268     2008-09-13 Wakaba <wakaba@suika.fam.cx>
269    
270 wakaba 1.293 * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
271     itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
272     no longer does the tokenizer have to push back next input
273     characters in those states.
274    
275     2008-09-13 Wakaba <wakaba@suika.fam.cx>
276    
277 wakaba 1.292 * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
278     into four states so that no longer does the tokenizer have to push
279     back next input characters in that state.
280    
281 wakaba 1.291 2008-09-11 Wakaba <wakaba@suika.fam.cx>
282    
283     * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
284     which can be used to insert some wrapper between the character
285     stream handle and the tokenizer. (It is currently not supported
286     for |set_inner_html| for |Element|s).
287    
288 wakaba 1.289 2008-09-10 Wakaba <wakaba@suika.fam.cx>
289    
290 wakaba 1.290 * HTML.pm.src: Ignore punctuations in charset names.
291    
292     2008-09-10 Wakaba <wakaba@suika.fam.cx>
293    
294 wakaba 1.289 * ContentChecker.pm: Support for charset-layer error levels.
295    
296     * HTML.pm.src: Don't specify |text| argument for the
297     |chardecode:fallback| error, since it is not the encoding
298     being used alternatively.
299    
300 wakaba 1.288 2008-09-06 Wakaba <wakaba@suika.fam.cx>
301    
302     * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
303    
304 wakaba 1.286 2008-08-31 Wakaba <wakaba@suika.fam.cx>
305    
306 wakaba 1.287 * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
307    
308     2008-08-31 Wakaba <wakaba@suika.fam.cx>
309    
310 wakaba 1.286 * HTML.pm.src: Bug fix and sync with the spec with regard
311     to after after frameset insertion mode processing (HTML5
312     revision 1909). Note that the implementation was wrong
313     per the old spec before the r1909 changes.
314    
315 wakaba 1.278 2008-08-30 Wakaba <wakaba@suika.fam.cx>
316    
317 wakaba 1.285 * HTMLTable.pm: scope=auto algorithm fix synced with the
318     spec (HTML5 revision 2093).
319     ($process_row): Algorithm step numbers synced with the
320     spec (HTML5 revision 2092).
321    
322     2008-08-30 Wakaba <wakaba@suika.fam.cx>
323    
324 wakaba 1.284 * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
325     revision 2094).
326    
327     2008-08-30 Wakaba <wakaba@suika.fam.cx>
328    
329     * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
330    
331     2008-08-30 Wakaba <wakaba@suika.fam.cx>
332    
333 wakaba 1.283 * HTML.pm.src: '"' and "'" at the end of attribute
334     name (after another attribute) now raise parse error (HTML5
335     revision 2123). Empty unquoted attribute values are no
336     longer allowed (HTML5 revision 2122).
337    
338     2008-08-30 Wakaba <wakaba@suika.fam.cx>
339    
340 wakaba 1.282 * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
341     revision 2130).
342    
343     2008-08-30 Wakaba <wakaba@suika.fam.cx>
344    
345 wakaba 1.281 * ContentChecker.pm: |xml:lang| attribute value must be same
346     as |lang| attribute value for HTML elements (HTML5 revision 2062
347     and so on).
348    
349     2008-08-30 Wakaba <wakaba@suika.fam.cx>
350    
351 wakaba 1.280 * ContentChecker.pm: Error level definition for |xml_id_error|
352     was missing.
353    
354     * URIChecker.pm: The end of the URL should be marked as the
355     error location for an empty path error. The position
356     between the userinfo and the port components should be
357     marked as the error location for an empty host error.
358    
359     2008-08-30 Wakaba <wakaba@suika.fam.cx>
360    
361 wakaba 1.279 * URIChecker.pm: Set parameters representing where in the
362     value the error occurs for errors. Report unknown
363     address format error in warning level, since address
364     formats are rarely added. Path segments starting with "/.."
365     were misinterpreted as a dot-segment.
366    
367     2008-08-30 Wakaba <wakaba@suika.fam.cx>
368    
369 wakaba 1.278 * URIChecker.pm (check_iri_reference): Requires
370     |Message::DOM::DOMImplementation|.
371    
372 wakaba 1.277 2008-08-29 Wakaba <wakaba@suika.fam.cx>
373    
374     * IMTChecker.pm: Updated for the new error reporting architecture.
375    
376     * ContentChecker.pm: Error levels for IMTs are added.
377    
378 wakaba 1.276 2008-08-17 Wakaba <wakaba@suika.fam.cx>
379    
380     * H2H.pm (_shift_token): Support for unquoted HTML attribute
381     values.
382    
383 wakaba 1.274 2008-08-16 Wakaba <wakaba@suika.fam.cx>
384    
385 wakaba 1.275 * CacheManifest.pm: Support for new style of error
386     reports.
387    
388     * HTML.pm.src: Set line=1, column=1 to the document node.
389    
390     2008-08-16 Wakaba <wakaba@suika.fam.cx>
391    
392 wakaba 1.274 * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
393     and URL checkers. Support for more error levels for bogus
394     langauge tag and URL "standards".
395    
396     * LangTag.pm, URIChecker.pm: Support for new style error
397     level reporting.
398    
399 wakaba 1.271 2008-08-15 Wakaba <wakaba@suika.fam.cx>
400    
401 wakaba 1.273 * ContentChecker.pm: Support for RDF/XML error levels.
402    
403     * HTMLTable.pm, RDFXML.pm: Support for new style of error level
404     specifying. Error types are revised.
405    
406     2008-08-15 Wakaba <wakaba@suika.fam.cx>
407    
408 wakaba 1.272 * ContentChecker.pm: All error reporting method calls are
409     renewed.
410    
411     2008-08-15 Wakaba <wakaba@suika.fam.cx>
412    
413 wakaba 1.271 * HTML.pm.src: All error type names and "text" parameters
414     are revised. Use new style for "level" specification.
415    
416     * mkhtmlparser.pl: Use new style for "level" specification.
417    
418 wakaba 1.268 2008-08-03 Wakaba <wakaba@suika.fam.cx>
419    
420 wakaba 1.270 * WebIDL.pm (parse_char_string): Simplified error
421     reporting process for broken ignored valuetype definition.
422     (Valuetype idl_text): Support for special "DOMString" name.
423    
424     2008-08-03 Wakaba <wakaba@suika.fam.cx>
425    
426 wakaba 1.269 * WebIDL.pm ($get_scoped_name): Append "::::" if the last
427     terminal of the ScopedName is "DOMString", such that whether
428     the last part of the scoped name is "DOMString" or "_DOMString"
429     later. It is necessary to determine whether a |typedef|
430     definition should be ignored or not.
431     (parse_char_string): Unescape the identifier of
432     exception members.
433     ($resolve): Return undef for builtin types and sequence<T>
434     types (we might not have to do this, however...).
435     (check): Support checking for Exceptions, Valuetypes,
436     and Typedefs.
437     ($serialize_type): Support for "DOMString::::" syntax.
438     (Typedef idl_text): Output Type as "DOMString" if it
439     is really "DOMString" (i.e. its internal representation
440     is "::DOMString::").
441    
442     2008-08-03 Wakaba <wakaba@suika.fam.cx>
443    
444 wakaba 1.268 * WebIDL.pm ($resolve): New code, based on resolve code
445     for constant types in the |check| method.
446     (check): Support for checking of attributes, operations, and
447     arguments.
448     (Attribute/Operation idl_text): Exception names in getraises,
449     setraises, and raises clauses is serizlied by |$serialize_type|
450     code.
451    
452 wakaba 1.264 2008-08-02 Wakaba <wakaba@suika.fam.cx>
453    
454 wakaba 1.267 * WebIDL.pm ($integer): Order of selections are changed to match
455     hexadecimal numbers (the original pattern, taken from the spec,
456     was not work for hexadecimal numbers, because the "0" prefix
457     matches to the [0-7]* part (as an empty string) and therefore
458     it does not match with remaining "x..." part of a "0x..." integer
459     literal.
460     ($get_type): It now returns a string, not an array reference,
461     for regular types and |sequence| types (i.e. it in any case
462     returns a string).
463     ($get_next_token): The second item in the array that represents
464     a integer or float token is now a Perl number value, not the
465     original string representation of the number.
466     (check): Support for const value consistency checking.
467     No extended attribute is defined for constants.
468     (Node subclasses): Use simple strings rather than array references
469     for default data type values.
470     ($serialize_type): Type values are now simple strings.
471     (value): If the new attribute value is a false value, then
472     a FALSE value is set to the attribute.
473    
474     2008-08-02 Wakaba <wakaba@suika.fam.cx>
475    
476 wakaba 1.266 * WebIDL.pm ($get_scoped_name): Now scoped names are stored
477     in its stringified format ("scoped name" as defined in the
478     spec). Note that future version of this module should not use
479     array references for type values and the |type_text| attribute
480     should be made obsolete.
481     (parse_char_string): Unescape attribute names.
482     (check): Support for checking of whether inherited interfaces
483     are actually defined or not. Support for checking of whether
484     interface member identifiers are duplicated or not.
485     ($serialize_type): Scoped names are returned as is. A future
486     version of this code should escape identifiers other than "DOMString",
487     otherwise the idl_text would be non-conforming.
488    
489     2008-08-02 Wakaba <wakaba@suika.fam.cx>
490    
491 wakaba 1.264 * WebIDL.pm (parse_char_string): Set line/column numbers
492 wakaba 1.265 to generated nodes. Unescape identifiers. Extended attributes
493     for Definition's were ignored.
494     (append_child): Set |parent_node| attribute.
495     (parent_node): New attribute.
496     (check): Support interface/exception members. Support
497     extended attributes. Support definition identifier uniqueness
498     constraint.
499     (qualified_name): New attribute.
500     (Interface/Exception idl_text): Extended attributes were
501     not prepended to the returned text.
502    
503     2008-08-02 Wakaba <wakaba@suika.fam.cx>
504    
505     * WebIDL.pm (parse_char_string): Set line/column numbers
506 wakaba 1.264 to interface object experimentally. s/shift/pop/g, shift
507     would make things wrong. Support for interface forward
508     declarations was missing. Broken interface declarations
509     with no block were not ignored entirely.
510     (Whatpm::WebIDL::Node): New abstract class. This class
511     makes things easier.
512     (child_nodes): New attribute. Unlike DOM's attribute with
513     same name, this attribute returns a dead list of nodes for
514     simplicity.
515     (get_user_data, set_user_data): New methods.
516     (Module idl_text): A SPACE character should be inserted
517     before the |{| character.
518     (Interface idl_text): Support for interface forward declarations.
519     (is_forward_declaration): New attribute.
520    
521 wakaba 1.259 2008-07-19 Wakaba <wakaba@suika.fam.cx>
522    
523 wakaba 1.263 * WebIDL.pm (type_text): Better serializer.
524    
525     2008-07-19 Wakaba <wakaba@suika.fam.cx>
526    
527 wakaba 1.262 * WebIDL.pm: Revise forward-compatible parsing so that
528     it now can handle broken extended attributes and as such.
529    
530     2008-07-19 Wakaba <wakaba@suika.fam.cx>
531    
532 wakaba 1.261 * WebIDL.pm: Real support for extended attributes.
533     Support for extended attributes with arguments.
534    
535     2008-07-19 Wakaba <wakaba@suika.fam.cx>
536    
537 wakaba 1.260 * WebIDL.pm: Support for |exception| syntax.
538     (Interface->idl_text): Tentative support for inheritances.
539    
540     2008-07-19 Wakaba <wakaba@suika.fam.cx>
541    
542 wakaba 1.259 * WebIDL.pm: Hierarchical scoped name support was broken.
543     Support for raises, setraises, and getraises syntaxes.
544    
545 wakaba 1.258 2008-07-18 Wakaba <wakaba@suika.fam.cx>
546    
547     * WebIDL.pm: Support for |idl_text| attribute, version 1 (no
548     proper support for types, extended attributes, and exceptions yet).
549     WebIDL parser, version 1 (no support for exceptions yet,
550     no proper support for extended attributes yet).
551    
552 wakaba 1.257 2008-07-09 Wakaba <wakaba@suika.fam.cx>
553    
554     * WebIDL.pm (parse_char_string): Support for basic attribute syntax.
555    
556 wakaba 1.255 2008-06-29 Wakaba <wakaba@suika.fam.cx>
557    
558 wakaba 1.256 * WebIDL.pm: Support for valuetype and const.
559    
560     2008-06-29 Wakaba <wakaba@suika.fam.cx>
561    
562 wakaba 1.255 * WebIDL.pm: New module.
563    
564 wakaba 1.254 2008-06-15 Wakaba <wakaba@suika.fam.cx>
565    
566     * Makefile (Entities.html): URI changed.
567    
568 wakaba 1.253 2008-06-08 Wakaba <wakaba@suika.fam.cx>
569    
570     * HTML.pm.src: Support for ruby parsing (HTML5 revision 1704).
571    
572 wakaba 1.251 2008-06-01 Wakaba <wakaba@suika.fam.cx>
573    
574 wakaba 1.252 * HTML.pm.src (_get_next_token): A parse error was missing.
575    
576     2008-06-01 Wakaba <wakaba@suika.fam.cx>
577    
578 wakaba 1.251 * mklinktypelist.pl: rel=contact is no longer part of the HTML5
579     spec (commented out). (HTML5 revision 1711).
580    
581 wakaba 1.248 2008-05-25 Wakaba <wakaba@suika.fam.cx>
582    
583 wakaba 1.250 * ContentType.pm: Drop support for UTF-32 (HTML5 revision 1701).
584    
585     * HTML.pm.src: UTF-16BE and UTF-16LE should be considered
586     as UTF-16 (HTML5 revision 1701).
587    
588     2008-05-25 Wakaba <wakaba@suika.fam.cx>
589    
590 wakaba 1.249 * HTML.pm.src: Support for <noframes> in <head> (HTML5 revision
591     1692).
592    
593     2008-05-25 Wakaba <wakaba@suika.fam.cx>
594    
595     * HTML.pm.src: The secondary insertion mode used when switching
596     to foreign content is the "in body" insertion mode (HTML5 revision
597     1696).
598    
599     2008-05-25 Wakaba <wakaba@suika.fam.cx>
600    
601 wakaba 1.248 * HTML.pm.src: Don't raise parse error for <isindex/> (HTML5
602     revision 1697).
603    
604 wakaba 1.239 2008-05-24 Wakaba <wakaba@suika.fam.cx>
605    
606 wakaba 1.247 * HTML.pm.src: Support for end-of-file token in foreign content
607     insertion mode (HTML5 revision 1693). Update SVG camelCase
608     attribute list (HTML5 revision 1700). <textarea> closes
609     </select> (HTML5 revision 1699). More start tags close in
610     foreign content insertion mode (HTML5 revision 1698).
611    
612     2008-05-24 Wakaba <wakaba@suika.fam.cx>
613    
614 wakaba 1.246 * HTML.pm.src: ";" is not part of charset name (HTML5 revision 1665).
615    
616     2008-05-24 Wakaba <wakaba@suika.fam.cx>
617    
618 wakaba 1.245 * HTML.pm.src: More robust charset parameter detection (HTML5
619     revision 1674).
620    
621     2008-05-24 Wakaba <wakaba@suika.fam.cx>
622    
623 wakaba 1.244 * ContentType.pm: Support for image/vnd.microsoft.icon (HTML5
624     revision 1676).
625    
626     2008-05-24 Wakaba <wakaba@suika.fam.cx>
627    
628 wakaba 1.243 * HTML.pm.src: Ignore language part of public identifiers for
629     quriks mode detection (HTML5 revision 1679).
630    
631     2008-05-24 Wakaba <wakaba@suika.fam.cx>
632    
633 wakaba 1.242 * HTML.pm.src: Reduce the number of errors in truncated doctypes (HTML5
634     revision 1685).
635    
636     2008-05-24 Wakaba <wakaba@suika.fam.cx>
637    
638 wakaba 1.241 * HTML.pm.src: Support for EOF in new states for tags (HTML5
639     revision 1684).
640    
641     2008-05-24 Wakaba <wakaba@suika.fam.cx>
642    
643 wakaba 1.240 * HTML.pm.src (_reset_insertion_mode): Make <td>.innerHTML
644     work (HTML5 revision 1690).
645    
646     2008-05-24 Wakaba <wakaba@suika.fam.cx>
647    
648 wakaba 1.239 * HTML.pm.src (_tree_construction_main): Change handling of
649     end tags in head insertion modes (HTML5 revision 1686).
650     (parse_char_string): Bug fix for non-utf8 character string handlings.
651     (parse_char_stream): |ungetc| does not work well for this context.
652    
653 wakaba 1.237 2008-05-18 Wakaba <wakaba@suika.fam.cx>
654    
655 wakaba 1.238 * HTML.pm.src (parse_byte_string): Redefined to invoke
656     |parse_byte_stream|.
657     (parse_byte_stream): New method.
658    
659     2008-05-18 Wakaba <wakaba@suika.fam.cx>
660    
661 wakaba 1.237 * HTML.pm.src (parse_byte_string): Fix the column number reported
662     by encoding layer error reporter.
663    
664 wakaba 1.233 2008-05-17 Wakaba <wakaba@suika.fam.cx>
665    
666 wakaba 1.236 * HTML.pm.src (parse_byte_string): Use streaming decoder
667     rather than converting the whole byte string and then parsing.
668     Propagate errors in character encoding layer.
669     (get_next_token): Precise error reporting for |bare stago| error.
670    
671     2008-05-17 Wakaba <wakaba@suika.fam.cx>
672    
673 wakaba 1.235 * HTML.pm.src (parse_char_stream): New method.
674     (parse_char_string): This method is now defined as an invocation
675     of the |parse_char_stream| method.
676    
677     2008-05-17 Wakaba <wakaba@suika.fam.cx>
678    
679 wakaba 1.234 * HTML.pm.src (parse_byte_string): Report various status
680     of the sniffing as info-level errors. Support for new
681     decoding framework in parser resestting.
682     (new): Various default error levels were not set.
683    
684     2008-05-17 Wakaba <wakaba@suika.fam.cx>
685    
686 wakaba 1.233 * HTML.pm.src (parse_byte_string): HTML5 encoding siniffing
687     algorithm, except for the actual sniffing, is implemented
688     with new framework with Message::Charset::Info.
689    
690 wakaba 1.232 2008-05-16 Wakaba <wakaba@suika.fam.cx>
691    
692     * CacheManifest.pm (_parse): Drop fragment identifiers from
693     URIs in fallback section (HTML5 revision 1596).
694    
695 wakaba 1.230 2008-05-10 Wakaba <wakaba@suika.fam.cx>
696    
697 wakaba 1.231 * Makefile (Entities.html): URI has changed.
698    
699     2008-05-10 Wakaba <wakaba@suika.fam.cx>
700    
701 wakaba 1.230 * CacheManifest.pm: Don't replace U+0000 NULL (HTML5 revision
702     1553).
703    
704 wakaba 1.228 2008-05-06 Wakaba <wakaba@suika.fam.cx>
705    
706 wakaba 1.229 * ContentChecker.pm: Noted that those returned in |table| are
707     no longer table elements, but table objects returned
708     by Whatpm::HTMLTable.
709    
710     * HTMLTable.pm (form_table): Return table element node
711     as |$table->{element}|.
712     (assign_header): Support for the |headers=""| attribute.
713    
714     2008-05-06 Wakaba <wakaba@suika.fam.cx>
715    
716 wakaba 1.228 * HTMLTable.pm (assign_header): New function; first version
717     with no support for headers="".
718     (form_table): Include table width and height to the returned
719     table object for covenience. Indexing in column assignement was wrong.
720     Set whether a data cell is empty or not for convenience.
721    
722 wakaba 1.222 2008-05-05 Wakaba <wakaba@suika.fam.cx>
723    
724 wakaba 1.227 * HTMLTable.pm: Robuster caption support (HTML5 revision 1393).
725    
726     2008-05-05 Wakaba <wakaba@suika.fam.cx>
727    
728 wakaba 1.226 * HTMLTable.pm: How table model errors are detected is
729     changed (HTML5 revision 1387).
730    
731     2008-05-05 Wakaba <wakaba@suika.fam.cx>
732    
733 wakaba 1.225 * HTMLTable.pm: The algorithm now moves |tfoot| elements
734     to the end of the table (HTML5 revision 1380).
735    
736     2008-05-05 Wakaba <wakaba@suika.fam.cx>
737    
738 wakaba 1.224 * HTMLTable.pm: The algorithm is now 0-based indexing, instead
739     of 1-based (HTML5 revision 1376).
740    
741     2008-05-05 Wakaba <wakaba@suika.fam.cx>
742    
743 wakaba 1.223 * ContentType.pm: "Content-Type: text/plain; charset=UTF-8"
744     and "Content-Encoding" no longer prevent sniffing (HTML5
745     revision 1288).
746    
747     2008-05-05 Wakaba <wakaba@suika.fam.cx>
748    
749 wakaba 1.222 * ContentType.pm: Skip BOMs n feed or HTML algorithm (HTML5 revision
750     1282).
751    
752 wakaba 1.220 2008-05-03 Wakaba <wakaba@suika.fam.cx>
753    
754 wakaba 1.221 * ContentChecker.pm: Support for global attributes.
755     Status of XML specs are added.
756    
757     2008-05-03 Wakaba <wakaba@suika.fam.cx>
758    
759 wakaba 1.220 * ContentChecker.pm (check_element): Support for |template=""|
760     and |ref=""| attribute (referent element type checking).
761    
762 wakaba 1.218 2008-04-29 Wakaba <wakaba@suika.fam.cx>
763    
764 wakaba 1.219 * CacheManifest.pm (_parse): New same origin definition (HTML5 revision
765     1500) is implemented (except for IDNA part and URI-scheme-specific
766 wakaba 1.218 knowledge). Line number counting was wrong for LF-only
767     documents.
768    
769 wakaba 1.211 2008-04-13 Wakaba <wakaba@suika.fam.cx>
770    
771 wakaba 1.217 * HTML.pm.src: Raise an parse error for any disallowed
772     character (HTML5 revision 1263).
773    
774     2008-04-13 Wakaba <wakaba@suika.fam.cx>
775    
776 wakaba 1.216 * mkentitylist.pl: Support for new HTML5 entity table format (the
777     definition for |AElig;| was missing).
778    
779     2008-04-13 Wakaba <wakaba@suika.fam.cx>
780    
781 wakaba 1.215 * HTML.pm.src, mkhtmlparser.pl: Support for element/attribute
782     name/namespace fixup (HTML5 revisions 1413, 1415, 1416, and 1417).
783    
784     2008-04-13 Wakaba <wakaba@suika.fam.cx>
785    
786 wakaba 1.214 * HTML.pm.src: List of element names that close foreign content
787     insertion mode is added (HTML5 revisions 1412 and 1418).
788    
789     2008-04-13 Wakaba <wakaba@suika.fam.cx>
790    
791 wakaba 1.213 * HTML.pm.src: Support for |mglyph| and |malignmark|
792     elements (HTML5 revision 1410).
793    
794     2008-04-13 Wakaba <wakaba@suika.fam.cx>
795    
796 wakaba 1.212 * HTML.pm.src: Support for new long MathML entities (HTML5
797     revision 1406).
798    
799     2008-04-13 Wakaba <wakaba@suika.fam.cx>
800    
801 wakaba 1.211 * HTML.pm.src: CDATA section support for MathML and SVG
802     elements (HTML5 revision 1404 and 1420).
803    
804 wakaba 1.209 2008-04-12 Wakaba <wakaba@suika.fam.cx>
805    
806 wakaba 1.210 * HTML.pm.src, mkhtmlparser.pl: Support for MathML and SVG
807     elements (HTML5 revision 1404). Unused !!!macro definitions
808     are removed.
809    
810     2008-04-12 Wakaba <wakaba@suika.fam.cx>
811    
812 wakaba 1.209 * HTML.pm.src, mkhtmlparser.pl: The way permitted slash errors
813     are raised is changed (HTML5 revision 1404).
814    
815 wakaba 1.206 2008-04-06 Wakaba <wakaba@suika.fam.cx>
816    
817 wakaba 1.208 * HTML.pm.src: Category sets that are no longer used are removed.
818    
819     2008-04-06 Wakaba <wakaba@suika.fam.cx>
820    
821 wakaba 1.207 * HTML.pm.src: The ->[1] property of stack entries are now
822     replaced by constants representing element category.
823    
824     2008-04-06 Wakaba <wakaba@suika.fam.cx>
825    
826 wakaba 1.206 * HTML.pm.src: Don't use local name stored in stack (i.e. ->[1])
827     for error reporting. (This is a preparation for using constant
828     value for ->[1].)
829    
830 wakaba 1.204 2008-03-22 Wakaba <wakaba@suika.fam.cx>
831    
832 wakaba 1.205 * RDFXML.pm: Typo fixed.
833    
834     2008-03-22 Wakaba <wakaba@suika.fam.cx>
835    
836 wakaba 1.204 * ContentChecker.pm: |fact_level| is now treated
837     as same as |must_level|, i.e. level = |m|.
838     (check_element): Make list of URIs in the DOM.
839    
840 wakaba 1.197 2008-03-21 Wakaba <wakaba@suika.fam.cx>
841    
842 wakaba 1.203 * RDFXML.pm: Language accessor implemented.
843     Local (null-namespace) attribute support.
844    
845     2008-03-21 Wakaba <wakaba@suika.fam.cx>
846    
847 wakaba 1.202 * RDFXML.pm: Factored out ID checking code.
848    
849     2008-03-21 Wakaba <wakaba@suika.fam.cx>
850    
851 wakaba 1.201 * RDFXML.pm: TODO items noted. Validation of ID
852     and URI attributes is implemented. Warn if unknown
853     value is used in rdf:parseType="" attribute.
854    
855     * URIChecker.pm (check_rdf_uri_reference): New function.
856    
857     2008-03-21 Wakaba <wakaba@suika.fam.cx>
858    
859 wakaba 1.200 * RDFXML.pm: bnodeid implemented. Relative references
860     are now resolved.
861    
862     2008-03-21 Wakaba <wakaba@suika.fam.cx>
863    
864 wakaba 1.199 * ContentChecker.pm: RDF reification implemented.
865    
866     * RDFXML.pm: undef vs false bug fixed.
867     Reification implemented.
868    
869     2008-03-21 Wakaba <wakaba@suika.fam.cx>
870    
871 wakaba 1.198 * RDFXML.pm: s/id/ID/ for attribute name.
872     The |node| arguments are added for |ontriple| calls.
873     Too many "attribute not allowed" errors were raised.
874    
875     * ContentChecker.pm: Initial experimental support for rdf:RDF
876     element.
877    
878     2008-03-21 Wakaba <wakaba@suika.fam.cx>
879    
880 wakaba 1.197 * RDFXML.pm: New module.
881    
882 wakaba 1.192 2008-03-20 Wakaba <wakaba@suika.fam.cx>
883    
884 wakaba 1.196 * HTML.pm.src (set_inner_html): Line/column number
885     code was old one yet.
886    
887     2008-03-20 Wakaba <wakaba@suika.fam.cx>
888    
889 wakaba 1.195 * HTML.pm.src: Better line/column reporting for "duplicate attribute"
890     errors. Line/column markings for DOCTYPE, comment, and
891     character tokens are reintroduced; otherwise,
892     error location for "not HTML5" error and errors
893     for implied elements are not attached.
894    
895     2008-03-20 Wakaba <wakaba@suika.fam.cx>
896    
897 wakaba 1.194 * HTML.pm.src: Set line/column numbers to attributes.
898    
899     * NanoDOM.pm (create_attribute_ns, set_attribute_node_ns): Added.
900     (value): Setter implemented.
901    
902     * mkhtmlparser.pl: Set line/column numbers to Attr nodes.
903    
904     2008-03-20 Wakaba <wakaba@suika.fam.cx>
905    
906 wakaba 1.193 * HTML.pm.src: Unused line/column markings are removed.
907    
908     2008-03-20 Wakaba <wakaba@suika.fam.cx>
909    
910 wakaba 1.192 * HTML.pm.src (_get_next_token): Remove |first_start_tag|
911     flag, which is no longer used.
912    
913 wakaba 1.190 2008-03-17 Wakaba <wakaba@suika.fam.cx>
914    
915 wakaba 1.191 * HTML.pm.src: Set line/column information to element nodes.
916    
917     * mkhtmlparser.pl (!!!create-element, !!!insert-element,
918     and !!!insert-element-t): Set line/column information to
919     element nodes.
920    
921     2008-03-17 Wakaba <wakaba@suika.fam.cx>
922    
923 wakaba 1.190 * HTML.pm.src (_get_next_token): The first "<" character
924     in "<?", "<>", or "</>" should be the error point.
925    
926 wakaba 1.187 2008-03-16 Wakaba <wakaba@suika.fam.cx>
927    
928 wakaba 1.189 * HTML.pm.src: Some more fixes on error position reporting.
929    
930     2008-03-16 Wakaba <wakaba@suika.fam.cx>
931    
932 wakaba 1.188 * HTML.pm.src: Token-level precious error reporting.
933    
934     2008-03-16 Wakaba <wakaba@suika.fam.cx>
935    
936 wakaba 1.187 * HTML.pm.src: Preparation for more precious error point
937     reporting.
938    
939 wakaba 1.184 2008-03-11 Wakaba <wakaba@suika.fam.cx>
940    
941 wakaba 1.186 * HTML.pm.src: Error type revised.
942    
943     2008-03-11 Wakaba <wakaba@suika.fam.cx>
944    
945 wakaba 1.185 * HTML.pm.src: Similar codes are merged together, again.
946    
947     2008-03-11 Wakaba <wakaba@suika.fam.cx>
948    
949 wakaba 1.184 * HTML.pm.src: Similar codes are merged together.
950    
951 wakaba 1.183 2008-03-10 Wakaba <wakaba@suika.fam.cx>
952    
953     * mkhtmlparser.pl: Set "level" parameter to parse errors.
954    
955     * HTML.pm.src: Code refined.
956    
957 wakaba 1.175 2008-03-09 Wakaba <wakaba@suika.fam.cx>
958    
959 wakaba 1.182 * HTML.pm.src: |</body>| treatement has been changed (HTML5
960     revision 1348). Note that I really don't know this makes
961     any difference in the black-box behavior of the parser.
962    
963     2008-03-09 Wakaba <wakaba@suika.fam.cx>
964    
965 wakaba 1.181 * HTML.pm.src: New end-of-file token implementation (HTML5
966     revision 1348).
967    
968     2008-03-09 Wakaba <wakaba@suika.fam.cx>
969    
970 wakaba 1.180 * HTML.pm.src: |applet| support (HTML5 revision 1347).
971    
972     2008-03-09 Wakaba <wakaba@suika.fam.cx>
973    
974 wakaba 1.179 * HTML.pm.src: Foster parenting in AAA (HTML5 revision 1343).
975    
976     2008-03-09 Wakaba <wakaba@suika.fam.cx>
977    
978 wakaba 1.178 * HTML.pm.src: Support for |<input>| in the "in select" insertion
979     mode and support for the "in select in table" insertion mode (HTML5
980     revision 1342).
981    
982     2008-03-09 Wakaba <wakaba@suika.fam.cx>
983    
984 wakaba 1.177 * HTML.pm.src: No foster parenting for <script> and <script>
985     in non-tainted <table>s (HTML5 revision 1336).
986    
987     2008-03-09 Wakaba <wakaba@suika.fam.cx>
988    
989 wakaba 1.176 * HTML.pm.src: Ignore white space characters between <html>
990     and <head> (HTML5 revision 1332).
991    
992     2008-03-09 Wakaba <wakaba@suika.fam.cx>
993    
994 wakaba 1.175 * HTML.pm.src: Treat <input type=hidden> as if it were a
995     white space (HTML5 revision 1331).
996    
997 wakaba 1.164 2008-03-08 Wakaba <wakaba@suika.fam.cx>
998    
999 wakaba 1.174 * HTML.pm.src: Ignore U+000A at the beginning of a |listing|
1000     element (HTML5 revision 1330).
1001    
1002     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1003    
1004 wakaba 1.173 * HTML.pm.src: <title> is always appended to the current
1005     element (HTML5 revision 1328).
1006    
1007     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1008    
1009 wakaba 1.172 * HTML.pm.src: White space in tainted tables are moved
1010     into foster parents (HTML5 revision 1326).
1011    
1012     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1013    
1014 wakaba 1.171 * HTML.pm.src: Reduce errors from foster parenting cases (HTML5
1015     revision 1321).
1016    
1017     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1018    
1019 wakaba 1.170 * HTML.pm.src: |</h/n/>| case code rearranged to align with
1020     the spec (HTML5 revision 1320). Note that we finally complete
1021     all of HTML5 revision 1320 changes.
1022    
1023     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1024    
1025 wakaba 1.169 * HTML.pm.src: |</form>| now works similar to |</div>| for unclosed
1026     tags (HTML5 revision 1320).
1027    
1028     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1029    
1030 wakaba 1.168 * HTML.pm.src: |</p>| case rearranged with no actual change in fact.
1031    
1032     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1033    
1034 wakaba 1.167 * HTML.pm.src: A "generate implied end tags" code (t409.1)
1035     could not be reached so that it is now removed (HTML5 revision
1036     1320).
1037    
1038     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1039    
1040 wakaba 1.166 * HTML.pm.src: Code for the case of |</div>| and so on are revised
1041     to align with new spec text (HTML5 revision 1320).
1042    
1043     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1044    
1045 wakaba 1.165 * HTML.pm.src: Remove strange |if| condition; however, it should
1046     have had no harm in theory.
1047    
1048     2008-03-08 Wakaba <wakaba@suika.fam.cx>
1049    
1050 wakaba 1.164 * HTML.pm.src (_tree_construction_main): '</p>' in body
1051     case is split from other end tags for the preparation
1052     of implementing HTML revision 1320.
1053    
1054 wakaba 1.162 2008-03-07 Wakaba <wakaba@suika.fam.cx>
1055    
1056 wakaba 1.163 * HTML.pm.src: Simplified "generate implied end tag" (HTML5
1057     revision 1320).
1058    
1059     2008-03-07 Wakaba <wakaba@suika.fam.cx>
1060    
1061 wakaba 1.162 * HTML.pm.src (_tree_construction_main): Merge rules for "h1"
1062     and "div" (HTML5 revision 1318). Add comments to where
1063     |form| pointer association codes should be inserted (HTML5 revision
1064     1319).
1065    
1066 wakaba 1.161 2008-03-06 Wakaba <wakaba@suika.fam.cx>
1067    
1068     * HTML.pm.src: <html> treatement refined (HTML5 revision 1314).
1069    
1070 wakaba 1.159 2008-03-05 Wakaba <wakaba@suika.fam.cx>
1071    
1072 wakaba 1.160 * HTML.pm.src: Since the case t268 should never be reached (no
1073     other token type, there are), it is replaced by a |die| statement.
1074    
1075     2008-03-05 Wakaba <wakaba@suika.fam.cx>
1076    
1077 wakaba 1.159 * HTML.pm.src: Typo fixed.
1078    
1079 wakaba 1.158 2008-03-04 Wakaba <wakaba@suika.fam.cx>
1080    
1081     * HTML.pm.src (_tree_construction_initial): Some limited quirks
1082     doctypes were not uppercased for comparison.
1083    
1084 wakaba 1.152 2008-03-03 Wakaba <wakaba@suika.fam.cx>
1085    
1086 wakaba 1.157 * HTML.pm.src (tree construction and set_inner_html): Checkpoints
1087     are added.
1088    
1089     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1090    
1091 wakaba 1.156 * HTML.pm.src (_tokenize_attempt_to_consume_an_entity): Checkpoints
1092     are set. Cases that are unlikely reached are noted as so.
1093    
1094     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1095    
1096 wakaba 1.155 * HTML.pm.src: Checkpoints for debugging are added.
1097    
1098     * mkhtmlparser.pl: Support for |!!!cp| syntax.
1099    
1100     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1101    
1102 wakaba 1.154 * HTML.pm.src, mkhtmlparser.pl: s/_input_character/_char/g
1103     for simplicity.
1104    
1105     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1106    
1107 wakaba 1.153 * HTML.pm.src: Flag name changed: s/correct/force-quirks/g (HTML5
1108     revision 1307).
1109    
1110     2008-03-03 Wakaba <wakaba@suika.fam.cx>
1111    
1112 wakaba 1.152 * HTML.pm.src (_get_next_token): Where the /incorrect/ flag
1113     is set are changed (HTML5 revision 1305).
1114    
1115 wakaba 1.149 2008-03-02 Wakaba <wakaba@suika.fam.cx>
1116    
1117 wakaba 1.151 * HTML.pm.src: Raise a parse error for |<span ===>| (HTML5 revision
1118     1292). Entities are not parsed in comment-like part in RCDATA
1119     elements (HTML5 revision 1294). Allow bare & at the end
1120     of attribute value literals (HTML5 revision 1296). More
1121     quirks mode doctypes (HTML5 revision 1302). Requires spaces
1122     between attributes and ban attribute names or unquoted
1123     attribute values containing single or double quotes (HTML5
1124     revision 1303).
1125    
1126     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1127    
1128 wakaba 1.150 * ContentChecker.pm: Typo fixed. Don't raise "character encoding"
1129     and related errors unless it is an HTML document (though the spec
1130     is unclear on whether it is applied to XHTML document).
1131    
1132     * HTML.pm (%HTMLAttrStatus): WF2 repetition model attributes
1133     are added.
1134    
1135     2008-03-02 Wakaba <wakaba@suika.fam.cx>
1136    
1137 wakaba 1.149 * HTML.pm.src: s/local_name/manakai_local_name/g.
1138    
1139 wakaba 1.148 2008-03-01 Wakaba <wakaba@suika.fam.cx>
1140    
1141     * _NamedEntityList.pm: Updated (HTML5 revision 1286).
1142    
1143     * HTML.pm.src: |charset| in |content| attribute is
1144     case-insensitive (HTML5 revision 1270).
1145    
1146 wakaba 1.147 2008-02-26 Wakaba <wakaba@suika.fam.cx>
1147    
1148     * ContentChecker.pm: New status constants are added.
1149     ($ElementDefault): |status| added.
1150     (check_element): Err for non-standard or deprecated elements.
1151     (_attr_status_info): For non-standard or deprecated attributes.
1152    
1153 wakaba 1.142 2008-02-24 Wakaba <wakaba@suika.fam.cx>
1154    
1155 wakaba 1.146 * ContentChecker.pm (_attr_status_info): New internal method.
1156    
1157     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1158    
1159 wakaba 1.145 * ContentChecker.pm (check_element): Element standardized
1160     status information is now dispatched.
1161    
1162     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1163    
1164 wakaba 1.144 * ContentChecker.pm (check_element): Fix |del|-and-significant
1165     problem by adding some more arguments.
1166    
1167     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1168    
1169 wakaba 1.143 * ContentChecker.pm (check_element): Use context of
1170     container-for-the-purpose-of-content-model element (not transparent
1171     element) for |check_child_element| calling and significant
1172     text flag marking. This reintroduces |<del>|-and-significant
1173     problem again.
1174    
1175     2008-02-24 Wakaba <wakaba@suika.fam.cx>
1176    
1177 wakaba 1.142 * ContentChecker.pm (check_element): Make semi-transparent
1178     elements ignored for the purpose of phase changes in
1179     content model checking.
1180    
1181 wakaba 1.138 2008-02-23 Wakaba <wakaba@suika.fam.cx>
1182    
1183 wakaba 1.141 * ContentChecker.pm (check_element): In-element state
1184     was not properly managed for transparent cases.
1185    
1186     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1187    
1188 wakaba 1.140 * ContentChecker.pm (check_element): Support for |video|
1189     and |audio| as semi-transparent elements.
1190    
1191     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1192    
1193 wakaba 1.139 * ContentChecker.pm ($HTMLSemiTransparentElements): New.
1194     (check_element): s/minuses/minus_elements/, s/pluses/plus_elements/.
1195     Support for |html:object| as a semi-transparent element.
1196    
1197     2008-02-23 Wakaba <wakaba@suika.fam.cx>
1198    
1199 wakaba 1.138 * ContentChecker.pm (check_element): The way to traverse
1200     the tree is entirely revised to make it easier to track
1201     the state of ancestors/descendants. As a result of this
1202     revision (which rewrites almost all of Whatpm::ContentChecker::HTML),
1203     support for content model checking for HTML elements |figure|,
1204     |object|, |video|, and |audio| and checking for XML elements (and
1205     some XMLNS checkings) are dropped for now. They will be
1206     reimplemented in due cource.
1207    
1208 wakaba 1.134 2008-02-17 Wakaba <wakaba@suika.fam.cx>
1209    
1210 wakaba 1.137 * HTML.pm.src: |>| in public or system literal closes the DOCTYPE
1211     token (HTML5 revision 1225).
1212    
1213     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1214    
1215 wakaba 1.136 * ContentChecker.pm ({unsupported_level}): New value.
1216    
1217     * HTML.pm.src: Save whether |meta| |content| attribute
1218     contains character references or not.
1219    
1220     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1221    
1222 wakaba 1.135 * ContentChecker.pm (_get_children): (Incompleted) attempt to
1223     imlement significant content checking for contents
1224     with |del| elements.
1225    
1226     2008-02-17 Wakaba <wakaba@suika.fam.cx>
1227    
1228 wakaba 1.134 * ContenteChecker.pm ($HTMLTransparentElements): More
1229     elements are added.
1230     (_get_children): HTML |object| elements are now semi-transparent.
1231    
1232     * NanoDOM.pm (manakai_html, manakai_head): New methods.
1233    
1234 wakaba 1.133 2008-02-16 Wakaba <wakaba@suika.fam.cx>
1235    
1236     * CacheManifest.pm: HTML5 revision 1211 implemented.
1237    
1238     * CacheManifest.pod: Updated.
1239    
1240 wakaba 1.132 2008-02-10 Wakaba <wakaba@suika.fam.cx>
1241    
1242     * ContentChecker.pm (check_document, check_element): Support
1243     for second argument ($onsubdoc).
1244     (_get_css_parser): Removed (now it is part of WDCC).
1245    
1246 wakaba 1.131 2008-02-09 Wakaba <wakaba@suika.fam.cx>
1247    
1248     * ContentChecker.pm (_get_css_parser): New.
1249    
1250 wakaba 1.129 2007-11-25 Wakaba <wakaba@suika.fam.cx>
1251    
1252 wakaba 1.130 * ContentChecker.pm ($AnyChecker): Old way to add child elements
1253     for checking had been used.
1254    
1255     2007-11-25 Wakaba <wakaba@suika.fam.cx>
1256    
1257 wakaba 1.129 * ContentChecker.pm (check_element): New todo item type |descendant|.
1258    
1259 wakaba 1.125 2007-11-23 Wakaba <wakaba@suika.fam.cx>
1260    
1261 wakaba 1.128 * IMTChecker.pm: Revised to raise errors and warnings as (poorly)
1262     specced in RFC 2046 and RFC 4288.
1263     (application/atom+xml): Definition added.
1264    
1265     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1266    
1267 wakaba 1.127 * URIChecker.pm: Make RFC 3986 should-level errors
1268     warnings (rather than SHOULD-level errors).
1269    
1270     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1271    
1272 wakaba 1.126 * NanoDOM.pm (get_user_data, set_user_data): New methods.
1273    
1274     * HTML.pm.src: A flag for character references in attribute
1275     values are added. Set |manakai_has_reference| user data
1276     to |charset| attribute.
1277    
1278     2007-11-23 Wakaba <wakaba@suika.fam.cx>
1279    
1280 wakaba 1.125 * NanoDOM.pm (input_encoding, manakai_charset, manakai_has_bom): New
1281     attributes.
1282    
1283     * ContentChecker.pm (check_document): Warn if charset requirements
1284     cannot be tested.
1285    
1286 wakaba 1.124 2007-11-19 Wakaba <wakaba@suika.fam.cx>
1287    
1288     * HTML.pm.src (parse_byte_string): Detect charset
1289     by universalchardet if charset parameter is not specified.
1290    
1291     * Makefile (Charset-all, Charset-clean): New rules.
1292    
1293 wakaba 1.120 2007-11-18 Wakaba <wakaba@suika.fam.cx>
1294    
1295 wakaba 1.123 * ContentChecker.pm (check_document): Check the existence
1296     of character encoding declaration and selection of encoding
1297     for HTML document.
1298    
1299     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1300    
1301 wakaba 1.122 * ContentType.pm (get_sniffed_type): Return also the official
1302     type in list context.
1303    
1304     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1305    
1306 wakaba 1.121 * ContentType.pm: Sniffing with leading white space
1307     ignoring (HTML5 revisions 1013 and 1016).
1308    
1309     2007-11-18 Wakaba <wakaba@suika.fam.cx>
1310    
1311 wakaba 1.120 * ContentType.pm: HTML5 revision 1013 changes, except for leading
1312     white spaces, are implemented.
1313    
1314 wakaba 1.117 2007-11-11 Wakaba <wakaba@suika.fam.cx>
1315    
1316 wakaba 1.119 * HTML.pm.src (parse_char_string): Set |inner_encoding|
1317     attribute if possible.
1318    
1319     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1320    
1321 wakaba 1.118 * HTML.pm.src (parse_byte_string): New method.
1322     (parse_char_string): New alias for |parse_string|.
1323     (main phase): Invoking "change the encoding" algorithm if desired.
1324    
1325     * HTML.pod: Updated.
1326    
1327     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1328    
1329 wakaba 1.117 * HTML.pod (get_inner_html): Removed.
1330    
1331     * Makefile (HTML-all, HTML-clean): New.
1332    
1333     2007-11-11 Wakaba <wakaba@suika.fam.cx>
1334    
1335     * HTML.pm.src (get_inner_html): Removed (moved to HTML/Serializer.pm).
1336    
1337 wakaba 1.116 2007-11-08 Wakaba <wakaba@suika.fam.cx>
1338    
1339     * mklinktypelist.pl: s/noreferer/noreferrer/ (HTML5 revision 1132).
1340    
1341 wakaba 1.112 2007-11-04 Wakaba <wakaba@suika.fam.cx>
1342    
1343 wakaba 1.115 * Makefile: |CacheManifest.html| is added.
1344    
1345     * CacheManifest.pod: New file.
1346    
1347     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1348    
1349 wakaba 1.114 * CacheManifest.pm: New module.
1350    
1351     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1352    
1353 wakaba 1.113 * HTML.pm.src: Support for application cache selection algorithm
1354     callback.
1355    
1356     2007-11-04 Wakaba <wakaba@suika.fam.cx>
1357    
1358 wakaba 1.112 * mklinktypelist.pl: Support for rel=noreferer (HTML5 revision 1118).
1359    
1360 wakaba 1.110 2007-10-17 Wakaba <wakaba@suika.fam.cx>
1361    
1362 wakaba 1.111 * Makefile (clean): New rule.
1363    
1364     * NanoDOM.pm (public_id, system_id): New attributes.
1365    
1366     2007-10-17 Wakaba <wakaba@suika.fam.cx>
1367    
1368 wakaba 1.110 * Makefile (CSS-all, CSS-clean, clean): New rules.
1369    
1370 wakaba 1.109 2007-10-14 Wakaba <wakaba@suika.fam.cx>
1371    
1372     * ContentChecker.pm (check_document): Support for
1373     new |is_xml_root| flag.
1374     (check_element): Support for new |pluses| state.
1375     (_add_pluses): New method.
1376     (_remove_minuses): Support for new |minus| item.
1377    
1378 wakaba 1.106 2007-09-24 Wakaba <wakaba@suika.fam.cx>
1379    
1380 wakaba 1.107 * ContentChecker.pm: Raise specific error for invalid
1381     root element.
1382    
1383     2007-09-24 Wakaba <wakaba@suika.fam.cx>
1384    
1385 wakaba 1.106 * ContentChecker.pm: Set level values for later uses.
1386    
1387 wakaba 1.103 2007-09-09 Wakaba <wakaba@suika.fam.cx>
1388    
1389 wakaba 1.105 * ContentChecker.pm: Support for language tag validation.
1390    
1391     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1392    
1393 wakaba 1.104 * LangTag.pm (check_rfc3066_language_tag): New method.
1394    
1395     2007-09-09 Wakaba <wakaba@suika.fam.cx>
1396    
1397 wakaba 1.103 * LangTag.pm: New module.
1398    
1399 wakaba 1.102 2007-09-04 Wakaba <wakaba@suika.fam.cx>
1400    
1401     * HTML.pm.src: Some error types were wrong.
1402    
1403 wakaba 1.100 2007-08-17 Wakaba <wakaba@suika.fam.cx>
1404    
1405 wakaba 1.101 * CSS/: New directory.
1406    
1407     2007-08-17 Wakaba <wakaba@suika.fam.cx>
1408    
1409 wakaba 1.100 * ContentChecker.pm (_check_get_children): Support
1410     for |noscript| in |head|.
1411    
1412 wakaba 1.99 2007-08-12 Wakaba <wakaba@suika.fam.cx>
1413    
1414     * URI/: New directory.
1415    
1416 wakaba 1.94 2007-08-11 Wakaba <wakaba@suika.fam.cx>
1417    
1418 wakaba 1.98 * HTML.pm.src: Tokenizer's states are now represented in
1419     number.
1420    
1421     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1422    
1423 wakaba 1.97 * HTML.pm.src: |or|s for insertion modes are replaced
1424     by |&|s.
1425    
1426     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1427    
1428 wakaba 1.96 * HTML.pm.src: Token types are now represented in number.
1429    
1430     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1431    
1432 wakaba 1.95 * ContentType.pm (SEE ALSO): Updated.
1433    
1434     * HTML.pm.src: Insertion modes are now represented in number.
1435    
1436     2007-08-11 Wakaba <wakaba@suika.fam.cx>
1437    
1438 wakaba 1.94 * ContentType.pm: Sniffing for bitmap images (HTML5 revision
1439     999) is implemented.
1440    
1441 wakaba 1.93 2007-08-08 Wakaba <wakaba@suika.fam.cx>
1442    
1443     * ContentType.pm: Sniffing for |<script| (HTML5 revision
1444     983) is implemented.
1445    
1446 wakaba 1.92 2007-08-06 Wakaba <wakaba@suika.fam.cx>
1447    
1448     * ContentChecker.pod: New documentation.
1449    
1450     * Makefile: A rule for |ContentChecker.html| is added.
1451    
1452     * ContentChecker.pm: A pod "LICENSE" section is added.
1453    
1454     * NanoDOM.pm ($VERSION): New variable.
1455    
1456 wakaba 1.90 2007-08-05 Wakaba <wakaba@suika.fam.cx>
1457    
1458 wakaba 1.91 * H2H.pm: |b|, |i|, and |sub| are added to the
1459     list of allowed HTML elements.
1460    
1461     2007-08-05 Wakaba <wakaba@suika.fam.cx>
1462    
1463 wakaba 1.90 * H2H.pm: |samp| is added to the list of allowed
1464     HTML elements.
1465    
1466     * URIChecker.pm (check_iri): New.
1467     (check_iri_reference): Error type for IRI reference
1468     syntax error is changed.
1469    
1470 wakaba 1.87 2007-08-04 Wakaba <wakaba@suika.fam.cx>
1471    
1472 wakaba 1.89 * ContentChecker.pm: Reference to the |Whatpm::ContentChecker::Atom|
1473     is added.
1474     (check_document): Load appropriate module before validation.
1475    
1476     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1477    
1478 wakaba 1.88 * ContentChecker/: New directory.
1479    
1480     2007-08-04 Wakaba <wakaba@suika.fam.cx>
1481    
1482 wakaba 1.87 * ContentChecker.pm: HTML |time| element is implemented.
1483    
1484     * HTMLTable.pm: Comments are updated as HTML5 is revised.
1485    
1486 wakaba 1.71 2007-07-21 Wakaba <wakaba@suika.fam.cx>
1487    
1488 wakaba 1.86 * ContentChecker.pm (check_document): Return value
1489     even if no document element is found.
1490    
1491     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1492    
1493 wakaba 1.85 * HTML.pm.src: |$in_body| is no longer a function.
1494    
1495     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1496    
1497 wakaba 1.84 * HTML.pm.src: The |$in_body| code has been moved down.
1498    
1499     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1500    
1501 wakaba 1.83 * HTML.pm.src: The "trailing end" insertion mode
1502     is split into "after html body" and "after html frameset"
1503     insertion modes. Their codes are merged with "after body"
1504     and "after frameset" codes. |$previous_insertion_mode|
1505     has been removed. "after frameset" code is
1506     merged with "in frameset" code.
1507    
1508     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1509    
1510 wakaba 1.82 * HTML.pm.src: The "before head" insertion mode is
1511     merged with the "in head" insertion mode.
1512    
1513     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1514    
1515 wakaba 1.81 * HTML.pm.src: Cases in "in head" insertion mode are
1516     reorganized.
1517    
1518     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1519    
1520 wakaba 1.80 * HTML.pm.src: Some cases in "in table" insertion mode
1521     are merged.
1522    
1523     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1524    
1525 wakaba 1.79 * HTML.pm.src: The "in row" insertion mode is merged
1526     with "in table" insertion mode.
1527    
1528     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1529    
1530 wakaba 1.78 * HTML.pm.src: The "in table" and "in table body" insertion
1531     modes are merged.
1532    
1533     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1534    
1535 wakaba 1.77 * HTML.pm.src: There is no "in table head" or "in table foot"
1536     insertion mode!
1537    
1538     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1539    
1540 wakaba 1.76 * HTML.pm.src: |<noframes>| "in frameset" and "in noframes"
1541     now directly invoke the handler.
1542    
1543     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1544    
1545 wakaba 1.75 * HTML.pm.src: Codes for "in cell" insertion mode
1546     is merged to the "in body" insertion mode code.
1547    
1548     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1549    
1550 wakaba 1.74 * HTML.pm.src: Codes for "in body" and "in caption"
1551     insertion modes are merged.
1552    
1553     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1554    
1555 wakaba 1.73 * HTML.pm.src: Two |!!!next-token|s were missing.
1556    
1557     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1558    
1559 wakaba 1.72 * HTML.pm.src: Use numeric constant for |{content_mode}|
1560     instead of string constant for |{content_model_flag}|.
1561    
1562     2007-07-21 Wakaba <wakaba@suika.fam.cx>
1563    
1564 wakaba 1.71 * HTML.pm.src: Add the name of the attribute
1565     to the "duplicate attribute" error.
1566    
1567 wakaba 1.69 2007-07-17 Wakaba <wakaba@suika.fam.cx>
1568    
1569 wakaba 1.70 * ContentChecker.pm: Return the |class| node list.
1570    
1571     2007-07-17 Wakaba <wakaba@suika.fam.cx>
1572    
1573 wakaba 1.69 * ContentChecker.pm: Return the |id| node list.
1574    
1575     * HTML.pm.src: A typo is fixed.
1576    
1577 wakaba 1.63 2007-07-16 Wakaba <wakaba@suika.fam.cx>
1578    
1579 wakaba 1.68 * ContentChecker.pm: Drop wrong |level => 'error'| specification
1580     from "in HTML:xml:lang" error. Character position
1581     is now the last part of the error type in the URI error
1582     description. Report "unsupported" status for language
1583     tags, media queries, script codes, and style sheets.
1584    
1585     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1586    
1587 wakaba 1.67 * ContentChecker.pm: Report error if |xml:lang|
1588     in HTML, |lang| in XML, |xmlns| in XML, and |meta| |charset|
1589     in XML.
1590    
1591     * NanoDOM.pm (Attr.owner_document): New attribute.
1592    
1593     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1594    
1595 wakaba 1.66 * HTML.pm.src: The character immediately following
1596     a bare |hcro| was discarded. Fix handling of
1597     entity references in attribute values.
1598    
1599     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1600    
1601 wakaba 1.65 * HTML.pm.src (main and trailin end phases): Token
1602     types |DOCTYPE|, |comment|, |end-of-file|, and |<html>| are
1603     factored out. Error types |in html:#DOCTYPE|
1604     and |after html:#DOCTYPE| are merged
1605     into |DOCTYPE in the middle|. |</frameset>|
1606     in fragment parsing mode changed the
1607     insertion mode.
1608    
1609     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1610    
1611 wakaba 1.64 * HTML.pm.src: |$phase| has been removed; The |trailing end|
1612     phase is now an insertion mode. Treatments for white
1613     space character tokens were incorrect for some
1614     insertion modes. An old |meta| case was not removed.
1615    
1616     2007-07-16 Wakaba <wakaba@suika.fam.cx>
1617    
1618 wakaba 1.63 * HTML.pm.src: |meta| charset declaration extraction
1619     implemented (but changing the encoding is not yet:-).
1620    
1621 wakaba 1.60 2007-07-15 Wakaba <wakaba@suika.fam.cx>
1622    
1623 wakaba 1.62 * Charset/: New directory.
1624    
1625     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1626    
1627 wakaba 1.61 * H2H.pm: New Perl module (created from
1628     manakai's H2H.dis).
1629    
1630     2007-07-15 Wakaba <wakaba@suika.fam.cx>
1631    
1632 wakaba 1.60 * XMLSerializer.pm: New Perl module (created from
1633     manakai's SimpleLS.dis).
1634    
1635 wakaba 1.59 2007-07-07 Wakaba <wakaba@suika.fam.cx>
1636    
1637     * HTML.pm.src: |<!---x-->| was not processed correctly.
1638    
1639 wakaba 1.57 2007-07-01 Wakaba <wakaba@suika.fam.cx>
1640    
1641 wakaba 1.58 * HTML.pm.src: Report correct error message
1642     for |<body></div></body>|.
1643    
1644     2007-07-01 Wakaba <wakaba@suika.fam.cx>
1645    
1646 wakaba 1.57 * HTMLTable.pm: An error description was incorrect.
1647    
1648     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1649    
1650     * ContentChecker.pm: Return |{term}| list.
1651    
1652 wakaba 1.55 2007-06-30 Wakaba <wakaba@suika.fam.cx>
1653    
1654 wakaba 1.56 * HTML.pm.src: HTML5 revisions 961-966 (</p>, </br>,
1655     nested <nobr>, implied </tbody>, </tfoot>, and </thead>,
1656     and <title> outside of head).
1657    
1658     2007-06-30 Wakaba <wakaba@suika.fam.cx>
1659    
1660 wakaba 1.55 * IMTChecker.pm: Report warning for unregistered
1661     and private types/subtypes.
1662    
1663     * ContentChecker.pm, HTML.pm.src, IMTChecker.pm,
1664     URIChecker.pm, HTMLTable.pm: Error messages are now
1665     consistent; they are all listed in
1666     <http://suika.fam.cx/gate/2005/sw/Whatpm%20Error%20Types>.
1667    
1668 wakaba 1.52 2007-06-25 Wakaba <wakaba@suika.fam.cx>
1669    
1670 wakaba 1.54 * ContentChecker.pm: |<img ismap>| not in |<a></a>|
1671     is now erred. |<datalist>| is implemented.
1672     Attribute checker for |<command>| and |<menu>| are
1673     added. Support for |contextmenu| global attribute
1674     is added.
1675    
1676     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1677    
1678 wakaba 1.53 * HTML.pm.src (_reset_insertion_mode): Interpretation
1679     of Step 3 has been changed.
1680    
1681     2007-06-25 Wakaba <wakaba@suika.fam.cx>
1682    
1683 wakaba 1.52 * HTML.pm.src: Late |<html>| parse error is implemented.
1684    
1685 wakaba 1.47 2007-06-24 Wakaba <wakaba@suika.fam.cx>
1686    
1687 wakaba 1.51 * URIChecker.pm (check_iri_reference): A |decode| method name was
1688     incorrect.
1689    
1690     * ContentChecker.pm: Support for the |footer| element.
1691     Check URI syntax for space-separated URI attributes.
1692     Support for the |tabindex| attribute. Support
1693     for |datetime| attribute.
1694    
1695     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1696    
1697 wakaba 1.50 * HTML.pm.src: HTML5 revision 1.144 (&#x0D;) and 1.145 (invalid
1698     character references). HTML5 revision 1.146 (white space
1699     characters before root start tag). HTML5 revision
1700     1.148 (named character references in attribute values).
1701     HTML5 revision 1.152 (<plaintext>.innerHTML get).
1702    
1703     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1704    
1705     * HTML.pm.src: HTML5 revisions 1.142 and 1.143 (<noscript>
1706 wakaba 1.49 in <head>).
1707    
1708     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1709    
1710 wakaba 1.48 * HTML.pm.src: HTML5 revision 935 (<base>, <link>, <meta>
1711     in body).
1712    
1713     * ContentChecker.pm: HTML5 revision 938 (scoped="").
1714    
1715     2007-06-24 Wakaba <wakaba@suika.fam.cx>
1716    
1717 wakaba 1.47 * HTML.pm.src: HTML5 revision 923 (matching end tag in CDATA
1718     or RCDATA in fragment parsing mode). HTML5 revision
1719     924 (<!--> and <!--->). HTML5 revision 926 (hn in hn).
1720    
1721     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1722    
1723     * HTML.pm.src (get_inner_html): HTML5 revision 922 (inner_html
1724     for <pre> and <textarea>).
1725    
1726 wakaba 1.33 2007-06-23 Wakaba <wakaba@suika.fam.cx>
1727    
1728 wakaba 1.46 * HTML.pm.src: HTML5 revision 920 (<isindex>).
1729    
1730     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1731    
1732 wakaba 1.45 * HTML.pm.src: HTML5 revision 918 (</head>, </body>, </html>).
1733    
1734     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1735    
1736 wakaba 1.44 * HTML.pm.src: HTML5 revision 916 (</body>).
1737     HTML5 revision 917 (conforming bare &).
1738    
1739     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1740    
1741 wakaba 1.43 * NanoDOM.pm (manakai_is_html): Setting to false did not work.
1742    
1743     * HTML.pm.src: HTML5 revision 914 (</ in CDATA, RCDATA).
1744     HTML5 revision 915 (<nobr>).
1745    
1746     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1747    
1748 wakaba 1.42 * HTML.pm.src: HTML5 revisions 908, 909, 912, and 913 (quirks mode).
1749    
1750     * NanoDOM.pm (manakai_is_html, manakai_compat_mode, compat_mode):
1751     New attributes.
1752    
1753     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1754    
1755 wakaba 1.41 * HTML.pm.src: HTML5 revisions 900, 901, 902, and 911 (<
1756     in tags).
1757    
1758     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1759    
1760 wakaba 1.40 * .cvsignore: |Entities.html| is added.
1761    
1762     * HTML.pm.src: |$entity_char| is removed and
1763     requires |Whatpm::_NamedEntityList| instead.
1764     HTML5 revision 898 (refc), except that lack of refc
1765     is parse error.
1766    
1767     * mkentitylist.pl: New script.
1768    
1769     * Makefile (all): |_NamedEntityList.pm| is added.
1770     (_NamedEntityList.pm, Entities.html): New rules.
1771    
1772     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1773    
1774 wakaba 1.39 * HTML.pm.src: Parse errors immediately after U+000D
1775     were ignored and U+000D immediately following another
1776     U+000D was not converted to U+000A.
1777    
1778     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1779    
1780 wakaba 1.38 * HTML.pm.src (set_inner_html): HTML5 revision 892 (adopt
1781     nodes before appended). Parser was not ready for NULL
1782     parse error and escape flag.
1783    
1784     * NanoDOM.pm (adopt_node): New.
1785    
1786     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1787    
1788 wakaba 1.37 * HTML.pm.src: HTML5 revision 886 (insane comment in
1789     CDATA and RCDATA). Note that current implementation
1790     is simply repeating what the spec says and it is maybe not
1791     a best way to do it.
1792    
1793     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1794    
1795 wakaba 1.36 * HTML.pm.src: HTML5 revision 884 (</form> don't close
1796     the form element if a descendant element without implied end tag has
1797     still been open).
1798    
1799     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1800    
1801 wakaba 1.35 * ContentChecker.pm: HTML5 revision 881 (Make |id| attribute with
1802     space characters non-conforming).
1803    
1804     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1805    
1806 wakaba 1.34 * HTML.pm.src: An error message was incorrect.
1807     HTML5 revision 869 (C1 character references).
1808    
1809     2007-06-23 Wakaba <wakaba@suika.fam.cx>
1810    
1811 wakaba 1.33 * HTML.pm.src: HTML5 revision 867 (a LF at the beginning of
1812     a |textarea| is removed).
1813    
1814 wakaba 1.32 2007-06-05 Wakaba <wakaba@suika.fam.cx>
1815    
1816     * NanoDOM.pm (get_attribute_node_ns): New method.
1817    
1818     * ContentChecker.pm: |script| |async| and |defer|
1819     no longer require |src|. |async| MUST NOT be
1820     specified if |defer|. (HTML5 revision 858).
1821    
1822 wakaba 1.31 2007-05-30 Wakaba <wakaba@suika.fam.cx>
1823    
1824     * HTML.pm.src: |<form><form>| went to inifinite loop.
1825    
1826 wakaba 1.27 2007-05-27 Wakaba <wakaba@suika.fam.cx>
1827    
1828 wakaba 1.30 * ContentChecker.pm (html): Set |is_root| (allowed
1829     as a document element) flag on.
1830     (new): Removed.
1831     (check_document): New method.
1832    
1833     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1834    
1835 wakaba 1.29 * ContentChecker.pm (thead, tfoot): Checker specifications
1836     were incorrect.
1837    
1838     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1839    
1840 wakaba 1.28 * ContentChecker.pm ($HTMLURIAttrChecker): Include
1841     error position in the |type| option of the error.
1842    
1843     * HTMLTable.pm (form_table): The |$onerror| parameter
1844     is now optional. Some bugs are fixed.
1845    
1846     2007-05-27 Wakaba <wakaba@suika.fam.cx>
1847    
1848 wakaba 1.27 * HTMLTable.pm: New module.
1849    
1850     * ContentChecker.pm (table): Invoke table model error checker.
1851    
1852     * NanoDOM.pm (first_child, get_attribute_ns): New.
1853    
1854 wakaba 1.25 2007-05-26 Wakaba <wakaba@suika.fam.cx>
1855    
1856 wakaba 1.26 * ContentChecker.pm ($HTMLLinkTypesAttrChecker): New checker.
1857     (link/@rel, a/@rel, area/@rel): Use new checker.
1858    
1859     * Makefile (_LinkTypeList.pm, RelExtensions.html): New rules.
1860    
1861     * _LinkTypeList.pm: New file.
1862    
1863     * mklinktypelist.pl: New file.
1864    
1865     * .cvsignore: |RelExtensions.html| added.
1866    
1867     * NanoDOM.pm (child_nodes): Returns an empty array
1868     for non-child-containing node types.
1869     (text_content): New attribute.
1870    
1871     2007-05-26 Wakaba <wakaba@suika.fam.cx>
1872    
1873 wakaba 1.25 * IMTChecker.pm: New module.
1874    
1875     * ContentChecker.pm ($HTMLIMTAttrChecker): Call IMTChecker
1876     to test parameter value validity.
1877    
1878     * HTML.pm.src ($style_start_tag): Attributes were
1879     discarded.
1880    
1881 wakaba 1.22 2007-05-25 Wakaba <wakaba@suika.fam.cx>
1882    
1883 wakaba 1.24 * ContentChecker.pm ($HTMLURIAttrChecker): Implemented.
1884    
1885     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1886    
1887 wakaba 1.23 * URIChecker.pm: All recommendations from RFC 3986
1888     and RFC 3987 are listed (not all testable items are checked yet).
1889    
1890     2007-05-25 Wakaba <wakaba@suika.fam.cx>
1891    
1892 wakaba 1.22 * URIChecker.pm: New module.
1893    
1894 wakaba 1.18 2007-05-20 Wakaba <wakaba@suika.fam.cx>
1895    
1896 wakaba 1.21 * ContentChecker.pm: Now most attributes are implemented
1897     or associated to some placeholder.
1898     ($ElementDefault): Warn unknown attributes
1899     for unknown elements as "attribute not supported".
1900     ($HTMLLanguageTagAttrChecker, $HTMLMQAttrChecker): New placeholders.
1901     ($HTMLUsemapAttrChecker, $HTMLTargetAttrChecker): New checkers.
1902     (|a| attribute checker): Reimplemented.
1903    
1904     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1905    
1906 wakaba 1.20 * ContentChecker.pm ($HTMLEventHandlerAttrChecker): New placeholder.
1907     ($HTMLAttrChecker): Event handler content attributes
1908     are added.
1909     (link, embed): Required attribute is now checked.
1910     (embed): Unknown local attributes are no longer warned.
1911    
1912     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1913    
1914 wakaba 1.19 * ContentChecker.pm ($HTMLSpaceURIsAttrChecker): New placeholder.
1915     ($HTMLIMTAttrChecker): New checker.
1916     (link@rel, link@href, link@type, style@type,
1917     a@href, a@ping, a@ping, a@type, embed@src, embed@type,
1918     object@data, object@type, source@src, source@type, area@alt,
1919     area@shape, area@coords, area@href,
1920     area@ping, area@rel, area@type, script@src,
1921     script@defer, script@async, script@type): Checkers added.
1922    
1923     2007-05-20 Wakaba <wakaba@suika.fam.cx>
1924    
1925 wakaba 1.18 * ContentChecker.pm: Descendant checking was incorrect.
1926    
1927 wakaba 1.12 2007-05-19 Wakaba <wakaba@suika.fam.cx>
1928    
1929 wakaba 1.17 * ContentChecker.pm: Support |xml:*| and |xmlns:*|
1930     attributes. Report an error if |Element.prefix|
1931     is |xmlns|.
1932    
1933     * NanoDOM.pm (prefix): New attribute.
1934    
1935     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1936    
1937 wakaba 1.16 * HTML.pm.src: In |main| phase, |in body| insertion
1938     mode, action for |<iframe>| was missing.
1939    
1940     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1941    
1942 wakaba 1.15 * ContentChecker.pm: Support for many of HTML5 elements.
1943     ($GetHTMLNonNegativeIntegerAttrChecker): New.
1944    
1945     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1946    
1947 wakaba 1.14 * ContentChecker.pm: Support for most elements up to |progress|.
1948     ($HTMLURIAttrChecker): Placeholder.
1949     ($HTMLIntegerAttrChecker, $GetHTMLFloatingPointNumberAttrChecker):
1950     New.
1951    
1952     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1953    
1954 wakaba 1.13 * ContentChecker.pm: Attribute checkers
1955     for global attributes, |html|, |base|, |style|, and |meta|.
1956    
1957     * NanoDOM.pm (insert_before): Weaken reference
1958     to the parent node.
1959     (Attr::new): Set |owner_element| attribute.
1960     (namespace_uri, manakai_local_name): New attribute implementations.
1961     (owner_element): New attribute.
1962    
1963     2007-05-19 Wakaba <wakaba@suika.fam.cx>
1964    
1965 wakaba 1.12 * ContentChecker.pm ($AttrChecker, $HTMLAttrChecker,
1966     $AnyChecker->{attr_checker}, $HTMLAttrsChecker,
1967     $Element->{$HTML_NS}->{''}): New.
1968     (check_element): Invoke attrs_checker for each element.
1969    
1970 wakaba 1.6 2007-05-13 Wakaba <wakaba@suika.fam.cx>
1971    
1972 wakaba 1.11 * ContentChecker.pm: Don't use |manakai_element_type_match|.
1973    
1974     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1975    
1976 wakaba 1.10 * ContentChecker.pm: Use hashs rather than lists for
1977     element type testings.
1978    
1979     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1980    
1981 wakaba 1.9 * ContentChecker.pm: Don't generate duplicate
1982     error when an element type is put in the "minus" list
1983     and the element type is not allowed explicitly in the particular
1984     element content model.
1985     (html:a checker): New checker.
1986     (html:details, html:datagrid): New checkers.
1987     (html:legend): New checker.
1988    
1989     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1990    
1991 wakaba 1.8 * ContentChecker.pm (html:li checker): Implemented.
1992    
1993     2007-05-13 Wakaba <wakaba@suika.fam.cx>
1994    
1995 wakaba 1.7 * ContentChecker.pm ($HTMLInlineOrStriclyInlineChecker): New
1996     checker.
1997     (html:dd checker): New checker.
1998     (html:q, html:em, html:strong, html:small,
1999     html:m, html:dfn, html:code, html:samp, html:span): New checkers.
2000    
2001     2007-05-13 Wakaba <wakaba@suika.fam.cx>
2002    
2003 wakaba 1.6 * ContentChecker.pm ($AnyChecker): Renamed
2004     from |$ElementDefault->{checker}|.
2005     ($ElementDefault->{checker}): Throw an error that
2006     the element type is not supported by the checker.
2007     ($HTMLMetadataElement): |html:base| was missing.
2008     ($HTMLEmptyChecker): Don't throw an error
2009     for inter-element whitespace nodes.
2010     (html:html checker): Errors were not
2011     thrown even if |html:head| and/or |html:body|
2012     children were missing.
2013     (html:head checker): An error was not
2014     thrown if <meta charset> appered after other
2015     elements.
2016    
2017 wakaba 1.5 2007-05-05 Wakaba <wakaba@suika.fam.cx>
2018    
2019     * ContentChecker.pm: |footer|, |video|, |audio|, |script|,
2020     and |noscript| elements are implemented.
2021     (new): New method.
2022    
2023 wakaba 1.3 2007-05-04 Wakaba <wakaba@suika.fam.cx>
2024    
2025 wakaba 1.4 * ContentChecker.pm: New module.
2026    
2027     2007-05-04 Wakaba <wakaba@suika.fam.cx>
2028    
2029 wakaba 1.3 * NanoDOM.pm (manakai_parent_element,
2030     document_element, manakai_local_name,
2031     manakai_element_type_match): New method.
2032    
2033     2007-05-03 Wakaba <wakaba@suika.fam.cx>
2034    
2035     * HTML.pm.src: Replace decimal and hexadecimal numeric
2036     entities in C1 range using Windows-1252 mapping. Bare LF
2037     did not count as new line for error reporting.
2038    
2039 wakaba 1.2 2007-05-02 Wakaba <wakaba@suika.fam.cx>
2040    
2041     * NanoDOM.pm (DOMImplementation): New class.
2042     (append_child): Weaken the |parent_node| reference.
2043     (create_element_ns, Element new): Set the |owner_document|
2044     reference.
2045     (implementation): New attribute.
2046     (owner_document, local_name, namespace_uri): New attributes.
2047    
2048     * HTML.pm.src (parse_string): Line and column numbers
2049     are now provided to error handler.
2050     (!!!parse-error): Short descriptions are added.
2051     (_construct_tree): Split into three methods; support
2052     for innerHTML mode.
2053     (set_inner_html): New method.
2054    
2055 wakaba 1.1 2007-05-01 Wakaba <wakaba@suika.fam.cx>
2056    
2057     * NanoDOM.html: Documentation is added.
2058    
2059     * HTML.pod, ContentType.html: Documentation is revised.
2060    
2061     * .cvsignore: Pod2html temporary files are added.
2062    
2063     * Makefile: Make |NanoDOM.html|.
2064    
2065     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2066    
2067     * HTML.pm.src (parse_string): New method.
2068     (get_inner_html): Renamed from |inner_html|.
2069    
2070     * Makefile: A rule for |HTML.html| is added.
2071    
2072     * HTML.pod: New documentation.
2073    
2074     2007-05-01 Wakaba <wakaba@suika.fam.cx>
2075    
2076     * NanoDOM.pm (last_child, previous_sibling): New attributes.
2077     (clone_node): Attribute nodes were not completely copied.
2078    
2079     * HTML.pm.src: Many bugs are fixed.
2080    
2081     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2082    
2083     * HTML.pm.src: Some typos are fixed.
2084    
2085     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2086    
2087     * mkhtmlparser.pl, Makefile: References to the |HTML-consume-entity.src|
2088     are removed.
2089    
2090     * HTML.pm.src: Tokenizer's handling on named entities are rewritten.
2091    
2092     * HTML-consume-entity.src: Removed.
2093    
2094     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2095    
2096     * HTML.pm.src: Tokenizer's handling on hexadecimal
2097     numeric entities are rewritten.
2098    
2099     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2100    
2101     * HTML.pm.src: Some tokenizer bugs are fixed.
2102    
2103     2007-04-30 Wakaba <wakaba@suika.fam.cx>
2104    
2105     * HTML.pm.src: The tree construction stage is implemented.
2106    
2107     * mkhtmlparser.pl: New macros are added.
2108    
2109     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2110    
2111     * ContentType.pm: A note on bug in the specification
2112     is removed since it's been now fixed.
2113    
2114     * .cvsignore: New file.
2115    
2116     2007-04-28 Wakaba <wakaba@suika.fam.cx>
2117    
2118     * HTML.pm.src, HTML-consume-entity.src: New files.
2119    
2120     * Makefile (HTML.pm): New rule.
2121    
2122     * mkhtmlparser.pl: New script.
2123    
2124     2007-04-25 Wakaba <wakaba@suika.fam.cx>
2125    
2126     * Makefile: New file.
2127    
2128     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2129    
2130     * ContentType.pm: An error in pod is fixed.
2131    
2132     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2133    
2134     * ContentType.pm: Documentation is added.
2135    
2136     2007-04-24 Wakaba <wakaba@suika.fam.cx>
2137    
2138     * ContentType.pm: New Perl module.
2139    
2140     * ChangeLog: New file.
2141    
2142    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24