/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.281 by wakaba, Sat Aug 30 12:33:36 2008 UTC revision 1.365 by wakaba, Sat Dec 6 10:05:23 2008 UTC
# Line 1  Line 1 
1    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * URIChecker.pm: Bug fix: It did not work unless
4            Message::DOM::DOMImplementation has been |require|d.
5    
6    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
7    
8            * NanoDOM.pm (document_uri): New attribute.
9    
10            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
11    
12    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
13    
14            * NanoDOM.pm (text_content): Don't create a Text node if the new
15            value is empty.
16    
17    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
18    
19            * SWML/: New directory.
20    
21    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
22    
23            * NanoDOM.pm (specified, all_declarations_processed,
24            manakai_attribute_type): New attributes.
25    
26    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
27    
28            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
29            array.
30    
31    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
32    
33            * NanoDOM.pm (notation_name): New attribute.
34    
35    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
36    
37            * NanoDOM.pm (public_id, system_id): New attributes.a
38    
39    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
40    
41            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
42            implemented.
43            (allowed_tokens, default_type, declared_type): Implemented.
44    
45    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
46    
47            * NanoDOM.pm (node_name): New attribute.
48            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
49            constants.
50            (create_element_type_definition_node, create_attribute_definition,
51            create_notation, create_general_entity,
52            get_element_type_definition_node,
53            set_element_type_definition_node, get_general_entity_node,
54            set_general_entity_node, get_notation_node, set_notation_node,
55            get_attribute_definition_node, set_attribute_definition_node): New
56            methods.
57            (element_types, entities, notations, attribute_definitions): New
58            attributes.
59            (DocumentType): Support for child nodes, entities, notations, and
60            element types.
61            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
62            New classes.
63    
64            * Dumper.pm: Support for general entities, notations, element type
65            definitions, and attribute definitions.
66    
67    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
68    
69            * NanoDOM.pm (create_processing_instruction): New method.
70            (xml_version, xml_encoding, xml_standalone): New attributes.
71            (ProcessingInstruction): New class.
72    
73    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Handling of end tags in the foreign content
76            insertion mode was partially wrong, because of wrong bit
77            operations.
78    
79    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
80    
81            * NanoDOM.pm (dom_config): New attribute (do nothing), for
82            Whatpm::XML::Parser support.
83    
84    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
85    
86            * Makefile: New rule to make HTML/Tokenizer.pm is added.
87    
88            * HTML.pm.src: Tokenizer part moved to another file.
89    
90    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
93    
94    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: Element category constants redefined.
97            
98    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
99    
100            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
101            construction stage synced with the spec (HTML5 revisions 2139 and
102            2302).
103    
104    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
105    
106            * ContentChecker.pm: New error level "html5_fact" added, which
107            should be tentatively used until all of requirements are properly
108            specced as RFC 2119 "MUST" in HTML5.
109    
110    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
111    
112            * ContentChecker.pod: Note on internal flags is added.
113    
114    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: An AAA bug fixed.
117    
118    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: If another node is inserted by the parser, don't
121            reuse existing Text node to append a character (HTML5 revision
122            2124).
123    
124    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
127            revisions 1731 and 2128).
128    
129    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
132            Support for end tags of camelCase SVG elements were broken.  A
133            wrong error type text fixed.
134    
135    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
138    
139    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Support for new definition of |param| and |source|
142            start tag parsing (HTML5 revision 1731).
143    
144    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
145    
146            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
147    
148    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
149    
150            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
151            revisions 1731 and 1831).
152    
153    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
154    
155            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
156            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
157            revision 1731).
158    
159    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
160    
161            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
162            revision 1731).  End tags of |option| and |optgroup| elements are
163            now optional (HTML5 revision 1731).
164    
165    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: New "special" elements added to the list (HTML5
168            revision 1778).  "strile" -> "strike".
169    
170    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
171    
172            * ContentType.pm (get_sniffed_type): Support for the "better"
173            content sniffing (HTML5 revision 1927).  In a case the official
174            type was not returned when the method is invoked in the list
175            context.
176    
177    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Character references for non-space C0 characters,
180            including U+000B VT, DEL character, noncharacter code points, are
181            now converted to the U+FFFD character (cf. HTML5 revision 2138).
182    
183    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
184    
185            * ContentChecker.pm: |form=""| check support added.
186    
187    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
188    
189            * ContentChecker.pm: |contextmenu| validness is now checked using
190            |id| and |id_type| properties, and |menu| property is removed.
191    
192    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
193    
194            * ContentChecker.pm: Prepare for |form| |name| attribute's
195            duplication checking.
196    
197    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
200            unsupported charset="" parameter value (HTML5 revision 2131).
201    
202    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: Reminding places where U+000B is allowed as a space
205            character is fixed (cf. HTML5 revision 1738).
206    
207            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
208            space characters (HTML5 revision 1738).
209    
210    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
211    
212            * HTML.pm.src: The "anything else" case for the "after after body"
213            insertion mode was not updated to swtich to the "in body"
214            insertion mode.  U+000B is no longer a space character for the
215            purpose of tree construction phase (HTML5 revision 1738).
216    
217    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
218    
219            * HTML.pm.src: U+000B is no longer a space character (HTML5
220            revision 1738).
221    
222    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
223    
224            * ContentType.pm: 0x0B is no longer a space character (HTML5
225            revision 1738).
226    
227            * HTML.pm.src: U+000B is no longer a space character for the
228            algorithm for extracting an encoding from a Content-Type (HTML5
229            revision 1738).
230    
231    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
232    
233            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
234    
235    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
236    
237            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
238    
239            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
240            matching.
241    
242    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
243    
244            * LangTag.pm: Warn for private use language subtags.  Error level
245            typos fixed.  Support for Suppress-Script field.
246    
247            * mklangreg.pl: Support for dumping of nested structure.
248    
249    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
250    
251            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
252            recommended case as per RFC 4646.
253    
254    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
255    
256            * LangTag.pm (check_rfc4646_langtag): New method.
257    
258    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
259    
260            * mklangreg.pl: New script.
261    
262            * Makefile: Updated for creation of the module for language subtag
263            registry.
264            
265    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
266    
267            * Makefile: WebIDL.html added.
268    
269            * WebIDL.pod: New documentation.
270    
271    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
272    
273            * WebIDL.pm: Checker's error types are redefined.
274    
275    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
276    
277            * WebIDL.pm: Parser's error types are redefined.  Some forward
278            compatible parsing bugs are fixed.  Some unreachable codes are
279            commented out.
280    
281    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
282    
283            * WebIDL.pm: Support for the reminding extended attributes are
284            added.  It does not satisfy the definition that a forward
285            interface declaration has an extended attribute.  It seems that
286            unless explicitly allowed multiple extended attributes with the
287            same name is not allowed, though it is not explicitly mentioned in
288            the spec.
289    
290    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
291    
292            * WebIDL.pm: Unescapes extended attribute names and extended
293            attribute identifiers.  Preserve whether an extended attribute has
294            an argument list of not.  Support for extended attributes:
295            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
296            NameGetter, NameSetter, and Null.
297            (has_argument_list): New attribute.
298            (idl_text): Stringifies argument lists, if any, even if it is
299            empty.
300    
301    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
304            |{s_kwd}| in DATA_STATE as default.
305    
306    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
309            by |{s_kwd}| in DATA_STATE.
310    
311    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
312    
313            * HTML.pm.src: Shorten keys.
314    
315    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: Remove checking for control character, surrogate
318            pair, or noncharacter code points and non-Unicode code
319            points (they should be handled by Whatpm::Charset::UnicodeChecker).
320            (parse_char_stream): Support for the |$get_wrapper| argument and
321            character stream error handlers.
322    
323    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
324    
325            * ContentChecker.pm: Don't call |loda_ns_module|
326            for null-namespace elements/attributes.
327    
328            * HTML.pm.src: Fact out $disallowed_control_chars
329            as a hash.
330    
331    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
332    
333            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
334            and |{next_char}| initializations are moved to initialization
335            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
336            with |parse_char_stream|.
337    
338    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
339    
340            * HTML.pm.src (parse_char_stream): Make |set_next_char|
341            invoke |manakai_read_until|, not only |read|, where
342            possible, to decrease the number of |read| method calls.
343    
344            * mkhtmlparser.pl: Related changes to the aforementioned
345            modification.
346    
347    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
350            would report character error from now.
351    
352    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
353    
354            * HTML.pm.src: White-space-leaded non-white-space character
355            tokens in "before head insertion mode" was not
356            correctly handled.
357            (set_inner_html): Reimplemented using CharString decodehandle
358            class.  Support for $get_wrapper argument.  Support
359            for |{read_until}| feature.
360    
361    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
362    
363            * HTML.pm.src: Make a "bare ero" error for unknown
364            entities point the "&" character.
365    
366    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src: It turns out that U+FFFD don't have to
369            be added to the list of excluded characters.
370    
371    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
372    
373            * HTML.pm.src ($char_onerror): Have character decoder's |line|
374            and |column| a higher priority than the one set by the
375            tokenizer's input handler.
376            ($self->{read_until}): Exclude U+FFFD (but this might
377            not be necessary, since now we do line/column fixup in
378            the character decode handle).
379    
380    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
381    
382            * HTML.pm.src: Use |{read_until}| where possible.
383    
384    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
385    
386            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
387            and |manakai_getc_until| to |manakai_read_until| to
388            reduce the number of string copies.
389    
390    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
391    
392            * HTML.pm.src (parse_char_string): Use newly created
393            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
394            standard feature to |open| a string as a filehandle,
395            since Perl's string filehandle seems not supporting |ungetc|
396            method correctly.
397            (parse_char_stream): Define |{getc_until}| method.
398            (DATA_STATE): Experimental support for |getc_until| feature.
399    
400    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
401    
402            * HTML.pm.src: Check points added to newly added branches.
403    
404    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
405    
406            * HTML.pm.src: Remove |{char}|, which is no longer used.
407            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
408            and replaced by |{prev_state}|.
409    
410            * mkhtmlparser.pl: Remove |{char}| feature.
411            Remove |!!!back-next-input-character;| macro.
412    
413    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
414    
415            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
416            entity related tokenizer states in favor of new states
417            implementing the consume character reference algorithm.
418    
419    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
420    
421            * HTML.pm.src: "Consume a character reference" algorithm is
422            now implemented as a tokenizer's state, rather than
423            a method, with minimum changes (more changes will
424            be made, in due course).  "Bogus comment state"'s inner
425            loop gets removed.
426    
427    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
430            into their own tokenizer states.
431    
432    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
435            is split into three states.
436    
437    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
438    
439            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
440            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
441            no longer does the tokenizer have to push back next input
442            characters in those states.
443    
444    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
445    
446            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
447            into four states so that no longer does the tokenizer have to push
448            back next input characters in that state.
449    
450    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
451    
452            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
453            which can be used to insert some wrapper between the character
454            stream handle and the tokenizer.  (It is currently not supported
455            for |set_inner_html| for |Element|s).
456    
457    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
458    
459            * HTML.pm.src: Ignore punctuations in charset names.
460    
461    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
462    
463            * ContentChecker.pm: Support for charset-layer error levels.
464    
465            * HTML.pm.src: Don't specify |text| argument for the
466            |chardecode:fallback| error, since it is not the encoding
467            being used alternatively.
468    
469    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
470    
471            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
472    
473    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
474    
475            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
476    
477    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
478    
479            * HTML.pm.src: Bug fix and sync with the spec with regard
480            to after after frameset insertion mode processing (HTML5
481            revision 1909).  Note that the implementation was wrong
482            per the old spec before the r1909 changes.
483    
484    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
485    
486            * HTMLTable.pm: scope=auto algorithm fix synced with the
487            spec (HTML5 revision 2093).
488            ($process_row): Algorithm step numbers synced with the
489            spec (HTML5 revision 2092).
490    
491    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
492    
493            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
494            revision 2094).
495    
496    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
497    
498            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
499    
500    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
501    
502            * HTML.pm.src: '"' and "'" at the end of attribute
503            name (after another attribute) now raise parse error (HTML5
504            revision 2123).  Empty unquoted attribute values are no
505            longer allowed (HTML5 revision 2122).
506    
507    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
508    
509            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
510            revision 2130).
511    
512  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
513    
514          * ContentChecker.pm: |xml:lang| attribute value must be same          * ContentChecker.pm: |xml:lang| attribute value must be same

Legend:
Removed from v.1.281  
changed lines
  Added in v.1.365

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24