/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.273 by wakaba, Fri Aug 15 14:13:42 2008 UTC revision 1.364 by wakaba, Sat Dec 6 10:00:53 2008 UTC
# Line 1  Line 1 
1    2008-12-06  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (document_uri): New attribute.
4    
5            * ContentChecker.pm: Don't use methods not implemented by NanoDOM.
6    
7    2008-11-07  Wakaba  <wakaba@suika.fam.cx>
8    
9            * NanoDOM.pm (text_content): Don't create a Text node if the new
10            value is empty.
11    
12    2008-11-06  Wakaba  <wakaba@suika.fam.cx>
13    
14            * SWML/: New directory.
15    
16    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * NanoDOM.pm (specified, all_declarations_processed,
19            manakai_attribute_type): New attributes.
20    
21    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
22    
23            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
24            array.
25    
26    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
27    
28            * NanoDOM.pm (notation_name): New attribute.
29    
30    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
31    
32            * NanoDOM.pm (public_id, system_id): New attributes.a
33    
34    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
35    
36            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
37            implemented.
38            (allowed_tokens, default_type, declared_type): Implemented.
39    
40    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
41    
42            * NanoDOM.pm (node_name): New attribute.
43            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
44            constants.
45            (create_element_type_definition_node, create_attribute_definition,
46            create_notation, create_general_entity,
47            get_element_type_definition_node,
48            set_element_type_definition_node, get_general_entity_node,
49            set_general_entity_node, get_notation_node, set_notation_node,
50            get_attribute_definition_node, set_attribute_definition_node): New
51            methods.
52            (element_types, entities, notations, attribute_definitions): New
53            attributes.
54            (DocumentType): Support for child nodes, entities, notations, and
55            element types.
56            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
57            New classes.
58    
59            * Dumper.pm: Support for general entities, notations, element type
60            definitions, and attribute definitions.
61    
62    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
63    
64            * NanoDOM.pm (create_processing_instruction): New method.
65            (xml_version, xml_encoding, xml_standalone): New attributes.
66            (ProcessingInstruction): New class.
67    
68    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: Handling of end tags in the foreign content
71            insertion mode was partially wrong, because of wrong bit
72            operations.
73    
74    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
75    
76            * NanoDOM.pm (dom_config): New attribute (do nothing), for
77            Whatpm::XML::Parser support.
78    
79    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
80    
81            * Makefile: New rule to make HTML/Tokenizer.pm is added.
82    
83            * HTML.pm.src: Tokenizer part moved to another file.
84    
85    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
88    
89    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: Element category constants redefined.
92            
93    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
96            construction stage synced with the spec (HTML5 revisions 2139 and
97            2302).
98    
99    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
100    
101            * ContentChecker.pm: New error level "html5_fact" added, which
102            should be tentatively used until all of requirements are properly
103            specced as RFC 2119 "MUST" in HTML5.
104    
105    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
106    
107            * ContentChecker.pod: Note on internal flags is added.
108    
109    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: An AAA bug fixed.
112    
113    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
114    
115            * HTML.pm.src: If another node is inserted by the parser, don't
116            reuse existing Text node to append a character (HTML5 revision
117            2124).
118    
119    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
122            revisions 1731 and 2128).
123    
124    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
127            Support for end tags of camelCase SVG elements were broken.  A
128            wrong error type text fixed.
129    
130    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
133    
134    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
135    
136            * HTML.pm.src: Support for new definition of |param| and |source|
137            start tag parsing (HTML5 revision 1731).
138    
139    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
142    
143    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
146            revisions 1731 and 1831).
147    
148    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
149    
150            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
151            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
152            revision 1731).
153    
154    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
157            revision 1731).  End tags of |option| and |optgroup| elements are
158            now optional (HTML5 revision 1731).
159    
160    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
161    
162            * HTML.pm.src: New "special" elements added to the list (HTML5
163            revision 1778).  "strile" -> "strike".
164    
165    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
166    
167            * ContentType.pm (get_sniffed_type): Support for the "better"
168            content sniffing (HTML5 revision 1927).  In a case the official
169            type was not returned when the method is invoked in the list
170            context.
171    
172    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
173    
174            * HTML.pm.src: Character references for non-space C0 characters,
175            including U+000B VT, DEL character, noncharacter code points, are
176            now converted to the U+FFFD character (cf. HTML5 revision 2138).
177    
178    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
179    
180            * ContentChecker.pm: |form=""| check support added.
181    
182    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
183    
184            * ContentChecker.pm: |contextmenu| validness is now checked using
185            |id| and |id_type| properties, and |menu| property is removed.
186    
187    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
188    
189            * ContentChecker.pm: Prepare for |form| |name| attribute's
190            duplication checking.
191    
192    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
195            unsupported charset="" parameter value (HTML5 revision 2131).
196    
197    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Reminding places where U+000B is allowed as a space
200            character is fixed (cf. HTML5 revision 1738).
201    
202            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
203            space characters (HTML5 revision 1738).
204    
205    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
206    
207            * HTML.pm.src: The "anything else" case for the "after after body"
208            insertion mode was not updated to swtich to the "in body"
209            insertion mode.  U+000B is no longer a space character for the
210            purpose of tree construction phase (HTML5 revision 1738).
211    
212    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: U+000B is no longer a space character (HTML5
215            revision 1738).
216    
217    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
218    
219            * ContentType.pm: 0x0B is no longer a space character (HTML5
220            revision 1738).
221    
222            * HTML.pm.src: U+000B is no longer a space character for the
223            algorithm for extracting an encoding from a Content-Type (HTML5
224            revision 1738).
225    
226    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
227    
228            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
229    
230    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
231    
232            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
233    
234            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
235            matching.
236    
237    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
238    
239            * LangTag.pm: Warn for private use language subtags.  Error level
240            typos fixed.  Support for Suppress-Script field.
241    
242            * mklangreg.pl: Support for dumping of nested structure.
243    
244    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
245    
246            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
247            recommended case as per RFC 4646.
248    
249    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
250    
251            * LangTag.pm (check_rfc4646_langtag): New method.
252    
253    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
254    
255            * mklangreg.pl: New script.
256    
257            * Makefile: Updated for creation of the module for language subtag
258            registry.
259            
260    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
261    
262            * Makefile: WebIDL.html added.
263    
264            * WebIDL.pod: New documentation.
265    
266    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
267    
268            * WebIDL.pm: Checker's error types are redefined.
269    
270    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
271    
272            * WebIDL.pm: Parser's error types are redefined.  Some forward
273            compatible parsing bugs are fixed.  Some unreachable codes are
274            commented out.
275    
276    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
277    
278            * WebIDL.pm: Support for the reminding extended attributes are
279            added.  It does not satisfy the definition that a forward
280            interface declaration has an extended attribute.  It seems that
281            unless explicitly allowed multiple extended attributes with the
282            same name is not allowed, though it is not explicitly mentioned in
283            the spec.
284    
285    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
286    
287            * WebIDL.pm: Unescapes extended attribute names and extended
288            attribute identifiers.  Preserve whether an extended attribute has
289            an argument list of not.  Support for extended attributes:
290            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
291            NameGetter, NameSetter, and Null.
292            (has_argument_list): New attribute.
293            (idl_text): Stringifies argument lists, if any, even if it is
294            empty.
295    
296    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
297    
298            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
299            |{s_kwd}| in DATA_STATE as default.
300    
301    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
304            by |{s_kwd}| in DATA_STATE.
305    
306    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: Shorten keys.
309    
310    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: Remove checking for control character, surrogate
313            pair, or noncharacter code points and non-Unicode code
314            points (they should be handled by Whatpm::Charset::UnicodeChecker).
315            (parse_char_stream): Support for the |$get_wrapper| argument and
316            character stream error handlers.
317    
318    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
319    
320            * ContentChecker.pm: Don't call |loda_ns_module|
321            for null-namespace elements/attributes.
322    
323            * HTML.pm.src: Fact out $disallowed_control_chars
324            as a hash.
325    
326    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
327    
328            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
329            and |{next_char}| initializations are moved to initialization
330            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
331            with |parse_char_stream|.
332    
333    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTML.pm.src (parse_char_stream): Make |set_next_char|
336            invoke |manakai_read_until|, not only |read|, where
337            possible, to decrease the number of |read| method calls.
338    
339            * mkhtmlparser.pl: Related changes to the aforementioned
340            modification.
341    
342    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
343    
344            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
345            would report character error from now.
346    
347    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: White-space-leaded non-white-space character
350            tokens in "before head insertion mode" was not
351            correctly handled.
352            (set_inner_html): Reimplemented using CharString decodehandle
353            class.  Support for $get_wrapper argument.  Support
354            for |{read_until}| feature.
355    
356    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
357    
358            * HTML.pm.src: Make a "bare ero" error for unknown
359            entities point the "&" character.
360    
361    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
362    
363            * HTML.pm.src: It turns out that U+FFFD don't have to
364            be added to the list of excluded characters.
365    
366    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src ($char_onerror): Have character decoder's |line|
369            and |column| a higher priority than the one set by the
370            tokenizer's input handler.
371            ($self->{read_until}): Exclude U+FFFD (but this might
372            not be necessary, since now we do line/column fixup in
373            the character decode handle).
374    
375    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
376    
377            * HTML.pm.src: Use |{read_until}| where possible.
378    
379    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
380    
381            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
382            and |manakai_getc_until| to |manakai_read_until| to
383            reduce the number of string copies.
384    
385    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
386    
387            * HTML.pm.src (parse_char_string): Use newly created
388            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
389            standard feature to |open| a string as a filehandle,
390            since Perl's string filehandle seems not supporting |ungetc|
391            method correctly.
392            (parse_char_stream): Define |{getc_until}| method.
393            (DATA_STATE): Experimental support for |getc_until| feature.
394    
395    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
396    
397            * HTML.pm.src: Check points added to newly added branches.
398    
399    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
400    
401            * HTML.pm.src: Remove |{char}|, which is no longer used.
402            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
403            and replaced by |{prev_state}|.
404    
405            * mkhtmlparser.pl: Remove |{char}| feature.
406            Remove |!!!back-next-input-character;| macro.
407    
408    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
409    
410            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
411            entity related tokenizer states in favor of new states
412            implementing the consume character reference algorithm.
413    
414    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
415    
416            * HTML.pm.src: "Consume a character reference" algorithm is
417            now implemented as a tokenizer's state, rather than
418            a method, with minimum changes (more changes will
419            be made, in due course).  "Bogus comment state"'s inner
420            loop gets removed.
421    
422    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
423    
424            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
425            into their own tokenizer states.
426    
427    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
428    
429            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
430            is split into three states.
431    
432    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
433    
434            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
435            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
436            no longer does the tokenizer have to push back next input
437            characters in those states.
438    
439    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
440    
441            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
442            into four states so that no longer does the tokenizer have to push
443            back next input characters in that state.
444    
445    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
446    
447            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
448            which can be used to insert some wrapper between the character
449            stream handle and the tokenizer.  (It is currently not supported
450            for |set_inner_html| for |Element|s).
451    
452    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
453    
454            * HTML.pm.src: Ignore punctuations in charset names.
455    
456    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
457    
458            * ContentChecker.pm: Support for charset-layer error levels.
459    
460            * HTML.pm.src: Don't specify |text| argument for the
461            |chardecode:fallback| error, since it is not the encoding
462            being used alternatively.
463    
464    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
465    
466            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
467    
468    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
469    
470            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
471    
472    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
473    
474            * HTML.pm.src: Bug fix and sync with the spec with regard
475            to after after frameset insertion mode processing (HTML5
476            revision 1909).  Note that the implementation was wrong
477            per the old spec before the r1909 changes.
478    
479    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
480    
481            * HTMLTable.pm: scope=auto algorithm fix synced with the
482            spec (HTML5 revision 2093).
483            ($process_row): Algorithm step numbers synced with the
484            spec (HTML5 revision 2092).
485    
486    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
487    
488            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
489            revision 2094).
490    
491    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
492    
493            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
494    
495    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
496    
497            * HTML.pm.src: '"' and "'" at the end of attribute
498            name (after another attribute) now raise parse error (HTML5
499            revision 2123).  Empty unquoted attribute values are no
500            longer allowed (HTML5 revision 2122).
501    
502    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
503    
504            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
505            revision 2130).
506    
507    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
508    
509            * ContentChecker.pm: |xml:lang| attribute value must be same
510            as |lang| attribute value for HTML elements (HTML5 revision 2062
511            and so on).
512    
513    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
514    
515            * ContentChecker.pm: Error level definition for |xml_id_error|
516            was missing.
517    
518            * URIChecker.pm: The end of the URL should be marked as the
519            error location for an empty path error.  The position
520            between the userinfo and the port components should be
521            marked as the error location for an empty host error.
522    
523    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
524    
525            * URIChecker.pm: Set parameters representing where in the
526            value the error occurs for errors.  Report unknown
527            address format error in warning level, since address
528            formats are rarely added.  Path segments starting with "/.."
529            were misinterpreted as a dot-segment.
530    
531    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
532    
533            * URIChecker.pm (check_iri_reference): Requires
534            |Message::DOM::DOMImplementation|.
535    
536    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
537    
538            * IMTChecker.pm: Updated for the new error reporting architecture.
539    
540            * ContentChecker.pm: Error levels for IMTs are added.
541    
542    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
543    
544            * H2H.pm (_shift_token): Support for unquoted HTML attribute
545            values.
546    
547    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
548    
549            * CacheManifest.pm: Support for new style of error
550            reports.
551    
552            * HTML.pm.src: Set line=1, column=1 to the document node.
553    
554    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
555    
556            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
557            and URL checkers.  Support for more error levels for bogus
558            langauge tag and URL "standards".
559    
560            * LangTag.pm, URIChecker.pm: Support for new style error
561            level reporting.
562    
563  2008-08-15  Wakaba  <wakaba@suika.fam.cx>  2008-08-15  Wakaba  <wakaba@suika.fam.cx>
564    
565          * ContentChecker.pm: Support for RDF/XML error levels.          * ContentChecker.pm: Support for RDF/XML error levels.

Legend:
Removed from v.1.273  
changed lines
  Added in v.1.364

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24