/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.282 by wakaba, Sat Aug 30 12:57:05 2008 UTC revision 1.361 by wakaba, Mon Oct 20 04:21:18 2008 UTC
# Line 1  Line 1 
1    2008-10-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (specified, all_declarations_processed,
4            manakai_attribute_type): New attributes.
5    
6    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
7    
8            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
9            array.
10    
11    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
12    
13            * NanoDOM.pm (notation_name): New attribute.
14    
15    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
16    
17            * NanoDOM.pm (public_id, system_id): New attributes.a
18    
19    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
20    
21            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
22            implemented.
23            (allowed_tokens, default_type, declared_type): Implemented.
24    
25    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
26    
27            * NanoDOM.pm (node_name): New attribute.
28            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
29            constants.
30            (create_element_type_definition_node, create_attribute_definition,
31            create_notation, create_general_entity,
32            get_element_type_definition_node,
33            set_element_type_definition_node, get_general_entity_node,
34            set_general_entity_node, get_notation_node, set_notation_node,
35            get_attribute_definition_node, set_attribute_definition_node): New
36            methods.
37            (element_types, entities, notations, attribute_definitions): New
38            attributes.
39            (DocumentType): Support for child nodes, entities, notations, and
40            element types.
41            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
42            New classes.
43    
44            * Dumper.pm: Support for general entities, notations, element type
45            definitions, and attribute definitions.
46    
47    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
48    
49            * NanoDOM.pm (create_processing_instruction): New method.
50            (xml_version, xml_encoding, xml_standalone): New attributes.
51            (ProcessingInstruction): New class.
52    
53    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Handling of end tags in the foreign content
56            insertion mode was partially wrong, because of wrong bit
57            operations.
58    
59    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
60    
61            * NanoDOM.pm (dom_config): New attribute (do nothing), for
62            Whatpm::XML::Parser support.
63    
64    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
65    
66            * Makefile: New rule to make HTML/Tokenizer.pm is added.
67    
68            * HTML.pm.src: Tokenizer part moved to another file.
69    
70    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
73    
74    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: Element category constants redefined.
77            
78    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
81            construction stage synced with the spec (HTML5 revisions 2139 and
82            2302).
83    
84    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentChecker.pm: New error level "html5_fact" added, which
87            should be tentatively used until all of requirements are properly
88            specced as RFC 2119 "MUST" in HTML5.
89    
90    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
91    
92            * ContentChecker.pod: Note on internal flags is added.
93    
94    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: An AAA bug fixed.
97    
98    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
99    
100            * HTML.pm.src: If another node is inserted by the parser, don't
101            reuse existing Text node to append a character (HTML5 revision
102            2124).
103    
104    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
107            revisions 1731 and 2128).
108    
109    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
112            Support for end tags of camelCase SVG elements were broken.  A
113            wrong error type text fixed.
114    
115    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
118    
119    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: Support for new definition of |param| and |source|
122            start tag parsing (HTML5 revision 1731).
123    
124    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
127    
128    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
131            revisions 1731 and 1831).
132    
133    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
136            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
137            revision 1731).
138    
139    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
142            revision 1731).  End tags of |option| and |optgroup| elements are
143            now optional (HTML5 revision 1731).
144    
145    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: New "special" elements added to the list (HTML5
148            revision 1778).  "strile" -> "strike".
149    
150    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
151    
152            * ContentType.pm (get_sniffed_type): Support for the "better"
153            content sniffing (HTML5 revision 1927).  In a case the official
154            type was not returned when the method is invoked in the list
155            context.
156    
157    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
158    
159            * HTML.pm.src: Character references for non-space C0 characters,
160            including U+000B VT, DEL character, noncharacter code points, are
161            now converted to the U+FFFD character (cf. HTML5 revision 2138).
162    
163    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
164    
165            * ContentChecker.pm: |form=""| check support added.
166    
167    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm: |contextmenu| validness is now checked using
170            |id| and |id_type| properties, and |menu| property is removed.
171    
172    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
173    
174            * ContentChecker.pm: Prepare for |form| |name| attribute's
175            duplication checking.
176    
177    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
180            unsupported charset="" parameter value (HTML5 revision 2131).
181    
182    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
183    
184            * HTML.pm.src: Reminding places where U+000B is allowed as a space
185            character is fixed (cf. HTML5 revision 1738).
186    
187            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
188            space characters (HTML5 revision 1738).
189    
190    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src: The "anything else" case for the "after after body"
193            insertion mode was not updated to swtich to the "in body"
194            insertion mode.  U+000B is no longer a space character for the
195            purpose of tree construction phase (HTML5 revision 1738).
196    
197    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: U+000B is no longer a space character (HTML5
200            revision 1738).
201    
202    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
203    
204            * ContentType.pm: 0x0B is no longer a space character (HTML5
205            revision 1738).
206    
207            * HTML.pm.src: U+000B is no longer a space character for the
208            algorithm for extracting an encoding from a Content-Type (HTML5
209            revision 1738).
210    
211    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
212    
213            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
214    
215    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
216    
217            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
218    
219            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
220            matching.
221    
222    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
223    
224            * LangTag.pm: Warn for private use language subtags.  Error level
225            typos fixed.  Support for Suppress-Script field.
226    
227            * mklangreg.pl: Support for dumping of nested structure.
228    
229    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
230    
231            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
232            recommended case as per RFC 4646.
233    
234    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
235    
236            * LangTag.pm (check_rfc4646_langtag): New method.
237    
238    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
239    
240            * mklangreg.pl: New script.
241    
242            * Makefile: Updated for creation of the module for language subtag
243            registry.
244            
245    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
246    
247            * Makefile: WebIDL.html added.
248    
249            * WebIDL.pod: New documentation.
250    
251    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
252    
253            * WebIDL.pm: Checker's error types are redefined.
254    
255    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
256    
257            * WebIDL.pm: Parser's error types are redefined.  Some forward
258            compatible parsing bugs are fixed.  Some unreachable codes are
259            commented out.
260    
261    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
262    
263            * WebIDL.pm: Support for the reminding extended attributes are
264            added.  It does not satisfy the definition that a forward
265            interface declaration has an extended attribute.  It seems that
266            unless explicitly allowed multiple extended attributes with the
267            same name is not allowed, though it is not explicitly mentioned in
268            the spec.
269    
270    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
271    
272            * WebIDL.pm: Unescapes extended attribute names and extended
273            attribute identifiers.  Preserve whether an extended attribute has
274            an argument list of not.  Support for extended attributes:
275            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
276            NameGetter, NameSetter, and Null.
277            (has_argument_list): New attribute.
278            (idl_text): Stringifies argument lists, if any, even if it is
279            empty.
280    
281    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
284            |{s_kwd}| in DATA_STATE as default.
285    
286    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
289            by |{s_kwd}| in DATA_STATE.
290    
291    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: Shorten keys.
294    
295    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Remove checking for control character, surrogate
298            pair, or noncharacter code points and non-Unicode code
299            points (they should be handled by Whatpm::Charset::UnicodeChecker).
300            (parse_char_stream): Support for the |$get_wrapper| argument and
301            character stream error handlers.
302    
303    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
304    
305            * ContentChecker.pm: Don't call |loda_ns_module|
306            for null-namespace elements/attributes.
307    
308            * HTML.pm.src: Fact out $disallowed_control_chars
309            as a hash.
310    
311    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
312    
313            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
314            and |{next_char}| initializations are moved to initialization
315            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
316            with |parse_char_stream|.
317    
318    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src (parse_char_stream): Make |set_next_char|
321            invoke |manakai_read_until|, not only |read|, where
322            possible, to decrease the number of |read| method calls.
323    
324            * mkhtmlparser.pl: Related changes to the aforementioned
325            modification.
326    
327    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
330            would report character error from now.
331    
332    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTML.pm.src: White-space-leaded non-white-space character
335            tokens in "before head insertion mode" was not
336            correctly handled.
337            (set_inner_html): Reimplemented using CharString decodehandle
338            class.  Support for $get_wrapper argument.  Support
339            for |{read_until}| feature.
340    
341    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: Make a "bare ero" error for unknown
344            entities point the "&" character.
345    
346    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src: It turns out that U+FFFD don't have to
349            be added to the list of excluded characters.
350    
351    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
352    
353            * HTML.pm.src ($char_onerror): Have character decoder's |line|
354            and |column| a higher priority than the one set by the
355            tokenizer's input handler.
356            ($self->{read_until}): Exclude U+FFFD (but this might
357            not be necessary, since now we do line/column fixup in
358            the character decode handle).
359    
360    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: Use |{read_until}| where possible.
363    
364    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
365    
366            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
367            and |manakai_getc_until| to |manakai_read_until| to
368            reduce the number of string copies.
369    
370    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
371    
372            * HTML.pm.src (parse_char_string): Use newly created
373            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
374            standard feature to |open| a string as a filehandle,
375            since Perl's string filehandle seems not supporting |ungetc|
376            method correctly.
377            (parse_char_stream): Define |{getc_until}| method.
378            (DATA_STATE): Experimental support for |getc_until| feature.
379    
380    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
381    
382            * HTML.pm.src: Check points added to newly added branches.
383    
384    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
385    
386            * HTML.pm.src: Remove |{char}|, which is no longer used.
387            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
388            and replaced by |{prev_state}|.
389    
390            * mkhtmlparser.pl: Remove |{char}| feature.
391            Remove |!!!back-next-input-character;| macro.
392    
393    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
394    
395            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
396            entity related tokenizer states in favor of new states
397            implementing the consume character reference algorithm.
398    
399    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
400    
401            * HTML.pm.src: "Consume a character reference" algorithm is
402            now implemented as a tokenizer's state, rather than
403            a method, with minimum changes (more changes will
404            be made, in due course).  "Bogus comment state"'s inner
405            loop gets removed.
406    
407    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
408    
409            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
410            into their own tokenizer states.
411    
412    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
415            is split into three states.
416    
417    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
418    
419            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
420            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
421            no longer does the tokenizer have to push back next input
422            characters in those states.
423    
424    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
425    
426            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
427            into four states so that no longer does the tokenizer have to push
428            back next input characters in that state.
429    
430    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
431    
432            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
433            which can be used to insert some wrapper between the character
434            stream handle and the tokenizer.  (It is currently not supported
435            for |set_inner_html| for |Element|s).
436    
437    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
438    
439            * HTML.pm.src: Ignore punctuations in charset names.
440    
441    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
442    
443            * ContentChecker.pm: Support for charset-layer error levels.
444    
445            * HTML.pm.src: Don't specify |text| argument for the
446            |chardecode:fallback| error, since it is not the encoding
447            being used alternatively.
448    
449    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
450    
451            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
452    
453    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
454    
455            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
456    
457    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
458    
459            * HTML.pm.src: Bug fix and sync with the spec with regard
460            to after after frameset insertion mode processing (HTML5
461            revision 1909).  Note that the implementation was wrong
462            per the old spec before the r1909 changes.
463    
464    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
465    
466            * HTMLTable.pm: scope=auto algorithm fix synced with the
467            spec (HTML5 revision 2093).
468            ($process_row): Algorithm step numbers synced with the
469            spec (HTML5 revision 2092).
470    
471    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
472    
473            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
474            revision 2094).
475    
476    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
477    
478            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
479    
480    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
481    
482            * HTML.pm.src: '"' and "'" at the end of attribute
483            name (after another attribute) now raise parse error (HTML5
484            revision 2123).  Empty unquoted attribute values are no
485            longer allowed (HTML5 revision 2122).
486    
487  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
488    
489          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5

Legend:
Removed from v.1.282  
changed lines
  Added in v.1.361

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24