/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.288 by wakaba, Fri Sep 5 17:57:47 2008 UTC revision 1.356 by wakaba, Fri Oct 17 07:14:29 2008 UTC
# Line 1  Line 1 
1    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (node_name): New attribute.
4            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
5            constants.
6            (create_element_type_definition_node, create_attribute_definition,
7            create_notation, create_general_entity,
8            get_element_type_definition_node,
9            set_element_type_definition_node, get_general_entity_node,
10            set_general_entity_node, get_notation_node, set_notation_node,
11            get_attribute_definition_node, set_attribute_definition_node): New
12            methods.
13            (element_types, entities, notations, attribute_definitions): New
14            attributes.
15            (DocumentType): Support for child nodes, entities, notations, and
16            element types.
17            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
18            New classes.
19    
20            * Dumper.pm: Support for general entities, notations, element type
21            definitions, and attribute definitions.
22    
23    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
24    
25            * NanoDOM.pm (create_processing_instruction): New method.
26            (xml_version, xml_encoding, xml_standalone): New attributes.
27            (ProcessingInstruction): New class.
28    
29    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
30    
31            * HTML.pm.src: Handling of end tags in the foreign content
32            insertion mode was partially wrong, because of wrong bit
33            operations.
34    
35    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
36    
37            * NanoDOM.pm (dom_config): New attribute (do nothing), for
38            Whatpm::XML::Parser support.
39    
40    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
41    
42            * Makefile: New rule to make HTML/Tokenizer.pm is added.
43    
44            * HTML.pm.src: Tokenizer part moved to another file.
45    
46    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
49    
50    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
51    
52            * HTML.pm.src: Element category constants redefined.
53            
54    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
57            construction stage synced with the spec (HTML5 revisions 2139 and
58            2302).
59    
60    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
61    
62            * ContentChecker.pm: New error level "html5_fact" added, which
63            should be tentatively used until all of requirements are properly
64            specced as RFC 2119 "MUST" in HTML5.
65    
66    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
67    
68            * ContentChecker.pod: Note on internal flags is added.
69    
70    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: An AAA bug fixed.
73    
74    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src: If another node is inserted by the parser, don't
77            reuse existing Text node to append a character (HTML5 revision
78            2124).
79    
80    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
83            revisions 1731 and 2128).
84    
85    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
88            Support for end tags of camelCase SVG elements were broken.  A
89            wrong error type text fixed.
90    
91    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
94    
95    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
96    
97            * HTML.pm.src: Support for new definition of |param| and |source|
98            start tag parsing (HTML5 revision 1731).
99    
100    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
103    
104    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
107            revisions 1731 and 1831).
108    
109    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
112            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
113            revision 1731).
114    
115    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
118            revision 1731).  End tags of |option| and |optgroup| elements are
119            now optional (HTML5 revision 1731).
120    
121    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
122    
123            * HTML.pm.src: New "special" elements added to the list (HTML5
124            revision 1778).  "strile" -> "strike".
125    
126    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
127    
128            * ContentType.pm (get_sniffed_type): Support for the "better"
129            content sniffing (HTML5 revision 1927).  In a case the official
130            type was not returned when the method is invoked in the list
131            context.
132    
133    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Character references for non-space C0 characters,
136            including U+000B VT, DEL character, noncharacter code points, are
137            now converted to the U+FFFD character (cf. HTML5 revision 2138).
138    
139    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
140    
141            * ContentChecker.pm: |form=""| check support added.
142    
143    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
144    
145            * ContentChecker.pm: |contextmenu| validness is now checked using
146            |id| and |id_type| properties, and |menu| property is removed.
147    
148    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
149    
150            * ContentChecker.pm: Prepare for |form| |name| attribute's
151            duplication checking.
152    
153    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
154    
155            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
156            unsupported charset="" parameter value (HTML5 revision 2131).
157    
158    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
159    
160            * HTML.pm.src: Reminding places where U+000B is allowed as a space
161            character is fixed (cf. HTML5 revision 1738).
162    
163            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
164            space characters (HTML5 revision 1738).
165    
166    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
167    
168            * HTML.pm.src: The "anything else" case for the "after after body"
169            insertion mode was not updated to swtich to the "in body"
170            insertion mode.  U+000B is no longer a space character for the
171            purpose of tree construction phase (HTML5 revision 1738).
172    
173    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
174    
175            * HTML.pm.src: U+000B is no longer a space character (HTML5
176            revision 1738).
177    
178    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
179    
180            * ContentType.pm: 0x0B is no longer a space character (HTML5
181            revision 1738).
182    
183            * HTML.pm.src: U+000B is no longer a space character for the
184            algorithm for extracting an encoding from a Content-Type (HTML5
185            revision 1738).
186    
187    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
188    
189            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
190    
191    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
192    
193            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
194    
195            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
196            matching.
197    
198    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
199    
200            * LangTag.pm: Warn for private use language subtags.  Error level
201            typos fixed.  Support for Suppress-Script field.
202    
203            * mklangreg.pl: Support for dumping of nested structure.
204    
205    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
206    
207            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
208            recommended case as per RFC 4646.
209    
210    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
211    
212            * LangTag.pm (check_rfc4646_langtag): New method.
213    
214    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
215    
216            * mklangreg.pl: New script.
217    
218            * Makefile: Updated for creation of the module for language subtag
219            registry.
220            
221    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
222    
223            * Makefile: WebIDL.html added.
224    
225            * WebIDL.pod: New documentation.
226    
227    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
228    
229            * WebIDL.pm: Checker's error types are redefined.
230    
231    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
232    
233            * WebIDL.pm: Parser's error types are redefined.  Some forward
234            compatible parsing bugs are fixed.  Some unreachable codes are
235            commented out.
236    
237    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
238    
239            * WebIDL.pm: Support for the reminding extended attributes are
240            added.  It does not satisfy the definition that a forward
241            interface declaration has an extended attribute.  It seems that
242            unless explicitly allowed multiple extended attributes with the
243            same name is not allowed, though it is not explicitly mentioned in
244            the spec.
245    
246    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
247    
248            * WebIDL.pm: Unescapes extended attribute names and extended
249            attribute identifiers.  Preserve whether an extended attribute has
250            an argument list of not.  Support for extended attributes:
251            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
252            NameGetter, NameSetter, and Null.
253            (has_argument_list): New attribute.
254            (idl_text): Stringifies argument lists, if any, even if it is
255            empty.
256    
257    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
260            |{s_kwd}| in DATA_STATE as default.
261    
262    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
265            by |{s_kwd}| in DATA_STATE.
266    
267    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
268    
269            * HTML.pm.src: Shorten keys.
270    
271    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
272    
273            * HTML.pm.src: Remove checking for control character, surrogate
274            pair, or noncharacter code points and non-Unicode code
275            points (they should be handled by Whatpm::Charset::UnicodeChecker).
276            (parse_char_stream): Support for the |$get_wrapper| argument and
277            character stream error handlers.
278    
279    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm: Don't call |loda_ns_module|
282            for null-namespace elements/attributes.
283    
284            * HTML.pm.src: Fact out $disallowed_control_chars
285            as a hash.
286    
287    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
290            and |{next_char}| initializations are moved to initialization
291            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
292            with |parse_char_stream|.
293    
294    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src (parse_char_stream): Make |set_next_char|
297            invoke |manakai_read_until|, not only |read|, where
298            possible, to decrease the number of |read| method calls.
299    
300            * mkhtmlparser.pl: Related changes to the aforementioned
301            modification.
302    
303    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
304    
305            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
306            would report character error from now.
307    
308    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src: White-space-leaded non-white-space character
311            tokens in "before head insertion mode" was not
312            correctly handled.
313            (set_inner_html): Reimplemented using CharString decodehandle
314            class.  Support for $get_wrapper argument.  Support
315            for |{read_until}| feature.
316    
317    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
318    
319            * HTML.pm.src: Make a "bare ero" error for unknown
320            entities point the "&" character.
321    
322    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: It turns out that U+FFFD don't have to
325            be added to the list of excluded characters.
326    
327    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTML.pm.src ($char_onerror): Have character decoder's |line|
330            and |column| a higher priority than the one set by the
331            tokenizer's input handler.
332            ($self->{read_until}): Exclude U+FFFD (but this might
333            not be necessary, since now we do line/column fixup in
334            the character decode handle).
335    
336    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: Use |{read_until}| where possible.
339    
340    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
343            and |manakai_getc_until| to |manakai_read_until| to
344            reduce the number of string copies.
345    
346    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src (parse_char_string): Use newly created
349            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
350            standard feature to |open| a string as a filehandle,
351            since Perl's string filehandle seems not supporting |ungetc|
352            method correctly.
353            (parse_char_stream): Define |{getc_until}| method.
354            (DATA_STATE): Experimental support for |getc_until| feature.
355    
356    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
357    
358            * HTML.pm.src: Check points added to newly added branches.
359    
360    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: Remove |{char}|, which is no longer used.
363            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
364            and replaced by |{prev_state}|.
365    
366            * mkhtmlparser.pl: Remove |{char}| feature.
367            Remove |!!!back-next-input-character;| macro.
368    
369    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
370    
371            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
372            entity related tokenizer states in favor of new states
373            implementing the consume character reference algorithm.
374    
375    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
376    
377            * HTML.pm.src: "Consume a character reference" algorithm is
378            now implemented as a tokenizer's state, rather than
379            a method, with minimum changes (more changes will
380            be made, in due course).  "Bogus comment state"'s inner
381            loop gets removed.
382    
383    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
384    
385            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
386            into their own tokenizer states.
387    
388    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
391            is split into three states.
392    
393    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
394    
395            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
396            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
397            no longer does the tokenizer have to push back next input
398            characters in those states.
399    
400    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
401    
402            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
403            into four states so that no longer does the tokenizer have to push
404            back next input characters in that state.
405    
406    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
407    
408            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
409            which can be used to insert some wrapper between the character
410            stream handle and the tokenizer.  (It is currently not supported
411            for |set_inner_html| for |Element|s).
412    
413    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
414    
415            * HTML.pm.src: Ignore punctuations in charset names.
416    
417    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
418    
419            * ContentChecker.pm: Support for charset-layer error levels.
420    
421            * HTML.pm.src: Don't specify |text| argument for the
422            |chardecode:fallback| error, since it is not the encoding
423            being used alternatively.
424    
425  2008-09-06  Wakaba  <wakaba@suika.fam.cx>  2008-09-06  Wakaba  <wakaba@suika.fam.cx>
426    
427          * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).          * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).

Legend:
Removed from v.1.288  
changed lines
  Added in v.1.356

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24