/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.295 by wakaba, Sat Sep 13 08:21:35 2008 UTC revision 1.360 by wakaba, Sun Oct 19 07:19:00 2008 UTC
# Line 1  Line 1 
1    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (Entity->new): Initialize ->child_nodes as an empty
4            array.
5    
6    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
7    
8            * NanoDOM.pm (notation_name): New attribute.
9    
10    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
11    
12            * NanoDOM.pm (public_id, system_id): New attributes.a
13    
14    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
15    
16            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
17            implemented.
18            (allowed_tokens, default_type, declared_type): Implemented.
19    
20    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
21    
22            * NanoDOM.pm (node_name): New attribute.
23            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
24            constants.
25            (create_element_type_definition_node, create_attribute_definition,
26            create_notation, create_general_entity,
27            get_element_type_definition_node,
28            set_element_type_definition_node, get_general_entity_node,
29            set_general_entity_node, get_notation_node, set_notation_node,
30            get_attribute_definition_node, set_attribute_definition_node): New
31            methods.
32            (element_types, entities, notations, attribute_definitions): New
33            attributes.
34            (DocumentType): Support for child nodes, entities, notations, and
35            element types.
36            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
37            New classes.
38    
39            * Dumper.pm: Support for general entities, notations, element type
40            definitions, and attribute definitions.
41    
42    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
43    
44            * NanoDOM.pm (create_processing_instruction): New method.
45            (xml_version, xml_encoding, xml_standalone): New attributes.
46            (ProcessingInstruction): New class.
47    
48    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: Handling of end tags in the foreign content
51            insertion mode was partially wrong, because of wrong bit
52            operations.
53    
54    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
55    
56            * NanoDOM.pm (dom_config): New attribute (do nothing), for
57            Whatpm::XML::Parser support.
58    
59    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
60    
61            * Makefile: New rule to make HTML/Tokenizer.pm is added.
62    
63            * HTML.pm.src: Tokenizer part moved to another file.
64    
65    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
68    
69    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
70    
71            * HTML.pm.src: Element category constants redefined.
72            
73    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
76            construction stage synced with the spec (HTML5 revisions 2139 and
77            2302).
78    
79    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
80    
81            * ContentChecker.pm: New error level "html5_fact" added, which
82            should be tentatively used until all of requirements are properly
83            specced as RFC 2119 "MUST" in HTML5.
84    
85    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
86    
87            * ContentChecker.pod: Note on internal flags is added.
88    
89    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
90    
91            * HTML.pm.src: An AAA bug fixed.
92    
93    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: If another node is inserted by the parser, don't
96            reuse existing Text node to append a character (HTML5 revision
97            2124).
98    
99    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
102            revisions 1731 and 2128).
103    
104    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
105    
106            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
107            Support for end tags of camelCase SVG elements were broken.  A
108            wrong error type text fixed.
109    
110    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
113    
114    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: Support for new definition of |param| and |source|
117            start tag parsing (HTML5 revision 1731).
118    
119    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
122    
123    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
126            revisions 1731 and 1831).
127    
128    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
129    
130            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
131            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
132            revision 1731).
133    
134    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
135    
136            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
137            revision 1731).  End tags of |option| and |optgroup| elements are
138            now optional (HTML5 revision 1731).
139    
140    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: New "special" elements added to the list (HTML5
143            revision 1778).  "strile" -> "strike".
144    
145    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
146    
147            * ContentType.pm (get_sniffed_type): Support for the "better"
148            content sniffing (HTML5 revision 1927).  In a case the official
149            type was not returned when the method is invoked in the list
150            context.
151    
152    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: Character references for non-space C0 characters,
155            including U+000B VT, DEL character, noncharacter code points, are
156            now converted to the U+FFFD character (cf. HTML5 revision 2138).
157    
158    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
159    
160            * ContentChecker.pm: |form=""| check support added.
161    
162    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
163    
164            * ContentChecker.pm: |contextmenu| validness is now checked using
165            |id| and |id_type| properties, and |menu| property is removed.
166    
167    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm: Prepare for |form| |name| attribute's
170            duplication checking.
171    
172    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
173    
174            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
175            unsupported charset="" parameter value (HTML5 revision 2131).
176    
177    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Reminding places where U+000B is allowed as a space
180            character is fixed (cf. HTML5 revision 1738).
181    
182            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
183            space characters (HTML5 revision 1738).
184    
185    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: The "anything else" case for the "after after body"
188            insertion mode was not updated to swtich to the "in body"
189            insertion mode.  U+000B is no longer a space character for the
190            purpose of tree construction phase (HTML5 revision 1738).
191    
192    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: U+000B is no longer a space character (HTML5
195            revision 1738).
196    
197    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
198    
199            * ContentType.pm: 0x0B is no longer a space character (HTML5
200            revision 1738).
201    
202            * HTML.pm.src: U+000B is no longer a space character for the
203            algorithm for extracting an encoding from a Content-Type (HTML5
204            revision 1738).
205    
206    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
207    
208            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
209    
210    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
211    
212            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
213    
214            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
215            matching.
216    
217    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
218    
219            * LangTag.pm: Warn for private use language subtags.  Error level
220            typos fixed.  Support for Suppress-Script field.
221    
222            * mklangreg.pl: Support for dumping of nested structure.
223    
224    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
225    
226            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
227            recommended case as per RFC 4646.
228    
229    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
230    
231            * LangTag.pm (check_rfc4646_langtag): New method.
232    
233    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
234    
235            * mklangreg.pl: New script.
236    
237            * Makefile: Updated for creation of the module for language subtag
238            registry.
239            
240    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
241    
242            * Makefile: WebIDL.html added.
243    
244            * WebIDL.pod: New documentation.
245    
246    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
247    
248            * WebIDL.pm: Checker's error types are redefined.
249    
250    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
251    
252            * WebIDL.pm: Parser's error types are redefined.  Some forward
253            compatible parsing bugs are fixed.  Some unreachable codes are
254            commented out.
255    
256    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
257    
258            * WebIDL.pm: Support for the reminding extended attributes are
259            added.  It does not satisfy the definition that a forward
260            interface declaration has an extended attribute.  It seems that
261            unless explicitly allowed multiple extended attributes with the
262            same name is not allowed, though it is not explicitly mentioned in
263            the spec.
264    
265    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
266    
267            * WebIDL.pm: Unescapes extended attribute names and extended
268            attribute identifiers.  Preserve whether an extended attribute has
269            an argument list of not.  Support for extended attributes:
270            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
271            NameGetter, NameSetter, and Null.
272            (has_argument_list): New attribute.
273            (idl_text): Stringifies argument lists, if any, even if it is
274            empty.
275    
276    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
277    
278            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
279            |{s_kwd}| in DATA_STATE as default.
280    
281    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
284            by |{s_kwd}| in DATA_STATE.
285    
286    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: Shorten keys.
289    
290    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTML.pm.src: Remove checking for control character, surrogate
293            pair, or noncharacter code points and non-Unicode code
294            points (they should be handled by Whatpm::Charset::UnicodeChecker).
295            (parse_char_stream): Support for the |$get_wrapper| argument and
296            character stream error handlers.
297    
298    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
299    
300            * ContentChecker.pm: Don't call |loda_ns_module|
301            for null-namespace elements/attributes.
302    
303            * HTML.pm.src: Fact out $disallowed_control_chars
304            as a hash.
305    
306    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
309            and |{next_char}| initializations are moved to initialization
310            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
311            with |parse_char_stream|.
312    
313    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
314    
315            * HTML.pm.src (parse_char_stream): Make |set_next_char|
316            invoke |manakai_read_until|, not only |read|, where
317            possible, to decrease the number of |read| method calls.
318    
319            * mkhtmlparser.pl: Related changes to the aforementioned
320            modification.
321    
322    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
325            would report character error from now.
326    
327    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTML.pm.src: White-space-leaded non-white-space character
330            tokens in "before head insertion mode" was not
331            correctly handled.
332            (set_inner_html): Reimplemented using CharString decodehandle
333            class.  Support for $get_wrapper argument.  Support
334            for |{read_until}| feature.
335    
336    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: Make a "bare ero" error for unknown
339            entities point the "&" character.
340    
341    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: It turns out that U+FFFD don't have to
344            be added to the list of excluded characters.
345    
346    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTML.pm.src ($char_onerror): Have character decoder's |line|
349            and |column| a higher priority than the one set by the
350            tokenizer's input handler.
351            ($self->{read_until}): Exclude U+FFFD (but this might
352            not be necessary, since now we do line/column fixup in
353            the character decode handle).
354    
355    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
356    
357            * HTML.pm.src: Use |{read_until}| where possible.
358    
359    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
360    
361            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
362            and |manakai_getc_until| to |manakai_read_until| to
363            reduce the number of string copies.
364    
365    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src (parse_char_string): Use newly created
368            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
369            standard feature to |open| a string as a filehandle,
370            since Perl's string filehandle seems not supporting |ungetc|
371            method correctly.
372            (parse_char_stream): Define |{getc_until}| method.
373            (DATA_STATE): Experimental support for |getc_until| feature.
374    
375    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
376    
377            * HTML.pm.src: Check points added to newly added branches.
378    
379    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
380    
381            * HTML.pm.src: Remove |{char}|, which is no longer used.
382            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
383            and replaced by |{prev_state}|.
384    
385            * mkhtmlparser.pl: Remove |{char}| feature.
386            Remove |!!!back-next-input-character;| macro.
387    
388    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
389    
390            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
391            entity related tokenizer states in favor of new states
392            implementing the consume character reference algorithm.
393    
394    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
395    
396            * HTML.pm.src: "Consume a character reference" algorithm is
397            now implemented as a tokenizer's state, rather than
398            a method, with minimum changes (more changes will
399            be made, in due course).  "Bogus comment state"'s inner
400            loop gets removed.
401    
402  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
403    
404          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
405          into their own tokenizer states.          into their own tokenizer states.
406    
407  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.295  
changed lines
  Added in v.1.360

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24