/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.295 by wakaba, Sat Sep 13 08:21:35 2008 UTC revision 1.359 by wakaba, Sun Oct 19 06:14:56 2008 UTC
# Line 1  Line 1 
1    2008-10-19  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (notation_name): New attribute.
4    
5    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
6    
7            * NanoDOM.pm (public_id, system_id): New attributes.a
8    
9    2008-10-18  Wakaba  <wakaba@suika.fam.cx>
10    
11            * NanoDOM.pm (text_content): Moved to Node from Element.  Setter
12            implemented.
13            (allowed_tokens, default_type, declared_type): Implemented.
14    
15    2008-10-17  Wakaba  <wakaba@suika.fam.cx>
16    
17            * NanoDOM.pm (node_name): New attribute.
18            (ELEMENT_TYPE_DEFINITION_NODE, ATTRIBUTE_DEFINITION_NODE): New
19            constants.
20            (create_element_type_definition_node, create_attribute_definition,
21            create_notation, create_general_entity,
22            get_element_type_definition_node,
23            set_element_type_definition_node, get_general_entity_node,
24            set_general_entity_node, get_notation_node, set_notation_node,
25            get_attribute_definition_node, set_attribute_definition_node): New
26            methods.
27            (element_types, entities, notations, attribute_definitions): New
28            attributes.
29            (DocumentType): Support for child nodes, entities, notations, and
30            element types.
31            (Entity, Notation, ElementTypeDefinition, AttributeDefinition):
32            New classes.
33    
34            * Dumper.pm: Support for general entities, notations, element type
35            definitions, and attribute definitions.
36    
37    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
38    
39            * NanoDOM.pm (create_processing_instruction): New method.
40            (xml_version, xml_encoding, xml_standalone): New attributes.
41            (ProcessingInstruction): New class.
42    
43    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: Handling of end tags in the foreign content
46            insertion mode was partially wrong, because of wrong bit
47            operations.
48    
49    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
50    
51            * NanoDOM.pm (dom_config): New attribute (do nothing), for
52            Whatpm::XML::Parser support.
53    
54    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
55    
56            * Makefile: New rule to make HTML/Tokenizer.pm is added.
57    
58            * HTML.pm.src: Tokenizer part moved to another file.
59    
60    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
61    
62            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
63    
64    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTML.pm.src: Element category constants redefined.
67            
68    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
71            construction stage synced with the spec (HTML5 revisions 2139 and
72            2302).
73    
74    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
75    
76            * ContentChecker.pm: New error level "html5_fact" added, which
77            should be tentatively used until all of requirements are properly
78            specced as RFC 2119 "MUST" in HTML5.
79    
80    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
81    
82            * ContentChecker.pod: Note on internal flags is added.
83    
84    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: An AAA bug fixed.
87    
88    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
89    
90            * HTML.pm.src: If another node is inserted by the parser, don't
91            reuse existing Text node to append a character (HTML5 revision
92            2124).
93    
94    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
97            revisions 1731 and 2128).
98    
99    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
102            Support for end tags of camelCase SVG elements were broken.  A
103            wrong error type text fixed.
104    
105    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
108    
109    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Support for new definition of |param| and |source|
112            start tag parsing (HTML5 revision 1731).
113    
114    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
117    
118    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
121            revisions 1731 and 1831).
122    
123    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
124    
125            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
126            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
127            revision 1731).
128    
129    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
132            revision 1731).  End tags of |option| and |optgroup| elements are
133            now optional (HTML5 revision 1731).
134    
135    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src: New "special" elements added to the list (HTML5
138            revision 1778).  "strile" -> "strike".
139    
140    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
141    
142            * ContentType.pm (get_sniffed_type): Support for the "better"
143            content sniffing (HTML5 revision 1927).  In a case the official
144            type was not returned when the method is invoked in the list
145            context.
146    
147    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Character references for non-space C0 characters,
150            including U+000B VT, DEL character, noncharacter code points, are
151            now converted to the U+FFFD character (cf. HTML5 revision 2138).
152    
153    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
154    
155            * ContentChecker.pm: |form=""| check support added.
156    
157    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
158    
159            * ContentChecker.pm: |contextmenu| validness is now checked using
160            |id| and |id_type| properties, and |menu| property is removed.
161    
162    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
163    
164            * ContentChecker.pm: Prepare for |form| |name| attribute's
165            duplication checking.
166    
167    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
168    
169            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
170            unsupported charset="" parameter value (HTML5 revision 2131).
171    
172    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
173    
174            * HTML.pm.src: Reminding places where U+000B is allowed as a space
175            character is fixed (cf. HTML5 revision 1738).
176    
177            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
178            space characters (HTML5 revision 1738).
179    
180    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src: The "anything else" case for the "after after body"
183            insertion mode was not updated to swtich to the "in body"
184            insertion mode.  U+000B is no longer a space character for the
185            purpose of tree construction phase (HTML5 revision 1738).
186    
187    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
188    
189            * HTML.pm.src: U+000B is no longer a space character (HTML5
190            revision 1738).
191    
192    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
193    
194            * ContentType.pm: 0x0B is no longer a space character (HTML5
195            revision 1738).
196    
197            * HTML.pm.src: U+000B is no longer a space character for the
198            algorithm for extracting an encoding from a Content-Type (HTML5
199            revision 1738).
200    
201    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
202    
203            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
204    
205    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
206    
207            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
208    
209            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
210            matching.
211    
212    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
213    
214            * LangTag.pm: Warn for private use language subtags.  Error level
215            typos fixed.  Support for Suppress-Script field.
216    
217            * mklangreg.pl: Support for dumping of nested structure.
218    
219    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
220    
221            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
222            recommended case as per RFC 4646.
223    
224    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
225    
226            * LangTag.pm (check_rfc4646_langtag): New method.
227    
228    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
229    
230            * mklangreg.pl: New script.
231    
232            * Makefile: Updated for creation of the module for language subtag
233            registry.
234            
235    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
236    
237            * Makefile: WebIDL.html added.
238    
239            * WebIDL.pod: New documentation.
240    
241    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
242    
243            * WebIDL.pm: Checker's error types are redefined.
244    
245    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
246    
247            * WebIDL.pm: Parser's error types are redefined.  Some forward
248            compatible parsing bugs are fixed.  Some unreachable codes are
249            commented out.
250    
251    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
252    
253            * WebIDL.pm: Support for the reminding extended attributes are
254            added.  It does not satisfy the definition that a forward
255            interface declaration has an extended attribute.  It seems that
256            unless explicitly allowed multiple extended attributes with the
257            same name is not allowed, though it is not explicitly mentioned in
258            the spec.
259    
260    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
261    
262            * WebIDL.pm: Unescapes extended attribute names and extended
263            attribute identifiers.  Preserve whether an extended attribute has
264            an argument list of not.  Support for extended attributes:
265            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
266            NameGetter, NameSetter, and Null.
267            (has_argument_list): New attribute.
268            (idl_text): Stringifies argument lists, if any, even if it is
269            empty.
270    
271    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
272    
273            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
274            |{s_kwd}| in DATA_STATE as default.
275    
276    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
277    
278            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
279            by |{s_kwd}| in DATA_STATE.
280    
281    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Shorten keys.
284    
285    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
286    
287            * HTML.pm.src: Remove checking for control character, surrogate
288            pair, or noncharacter code points and non-Unicode code
289            points (they should be handled by Whatpm::Charset::UnicodeChecker).
290            (parse_char_stream): Support for the |$get_wrapper| argument and
291            character stream error handlers.
292    
293    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
294    
295            * ContentChecker.pm: Don't call |loda_ns_module|
296            for null-namespace elements/attributes.
297    
298            * HTML.pm.src: Fact out $disallowed_control_chars
299            as a hash.
300    
301    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
302    
303            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
304            and |{next_char}| initializations are moved to initialization
305            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
306            with |parse_char_stream|.
307    
308    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src (parse_char_stream): Make |set_next_char|
311            invoke |manakai_read_until|, not only |read|, where
312            possible, to decrease the number of |read| method calls.
313    
314            * mkhtmlparser.pl: Related changes to the aforementioned
315            modification.
316    
317    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
318    
319            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
320            would report character error from now.
321    
322    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: White-space-leaded non-white-space character
325            tokens in "before head insertion mode" was not
326            correctly handled.
327            (set_inner_html): Reimplemented using CharString decodehandle
328            class.  Support for $get_wrapper argument.  Support
329            for |{read_until}| feature.
330    
331    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
332    
333            * HTML.pm.src: Make a "bare ero" error for unknown
334            entities point the "&" character.
335    
336    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
337    
338            * HTML.pm.src: It turns out that U+FFFD don't have to
339            be added to the list of excluded characters.
340    
341    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src ($char_onerror): Have character decoder's |line|
344            and |column| a higher priority than the one set by the
345            tokenizer's input handler.
346            ($self->{read_until}): Exclude U+FFFD (but this might
347            not be necessary, since now we do line/column fixup in
348            the character decode handle).
349    
350    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
351    
352            * HTML.pm.src: Use |{read_until}| where possible.
353    
354    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
357            and |manakai_getc_until| to |manakai_read_until| to
358            reduce the number of string copies.
359    
360    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src (parse_char_string): Use newly created
363            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
364            standard feature to |open| a string as a filehandle,
365            since Perl's string filehandle seems not supporting |ungetc|
366            method correctly.
367            (parse_char_stream): Define |{getc_until}| method.
368            (DATA_STATE): Experimental support for |getc_until| feature.
369    
370    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
371    
372            * HTML.pm.src: Check points added to newly added branches.
373    
374    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTML.pm.src: Remove |{char}|, which is no longer used.
377            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
378            and replaced by |{prev_state}|.
379    
380            * mkhtmlparser.pl: Remove |{char}| feature.
381            Remove |!!!back-next-input-character;| macro.
382    
383    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
384    
385            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
386            entity related tokenizer states in favor of new states
387            implementing the consume character reference algorithm.
388    
389    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
390    
391            * HTML.pm.src: "Consume a character reference" algorithm is
392            now implemented as a tokenizer's state, rather than
393            a method, with minimum changes (more changes will
394            be made, in due course).  "Bogus comment state"'s inner
395            loop gets removed.
396    
397  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
398    
399          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
400          into their own tokenizer states.          into their own tokenizer states.
401    
402  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.295  
changed lines
  Added in v.1.359

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24