/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.355 by wakaba, Wed Oct 15 04:38:22 2008 UTC
# Line 1  Line 1 
1    2008-10-15  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (create_processing_instruction): New method.
4            (xml_version, xml_encoding, xml_standalone): New attributes.
5            (ProcessingInstruction): New class.
6    
7    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: Handling of end tags in the foreign content
10            insertion mode was partially wrong, because of wrong bit
11            operations.
12    
13    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
14    
15            * NanoDOM.pm (dom_config): New attribute (do nothing), for
16            Whatpm::XML::Parser support.
17    
18    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
19    
20            * Makefile: New rule to make HTML/Tokenizer.pm is added.
21    
22            * HTML.pm.src: Tokenizer part moved to another file.
23    
24    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
27    
28    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: Element category constants redefined.
31            
32    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
35            construction stage synced with the spec (HTML5 revisions 2139 and
36            2302).
37    
38    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
39    
40            * ContentChecker.pm: New error level "html5_fact" added, which
41            should be tentatively used until all of requirements are properly
42            specced as RFC 2119 "MUST" in HTML5.
43    
44    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
45    
46            * ContentChecker.pod: Note on internal flags is added.
47    
48    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: An AAA bug fixed.
51    
52    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: If another node is inserted by the parser, don't
55            reuse existing Text node to append a character (HTML5 revision
56            2124).
57    
58    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
61            revisions 1731 and 2128).
62    
63    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
66            Support for end tags of camelCase SVG elements were broken.  A
67            wrong error type text fixed.
68    
69    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
70    
71            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
72    
73    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Support for new definition of |param| and |source|
76            start tag parsing (HTML5 revision 1731).
77    
78    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
81    
82    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
85            revisions 1731 and 1831).
86    
87    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
90            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
91            revision 1731).
92    
93    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
96            revision 1731).  End tags of |option| and |optgroup| elements are
97            now optional (HTML5 revision 1731).
98    
99    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: New "special" elements added to the list (HTML5
102            revision 1778).  "strile" -> "strike".
103    
104    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
105    
106            * ContentType.pm (get_sniffed_type): Support for the "better"
107            content sniffing (HTML5 revision 1927).  In a case the official
108            type was not returned when the method is invoked in the list
109            context.
110    
111    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
112    
113            * HTML.pm.src: Character references for non-space C0 characters,
114            including U+000B VT, DEL character, noncharacter code points, are
115            now converted to the U+FFFD character (cf. HTML5 revision 2138).
116    
117    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
118    
119            * ContentChecker.pm: |form=""| check support added.
120    
121    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
122    
123            * ContentChecker.pm: |contextmenu| validness is now checked using
124            |id| and |id_type| properties, and |menu| property is removed.
125    
126    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
127    
128            * ContentChecker.pm: Prepare for |form| |name| attribute's
129            duplication checking.
130    
131    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
134            unsupported charset="" parameter value (HTML5 revision 2131).
135    
136    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: Reminding places where U+000B is allowed as a space
139            character is fixed (cf. HTML5 revision 1738).
140    
141            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
142            space characters (HTML5 revision 1738).
143    
144    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
145    
146            * HTML.pm.src: The "anything else" case for the "after after body"
147            insertion mode was not updated to swtich to the "in body"
148            insertion mode.  U+000B is no longer a space character for the
149            purpose of tree construction phase (HTML5 revision 1738).
150    
151    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: U+000B is no longer a space character (HTML5
154            revision 1738).
155    
156    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
157    
158            * ContentType.pm: 0x0B is no longer a space character (HTML5
159            revision 1738).
160    
161            * HTML.pm.src: U+000B is no longer a space character for the
162            algorithm for extracting an encoding from a Content-Type (HTML5
163            revision 1738).
164    
165    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
166    
167            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
168    
169    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
170    
171            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
172    
173            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
174            matching.
175    
176    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
177    
178            * LangTag.pm: Warn for private use language subtags.  Error level
179            typos fixed.  Support for Suppress-Script field.
180    
181            * mklangreg.pl: Support for dumping of nested structure.
182    
183    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
184    
185            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
186            recommended case as per RFC 4646.
187    
188    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
189    
190            * LangTag.pm (check_rfc4646_langtag): New method.
191    
192    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
193    
194            * mklangreg.pl: New script.
195    
196            * Makefile: Updated for creation of the module for language subtag
197            registry.
198            
199    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
200    
201            * Makefile: WebIDL.html added.
202    
203            * WebIDL.pod: New documentation.
204    
205    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
206    
207            * WebIDL.pm: Checker's error types are redefined.
208    
209    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm: Parser's error types are redefined.  Some forward
212            compatible parsing bugs are fixed.  Some unreachable codes are
213            commented out.
214    
215    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
216    
217            * WebIDL.pm: Support for the reminding extended attributes are
218            added.  It does not satisfy the definition that a forward
219            interface declaration has an extended attribute.  It seems that
220            unless explicitly allowed multiple extended attributes with the
221            same name is not allowed, though it is not explicitly mentioned in
222            the spec.
223    
224    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
225    
226            * WebIDL.pm: Unescapes extended attribute names and extended
227            attribute identifiers.  Preserve whether an extended attribute has
228            an argument list of not.  Support for extended attributes:
229            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
230            NameGetter, NameSetter, and Null.
231            (has_argument_list): New attribute.
232            (idl_text): Stringifies argument lists, if any, even if it is
233            empty.
234    
235    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
238            |{s_kwd}| in DATA_STATE as default.
239    
240    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
241    
242            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
243            by |{s_kwd}| in DATA_STATE.
244    
245    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src: Shorten keys.
248    
249    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: Remove checking for control character, surrogate
252            pair, or noncharacter code points and non-Unicode code
253            points (they should be handled by Whatpm::Charset::UnicodeChecker).
254            (parse_char_stream): Support for the |$get_wrapper| argument and
255            character stream error handlers.
256    
257    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
258    
259            * ContentChecker.pm: Don't call |loda_ns_module|
260            for null-namespace elements/attributes.
261    
262            * HTML.pm.src: Fact out $disallowed_control_chars
263            as a hash.
264    
265    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
266    
267            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
268            and |{next_char}| initializations are moved to initialization
269            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
270            with |parse_char_stream|.
271    
272    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
273    
274            * HTML.pm.src (parse_char_stream): Make |set_next_char|
275            invoke |manakai_read_until|, not only |read|, where
276            possible, to decrease the number of |read| method calls.
277    
278            * mkhtmlparser.pl: Related changes to the aforementioned
279            modification.
280    
281    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
284            would report character error from now.
285    
286    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
287    
288            * HTML.pm.src: White-space-leaded non-white-space character
289            tokens in "before head insertion mode" was not
290            correctly handled.
291            (set_inner_html): Reimplemented using CharString decodehandle
292            class.  Support for $get_wrapper argument.  Support
293            for |{read_until}| feature.
294    
295    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Make a "bare ero" error for unknown
298            entities point the "&" character.
299    
300    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
301    
302            * HTML.pm.src: It turns out that U+FFFD don't have to
303            be added to the list of excluded characters.
304    
305    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTML.pm.src ($char_onerror): Have character decoder's |line|
308            and |column| a higher priority than the one set by the
309            tokenizer's input handler.
310            ($self->{read_until}): Exclude U+FFFD (but this might
311            not be necessary, since now we do line/column fixup in
312            the character decode handle).
313    
314    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: Use |{read_until}| where possible.
317    
318    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
321            and |manakai_getc_until| to |manakai_read_until| to
322            reduce the number of string copies.
323    
324    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
325    
326            * HTML.pm.src (parse_char_string): Use newly created
327            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
328            standard feature to |open| a string as a filehandle,
329            since Perl's string filehandle seems not supporting |ungetc|
330            method correctly.
331            (parse_char_stream): Define |{getc_until}| method.
332            (DATA_STATE): Experimental support for |getc_until| feature.
333    
334    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
335    
336            * HTML.pm.src: Check points added to newly added branches.
337    
338    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
339    
340            * HTML.pm.src: Remove |{char}|, which is no longer used.
341            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
342            and replaced by |{prev_state}|.
343    
344            * mkhtmlparser.pl: Remove |{char}| feature.
345            Remove |!!!back-next-input-character;| macro.
346    
347    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
350            entity related tokenizer states in favor of new states
351            implementing the consume character reference algorithm.
352    
353  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
354    
355          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
356          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
357          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
358          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 360 
360    
361  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
362    
363          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
364          into their own tokenizer states.          into their own tokenizer states.
365    
366  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.355

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24