/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.281 by wakaba, Sat Aug 30 12:33:36 2008 UTC revision 1.345 by wakaba, Sat Oct 4 14:31:27 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: If another node is inserted by the parser, don't
4            reuse existing Text node to append a character (HTML5 revision
5            2124).
6    
7    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
10            revisions 1731 and 2128).
11    
12    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
15            Support for end tags of camelCase SVG elements were broken.  A
16            wrong error type text fixed.
17    
18    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
19    
20            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
21    
22    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Support for new definition of |param| and |source|
25            start tag parsing (HTML5 revision 1731).
26    
27    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
30    
31    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
34            revisions 1731 and 1831).
35    
36    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
39            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
40            revision 1731).
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
45            revision 1731).  End tags of |option| and |optgroup| elements are
46            now optional (HTML5 revision 1731).
47    
48    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
49    
50            * HTML.pm.src: New "special" elements added to the list (HTML5
51            revision 1778).  "strile" -> "strike".
52    
53    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
54    
55            * ContentType.pm (get_sniffed_type): Support for the "better"
56            content sniffing (HTML5 revision 1927).  In a case the official
57            type was not returned when the method is invoked in the list
58            context.
59    
60    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
61    
62            * HTML.pm.src: Character references for non-space C0 characters,
63            including U+000B VT, DEL character, noncharacter code points, are
64            now converted to the U+FFFD character (cf. HTML5 revision 2138).
65    
66    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
67    
68            * ContentChecker.pm: |form=""| check support added.
69    
70    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
71    
72            * ContentChecker.pm: |contextmenu| validness is now checked using
73            |id| and |id_type| properties, and |menu| property is removed.
74    
75    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
76    
77            * ContentChecker.pm: Prepare for |form| |name| attribute's
78            duplication checking.
79    
80    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
83            unsupported charset="" parameter value (HTML5 revision 2131).
84    
85    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: Reminding places where U+000B is allowed as a space
88            character is fixed (cf. HTML5 revision 1738).
89    
90            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
91            space characters (HTML5 revision 1738).
92    
93    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: The "anything else" case for the "after after body"
96            insertion mode was not updated to swtich to the "in body"
97            insertion mode.  U+000B is no longer a space character for the
98            purpose of tree construction phase (HTML5 revision 1738).
99    
100    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: U+000B is no longer a space character (HTML5
103            revision 1738).
104    
105    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
106    
107            * ContentType.pm: 0x0B is no longer a space character (HTML5
108            revision 1738).
109    
110            * HTML.pm.src: U+000B is no longer a space character for the
111            algorithm for extracting an encoding from a Content-Type (HTML5
112            revision 1738).
113    
114    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
115    
116            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
117    
118    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
119    
120            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
121    
122            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
123            matching.
124    
125    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
126    
127            * LangTag.pm: Warn for private use language subtags.  Error level
128            typos fixed.  Support for Suppress-Script field.
129    
130            * mklangreg.pl: Support for dumping of nested structure.
131    
132    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
133    
134            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
135            recommended case as per RFC 4646.
136    
137    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
138    
139            * LangTag.pm (check_rfc4646_langtag): New method.
140    
141    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
142    
143            * mklangreg.pl: New script.
144    
145            * Makefile: Updated for creation of the module for language subtag
146            registry.
147            
148    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
149    
150            * Makefile: WebIDL.html added.
151    
152            * WebIDL.pod: New documentation.
153    
154    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
155    
156            * WebIDL.pm: Checker's error types are redefined.
157    
158    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
159    
160            * WebIDL.pm: Parser's error types are redefined.  Some forward
161            compatible parsing bugs are fixed.  Some unreachable codes are
162            commented out.
163    
164    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
165    
166            * WebIDL.pm: Support for the reminding extended attributes are
167            added.  It does not satisfy the definition that a forward
168            interface declaration has an extended attribute.  It seems that
169            unless explicitly allowed multiple extended attributes with the
170            same name is not allowed, though it is not explicitly mentioned in
171            the spec.
172    
173    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
174    
175            * WebIDL.pm: Unescapes extended attribute names and extended
176            attribute identifiers.  Preserve whether an extended attribute has
177            an argument list of not.  Support for extended attributes:
178            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
179            NameGetter, NameSetter, and Null.
180            (has_argument_list): New attribute.
181            (idl_text): Stringifies argument lists, if any, even if it is
182            empty.
183    
184    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
187            |{s_kwd}| in DATA_STATE as default.
188    
189    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
192            by |{s_kwd}| in DATA_STATE.
193    
194    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
195    
196            * HTML.pm.src: Shorten keys.
197    
198    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Remove checking for control character, surrogate
201            pair, or noncharacter code points and non-Unicode code
202            points (they should be handled by Whatpm::Charset::UnicodeChecker).
203            (parse_char_stream): Support for the |$get_wrapper| argument and
204            character stream error handlers.
205    
206    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
207    
208            * ContentChecker.pm: Don't call |loda_ns_module|
209            for null-namespace elements/attributes.
210    
211            * HTML.pm.src: Fact out $disallowed_control_chars
212            as a hash.
213    
214    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
217            and |{next_char}| initializations are moved to initialization
218            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
219            with |parse_char_stream|.
220    
221    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
222    
223            * HTML.pm.src (parse_char_stream): Make |set_next_char|
224            invoke |manakai_read_until|, not only |read|, where
225            possible, to decrease the number of |read| method calls.
226    
227            * mkhtmlparser.pl: Related changes to the aforementioned
228            modification.
229    
230    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
231    
232            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
233            would report character error from now.
234    
235    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: White-space-leaded non-white-space character
238            tokens in "before head insertion mode" was not
239            correctly handled.
240            (set_inner_html): Reimplemented using CharString decodehandle
241            class.  Support for $get_wrapper argument.  Support
242            for |{read_until}| feature.
243    
244    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
245    
246            * HTML.pm.src: Make a "bare ero" error for unknown
247            entities point the "&" character.
248    
249    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: It turns out that U+FFFD don't have to
252            be added to the list of excluded characters.
253    
254    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
255    
256            * HTML.pm.src ($char_onerror): Have character decoder's |line|
257            and |column| a higher priority than the one set by the
258            tokenizer's input handler.
259            ($self->{read_until}): Exclude U+FFFD (but this might
260            not be necessary, since now we do line/column fixup in
261            the character decode handle).
262    
263    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src: Use |{read_until}| where possible.
266    
267    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
268    
269            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
270            and |manakai_getc_until| to |manakai_read_until| to
271            reduce the number of string copies.
272    
273    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src (parse_char_string): Use newly created
276            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
277            standard feature to |open| a string as a filehandle,
278            since Perl's string filehandle seems not supporting |ungetc|
279            method correctly.
280            (parse_char_stream): Define |{getc_until}| method.
281            (DATA_STATE): Experimental support for |getc_until| feature.
282    
283    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Check points added to newly added branches.
286    
287    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Remove |{char}|, which is no longer used.
290            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
291            and replaced by |{prev_state}|.
292    
293            * mkhtmlparser.pl: Remove |{char}| feature.
294            Remove |!!!back-next-input-character;| macro.
295    
296    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
297    
298            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
299            entity related tokenizer states in favor of new states
300            implementing the consume character reference algorithm.
301    
302    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTML.pm.src: "Consume a character reference" algorithm is
305            now implemented as a tokenizer's state, rather than
306            a method, with minimum changes (more changes will
307            be made, in due course).  "Bogus comment state"'s inner
308            loop gets removed.
309    
310    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
311    
312            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
313            into their own tokenizer states.
314    
315    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
316    
317            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
318            is split into three states.
319    
320    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
321    
322            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
323            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
324            no longer does the tokenizer have to push back next input
325            characters in those states.
326    
327    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
328    
329            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
330            into four states so that no longer does the tokenizer have to push
331            back next input characters in that state.
332    
333    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
334    
335            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
336            which can be used to insert some wrapper between the character
337            stream handle and the tokenizer.  (It is currently not supported
338            for |set_inner_html| for |Element|s).
339    
340    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
341    
342            * HTML.pm.src: Ignore punctuations in charset names.
343    
344    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
345    
346            * ContentChecker.pm: Support for charset-layer error levels.
347    
348            * HTML.pm.src: Don't specify |text| argument for the
349            |chardecode:fallback| error, since it is not the encoding
350            being used alternatively.
351    
352    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
353    
354            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
355    
356    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
357    
358            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
359    
360    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: Bug fix and sync with the spec with regard
363            to after after frameset insertion mode processing (HTML5
364            revision 1909).  Note that the implementation was wrong
365            per the old spec before the r1909 changes.
366    
367    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
368    
369            * HTMLTable.pm: scope=auto algorithm fix synced with the
370            spec (HTML5 revision 2093).
371            ($process_row): Algorithm step numbers synced with the
372            spec (HTML5 revision 2092).
373    
374    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
375    
376            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
377            revision 2094).
378    
379    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
380    
381            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
382    
383    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
384    
385            * HTML.pm.src: '"' and "'" at the end of attribute
386            name (after another attribute) now raise parse error (HTML5
387            revision 2123).  Empty unquoted attribute values are no
388            longer allowed (HTML5 revision 2122).
389    
390    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
391    
392            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
393            revision 2130).
394    
395  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
396    
397          * ContentChecker.pm: |xml:lang| attribute value must be same          * ContentChecker.pm: |xml:lang| attribute value must be same

Legend:
Removed from v.1.281  
changed lines
  Added in v.1.345

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24