/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.276 by wakaba, Sun Aug 17 05:09:12 2008 UTC revision 1.341 by wakaba, Sat Oct 4 08:58:02 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Support for new definition of |param| and |source|
4            start tag parsing (HTML5 revision 1731).
5    
6    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
9    
10    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
13            revisions 1731 and 1831).
14    
15    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
18            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
19            revision 1731).
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
24            revision 1731).  End tags of |option| and |optgroup| elements are
25            now optional (HTML5 revision 1731).
26    
27    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: New "special" elements added to the list (HTML5
30            revision 1778).  "strile" -> "strike".
31    
32    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
33    
34            * ContentType.pm (get_sniffed_type): Support for the "better"
35            content sniffing (HTML5 revision 1927).  In a case the official
36            type was not returned when the method is invoked in the list
37            context.
38    
39    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src: Character references for non-space C0 characters,
42            including U+000B VT, DEL character, noncharacter code points, are
43            now converted to the U+FFFD character (cf. HTML5 revision 2138).
44    
45    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
46    
47            * ContentChecker.pm: |form=""| check support added.
48    
49    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm: |contextmenu| validness is now checked using
52            |id| and |id_type| properties, and |menu| property is removed.
53    
54    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
55    
56            * ContentChecker.pm: Prepare for |form| |name| attribute's
57            duplication checking.
58    
59    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
62            unsupported charset="" parameter value (HTML5 revision 2131).
63    
64    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTML.pm.src: Reminding places where U+000B is allowed as a space
67            character is fixed (cf. HTML5 revision 1738).
68    
69            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
70            space characters (HTML5 revision 1738).
71    
72    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: The "anything else" case for the "after after body"
75            insertion mode was not updated to swtich to the "in body"
76            insertion mode.  U+000B is no longer a space character for the
77            purpose of tree construction phase (HTML5 revision 1738).
78    
79    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
80    
81            * HTML.pm.src: U+000B is no longer a space character (HTML5
82            revision 1738).
83    
84    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
85    
86            * ContentType.pm: 0x0B is no longer a space character (HTML5
87            revision 1738).
88    
89            * HTML.pm.src: U+000B is no longer a space character for the
90            algorithm for extracting an encoding from a Content-Type (HTML5
91            revision 1738).
92    
93    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
94    
95            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
96    
97    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
98    
99            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
100    
101            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
102            matching.
103    
104    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
105    
106            * LangTag.pm: Warn for private use language subtags.  Error level
107            typos fixed.  Support for Suppress-Script field.
108    
109            * mklangreg.pl: Support for dumping of nested structure.
110    
111    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
112    
113            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
114            recommended case as per RFC 4646.
115    
116    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
117    
118            * LangTag.pm (check_rfc4646_langtag): New method.
119    
120    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
121    
122            * mklangreg.pl: New script.
123    
124            * Makefile: Updated for creation of the module for language subtag
125            registry.
126            
127    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
128    
129            * Makefile: WebIDL.html added.
130    
131            * WebIDL.pod: New documentation.
132    
133    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
134    
135            * WebIDL.pm: Checker's error types are redefined.
136    
137    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
138    
139            * WebIDL.pm: Parser's error types are redefined.  Some forward
140            compatible parsing bugs are fixed.  Some unreachable codes are
141            commented out.
142    
143    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
144    
145            * WebIDL.pm: Support for the reminding extended attributes are
146            added.  It does not satisfy the definition that a forward
147            interface declaration has an extended attribute.  It seems that
148            unless explicitly allowed multiple extended attributes with the
149            same name is not allowed, though it is not explicitly mentioned in
150            the spec.
151    
152    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
153    
154            * WebIDL.pm: Unescapes extended attribute names and extended
155            attribute identifiers.  Preserve whether an extended attribute has
156            an argument list of not.  Support for extended attributes:
157            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
158            NameGetter, NameSetter, and Null.
159            (has_argument_list): New attribute.
160            (idl_text): Stringifies argument lists, if any, even if it is
161            empty.
162    
163    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
166            |{s_kwd}| in DATA_STATE as default.
167    
168    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
169    
170            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
171            by |{s_kwd}| in DATA_STATE.
172    
173    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
174    
175            * HTML.pm.src: Shorten keys.
176    
177    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Remove checking for control character, surrogate
180            pair, or noncharacter code points and non-Unicode code
181            points (they should be handled by Whatpm::Charset::UnicodeChecker).
182            (parse_char_stream): Support for the |$get_wrapper| argument and
183            character stream error handlers.
184    
185    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
186    
187            * ContentChecker.pm: Don't call |loda_ns_module|
188            for null-namespace elements/attributes.
189    
190            * HTML.pm.src: Fact out $disallowed_control_chars
191            as a hash.
192    
193    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
196            and |{next_char}| initializations are moved to initialization
197            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
198            with |parse_char_stream|.
199    
200    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
201    
202            * HTML.pm.src (parse_char_stream): Make |set_next_char|
203            invoke |manakai_read_until|, not only |read|, where
204            possible, to decrease the number of |read| method calls.
205    
206            * mkhtmlparser.pl: Related changes to the aforementioned
207            modification.
208    
209    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
212            would report character error from now.
213    
214    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: White-space-leaded non-white-space character
217            tokens in "before head insertion mode" was not
218            correctly handled.
219            (set_inner_html): Reimplemented using CharString decodehandle
220            class.  Support for $get_wrapper argument.  Support
221            for |{read_until}| feature.
222    
223    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: Make a "bare ero" error for unknown
226            entities point the "&" character.
227    
228    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: It turns out that U+FFFD don't have to
231            be added to the list of excluded characters.
232    
233    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src ($char_onerror): Have character decoder's |line|
236            and |column| a higher priority than the one set by the
237            tokenizer's input handler.
238            ($self->{read_until}): Exclude U+FFFD (but this might
239            not be necessary, since now we do line/column fixup in
240            the character decode handle).
241    
242    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
243    
244            * HTML.pm.src: Use |{read_until}| where possible.
245    
246    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
249            and |manakai_getc_until| to |manakai_read_until| to
250            reduce the number of string copies.
251    
252    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
253    
254            * HTML.pm.src (parse_char_string): Use newly created
255            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
256            standard feature to |open| a string as a filehandle,
257            since Perl's string filehandle seems not supporting |ungetc|
258            method correctly.
259            (parse_char_stream): Define |{getc_until}| method.
260            (DATA_STATE): Experimental support for |getc_until| feature.
261    
262    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src: Check points added to newly added branches.
265    
266    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: Remove |{char}|, which is no longer used.
269            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
270            and replaced by |{prev_state}|.
271    
272            * mkhtmlparser.pl: Remove |{char}| feature.
273            Remove |!!!back-next-input-character;| macro.
274    
275    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
278            entity related tokenizer states in favor of new states
279            implementing the consume character reference algorithm.
280    
281    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: "Consume a character reference" algorithm is
284            now implemented as a tokenizer's state, rather than
285            a method, with minimum changes (more changes will
286            be made, in due course).  "Bogus comment state"'s inner
287            loop gets removed.
288    
289    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
292            into their own tokenizer states.
293    
294    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
297            is split into three states.
298    
299    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
302            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
303            no longer does the tokenizer have to push back next input
304            characters in those states.
305    
306    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
309            into four states so that no longer does the tokenizer have to push
310            back next input characters in that state.
311    
312    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
313    
314            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
315            which can be used to insert some wrapper between the character
316            stream handle and the tokenizer.  (It is currently not supported
317            for |set_inner_html| for |Element|s).
318    
319    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
320    
321            * HTML.pm.src: Ignore punctuations in charset names.
322    
323    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
324    
325            * ContentChecker.pm: Support for charset-layer error levels.
326    
327            * HTML.pm.src: Don't specify |text| argument for the
328            |chardecode:fallback| error, since it is not the encoding
329            being used alternatively.
330    
331    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
332    
333            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
334    
335    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
336    
337            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
338    
339    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTML.pm.src: Bug fix and sync with the spec with regard
342            to after after frameset insertion mode processing (HTML5
343            revision 1909).  Note that the implementation was wrong
344            per the old spec before the r1909 changes.
345    
346    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
347    
348            * HTMLTable.pm: scope=auto algorithm fix synced with the
349            spec (HTML5 revision 2093).
350            ($process_row): Algorithm step numbers synced with the
351            spec (HTML5 revision 2092).
352    
353    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
354    
355            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
356            revision 2094).
357    
358    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
359    
360            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
361    
362    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
363    
364            * HTML.pm.src: '"' and "'" at the end of attribute
365            name (after another attribute) now raise parse error (HTML5
366            revision 2123).  Empty unquoted attribute values are no
367            longer allowed (HTML5 revision 2122).
368    
369    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
370    
371            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
372            revision 2130).
373    
374    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
375    
376            * ContentChecker.pm: |xml:lang| attribute value must be same
377            as |lang| attribute value for HTML elements (HTML5 revision 2062
378            and so on).
379    
380    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
381    
382            * ContentChecker.pm: Error level definition for |xml_id_error|
383            was missing.
384    
385            * URIChecker.pm: The end of the URL should be marked as the
386            error location for an empty path error.  The position
387            between the userinfo and the port components should be
388            marked as the error location for an empty host error.
389    
390    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
391    
392            * URIChecker.pm: Set parameters representing where in the
393            value the error occurs for errors.  Report unknown
394            address format error in warning level, since address
395            formats are rarely added.  Path segments starting with "/.."
396            were misinterpreted as a dot-segment.
397    
398    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
399    
400            * URIChecker.pm (check_iri_reference): Requires
401            |Message::DOM::DOMImplementation|.
402    
403    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
404    
405            * IMTChecker.pm: Updated for the new error reporting architecture.
406    
407            * ContentChecker.pm: Error levels for IMTs are added.
408    
409  2008-08-17  Wakaba  <wakaba@suika.fam.cx>  2008-08-17  Wakaba  <wakaba@suika.fam.cx>
410    
411          * H2H.pm (_shift_token): Support for unquoted HTML attribute          * H2H.pm (_shift_token): Support for unquoted HTML attribute

Legend:
Removed from v.1.276  
changed lines
  Added in v.1.341

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24