/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.276 by wakaba, Sun Aug 17 05:09:12 2008 UTC revision 1.338 by wakaba, Sat Oct 4 06:30:33 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
4            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
5            revision 1731).
6    
7    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
8    
9            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
10            revision 1731).  End tags of |option| and |optgroup| elements are
11            now optional (HTML5 revision 1731).
12    
13    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
14    
15            * HTML.pm.src: New "special" elements added to the list (HTML5
16            revision 1778).  "strile" -> "strike".
17    
18    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
19    
20            * ContentType.pm (get_sniffed_type): Support for the "better"
21            content sniffing (HTML5 revision 1927).  In a case the official
22            type was not returned when the method is invoked in the list
23            context.
24    
25    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: Character references for non-space C0 characters,
28            including U+000B VT, DEL character, noncharacter code points, are
29            now converted to the U+FFFD character (cf. HTML5 revision 2138).
30    
31    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
32    
33            * ContentChecker.pm: |form=""| check support added.
34    
35    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
36    
37            * ContentChecker.pm: |contextmenu| validness is now checked using
38            |id| and |id_type| properties, and |menu| property is removed.
39    
40    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentChecker.pm: Prepare for |form| |name| attribute's
43            duplication checking.
44    
45    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
46    
47            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
48            unsupported charset="" parameter value (HTML5 revision 2131).
49    
50    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
51    
52            * HTML.pm.src: Reminding places where U+000B is allowed as a space
53            character is fixed (cf. HTML5 revision 1738).
54    
55            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
56            space characters (HTML5 revision 1738).
57    
58    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: The "anything else" case for the "after after body"
61            insertion mode was not updated to swtich to the "in body"
62            insertion mode.  U+000B is no longer a space character for the
63            purpose of tree construction phase (HTML5 revision 1738).
64    
65    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
66    
67            * HTML.pm.src: U+000B is no longer a space character (HTML5
68            revision 1738).
69    
70    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
71    
72            * ContentType.pm: 0x0B is no longer a space character (HTML5
73            revision 1738).
74    
75            * HTML.pm.src: U+000B is no longer a space character for the
76            algorithm for extracting an encoding from a Content-Type (HTML5
77            revision 1738).
78    
79    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
80    
81            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
82    
83    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
84    
85            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
86    
87            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
88            matching.
89    
90    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
91    
92            * LangTag.pm: Warn for private use language subtags.  Error level
93            typos fixed.  Support for Suppress-Script field.
94    
95            * mklangreg.pl: Support for dumping of nested structure.
96    
97    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
98    
99            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
100            recommended case as per RFC 4646.
101    
102    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
103    
104            * LangTag.pm (check_rfc4646_langtag): New method.
105    
106    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
107    
108            * mklangreg.pl: New script.
109    
110            * Makefile: Updated for creation of the module for language subtag
111            registry.
112            
113    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
114    
115            * Makefile: WebIDL.html added.
116    
117            * WebIDL.pod: New documentation.
118    
119    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
120    
121            * WebIDL.pm: Checker's error types are redefined.
122    
123    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
124    
125            * WebIDL.pm: Parser's error types are redefined.  Some forward
126            compatible parsing bugs are fixed.  Some unreachable codes are
127            commented out.
128    
129    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
130    
131            * WebIDL.pm: Support for the reminding extended attributes are
132            added.  It does not satisfy the definition that a forward
133            interface declaration has an extended attribute.  It seems that
134            unless explicitly allowed multiple extended attributes with the
135            same name is not allowed, though it is not explicitly mentioned in
136            the spec.
137    
138    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
139    
140            * WebIDL.pm: Unescapes extended attribute names and extended
141            attribute identifiers.  Preserve whether an extended attribute has
142            an argument list of not.  Support for extended attributes:
143            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
144            NameGetter, NameSetter, and Null.
145            (has_argument_list): New attribute.
146            (idl_text): Stringifies argument lists, if any, even if it is
147            empty.
148    
149    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
152            |{s_kwd}| in DATA_STATE as default.
153    
154    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
157            by |{s_kwd}| in DATA_STATE.
158    
159    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
160    
161            * HTML.pm.src: Shorten keys.
162    
163    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: Remove checking for control character, surrogate
166            pair, or noncharacter code points and non-Unicode code
167            points (they should be handled by Whatpm::Charset::UnicodeChecker).
168            (parse_char_stream): Support for the |$get_wrapper| argument and
169            character stream error handlers.
170    
171    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
172    
173            * ContentChecker.pm: Don't call |loda_ns_module|
174            for null-namespace elements/attributes.
175    
176            * HTML.pm.src: Fact out $disallowed_control_chars
177            as a hash.
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
182            and |{next_char}| initializations are moved to initialization
183            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
184            with |parse_char_stream|.
185    
186    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
187    
188            * HTML.pm.src (parse_char_stream): Make |set_next_char|
189            invoke |manakai_read_until|, not only |read|, where
190            possible, to decrease the number of |read| method calls.
191    
192            * mkhtmlparser.pl: Related changes to the aforementioned
193            modification.
194    
195    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
196    
197            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
198            would report character error from now.
199    
200    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
201    
202            * HTML.pm.src: White-space-leaded non-white-space character
203            tokens in "before head insertion mode" was not
204            correctly handled.
205            (set_inner_html): Reimplemented using CharString decodehandle
206            class.  Support for $get_wrapper argument.  Support
207            for |{read_until}| feature.
208    
209    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src: Make a "bare ero" error for unknown
212            entities point the "&" character.
213    
214    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: It turns out that U+FFFD don't have to
217            be added to the list of excluded characters.
218    
219    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
220    
221            * HTML.pm.src ($char_onerror): Have character decoder's |line|
222            and |column| a higher priority than the one set by the
223            tokenizer's input handler.
224            ($self->{read_until}): Exclude U+FFFD (but this might
225            not be necessary, since now we do line/column fixup in
226            the character decode handle).
227    
228    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: Use |{read_until}| where possible.
231    
232    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
233    
234            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
235            and |manakai_getc_until| to |manakai_read_until| to
236            reduce the number of string copies.
237    
238    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src (parse_char_string): Use newly created
241            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
242            standard feature to |open| a string as a filehandle,
243            since Perl's string filehandle seems not supporting |ungetc|
244            method correctly.
245            (parse_char_stream): Define |{getc_until}| method.
246            (DATA_STATE): Experimental support for |getc_until| feature.
247    
248    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src: Check points added to newly added branches.
251    
252    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
253    
254            * HTML.pm.src: Remove |{char}|, which is no longer used.
255            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
256            and replaced by |{prev_state}|.
257    
258            * mkhtmlparser.pl: Remove |{char}| feature.
259            Remove |!!!back-next-input-character;| macro.
260    
261    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
264            entity related tokenizer states in favor of new states
265            implementing the consume character reference algorithm.
266    
267    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
268    
269            * HTML.pm.src: "Consume a character reference" algorithm is
270            now implemented as a tokenizer's state, rather than
271            a method, with minimum changes (more changes will
272            be made, in due course).  "Bogus comment state"'s inner
273            loop gets removed.
274    
275    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
278            into their own tokenizer states.
279    
280    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
281    
282            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
283            is split into three states.
284    
285    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
286    
287            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
288            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
289            no longer does the tokenizer have to push back next input
290            characters in those states.
291    
292    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
293    
294            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
295            into four states so that no longer does the tokenizer have to push
296            back next input characters in that state.
297    
298    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
299    
300            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
301            which can be used to insert some wrapper between the character
302            stream handle and the tokenizer.  (It is currently not supported
303            for |set_inner_html| for |Element|s).
304    
305    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
306    
307            * HTML.pm.src: Ignore punctuations in charset names.
308    
309    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
310    
311            * ContentChecker.pm: Support for charset-layer error levels.
312    
313            * HTML.pm.src: Don't specify |text| argument for the
314            |chardecode:fallback| error, since it is not the encoding
315            being used alternatively.
316    
317    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
318    
319            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
320    
321    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
322    
323            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
324    
325    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
326    
327            * HTML.pm.src: Bug fix and sync with the spec with regard
328            to after after frameset insertion mode processing (HTML5
329            revision 1909).  Note that the implementation was wrong
330            per the old spec before the r1909 changes.
331    
332    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTMLTable.pm: scope=auto algorithm fix synced with the
335            spec (HTML5 revision 2093).
336            ($process_row): Algorithm step numbers synced with the
337            spec (HTML5 revision 2092).
338    
339    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
340    
341            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
342            revision 2094).
343    
344    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
345    
346            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
347    
348    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
349    
350            * HTML.pm.src: '"' and "'" at the end of attribute
351            name (after another attribute) now raise parse error (HTML5
352            revision 2123).  Empty unquoted attribute values are no
353            longer allowed (HTML5 revision 2122).
354    
355    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
356    
357            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
358            revision 2130).
359    
360    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
361    
362            * ContentChecker.pm: |xml:lang| attribute value must be same
363            as |lang| attribute value for HTML elements (HTML5 revision 2062
364            and so on).
365    
366    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
367    
368            * ContentChecker.pm: Error level definition for |xml_id_error|
369            was missing.
370    
371            * URIChecker.pm: The end of the URL should be marked as the
372            error location for an empty path error.  The position
373            between the userinfo and the port components should be
374            marked as the error location for an empty host error.
375    
376    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
377    
378            * URIChecker.pm: Set parameters representing where in the
379            value the error occurs for errors.  Report unknown
380            address format error in warning level, since address
381            formats are rarely added.  Path segments starting with "/.."
382            were misinterpreted as a dot-segment.
383    
384    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
385    
386            * URIChecker.pm (check_iri_reference): Requires
387            |Message::DOM::DOMImplementation|.
388    
389    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
390    
391            * IMTChecker.pm: Updated for the new error reporting architecture.
392    
393            * ContentChecker.pm: Error levels for IMTs are added.
394    
395  2008-08-17  Wakaba  <wakaba@suika.fam.cx>  2008-08-17  Wakaba  <wakaba@suika.fam.cx>
396    
397          * H2H.pm (_shift_token): Support for unquoted HTML attribute          * H2H.pm (_shift_token): Support for unquoted HTML attribute

Legend:
Removed from v.1.276  
changed lines
  Added in v.1.338

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24