/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.282 by wakaba, Sat Aug 30 12:57:05 2008 UTC revision 1.328 by wakaba, Sat Sep 20 10:20:47 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: The "anything else" case for the "after after body"
4            insertion mode was not updated to swtich to the "in body"
5            insertion mode.  U+000B is no longer a space character for the
6            purpose of tree construction phase (HTML5 revision 1738).
7    
8    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
9    
10            * HTML.pm.src: U+000B is no longer a space character (HTML5
11            revision 1738).
12    
13    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
14    
15            * ContentType.pm: 0x0B is no longer a space character (HTML5
16            revision 1738).
17    
18            * HTML.pm.src: U+000B is no longer a space character for the
19            algorithm for extracting an encoding from a Content-Type (HTML5
20            revision 1738).
21    
22    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
23    
24            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
25    
26    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
27    
28            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
29    
30            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
31            matching.
32    
33    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
34    
35            * LangTag.pm: Warn for private use language subtags.  Error level
36            typos fixed.  Support for Suppress-Script field.
37    
38            * mklangreg.pl: Support for dumping of nested structure.
39    
40    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
41    
42            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
43            recommended case as per RFC 4646.
44    
45    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
46    
47            * LangTag.pm (check_rfc4646_langtag): New method.
48    
49    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
50    
51            * mklangreg.pl: New script.
52    
53            * Makefile: Updated for creation of the module for language subtag
54            registry.
55            
56    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
57    
58            * Makefile: WebIDL.html added.
59    
60            * WebIDL.pod: New documentation.
61    
62    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
63    
64            * WebIDL.pm: Checker's error types are redefined.
65    
66    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
67    
68            * WebIDL.pm: Parser's error types are redefined.  Some forward
69            compatible parsing bugs are fixed.  Some unreachable codes are
70            commented out.
71    
72    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
73    
74            * WebIDL.pm: Support for the reminding extended attributes are
75            added.  It does not satisfy the definition that a forward
76            interface declaration has an extended attribute.  It seems that
77            unless explicitly allowed multiple extended attributes with the
78            same name is not allowed, though it is not explicitly mentioned in
79            the spec.
80    
81    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
82    
83            * WebIDL.pm: Unescapes extended attribute names and extended
84            attribute identifiers.  Preserve whether an extended attribute has
85            an argument list of not.  Support for extended attributes:
86            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
87            NameGetter, NameSetter, and Null.
88            (has_argument_list): New attribute.
89            (idl_text): Stringifies argument lists, if any, even if it is
90            empty.
91    
92    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
93    
94            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
95            |{s_kwd}| in DATA_STATE as default.
96    
97    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
100            by |{s_kwd}| in DATA_STATE.
101    
102    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
103    
104            * HTML.pm.src: Shorten keys.
105    
106    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Remove checking for control character, surrogate
109            pair, or noncharacter code points and non-Unicode code
110            points (they should be handled by Whatpm::Charset::UnicodeChecker).
111            (parse_char_stream): Support for the |$get_wrapper| argument and
112            character stream error handlers.
113    
114    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
115    
116            * ContentChecker.pm: Don't call |loda_ns_module|
117            for null-namespace elements/attributes.
118    
119            * HTML.pm.src: Fact out $disallowed_control_chars
120            as a hash.
121    
122    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
123    
124            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
125            and |{next_char}| initializations are moved to initialization
126            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
127            with |parse_char_stream|.
128    
129    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src (parse_char_stream): Make |set_next_char|
132            invoke |manakai_read_until|, not only |read|, where
133            possible, to decrease the number of |read| method calls.
134    
135            * mkhtmlparser.pl: Related changes to the aforementioned
136            modification.
137    
138    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
141            would report character error from now.
142    
143    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: White-space-leaded non-white-space character
146            tokens in "before head insertion mode" was not
147            correctly handled.
148            (set_inner_html): Reimplemented using CharString decodehandle
149            class.  Support for $get_wrapper argument.  Support
150            for |{read_until}| feature.
151    
152    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
153    
154            * HTML.pm.src: Make a "bare ero" error for unknown
155            entities point the "&" character.
156    
157    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
158    
159            * HTML.pm.src: It turns out that U+FFFD don't have to
160            be added to the list of excluded characters.
161    
162    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
163    
164            * HTML.pm.src ($char_onerror): Have character decoder's |line|
165            and |column| a higher priority than the one set by the
166            tokenizer's input handler.
167            ($self->{read_until}): Exclude U+FFFD (but this might
168            not be necessary, since now we do line/column fixup in
169            the character decode handle).
170    
171    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
172    
173            * HTML.pm.src: Use |{read_until}| where possible.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
178            and |manakai_getc_until| to |manakai_read_until| to
179            reduce the number of string copies.
180    
181    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
182    
183            * HTML.pm.src (parse_char_string): Use newly created
184            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
185            standard feature to |open| a string as a filehandle,
186            since Perl's string filehandle seems not supporting |ungetc|
187            method correctly.
188            (parse_char_stream): Define |{getc_until}| method.
189            (DATA_STATE): Experimental support for |getc_until| feature.
190    
191    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
192    
193            * HTML.pm.src: Check points added to newly added branches.
194    
195    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
196    
197            * HTML.pm.src: Remove |{char}|, which is no longer used.
198            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
199            and replaced by |{prev_state}|.
200    
201            * mkhtmlparser.pl: Remove |{char}| feature.
202            Remove |!!!back-next-input-character;| macro.
203    
204    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
207            entity related tokenizer states in favor of new states
208            implementing the consume character reference algorithm.
209    
210    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
211    
212            * HTML.pm.src: "Consume a character reference" algorithm is
213            now implemented as a tokenizer's state, rather than
214            a method, with minimum changes (more changes will
215            be made, in due course).  "Bogus comment state"'s inner
216            loop gets removed.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
221            into their own tokenizer states.
222    
223    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
226            is split into three states.
227    
228    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
231            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
232            no longer does the tokenizer have to push back next input
233            characters in those states.
234    
235    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
236    
237            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
238            into four states so that no longer does the tokenizer have to push
239            back next input characters in that state.
240    
241    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
242    
243            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
244            which can be used to insert some wrapper between the character
245            stream handle and the tokenizer.  (It is currently not supported
246            for |set_inner_html| for |Element|s).
247    
248    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src: Ignore punctuations in charset names.
251    
252    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
253    
254            * ContentChecker.pm: Support for charset-layer error levels.
255    
256            * HTML.pm.src: Don't specify |text| argument for the
257            |chardecode:fallback| error, since it is not the encoding
258            being used alternatively.
259    
260    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
261    
262            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
263    
264    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
265    
266            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
267    
268    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Bug fix and sync with the spec with regard
271            to after after frameset insertion mode processing (HTML5
272            revision 1909).  Note that the implementation was wrong
273            per the old spec before the r1909 changes.
274    
275    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTMLTable.pm: scope=auto algorithm fix synced with the
278            spec (HTML5 revision 2093).
279            ($process_row): Algorithm step numbers synced with the
280            spec (HTML5 revision 2092).
281    
282    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
283    
284            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
285            revision 2094).
286    
287    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
288    
289            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
290    
291    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
292    
293            * HTML.pm.src: '"' and "'" at the end of attribute
294            name (after another attribute) now raise parse error (HTML5
295            revision 2123).  Empty unquoted attribute values are no
296            longer allowed (HTML5 revision 2122).
297    
298  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
299    
300          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5

Legend:
Removed from v.1.282  
changed lines
  Added in v.1.328

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24