/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.282 by wakaba, Sat Aug 30 12:57:05 2008 UTC revision 1.329 by wakaba, Sat Sep 20 11:25:56 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Reminding places where U+000B is allowed as a space
4            character is fixed (cf. HTML5 revision 1738).
5    
6            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
7            space characters (HTML5 revision 1738).
8    
9    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
10    
11            * HTML.pm.src: The "anything else" case for the "after after body"
12            insertion mode was not updated to swtich to the "in body"
13            insertion mode.  U+000B is no longer a space character for the
14            purpose of tree construction phase (HTML5 revision 1738).
15    
16    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: U+000B is no longer a space character (HTML5
19            revision 1738).
20    
21    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
22    
23            * ContentType.pm: 0x0B is no longer a space character (HTML5
24            revision 1738).
25    
26            * HTML.pm.src: U+000B is no longer a space character for the
27            algorithm for extracting an encoding from a Content-Type (HTML5
28            revision 1738).
29    
30    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
31    
32            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
33    
34    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
35    
36            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
37    
38            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
39            matching.
40    
41    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
42    
43            * LangTag.pm: Warn for private use language subtags.  Error level
44            typos fixed.  Support for Suppress-Script field.
45    
46            * mklangreg.pl: Support for dumping of nested structure.
47    
48    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
49    
50            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
51            recommended case as per RFC 4646.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm (check_rfc4646_langtag): New method.
56    
57    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
58    
59            * mklangreg.pl: New script.
60    
61            * Makefile: Updated for creation of the module for language subtag
62            registry.
63            
64    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
65    
66            * Makefile: WebIDL.html added.
67    
68            * WebIDL.pod: New documentation.
69    
70    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
71    
72            * WebIDL.pm: Checker's error types are redefined.
73    
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * WebIDL.pm: Parser's error types are redefined.  Some forward
77            compatible parsing bugs are fixed.  Some unreachable codes are
78            commented out.
79    
80    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * WebIDL.pm: Support for the reminding extended attributes are
83            added.  It does not satisfy the definition that a forward
84            interface declaration has an extended attribute.  It seems that
85            unless explicitly allowed multiple extended attributes with the
86            same name is not allowed, though it is not explicitly mentioned in
87            the spec.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Unescapes extended attribute names and extended
92            attribute identifiers.  Preserve whether an extended attribute has
93            an argument list of not.  Support for extended attributes:
94            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
95            NameGetter, NameSetter, and Null.
96            (has_argument_list): New attribute.
97            (idl_text): Stringifies argument lists, if any, even if it is
98            empty.
99    
100    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
103            |{s_kwd}| in DATA_STATE as default.
104    
105    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
108            by |{s_kwd}| in DATA_STATE.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Shorten keys.
113    
114    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: Remove checking for control character, surrogate
117            pair, or noncharacter code points and non-Unicode code
118            points (they should be handled by Whatpm::Charset::UnicodeChecker).
119            (parse_char_stream): Support for the |$get_wrapper| argument and
120            character stream error handlers.
121    
122    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
123    
124            * ContentChecker.pm: Don't call |loda_ns_module|
125            for null-namespace elements/attributes.
126    
127            * HTML.pm.src: Fact out $disallowed_control_chars
128            as a hash.
129    
130    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
133            and |{next_char}| initializations are moved to initialization
134            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
135            with |parse_char_stream|.
136    
137    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src (parse_char_stream): Make |set_next_char|
140            invoke |manakai_read_until|, not only |read|, where
141            possible, to decrease the number of |read| method calls.
142    
143            * mkhtmlparser.pl: Related changes to the aforementioned
144            modification.
145    
146    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
149            would report character error from now.
150    
151    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: White-space-leaded non-white-space character
154            tokens in "before head insertion mode" was not
155            correctly handled.
156            (set_inner_html): Reimplemented using CharString decodehandle
157            class.  Support for $get_wrapper argument.  Support
158            for |{read_until}| feature.
159    
160    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * HTML.pm.src: Make a "bare ero" error for unknown
163            entities point the "&" character.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: It turns out that U+FFFD don't have to
168            be added to the list of excluded characters.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src ($char_onerror): Have character decoder's |line|
173            and |column| a higher priority than the one set by the
174            tokenizer's input handler.
175            ($self->{read_until}): Exclude U+FFFD (but this might
176            not be necessary, since now we do line/column fixup in
177            the character decode handle).
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Use |{read_until}| where possible.
182    
183    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
186            and |manakai_getc_until| to |manakai_read_until| to
187            reduce the number of string copies.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src (parse_char_string): Use newly created
192            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
193            standard feature to |open| a string as a filehandle,
194            since Perl's string filehandle seems not supporting |ungetc|
195            method correctly.
196            (parse_char_stream): Define |{getc_until}| method.
197            (DATA_STATE): Experimental support for |getc_until| feature.
198    
199    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src: Check points added to newly added branches.
202    
203    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: Remove |{char}|, which is no longer used.
206            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
207            and replaced by |{prev_state}|.
208    
209            * mkhtmlparser.pl: Remove |{char}| feature.
210            Remove |!!!back-next-input-character;| macro.
211    
212    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
215            entity related tokenizer states in favor of new states
216            implementing the consume character reference algorithm.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: "Consume a character reference" algorithm is
221            now implemented as a tokenizer's state, rather than
222            a method, with minimum changes (more changes will
223            be made, in due course).  "Bogus comment state"'s inner
224            loop gets removed.
225    
226    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
227    
228            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
229            into their own tokenizer states.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
234            is split into three states.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
239            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
240            no longer does the tokenizer have to push back next input
241            characters in those states.
242    
243    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
246            into four states so that no longer does the tokenizer have to push
247            back next input characters in that state.
248    
249    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
252            which can be used to insert some wrapper between the character
253            stream handle and the tokenizer.  (It is currently not supported
254            for |set_inner_html| for |Element|s).
255    
256    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src: Ignore punctuations in charset names.
259    
260    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
261    
262            * ContentChecker.pm: Support for charset-layer error levels.
263    
264            * HTML.pm.src: Don't specify |text| argument for the
265            |chardecode:fallback| error, since it is not the encoding
266            being used alternatively.
267    
268    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
271    
272    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
273    
274            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
275    
276    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
277    
278            * HTML.pm.src: Bug fix and sync with the spec with regard
279            to after after frameset insertion mode processing (HTML5
280            revision 1909).  Note that the implementation was wrong
281            per the old spec before the r1909 changes.
282    
283    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTMLTable.pm: scope=auto algorithm fix synced with the
286            spec (HTML5 revision 2093).
287            ($process_row): Algorithm step numbers synced with the
288            spec (HTML5 revision 2092).
289    
290    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
293            revision 2094).
294    
295    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
296    
297            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
298    
299    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src: '"' and "'" at the end of attribute
302            name (after another attribute) now raise parse error (HTML5
303            revision 2123).  Empty unquoted attribute values are no
304            longer allowed (HTML5 revision 2122).
305    
306  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
307    
308          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5

Legend:
Removed from v.1.282  
changed lines
  Added in v.1.329

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24