/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.282 by wakaba, Sat Aug 30 12:57:05 2008 UTC revision 1.333 by wakaba, Sun Sep 21 12:37:09 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: |form=""| check support added.
4    
5    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
6    
7            * ContentChecker.pm: |contextmenu| validness is now checked using
8            |id| and |id_type| properties, and |menu| property is removed.
9    
10    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
11    
12            * ContentChecker.pm: Prepare for |form| |name| attribute's
13            duplication checking.
14    
15    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
18            unsupported charset="" parameter value (HTML5 revision 2131).
19    
20    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Reminding places where U+000B is allowed as a space
23            character is fixed (cf. HTML5 revision 1738).
24    
25            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
26            space characters (HTML5 revision 1738).
27    
28    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
29    
30            * HTML.pm.src: The "anything else" case for the "after after body"
31            insertion mode was not updated to swtich to the "in body"
32            insertion mode.  U+000B is no longer a space character for the
33            purpose of tree construction phase (HTML5 revision 1738).
34    
35    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
36    
37            * HTML.pm.src: U+000B is no longer a space character (HTML5
38            revision 1738).
39    
40    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentType.pm: 0x0B is no longer a space character (HTML5
43            revision 1738).
44    
45            * HTML.pm.src: U+000B is no longer a space character for the
46            algorithm for extracting an encoding from a Content-Type (HTML5
47            revision 1738).
48    
49    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
56    
57            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
58            matching.
59    
60    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
61    
62            * LangTag.pm: Warn for private use language subtags.  Error level
63            typos fixed.  Support for Suppress-Script field.
64    
65            * mklangreg.pl: Support for dumping of nested structure.
66    
67    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
68    
69            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
70            recommended case as per RFC 4646.
71    
72    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
73    
74            * LangTag.pm (check_rfc4646_langtag): New method.
75    
76    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
77    
78            * mklangreg.pl: New script.
79    
80            * Makefile: Updated for creation of the module for language subtag
81            registry.
82            
83    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
84    
85            * Makefile: WebIDL.html added.
86    
87            * WebIDL.pod: New documentation.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Checker's error types are redefined.
92    
93    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
94    
95            * WebIDL.pm: Parser's error types are redefined.  Some forward
96            compatible parsing bugs are fixed.  Some unreachable codes are
97            commented out.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Support for the reminding extended attributes are
102            added.  It does not satisfy the definition that a forward
103            interface declaration has an extended attribute.  It seems that
104            unless explicitly allowed multiple extended attributes with the
105            same name is not allowed, though it is not explicitly mentioned in
106            the spec.
107    
108    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
109    
110            * WebIDL.pm: Unescapes extended attribute names and extended
111            attribute identifiers.  Preserve whether an extended attribute has
112            an argument list of not.  Support for extended attributes:
113            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
114            NameGetter, NameSetter, and Null.
115            (has_argument_list): New attribute.
116            (idl_text): Stringifies argument lists, if any, even if it is
117            empty.
118    
119    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
122            |{s_kwd}| in DATA_STATE as default.
123    
124    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
127            by |{s_kwd}| in DATA_STATE.
128    
129    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
130    
131            * HTML.pm.src: Shorten keys.
132    
133    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTML.pm.src: Remove checking for control character, surrogate
136            pair, or noncharacter code points and non-Unicode code
137            points (they should be handled by Whatpm::Charset::UnicodeChecker).
138            (parse_char_stream): Support for the |$get_wrapper| argument and
139            character stream error handlers.
140    
141    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
142    
143            * ContentChecker.pm: Don't call |loda_ns_module|
144            for null-namespace elements/attributes.
145    
146            * HTML.pm.src: Fact out $disallowed_control_chars
147            as a hash.
148    
149    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
152            and |{next_char}| initializations are moved to initialization
153            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
154            with |parse_char_stream|.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src (parse_char_stream): Make |set_next_char|
159            invoke |manakai_read_until|, not only |read|, where
160            possible, to decrease the number of |read| method calls.
161    
162            * mkhtmlparser.pl: Related changes to the aforementioned
163            modification.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
168            would report character error from now.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: White-space-leaded non-white-space character
173            tokens in "before head insertion mode" was not
174            correctly handled.
175            (set_inner_html): Reimplemented using CharString decodehandle
176            class.  Support for $get_wrapper argument.  Support
177            for |{read_until}| feature.
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Make a "bare ero" error for unknown
182            entities point the "&" character.
183    
184    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
185    
186            * HTML.pm.src: It turns out that U+FFFD don't have to
187            be added to the list of excluded characters.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src ($char_onerror): Have character decoder's |line|
192            and |column| a higher priority than the one set by the
193            tokenizer's input handler.
194            ($self->{read_until}): Exclude U+FFFD (but this might
195            not be necessary, since now we do line/column fixup in
196            the character decode handle).
197    
198    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
199    
200            * HTML.pm.src: Use |{read_until}| where possible.
201    
202    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
205            and |manakai_getc_until| to |manakai_read_until| to
206            reduce the number of string copies.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src (parse_char_string): Use newly created
211            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
212            standard feature to |open| a string as a filehandle,
213            since Perl's string filehandle seems not supporting |ungetc|
214            method correctly.
215            (parse_char_stream): Define |{getc_until}| method.
216            (DATA_STATE): Experimental support for |getc_until| feature.
217    
218    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: Check points added to newly added branches.
221    
222    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Remove |{char}|, which is no longer used.
225            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
226            and replaced by |{prev_state}|.
227    
228            * mkhtmlparser.pl: Remove |{char}| feature.
229            Remove |!!!back-next-input-character;| macro.
230    
231    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
232    
233            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
234            entity related tokenizer states in favor of new states
235            implementing the consume character reference algorithm.
236    
237    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: "Consume a character reference" algorithm is
240            now implemented as a tokenizer's state, rather than
241            a method, with minimum changes (more changes will
242            be made, in due course).  "Bogus comment state"'s inner
243            loop gets removed.
244    
245    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
246    
247            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
248            into their own tokenizer states.
249    
250    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
253            is split into three states.
254    
255    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
258            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
259            no longer does the tokenizer have to push back next input
260            characters in those states.
261    
262    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
263    
264            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
265            into four states so that no longer does the tokenizer have to push
266            back next input characters in that state.
267    
268    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
271            which can be used to insert some wrapper between the character
272            stream handle and the tokenizer.  (It is currently not supported
273            for |set_inner_html| for |Element|s).
274    
275    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Ignore punctuations in charset names.
278    
279    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
280    
281            * ContentChecker.pm: Support for charset-layer error levels.
282    
283            * HTML.pm.src: Don't specify |text| argument for the
284            |chardecode:fallback| error, since it is not the encoding
285            being used alternatively.
286    
287    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
288    
289            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
290    
291    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
292    
293            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
294    
295    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
296    
297            * HTML.pm.src: Bug fix and sync with the spec with regard
298            to after after frameset insertion mode processing (HTML5
299            revision 1909).  Note that the implementation was wrong
300            per the old spec before the r1909 changes.
301    
302    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTMLTable.pm: scope=auto algorithm fix synced with the
305            spec (HTML5 revision 2093).
306            ($process_row): Algorithm step numbers synced with the
307            spec (HTML5 revision 2092).
308    
309    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
312            revision 2094).
313    
314    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
315    
316            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
317    
318    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src: '"' and "'" at the end of attribute
321            name (after another attribute) now raise parse error (HTML5
322            revision 2123).  Empty unquoted attribute values are no
323            longer allowed (HTML5 revision 2122).
324    
325  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
326    
327          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5

Legend:
Removed from v.1.282  
changed lines
  Added in v.1.333

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24