/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.334 by wakaba, Mon Sep 22 06:04:28 2008 UTC
# Line 1  Line 1 
1    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Character references for non-space C0 characters,
4            including U+000B VT, DEL character, noncharacter code points, are
5            now converted to the U+FFFD character (cf. HTML5 revision 2138).
6    
7    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
8    
9            * ContentChecker.pm: |form=""| check support added.
10    
11    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
12    
13            * ContentChecker.pm: |contextmenu| validness is now checked using
14            |id| and |id_type| properties, and |menu| property is removed.
15    
16    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
17    
18            * ContentChecker.pm: Prepare for |form| |name| attribute's
19            duplication checking.
20    
21    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
24            unsupported charset="" parameter value (HTML5 revision 2131).
25    
26    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Reminding places where U+000B is allowed as a space
29            character is fixed (cf. HTML5 revision 1738).
30    
31            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
32            space characters (HTML5 revision 1738).
33    
34    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
35    
36            * HTML.pm.src: The "anything else" case for the "after after body"
37            insertion mode was not updated to swtich to the "in body"
38            insertion mode.  U+000B is no longer a space character for the
39            purpose of tree construction phase (HTML5 revision 1738).
40    
41    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: U+000B is no longer a space character (HTML5
44            revision 1738).
45    
46    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
47    
48            * ContentType.pm: 0x0B is no longer a space character (HTML5
49            revision 1738).
50    
51            * HTML.pm.src: U+000B is no longer a space character for the
52            algorithm for extracting an encoding from a Content-Type (HTML5
53            revision 1738).
54    
55    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
56    
57            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
58    
59    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
60    
61            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
62    
63            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
64            matching.
65    
66    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
67    
68            * LangTag.pm: Warn for private use language subtags.  Error level
69            typos fixed.  Support for Suppress-Script field.
70    
71            * mklangreg.pl: Support for dumping of nested structure.
72    
73    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
74    
75            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
76            recommended case as per RFC 4646.
77    
78    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
79    
80            * LangTag.pm (check_rfc4646_langtag): New method.
81    
82    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
83    
84            * mklangreg.pl: New script.
85    
86            * Makefile: Updated for creation of the module for language subtag
87            registry.
88            
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * Makefile: WebIDL.html added.
92    
93            * WebIDL.pod: New documentation.
94    
95    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
96    
97            * WebIDL.pm: Checker's error types are redefined.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Parser's error types are redefined.  Some forward
102            compatible parsing bugs are fixed.  Some unreachable codes are
103            commented out.
104    
105    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
106    
107            * WebIDL.pm: Support for the reminding extended attributes are
108            added.  It does not satisfy the definition that a forward
109            interface declaration has an extended attribute.  It seems that
110            unless explicitly allowed multiple extended attributes with the
111            same name is not allowed, though it is not explicitly mentioned in
112            the spec.
113    
114    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
115    
116            * WebIDL.pm: Unescapes extended attribute names and extended
117            attribute identifiers.  Preserve whether an extended attribute has
118            an argument list of not.  Support for extended attributes:
119            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
120            NameGetter, NameSetter, and Null.
121            (has_argument_list): New attribute.
122            (idl_text): Stringifies argument lists, if any, even if it is
123            empty.
124    
125    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
128            |{s_kwd}| in DATA_STATE as default.
129    
130    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
133            by |{s_kwd}| in DATA_STATE.
134    
135    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
136    
137            * HTML.pm.src: Shorten keys.
138    
139    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Remove checking for control character, surrogate
142            pair, or noncharacter code points and non-Unicode code
143            points (they should be handled by Whatpm::Charset::UnicodeChecker).
144            (parse_char_stream): Support for the |$get_wrapper| argument and
145            character stream error handlers.
146    
147    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
148    
149            * ContentChecker.pm: Don't call |loda_ns_module|
150            for null-namespace elements/attributes.
151    
152            * HTML.pm.src: Fact out $disallowed_control_chars
153            as a hash.
154    
155    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
158            and |{next_char}| initializations are moved to initialization
159            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
160            with |parse_char_stream|.
161    
162    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
163    
164            * HTML.pm.src (parse_char_stream): Make |set_next_char|
165            invoke |manakai_read_until|, not only |read|, where
166            possible, to decrease the number of |read| method calls.
167    
168            * mkhtmlparser.pl: Related changes to the aforementioned
169            modification.
170    
171    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
172    
173            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
174            would report character error from now.
175    
176    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
177    
178            * HTML.pm.src: White-space-leaded non-white-space character
179            tokens in "before head insertion mode" was not
180            correctly handled.
181            (set_inner_html): Reimplemented using CharString decodehandle
182            class.  Support for $get_wrapper argument.  Support
183            for |{read_until}| feature.
184    
185    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
186    
187            * HTML.pm.src: Make a "bare ero" error for unknown
188            entities point the "&" character.
189    
190    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
191    
192            * HTML.pm.src: It turns out that U+FFFD don't have to
193            be added to the list of excluded characters.
194    
195    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
196    
197            * HTML.pm.src ($char_onerror): Have character decoder's |line|
198            and |column| a higher priority than the one set by the
199            tokenizer's input handler.
200            ($self->{read_until}): Exclude U+FFFD (but this might
201            not be necessary, since now we do line/column fixup in
202            the character decode handle).
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src: Use |{read_until}| where possible.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
211            and |manakai_getc_until| to |manakai_read_until| to
212            reduce the number of string copies.
213    
214    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src (parse_char_string): Use newly created
217            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
218            standard feature to |open| a string as a filehandle,
219            since Perl's string filehandle seems not supporting |ungetc|
220            method correctly.
221            (parse_char_stream): Define |{getc_until}| method.
222            (DATA_STATE): Experimental support for |getc_until| feature.
223    
224    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src: Check points added to newly added branches.
227    
228    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src: Remove |{char}|, which is no longer used.
231            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
232            and replaced by |{prev_state}|.
233    
234            * mkhtmlparser.pl: Remove |{char}| feature.
235            Remove |!!!back-next-input-character;| macro.
236    
237    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
240            entity related tokenizer states in favor of new states
241            implementing the consume character reference algorithm.
242    
243  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
244    
245          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
246          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
247          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
248          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 250 
250    
251  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
252    
253          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
254          into their own tokenizer states.          into their own tokenizer states.
255    
256  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.334

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24