/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.331 by wakaba, Sun Sep 21 09:45:02 2008 UTC
# Line 1  Line 1 
1    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm: Prepare for |form| |name| attribute's
4            duplication checking.
5    
6    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
9            unsupported charset="" parameter value (HTML5 revision 2131).
10    
11    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src: Reminding places where U+000B is allowed as a space
14            character is fixed (cf. HTML5 revision 1738).
15    
16            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
17            space characters (HTML5 revision 1738).
18    
19    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: The "anything else" case for the "after after body"
22            insertion mode was not updated to swtich to the "in body"
23            insertion mode.  U+000B is no longer a space character for the
24            purpose of tree construction phase (HTML5 revision 1738).
25    
26    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: U+000B is no longer a space character (HTML5
29            revision 1738).
30    
31    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
32    
33            * ContentType.pm: 0x0B is no longer a space character (HTML5
34            revision 1738).
35    
36            * HTML.pm.src: U+000B is no longer a space character for the
37            algorithm for extracting an encoding from a Content-Type (HTML5
38            revision 1738).
39    
40    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
41    
42            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
43    
44    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
45    
46            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
47    
48            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
49            matching.
50    
51    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
52    
53            * LangTag.pm: Warn for private use language subtags.  Error level
54            typos fixed.  Support for Suppress-Script field.
55    
56            * mklangreg.pl: Support for dumping of nested structure.
57    
58    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
59    
60            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
61            recommended case as per RFC 4646.
62    
63    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
64    
65            * LangTag.pm (check_rfc4646_langtag): New method.
66    
67    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
68    
69            * mklangreg.pl: New script.
70    
71            * Makefile: Updated for creation of the module for language subtag
72            registry.
73            
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * Makefile: WebIDL.html added.
77    
78            * WebIDL.pod: New documentation.
79    
80    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * WebIDL.pm: Checker's error types are redefined.
83    
84    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
85    
86            * WebIDL.pm: Parser's error types are redefined.  Some forward
87            compatible parsing bugs are fixed.  Some unreachable codes are
88            commented out.
89    
90    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
91    
92            * WebIDL.pm: Support for the reminding extended attributes are
93            added.  It does not satisfy the definition that a forward
94            interface declaration has an extended attribute.  It seems that
95            unless explicitly allowed multiple extended attributes with the
96            same name is not allowed, though it is not explicitly mentioned in
97            the spec.
98    
99    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
100    
101            * WebIDL.pm: Unescapes extended attribute names and extended
102            attribute identifiers.  Preserve whether an extended attribute has
103            an argument list of not.  Support for extended attributes:
104            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
105            NameGetter, NameSetter, and Null.
106            (has_argument_list): New attribute.
107            (idl_text): Stringifies argument lists, if any, even if it is
108            empty.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
113            |{s_kwd}| in DATA_STATE as default.
114    
115    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
118            by |{s_kwd}| in DATA_STATE.
119    
120    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
121    
122            * HTML.pm.src: Shorten keys.
123    
124    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Remove checking for control character, surrogate
127            pair, or noncharacter code points and non-Unicode code
128            points (they should be handled by Whatpm::Charset::UnicodeChecker).
129            (parse_char_stream): Support for the |$get_wrapper| argument and
130            character stream error handlers.
131    
132    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
133    
134            * ContentChecker.pm: Don't call |loda_ns_module|
135            for null-namespace elements/attributes.
136    
137            * HTML.pm.src: Fact out $disallowed_control_chars
138            as a hash.
139    
140    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
143            and |{next_char}| initializations are moved to initialization
144            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
145            with |parse_char_stream|.
146    
147    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src (parse_char_stream): Make |set_next_char|
150            invoke |manakai_read_until|, not only |read|, where
151            possible, to decrease the number of |read| method calls.
152    
153            * mkhtmlparser.pl: Related changes to the aforementioned
154            modification.
155    
156    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
157    
158            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
159            would report character error from now.
160    
161    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: White-space-leaded non-white-space character
164            tokens in "before head insertion mode" was not
165            correctly handled.
166            (set_inner_html): Reimplemented using CharString decodehandle
167            class.  Support for $get_wrapper argument.  Support
168            for |{read_until}| feature.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Make a "bare ero" error for unknown
173            entities point the "&" character.
174    
175    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
176    
177            * HTML.pm.src: It turns out that U+FFFD don't have to
178            be added to the list of excluded characters.
179    
180    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src ($char_onerror): Have character decoder's |line|
183            and |column| a higher priority than the one set by the
184            tokenizer's input handler.
185            ($self->{read_until}): Exclude U+FFFD (but this might
186            not be necessary, since now we do line/column fixup in
187            the character decode handle).
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: Use |{read_until}| where possible.
192    
193    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
194    
195            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
196            and |manakai_getc_until| to |manakai_read_until| to
197            reduce the number of string copies.
198    
199    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src (parse_char_string): Use newly created
202            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
203            standard feature to |open| a string as a filehandle,
204            since Perl's string filehandle seems not supporting |ungetc|
205            method correctly.
206            (parse_char_stream): Define |{getc_until}| method.
207            (DATA_STATE): Experimental support for |getc_until| feature.
208    
209    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
210    
211            * HTML.pm.src: Check points added to newly added branches.
212    
213    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src: Remove |{char}|, which is no longer used.
216            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
217            and replaced by |{prev_state}|.
218    
219            * mkhtmlparser.pl: Remove |{char}| feature.
220            Remove |!!!back-next-input-character;| macro.
221    
222    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
223    
224            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
225            entity related tokenizer states in favor of new states
226            implementing the consume character reference algorithm.
227    
228  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
229    
230          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
231          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
232          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
233          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 235 
235    
236  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
239          into their own tokenizer states.          into their own tokenizer states.
240    
241  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.331

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24