/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.329 by wakaba, Sat Sep 20 11:25:56 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Reminding places where U+000B is allowed as a space
4            character is fixed (cf. HTML5 revision 1738).
5    
6            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
7            space characters (HTML5 revision 1738).
8    
9    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
10    
11            * HTML.pm.src: The "anything else" case for the "after after body"
12            insertion mode was not updated to swtich to the "in body"
13            insertion mode.  U+000B is no longer a space character for the
14            purpose of tree construction phase (HTML5 revision 1738).
15    
16    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: U+000B is no longer a space character (HTML5
19            revision 1738).
20    
21    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
22    
23            * ContentType.pm: 0x0B is no longer a space character (HTML5
24            revision 1738).
25    
26            * HTML.pm.src: U+000B is no longer a space character for the
27            algorithm for extracting an encoding from a Content-Type (HTML5
28            revision 1738).
29    
30    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
31    
32            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
33    
34    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
35    
36            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
37    
38            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
39            matching.
40    
41    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
42    
43            * LangTag.pm: Warn for private use language subtags.  Error level
44            typos fixed.  Support for Suppress-Script field.
45    
46            * mklangreg.pl: Support for dumping of nested structure.
47    
48    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
49    
50            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
51            recommended case as per RFC 4646.
52    
53    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
54    
55            * LangTag.pm (check_rfc4646_langtag): New method.
56    
57    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
58    
59            * mklangreg.pl: New script.
60    
61            * Makefile: Updated for creation of the module for language subtag
62            registry.
63            
64    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
65    
66            * Makefile: WebIDL.html added.
67    
68            * WebIDL.pod: New documentation.
69    
70    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
71    
72            * WebIDL.pm: Checker's error types are redefined.
73    
74    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
75    
76            * WebIDL.pm: Parser's error types are redefined.  Some forward
77            compatible parsing bugs are fixed.  Some unreachable codes are
78            commented out.
79    
80    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
81    
82            * WebIDL.pm: Support for the reminding extended attributes are
83            added.  It does not satisfy the definition that a forward
84            interface declaration has an extended attribute.  It seems that
85            unless explicitly allowed multiple extended attributes with the
86            same name is not allowed, though it is not explicitly mentioned in
87            the spec.
88    
89    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
90    
91            * WebIDL.pm: Unescapes extended attribute names and extended
92            attribute identifiers.  Preserve whether an extended attribute has
93            an argument list of not.  Support for extended attributes:
94            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
95            NameGetter, NameSetter, and Null.
96            (has_argument_list): New attribute.
97            (idl_text): Stringifies argument lists, if any, even if it is
98            empty.
99    
100    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
101    
102            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
103            |{s_kwd}| in DATA_STATE as default.
104    
105    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
108            by |{s_kwd}| in DATA_STATE.
109    
110    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Shorten keys.
113    
114    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
115    
116            * HTML.pm.src: Remove checking for control character, surrogate
117            pair, or noncharacter code points and non-Unicode code
118            points (they should be handled by Whatpm::Charset::UnicodeChecker).
119            (parse_char_stream): Support for the |$get_wrapper| argument and
120            character stream error handlers.
121    
122    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
123    
124            * ContentChecker.pm: Don't call |loda_ns_module|
125            for null-namespace elements/attributes.
126    
127            * HTML.pm.src: Fact out $disallowed_control_chars
128            as a hash.
129    
130    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
133            and |{next_char}| initializations are moved to initialization
134            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
135            with |parse_char_stream|.
136    
137    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src (parse_char_stream): Make |set_next_char|
140            invoke |manakai_read_until|, not only |read|, where
141            possible, to decrease the number of |read| method calls.
142    
143            * mkhtmlparser.pl: Related changes to the aforementioned
144            modification.
145    
146    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
147    
148            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
149            would report character error from now.
150    
151    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: White-space-leaded non-white-space character
154            tokens in "before head insertion mode" was not
155            correctly handled.
156            (set_inner_html): Reimplemented using CharString decodehandle
157            class.  Support for $get_wrapper argument.  Support
158            for |{read_until}| feature.
159    
160    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * HTML.pm.src: Make a "bare ero" error for unknown
163            entities point the "&" character.
164    
165    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
166    
167            * HTML.pm.src: It turns out that U+FFFD don't have to
168            be added to the list of excluded characters.
169    
170    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src ($char_onerror): Have character decoder's |line|
173            and |column| a higher priority than the one set by the
174            tokenizer's input handler.
175            ($self->{read_until}): Exclude U+FFFD (but this might
176            not be necessary, since now we do line/column fixup in
177            the character decode handle).
178    
179    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Use |{read_until}| where possible.
182    
183    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
186            and |manakai_getc_until| to |manakai_read_until| to
187            reduce the number of string copies.
188    
189    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src (parse_char_string): Use newly created
192            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
193            standard feature to |open| a string as a filehandle,
194            since Perl's string filehandle seems not supporting |ungetc|
195            method correctly.
196            (parse_char_stream): Define |{getc_until}| method.
197            (DATA_STATE): Experimental support for |getc_until| feature.
198    
199    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
200    
201            * HTML.pm.src: Check points added to newly added branches.
202    
203    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
204    
205            * HTML.pm.src: Remove |{char}|, which is no longer used.
206            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
207            and replaced by |{prev_state}|.
208    
209            * mkhtmlparser.pl: Remove |{char}| feature.
210            Remove |!!!back-next-input-character;| macro.
211    
212    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
213    
214            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
215            entity related tokenizer states in favor of new states
216            implementing the consume character reference algorithm.
217    
218  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
219    
220          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
221          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
222          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
223          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 225 
225    
226  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
227    
228          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
229          into their own tokenizer states.          into their own tokenizer states.
230    
231  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.329

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24