/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.275 by wakaba, Sat Aug 16 07:35:22 2008 UTC revision 1.325 by wakaba, Sat Sep 20 06:10:17 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
4    
5    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
6    
7            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
8    
9            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
10            matching.
11    
12    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
13    
14            * LangTag.pm: Warn for private use language subtags.  Error level
15            typos fixed.  Support for Suppress-Script field.
16    
17            * mklangreg.pl: Support for dumping of nested structure.
18    
19    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
20    
21            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
22            recommended case as per RFC 4646.
23    
24    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
25    
26            * LangTag.pm (check_rfc4646_langtag): New method.
27    
28    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
29    
30            * mklangreg.pl: New script.
31    
32            * Makefile: Updated for creation of the module for language subtag
33            registry.
34            
35    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
36    
37            * Makefile: WebIDL.html added.
38    
39            * WebIDL.pod: New documentation.
40    
41    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
42    
43            * WebIDL.pm: Checker's error types are redefined.
44    
45    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
46    
47            * WebIDL.pm: Parser's error types are redefined.  Some forward
48            compatible parsing bugs are fixed.  Some unreachable codes are
49            commented out.
50    
51    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
52    
53            * WebIDL.pm: Support for the reminding extended attributes are
54            added.  It does not satisfy the definition that a forward
55            interface declaration has an extended attribute.  It seems that
56            unless explicitly allowed multiple extended attributes with the
57            same name is not allowed, though it is not explicitly mentioned in
58            the spec.
59    
60    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
61    
62            * WebIDL.pm: Unescapes extended attribute names and extended
63            attribute identifiers.  Preserve whether an extended attribute has
64            an argument list of not.  Support for extended attributes:
65            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
66            NameGetter, NameSetter, and Null.
67            (has_argument_list): New attribute.
68            (idl_text): Stringifies argument lists, if any, even if it is
69            empty.
70    
71    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
72    
73            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
74            |{s_kwd}| in DATA_STATE as default.
75    
76    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
79            by |{s_kwd}| in DATA_STATE.
80    
81    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Shorten keys.
84    
85    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src: Remove checking for control character, surrogate
88            pair, or noncharacter code points and non-Unicode code
89            points (they should be handled by Whatpm::Charset::UnicodeChecker).
90            (parse_char_stream): Support for the |$get_wrapper| argument and
91            character stream error handlers.
92    
93    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
94    
95            * ContentChecker.pm: Don't call |loda_ns_module|
96            for null-namespace elements/attributes.
97    
98            * HTML.pm.src: Fact out $disallowed_control_chars
99            as a hash.
100    
101    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
104            and |{next_char}| initializations are moved to initialization
105            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
106            with |parse_char_stream|.
107    
108    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
109    
110            * HTML.pm.src (parse_char_stream): Make |set_next_char|
111            invoke |manakai_read_until|, not only |read|, where
112            possible, to decrease the number of |read| method calls.
113    
114            * mkhtmlparser.pl: Related changes to the aforementioned
115            modification.
116    
117    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
120            would report character error from now.
121    
122    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
123    
124            * HTML.pm.src: White-space-leaded non-white-space character
125            tokens in "before head insertion mode" was not
126            correctly handled.
127            (set_inner_html): Reimplemented using CharString decodehandle
128            class.  Support for $get_wrapper argument.  Support
129            for |{read_until}| feature.
130    
131    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src: Make a "bare ero" error for unknown
134            entities point the "&" character.
135    
136    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
137    
138            * HTML.pm.src: It turns out that U+FFFD don't have to
139            be added to the list of excluded characters.
140    
141    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
142    
143            * HTML.pm.src ($char_onerror): Have character decoder's |line|
144            and |column| a higher priority than the one set by the
145            tokenizer's input handler.
146            ($self->{read_until}): Exclude U+FFFD (but this might
147            not be necessary, since now we do line/column fixup in
148            the character decode handle).
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src: Use |{read_until}| where possible.
153    
154    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
155    
156            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
157            and |manakai_getc_until| to |manakai_read_until| to
158            reduce the number of string copies.
159    
160    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
161    
162            * HTML.pm.src (parse_char_string): Use newly created
163            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
164            standard feature to |open| a string as a filehandle,
165            since Perl's string filehandle seems not supporting |ungetc|
166            method correctly.
167            (parse_char_stream): Define |{getc_until}| method.
168            (DATA_STATE): Experimental support for |getc_until| feature.
169    
170    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
171    
172            * HTML.pm.src: Check points added to newly added branches.
173    
174    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src: Remove |{char}|, which is no longer used.
177            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
178            and replaced by |{prev_state}|.
179    
180            * mkhtmlparser.pl: Remove |{char}| feature.
181            Remove |!!!back-next-input-character;| macro.
182    
183    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
186            entity related tokenizer states in favor of new states
187            implementing the consume character reference algorithm.
188    
189    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
190    
191            * HTML.pm.src: "Consume a character reference" algorithm is
192            now implemented as a tokenizer's state, rather than
193            a method, with minimum changes (more changes will
194            be made, in due course).  "Bogus comment state"'s inner
195            loop gets removed.
196    
197    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
200            into their own tokenizer states.
201    
202    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
205            is split into three states.
206    
207    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
208    
209            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
210            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
211            no longer does the tokenizer have to push back next input
212            characters in those states.
213    
214    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
215    
216            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
217            into four states so that no longer does the tokenizer have to push
218            back next input characters in that state.
219    
220    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
223            which can be used to insert some wrapper between the character
224            stream handle and the tokenizer.  (It is currently not supported
225            for |set_inner_html| for |Element|s).
226    
227    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Ignore punctuations in charset names.
230    
231    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
232    
233            * ContentChecker.pm: Support for charset-layer error levels.
234    
235            * HTML.pm.src: Don't specify |text| argument for the
236            |chardecode:fallback| error, since it is not the encoding
237            being used alternatively.
238    
239    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
240    
241            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
242    
243    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
244    
245            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
246    
247    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
248    
249            * HTML.pm.src: Bug fix and sync with the spec with regard
250            to after after frameset insertion mode processing (HTML5
251            revision 1909).  Note that the implementation was wrong
252            per the old spec before the r1909 changes.
253    
254    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
255    
256            * HTMLTable.pm: scope=auto algorithm fix synced with the
257            spec (HTML5 revision 2093).
258            ($process_row): Algorithm step numbers synced with the
259            spec (HTML5 revision 2092).
260    
261    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
264            revision 2094).
265    
266    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
267    
268            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
269    
270    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTML.pm.src: '"' and "'" at the end of attribute
273            name (after another attribute) now raise parse error (HTML5
274            revision 2123).  Empty unquoted attribute values are no
275            longer allowed (HTML5 revision 2122).
276    
277    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
278    
279            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
280            revision 2130).
281    
282    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
283    
284            * ContentChecker.pm: |xml:lang| attribute value must be same
285            as |lang| attribute value for HTML elements (HTML5 revision 2062
286            and so on).
287    
288    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
289    
290            * ContentChecker.pm: Error level definition for |xml_id_error|
291            was missing.
292    
293            * URIChecker.pm: The end of the URL should be marked as the
294            error location for an empty path error.  The position
295            between the userinfo and the port components should be
296            marked as the error location for an empty host error.
297    
298    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
299    
300            * URIChecker.pm: Set parameters representing where in the
301            value the error occurs for errors.  Report unknown
302            address format error in warning level, since address
303            formats are rarely added.  Path segments starting with "/.."
304            were misinterpreted as a dot-segment.
305    
306    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
307    
308            * URIChecker.pm (check_iri_reference): Requires
309            |Message::DOM::DOMImplementation|.
310    
311    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
312    
313            * IMTChecker.pm: Updated for the new error reporting architecture.
314    
315            * ContentChecker.pm: Error levels for IMTs are added.
316    
317    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
318    
319            * H2H.pm (_shift_token): Support for unquoted HTML attribute
320            values.
321    
322  2008-08-16  Wakaba  <wakaba@suika.fam.cx>  2008-08-16  Wakaba  <wakaba@suika.fam.cx>
323    
324          * CacheManifest.pm: Support for new style of error          * CacheManifest.pm: Support for new style of error

Legend:
Removed from v.1.275  
changed lines
  Added in v.1.325

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24