/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.268 by wakaba, Sun Aug 3 05:43:11 2008 UTC revision 1.305 by wakaba, Sun Sep 14 07:19:47 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Make a "bare ero" error for unknown
4            entities point the "&" character.
5    
6    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: It turns out that U+FFFD don't have to
9            be added to the list of excluded characters.
10    
11    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src ($char_onerror): Have character decoder's |line|
14            and |column| a higher priority than the one set by the
15            tokenizer's input handler.
16            ($self->{read_until}): Exclude U+FFFD (but this might
17            not be necessary, since now we do line/column fixup in
18            the character decode handle).
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Use |{read_until}| where possible.
23    
24    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
27            and |manakai_getc_until| to |manakai_read_until| to
28            reduce the number of string copies.
29    
30    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src (parse_char_string): Use newly created
33            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
34            standard feature to |open| a string as a filehandle,
35            since Perl's string filehandle seems not supporting |ungetc|
36            method correctly.
37            (parse_char_stream): Define |{getc_until}| method.
38            (DATA_STATE): Experimental support for |getc_until| feature.
39    
40    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
41    
42            * HTML.pm.src: Check points added to newly added branches.
43    
44    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: Remove |{char}|, which is no longer used.
47            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
48            and replaced by |{prev_state}|.
49    
50            * mkhtmlparser.pl: Remove |{char}| feature.
51            Remove |!!!back-next-input-character;| macro.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
56            entity related tokenizer states in favor of new states
57            implementing the consume character reference algorithm.
58    
59    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src: "Consume a character reference" algorithm is
62            now implemented as a tokenizer's state, rather than
63            a method, with minimum changes (more changes will
64            be made, in due course).  "Bogus comment state"'s inner
65            loop gets removed.
66    
67    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
70            into their own tokenizer states.
71    
72    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
75            is split into three states.
76    
77    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
80            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
81            no longer does the tokenizer have to push back next input
82            characters in those states.
83    
84    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
87            into four states so that no longer does the tokenizer have to push
88            back next input characters in that state.
89    
90    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
93            which can be used to insert some wrapper between the character
94            stream handle and the tokenizer.  (It is currently not supported
95            for |set_inner_html| for |Element|s).
96    
97    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Ignore punctuations in charset names.
100    
101    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
102    
103            * ContentChecker.pm: Support for charset-layer error levels.
104    
105            * HTML.pm.src: Don't specify |text| argument for the
106            |chardecode:fallback| error, since it is not the encoding
107            being used alternatively.
108    
109    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
112    
113    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
114    
115            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
116    
117    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: Bug fix and sync with the spec with regard
120            to after after frameset insertion mode processing (HTML5
121            revision 1909).  Note that the implementation was wrong
122            per the old spec before the r1909 changes.
123    
124    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTMLTable.pm: scope=auto algorithm fix synced with the
127            spec (HTML5 revision 2093).
128            ($process_row): Algorithm step numbers synced with the
129            spec (HTML5 revision 2092).
130    
131    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
134            revision 2094).
135    
136    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
137    
138            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
139    
140    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: '"' and "'" at the end of attribute
143            name (after another attribute) now raise parse error (HTML5
144            revision 2123).  Empty unquoted attribute values are no
145            longer allowed (HTML5 revision 2122).
146    
147    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
148    
149            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
150            revision 2130).
151    
152    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
153    
154            * ContentChecker.pm: |xml:lang| attribute value must be same
155            as |lang| attribute value for HTML elements (HTML5 revision 2062
156            and so on).
157    
158    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
159    
160            * ContentChecker.pm: Error level definition for |xml_id_error|
161            was missing.
162    
163            * URIChecker.pm: The end of the URL should be marked as the
164            error location for an empty path error.  The position
165            between the userinfo and the port components should be
166            marked as the error location for an empty host error.
167    
168    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
169    
170            * URIChecker.pm: Set parameters representing where in the
171            value the error occurs for errors.  Report unknown
172            address format error in warning level, since address
173            formats are rarely added.  Path segments starting with "/.."
174            were misinterpreted as a dot-segment.
175    
176    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
177    
178            * URIChecker.pm (check_iri_reference): Requires
179            |Message::DOM::DOMImplementation|.
180    
181    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
182    
183            * IMTChecker.pm: Updated for the new error reporting architecture.
184    
185            * ContentChecker.pm: Error levels for IMTs are added.
186    
187    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
188    
189            * H2H.pm (_shift_token): Support for unquoted HTML attribute
190            values.
191    
192    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
193    
194            * CacheManifest.pm: Support for new style of error
195            reports.
196    
197            * HTML.pm.src: Set line=1, column=1 to the document node.
198    
199    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
200    
201            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
202            and URL checkers.  Support for more error levels for bogus
203            langauge tag and URL "standards".
204    
205            * LangTag.pm, URIChecker.pm: Support for new style error
206            level reporting.
207    
208    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
209    
210            * ContentChecker.pm: Support for RDF/XML error levels.
211    
212            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
213            specifying.  Error types are revised.
214    
215    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
216    
217            * ContentChecker.pm: All error reporting method calls are
218            renewed.
219    
220    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: All error type names and "text" parameters
223            are revised.  Use new style for "level" specification.
224    
225            * mkhtmlparser.pl: Use new style for "level" specification.
226    
227    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
228    
229            * WebIDL.pm (parse_char_string): Simplified error
230            reporting process for broken ignored valuetype definition.
231            (Valuetype idl_text): Support for special "DOMString" name.
232    
233    2008-08-03  Wakaba  <wakaba@suika.fam.cx>
234    
235            * WebIDL.pm ($get_scoped_name): Append "::::" if the last
236            terminal of the ScopedName is "DOMString", such that whether
237            the last part of the scoped name is "DOMString" or "_DOMString"
238            later.  It is necessary to determine whether a |typedef|
239            definition should be ignored or not.
240            (parse_char_string): Unescape the identifier of
241            exception members.
242            ($resolve): Return undef for builtin types and sequence<T>
243            types (we might not have to do this, however...).
244            (check): Support checking for Exceptions, Valuetypes,
245            and Typedefs.
246            ($serialize_type): Support for "DOMString::::" syntax.
247            (Typedef idl_text): Output Type as "DOMString" if it
248            is really "DOMString" (i.e. its internal representation
249            is "::DOMString::").
250    
251  2008-08-03  Wakaba  <wakaba@suika.fam.cx>  2008-08-03  Wakaba  <wakaba@suika.fam.cx>
252    
253          * WebIDL.pm ($resolve): New code, based on resolve code          * WebIDL.pm ($resolve): New code, based on resolve code

Legend:
Removed from v.1.268  
changed lines
  Added in v.1.305

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24