/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.271 by wakaba, Fri Aug 15 08:32:41 2008 UTC revision 1.306 by wakaba, Sun Sep 14 09:05:54 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: White-space-leaded non-white-space character
4            tokens in "before head insertion mode" was not
5            correctly handled.
6            (set_inner_html): Reimplemented using CharString decodehandle
7            class.  Support for $get_wrapper argument.  Support
8            for |{read_until}| feature.
9    
10    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Make a "bare ero" error for unknown
13            entities point the "&" character.
14    
15    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: It turns out that U+FFFD don't have to
18            be added to the list of excluded characters.
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src ($char_onerror): Have character decoder's |line|
23            and |column| a higher priority than the one set by the
24            tokenizer's input handler.
25            ($self->{read_until}): Exclude U+FFFD (but this might
26            not be necessary, since now we do line/column fixup in
27            the character decode handle).
28    
29    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
30    
31            * HTML.pm.src: Use |{read_until}| where possible.
32    
33    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
36            and |manakai_getc_until| to |manakai_read_until| to
37            reduce the number of string copies.
38    
39    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src (parse_char_string): Use newly created
42            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
43            standard feature to |open| a string as a filehandle,
44            since Perl's string filehandle seems not supporting |ungetc|
45            method correctly.
46            (parse_char_stream): Define |{getc_until}| method.
47            (DATA_STATE): Experimental support for |getc_until| feature.
48    
49    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: Check points added to newly added branches.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Remove |{char}|, which is no longer used.
56            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
57            and replaced by |{prev_state}|.
58    
59            * mkhtmlparser.pl: Remove |{char}| feature.
60            Remove |!!!back-next-input-character;| macro.
61    
62    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
65            entity related tokenizer states in favor of new states
66            implementing the consume character reference algorithm.
67    
68    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: "Consume a character reference" algorithm is
71            now implemented as a tokenizer's state, rather than
72            a method, with minimum changes (more changes will
73            be made, in due course).  "Bogus comment state"'s inner
74            loop gets removed.
75    
76    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
79            into their own tokenizer states.
80    
81    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
84            is split into three states.
85    
86    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
89            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
90            no longer does the tokenizer have to push back next input
91            characters in those states.
92    
93    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
96            into four states so that no longer does the tokenizer have to push
97            back next input characters in that state.
98    
99    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
102            which can be used to insert some wrapper between the character
103            stream handle and the tokenizer.  (It is currently not supported
104            for |set_inner_html| for |Element|s).
105    
106    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Ignore punctuations in charset names.
109    
110    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
111    
112            * ContentChecker.pm: Support for charset-layer error levels.
113    
114            * HTML.pm.src: Don't specify |text| argument for the
115            |chardecode:fallback| error, since it is not the encoding
116            being used alternatively.
117    
118    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
121    
122    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
123    
124            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
125    
126    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Bug fix and sync with the spec with regard
129            to after after frameset insertion mode processing (HTML5
130            revision 1909).  Note that the implementation was wrong
131            per the old spec before the r1909 changes.
132    
133    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTMLTable.pm: scope=auto algorithm fix synced with the
136            spec (HTML5 revision 2093).
137            ($process_row): Algorithm step numbers synced with the
138            spec (HTML5 revision 2092).
139    
140    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
143            revision 2094).
144    
145    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
146    
147            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
148    
149    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: '"' and "'" at the end of attribute
152            name (after another attribute) now raise parse error (HTML5
153            revision 2123).  Empty unquoted attribute values are no
154            longer allowed (HTML5 revision 2122).
155    
156    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
157    
158            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
159            revision 2130).
160    
161    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
162    
163            * ContentChecker.pm: |xml:lang| attribute value must be same
164            as |lang| attribute value for HTML elements (HTML5 revision 2062
165            and so on).
166    
167    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
168    
169            * ContentChecker.pm: Error level definition for |xml_id_error|
170            was missing.
171    
172            * URIChecker.pm: The end of the URL should be marked as the
173            error location for an empty path error.  The position
174            between the userinfo and the port components should be
175            marked as the error location for an empty host error.
176    
177    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
178    
179            * URIChecker.pm: Set parameters representing where in the
180            value the error occurs for errors.  Report unknown
181            address format error in warning level, since address
182            formats are rarely added.  Path segments starting with "/.."
183            were misinterpreted as a dot-segment.
184    
185    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
186    
187            * URIChecker.pm (check_iri_reference): Requires
188            |Message::DOM::DOMImplementation|.
189    
190    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
191    
192            * IMTChecker.pm: Updated for the new error reporting architecture.
193    
194            * ContentChecker.pm: Error levels for IMTs are added.
195    
196    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
197    
198            * H2H.pm (_shift_token): Support for unquoted HTML attribute
199            values.
200    
201    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
202    
203            * CacheManifest.pm: Support for new style of error
204            reports.
205    
206            * HTML.pm.src: Set line=1, column=1 to the document node.
207    
208    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
209    
210            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
211            and URL checkers.  Support for more error levels for bogus
212            langauge tag and URL "standards".
213    
214            * LangTag.pm, URIChecker.pm: Support for new style error
215            level reporting.
216    
217    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
218    
219            * ContentChecker.pm: Support for RDF/XML error levels.
220    
221            * HTMLTable.pm, RDFXML.pm: Support for new style of error level
222            specifying.  Error types are revised.
223    
224    2008-08-15  Wakaba  <wakaba@suika.fam.cx>
225    
226            * ContentChecker.pm: All error reporting method calls are
227            renewed.
228    
229  2008-08-15  Wakaba  <wakaba@suika.fam.cx>  2008-08-15  Wakaba  <wakaba@suika.fam.cx>
230    
231          * HTML.pm.src: All error type names and "text" parameters          * HTML.pm.src: All error type names and "text" parameters

Legend:
Removed from v.1.271  
changed lines
  Added in v.1.306

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24