/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.282 by wakaba, Sat Aug 30 12:57:05 2008 UTC revision 1.315 by wakaba, Tue Sep 16 04:20:52 2008 UTC
# Line 1  Line 1 
1    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * WebIDL.pm: Unescapes extended attribute names and extended
4            attribute identifiers.  Preserve whether an extended attribute has
5            an argument list of not.  Support for extended attributes:
6            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
7            NameGetter, NameSetter, and Null.
8            (has_argument_list): New attribute.
9            (idl_text): Stringifies argument lists, if any, even if it is
10            empty.
11    
12    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
15            |{s_kwd}| in DATA_STATE as default.
16    
17    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
20            by |{s_kwd}| in DATA_STATE.
21    
22    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Shorten keys.
25    
26    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Remove checking for control character, surrogate
29            pair, or noncharacter code points and non-Unicode code
30            points (they should be handled by Whatpm::Charset::UnicodeChecker).
31            (parse_char_stream): Support for the |$get_wrapper| argument and
32            character stream error handlers.
33    
34    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
35    
36            * ContentChecker.pm: Don't call |loda_ns_module|
37            for null-namespace elements/attributes.
38    
39            * HTML.pm.src: Fact out $disallowed_control_chars
40            as a hash.
41    
42    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
45            and |{next_char}| initializations are moved to initialization
46            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
47            with |parse_char_stream|.
48    
49    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src (parse_char_stream): Make |set_next_char|
52            invoke |manakai_read_until|, not only |read|, where
53            possible, to decrease the number of |read| method calls.
54    
55            * mkhtmlparser.pl: Related changes to the aforementioned
56            modification.
57    
58    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
61            would report character error from now.
62    
63    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: White-space-leaded non-white-space character
66            tokens in "before head insertion mode" was not
67            correctly handled.
68            (set_inner_html): Reimplemented using CharString decodehandle
69            class.  Support for $get_wrapper argument.  Support
70            for |{read_until}| feature.
71    
72    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: Make a "bare ero" error for unknown
75            entities point the "&" character.
76    
77    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: It turns out that U+FFFD don't have to
80            be added to the list of excluded characters.
81    
82    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src ($char_onerror): Have character decoder's |line|
85            and |column| a higher priority than the one set by the
86            tokenizer's input handler.
87            ($self->{read_until}): Exclude U+FFFD (but this might
88            not be necessary, since now we do line/column fixup in
89            the character decode handle).
90    
91    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src: Use |{read_until}| where possible.
94    
95    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
96    
97            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
98            and |manakai_getc_until| to |manakai_read_until| to
99            reduce the number of string copies.
100    
101    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src (parse_char_string): Use newly created
104            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
105            standard feature to |open| a string as a filehandle,
106            since Perl's string filehandle seems not supporting |ungetc|
107            method correctly.
108            (parse_char_stream): Define |{getc_until}| method.
109            (DATA_STATE): Experimental support for |getc_until| feature.
110    
111    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
112    
113            * HTML.pm.src: Check points added to newly added branches.
114    
115    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: Remove |{char}|, which is no longer used.
118            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
119            and replaced by |{prev_state}|.
120    
121            * mkhtmlparser.pl: Remove |{char}| feature.
122            Remove |!!!back-next-input-character;| macro.
123    
124    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
127            entity related tokenizer states in favor of new states
128            implementing the consume character reference algorithm.
129    
130    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: "Consume a character reference" algorithm is
133            now implemented as a tokenizer's state, rather than
134            a method, with minimum changes (more changes will
135            be made, in due course).  "Bogus comment state"'s inner
136            loop gets removed.
137    
138    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
141            into their own tokenizer states.
142    
143    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
146            is split into three states.
147    
148    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
149    
150            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
151            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
152            no longer does the tokenizer have to push back next input
153            characters in those states.
154    
155    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
158            into four states so that no longer does the tokenizer have to push
159            back next input characters in that state.
160    
161    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
164            which can be used to insert some wrapper between the character
165            stream handle and the tokenizer.  (It is currently not supported
166            for |set_inner_html| for |Element|s).
167    
168    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
169    
170            * HTML.pm.src: Ignore punctuations in charset names.
171    
172    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
173    
174            * ContentChecker.pm: Support for charset-layer error levels.
175    
176            * HTML.pm.src: Don't specify |text| argument for the
177            |chardecode:fallback| error, since it is not the encoding
178            being used alternatively.
179    
180    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
181    
182            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
183    
184    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
185    
186            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
187    
188    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: Bug fix and sync with the spec with regard
191            to after after frameset insertion mode processing (HTML5
192            revision 1909).  Note that the implementation was wrong
193            per the old spec before the r1909 changes.
194    
195    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
196    
197            * HTMLTable.pm: scope=auto algorithm fix synced with the
198            spec (HTML5 revision 2093).
199            ($process_row): Algorithm step numbers synced with the
200            spec (HTML5 revision 2092).
201    
202    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
205            revision 2094).
206    
207    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
208    
209            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
210    
211    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
212    
213            * HTML.pm.src: '"' and "'" at the end of attribute
214            name (after another attribute) now raise parse error (HTML5
215            revision 2123).  Empty unquoted attribute values are no
216            longer allowed (HTML5 revision 2122).
217    
218  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
219    
220          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5          * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5

Legend:
Removed from v.1.282  
changed lines
  Added in v.1.315

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24