/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.292 by wakaba, Sat Sep 13 04:19:55 2008 UTC revision 1.336 by wakaba, Sat Oct 4 04:06:32 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: New "special" elements added to the list (HTML5
4            revision 1778).  "strile" -> "strike".
5    
6    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
7    
8            * ContentType.pm (get_sniffed_type): Support for the "better"
9            content sniffing (HTML5 revision 1927).  In a case the official
10            type was not returned when the method is invoked in the list
11            context.
12    
13    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
14    
15            * HTML.pm.src: Character references for non-space C0 characters,
16            including U+000B VT, DEL character, noncharacter code points, are
17            now converted to the U+FFFD character (cf. HTML5 revision 2138).
18    
19    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
20    
21            * ContentChecker.pm: |form=""| check support added.
22    
23    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
24    
25            * ContentChecker.pm: |contextmenu| validness is now checked using
26            |id| and |id_type| properties, and |menu| property is removed.
27    
28    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
29    
30            * ContentChecker.pm: Prepare for |form| |name| attribute's
31            duplication checking.
32    
33    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
36            unsupported charset="" parameter value (HTML5 revision 2131).
37    
38    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
39    
40            * HTML.pm.src: Reminding places where U+000B is allowed as a space
41            character is fixed (cf. HTML5 revision 1738).
42    
43            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
44            space characters (HTML5 revision 1738).
45    
46    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: The "anything else" case for the "after after body"
49            insertion mode was not updated to swtich to the "in body"
50            insertion mode.  U+000B is no longer a space character for the
51            purpose of tree construction phase (HTML5 revision 1738).
52    
53    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: U+000B is no longer a space character (HTML5
56            revision 1738).
57    
58    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
59    
60            * ContentType.pm: 0x0B is no longer a space character (HTML5
61            revision 1738).
62    
63            * HTML.pm.src: U+000B is no longer a space character for the
64            algorithm for extracting an encoding from a Content-Type (HTML5
65            revision 1738).
66    
67    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
68    
69            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
70    
71    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
72    
73            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
74    
75            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
76            matching.
77    
78    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
79    
80            * LangTag.pm: Warn for private use language subtags.  Error level
81            typos fixed.  Support for Suppress-Script field.
82    
83            * mklangreg.pl: Support for dumping of nested structure.
84    
85    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
86    
87            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
88            recommended case as per RFC 4646.
89    
90    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
91    
92            * LangTag.pm (check_rfc4646_langtag): New method.
93    
94    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
95    
96            * mklangreg.pl: New script.
97    
98            * Makefile: Updated for creation of the module for language subtag
99            registry.
100            
101    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
102    
103            * Makefile: WebIDL.html added.
104    
105            * WebIDL.pod: New documentation.
106    
107    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
108    
109            * WebIDL.pm: Checker's error types are redefined.
110    
111    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
112    
113            * WebIDL.pm: Parser's error types are redefined.  Some forward
114            compatible parsing bugs are fixed.  Some unreachable codes are
115            commented out.
116    
117    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
118    
119            * WebIDL.pm: Support for the reminding extended attributes are
120            added.  It does not satisfy the definition that a forward
121            interface declaration has an extended attribute.  It seems that
122            unless explicitly allowed multiple extended attributes with the
123            same name is not allowed, though it is not explicitly mentioned in
124            the spec.
125    
126    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
127    
128            * WebIDL.pm: Unescapes extended attribute names and extended
129            attribute identifiers.  Preserve whether an extended attribute has
130            an argument list of not.  Support for extended attributes:
131            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
132            NameGetter, NameSetter, and Null.
133            (has_argument_list): New attribute.
134            (idl_text): Stringifies argument lists, if any, even if it is
135            empty.
136    
137    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
138    
139            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
140            |{s_kwd}| in DATA_STATE as default.
141    
142    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
143    
144            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
145            by |{s_kwd}| in DATA_STATE.
146    
147    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
148    
149            * HTML.pm.src: Shorten keys.
150    
151    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
152    
153            * HTML.pm.src: Remove checking for control character, surrogate
154            pair, or noncharacter code points and non-Unicode code
155            points (they should be handled by Whatpm::Charset::UnicodeChecker).
156            (parse_char_stream): Support for the |$get_wrapper| argument and
157            character stream error handlers.
158    
159    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
160    
161            * ContentChecker.pm: Don't call |loda_ns_module|
162            for null-namespace elements/attributes.
163    
164            * HTML.pm.src: Fact out $disallowed_control_chars
165            as a hash.
166    
167    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
168    
169            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
170            and |{next_char}| initializations are moved to initialization
171            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
172            with |parse_char_stream|.
173    
174    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
175    
176            * HTML.pm.src (parse_char_stream): Make |set_next_char|
177            invoke |manakai_read_until|, not only |read|, where
178            possible, to decrease the number of |read| method calls.
179    
180            * mkhtmlparser.pl: Related changes to the aforementioned
181            modification.
182    
183    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
186            would report character error from now.
187    
188    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: White-space-leaded non-white-space character
191            tokens in "before head insertion mode" was not
192            correctly handled.
193            (set_inner_html): Reimplemented using CharString decodehandle
194            class.  Support for $get_wrapper argument.  Support
195            for |{read_until}| feature.
196    
197    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Make a "bare ero" error for unknown
200            entities point the "&" character.
201    
202    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
203    
204            * HTML.pm.src: It turns out that U+FFFD don't have to
205            be added to the list of excluded characters.
206    
207    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
208    
209            * HTML.pm.src ($char_onerror): Have character decoder's |line|
210            and |column| a higher priority than the one set by the
211            tokenizer's input handler.
212            ($self->{read_until}): Exclude U+FFFD (but this might
213            not be necessary, since now we do line/column fixup in
214            the character decode handle).
215    
216    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
217    
218            * HTML.pm.src: Use |{read_until}| where possible.
219    
220    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
221    
222            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
223            and |manakai_getc_until| to |manakai_read_until| to
224            reduce the number of string copies.
225    
226    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
227    
228            * HTML.pm.src (parse_char_string): Use newly created
229            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
230            standard feature to |open| a string as a filehandle,
231            since Perl's string filehandle seems not supporting |ungetc|
232            method correctly.
233            (parse_char_stream): Define |{getc_until}| method.
234            (DATA_STATE): Experimental support for |getc_until| feature.
235    
236    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
237    
238            * HTML.pm.src: Check points added to newly added branches.
239    
240    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
241    
242            * HTML.pm.src: Remove |{char}|, which is no longer used.
243            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
244            and replaced by |{prev_state}|.
245    
246            * mkhtmlparser.pl: Remove |{char}| feature.
247            Remove |!!!back-next-input-character;| macro.
248    
249    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
250    
251            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
252            entity related tokenizer states in favor of new states
253            implementing the consume character reference algorithm.
254    
255    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
256    
257            * HTML.pm.src: "Consume a character reference" algorithm is
258            now implemented as a tokenizer's state, rather than
259            a method, with minimum changes (more changes will
260            be made, in due course).  "Bogus comment state"'s inner
261            loop gets removed.
262    
263    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
264    
265            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
266            into their own tokenizer states.
267    
268    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
269    
270            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
271            is split into three states.
272    
273    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
274    
275            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
276            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
277            no longer does the tokenizer have to push back next input
278            characters in those states.
279    
280  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
281    
282          * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken          * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken

Legend:
Removed from v.1.292  
changed lines
  Added in v.1.336

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24