/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.291 by wakaba, Thu Sep 11 09:12:27 2008 UTC revision 1.344 by wakaba, Sat Oct 4 12:20:35 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
4            revisions 1731 and 2128).
5    
6    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
9            Support for end tags of camelCase SVG elements were broken.  A
10            wrong error type text fixed.
11    
12    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
15    
16    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Support for new definition of |param| and |source|
19            start tag parsing (HTML5 revision 1731).
20    
21    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
22    
23            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
24    
25    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
28            revisions 1731 and 1831).
29    
30    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
33            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
34            revision 1731).
35    
36    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
39            revision 1731).  End tags of |option| and |optgroup| elements are
40            now optional (HTML5 revision 1731).
41    
42    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: New "special" elements added to the list (HTML5
45            revision 1778).  "strile" -> "strike".
46    
47    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
48    
49            * ContentType.pm (get_sniffed_type): Support for the "better"
50            content sniffing (HTML5 revision 1927).  In a case the official
51            type was not returned when the method is invoked in the list
52            context.
53    
54    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
55    
56            * HTML.pm.src: Character references for non-space C0 characters,
57            including U+000B VT, DEL character, noncharacter code points, are
58            now converted to the U+FFFD character (cf. HTML5 revision 2138).
59    
60    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
61    
62            * ContentChecker.pm: |form=""| check support added.
63    
64    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
65    
66            * ContentChecker.pm: |contextmenu| validness is now checked using
67            |id| and |id_type| properties, and |menu| property is removed.
68    
69    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
70    
71            * ContentChecker.pm: Prepare for |form| |name| attribute's
72            duplication checking.
73    
74    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
75    
76            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
77            unsupported charset="" parameter value (HTML5 revision 2131).
78    
79    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
80    
81            * HTML.pm.src: Reminding places where U+000B is allowed as a space
82            character is fixed (cf. HTML5 revision 1738).
83    
84            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
85            space characters (HTML5 revision 1738).
86    
87    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: The "anything else" case for the "after after body"
90            insertion mode was not updated to swtich to the "in body"
91            insertion mode.  U+000B is no longer a space character for the
92            purpose of tree construction phase (HTML5 revision 1738).
93    
94    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: U+000B is no longer a space character (HTML5
97            revision 1738).
98    
99    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
100    
101            * ContentType.pm: 0x0B is no longer a space character (HTML5
102            revision 1738).
103    
104            * HTML.pm.src: U+000B is no longer a space character for the
105            algorithm for extracting an encoding from a Content-Type (HTML5
106            revision 1738).
107    
108    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
109    
110            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
111    
112    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
113    
114            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
115    
116            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
117            matching.
118    
119    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
120    
121            * LangTag.pm: Warn for private use language subtags.  Error level
122            typos fixed.  Support for Suppress-Script field.
123    
124            * mklangreg.pl: Support for dumping of nested structure.
125    
126    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
127    
128            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
129            recommended case as per RFC 4646.
130    
131    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
132    
133            * LangTag.pm (check_rfc4646_langtag): New method.
134    
135    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
136    
137            * mklangreg.pl: New script.
138    
139            * Makefile: Updated for creation of the module for language subtag
140            registry.
141            
142    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
143    
144            * Makefile: WebIDL.html added.
145    
146            * WebIDL.pod: New documentation.
147    
148    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
149    
150            * WebIDL.pm: Checker's error types are redefined.
151    
152    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
153    
154            * WebIDL.pm: Parser's error types are redefined.  Some forward
155            compatible parsing bugs are fixed.  Some unreachable codes are
156            commented out.
157    
158    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
159    
160            * WebIDL.pm: Support for the reminding extended attributes are
161            added.  It does not satisfy the definition that a forward
162            interface declaration has an extended attribute.  It seems that
163            unless explicitly allowed multiple extended attributes with the
164            same name is not allowed, though it is not explicitly mentioned in
165            the spec.
166    
167    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
168    
169            * WebIDL.pm: Unescapes extended attribute names and extended
170            attribute identifiers.  Preserve whether an extended attribute has
171            an argument list of not.  Support for extended attributes:
172            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
173            NameGetter, NameSetter, and Null.
174            (has_argument_list): New attribute.
175            (idl_text): Stringifies argument lists, if any, even if it is
176            empty.
177    
178    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
179    
180            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
181            |{s_kwd}| in DATA_STATE as default.
182    
183    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
186            by |{s_kwd}| in DATA_STATE.
187    
188    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
189    
190            * HTML.pm.src: Shorten keys.
191    
192    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: Remove checking for control character, surrogate
195            pair, or noncharacter code points and non-Unicode code
196            points (they should be handled by Whatpm::Charset::UnicodeChecker).
197            (parse_char_stream): Support for the |$get_wrapper| argument and
198            character stream error handlers.
199    
200    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
201    
202            * ContentChecker.pm: Don't call |loda_ns_module|
203            for null-namespace elements/attributes.
204    
205            * HTML.pm.src: Fact out $disallowed_control_chars
206            as a hash.
207    
208    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
209    
210            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
211            and |{next_char}| initializations are moved to initialization
212            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
213            with |parse_char_stream|.
214    
215    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
216    
217            * HTML.pm.src (parse_char_stream): Make |set_next_char|
218            invoke |manakai_read_until|, not only |read|, where
219            possible, to decrease the number of |read| method calls.
220    
221            * mkhtmlparser.pl: Related changes to the aforementioned
222            modification.
223    
224    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
225    
226            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
227            would report character error from now.
228    
229    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: White-space-leaded non-white-space character
232            tokens in "before head insertion mode" was not
233            correctly handled.
234            (set_inner_html): Reimplemented using CharString decodehandle
235            class.  Support for $get_wrapper argument.  Support
236            for |{read_until}| feature.
237    
238    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
239    
240            * HTML.pm.src: Make a "bare ero" error for unknown
241            entities point the "&" character.
242    
243    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: It turns out that U+FFFD don't have to
246            be added to the list of excluded characters.
247    
248    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
249    
250            * HTML.pm.src ($char_onerror): Have character decoder's |line|
251            and |column| a higher priority than the one set by the
252            tokenizer's input handler.
253            ($self->{read_until}): Exclude U+FFFD (but this might
254            not be necessary, since now we do line/column fixup in
255            the character decode handle).
256    
257    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
258    
259            * HTML.pm.src: Use |{read_until}| where possible.
260    
261    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
262    
263            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
264            and |manakai_getc_until| to |manakai_read_until| to
265            reduce the number of string copies.
266    
267    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
268    
269            * HTML.pm.src (parse_char_string): Use newly created
270            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
271            standard feature to |open| a string as a filehandle,
272            since Perl's string filehandle seems not supporting |ungetc|
273            method correctly.
274            (parse_char_stream): Define |{getc_until}| method.
275            (DATA_STATE): Experimental support for |getc_until| feature.
276    
277    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
278    
279            * HTML.pm.src: Check points added to newly added branches.
280    
281    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
282    
283            * HTML.pm.src: Remove |{char}|, which is no longer used.
284            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
285            and replaced by |{prev_state}|.
286    
287            * mkhtmlparser.pl: Remove |{char}| feature.
288            Remove |!!!back-next-input-character;| macro.
289    
290    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
291    
292            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
293            entity related tokenizer states in favor of new states
294            implementing the consume character reference algorithm.
295    
296    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
297    
298            * HTML.pm.src: "Consume a character reference" algorithm is
299            now implemented as a tokenizer's state, rather than
300            a method, with minimum changes (more changes will
301            be made, in due course).  "Bogus comment state"'s inner
302            loop gets removed.
303    
304    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
305    
306            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
307            into their own tokenizer states.
308    
309    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
310    
311            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
312            is split into three states.
313    
314    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
315    
316            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
317            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
318            no longer does the tokenizer have to push back next input
319            characters in those states.
320    
321    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
322    
323            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
324            into four states so that no longer does the tokenizer have to push
325            back next input characters in that state.
326    
327  2008-09-11  Wakaba  <wakaba@suika.fam.cx>  2008-09-11  Wakaba  <wakaba@suika.fam.cx>
328    
329          * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,          * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,

Legend:
Removed from v.1.291  
changed lines
  Added in v.1.344

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24