/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.354 by wakaba, Tue Oct 14 13:24:52 2008 UTC
# Line 1  Line 1 
1    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Handling of end tags in the foreign content
4            insertion mode was partially wrong, because of wrong bit
5            operations.
6    
7    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
8    
9            * NanoDOM.pm (dom_config): New attribute (do nothing), for
10            Whatpm::XML::Parser support.
11    
12    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
13    
14            * Makefile: New rule to make HTML/Tokenizer.pm is added.
15    
16            * HTML.pm.src: Tokenizer part moved to another file.
17    
18    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
19    
20            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
21    
22    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Element category constants redefined.
25            
26    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
29            construction stage synced with the spec (HTML5 revisions 2139 and
30            2302).
31    
32    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
33    
34            * ContentChecker.pm: New error level "html5_fact" added, which
35            should be tentatively used until all of requirements are properly
36            specced as RFC 2119 "MUST" in HTML5.
37    
38    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
39    
40            * ContentChecker.pod: Note on internal flags is added.
41    
42    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: An AAA bug fixed.
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: If another node is inserted by the parser, don't
49            reuse existing Text node to append a character (HTML5 revision
50            2124).
51    
52    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
55            revisions 1731 and 2128).
56    
57    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
60            Support for end tags of camelCase SVG elements were broken.  A
61            wrong error type text fixed.
62    
63    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
66    
67    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Support for new definition of |param| and |source|
70            start tag parsing (HTML5 revision 1731).
71    
72    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
75    
76    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
79            revisions 1731 and 1831).
80    
81    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
84            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
85            revision 1731).
86    
87    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
90            revision 1731).  End tags of |option| and |optgroup| elements are
91            now optional (HTML5 revision 1731).
92    
93    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: New "special" elements added to the list (HTML5
96            revision 1778).  "strile" -> "strike".
97    
98    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
99    
100            * ContentType.pm (get_sniffed_type): Support for the "better"
101            content sniffing (HTML5 revision 1927).  In a case the official
102            type was not returned when the method is invoked in the list
103            context.
104    
105    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Character references for non-space C0 characters,
108            including U+000B VT, DEL character, noncharacter code points, are
109            now converted to the U+FFFD character (cf. HTML5 revision 2138).
110    
111    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
112    
113            * ContentChecker.pm: |form=""| check support added.
114    
115    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
116    
117            * ContentChecker.pm: |contextmenu| validness is now checked using
118            |id| and |id_type| properties, and |menu| property is removed.
119    
120    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
121    
122            * ContentChecker.pm: Prepare for |form| |name| attribute's
123            duplication checking.
124    
125    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
128            unsupported charset="" parameter value (HTML5 revision 2131).
129    
130    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Reminding places where U+000B is allowed as a space
133            character is fixed (cf. HTML5 revision 1738).
134    
135            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
136            space characters (HTML5 revision 1738).
137    
138    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src: The "anything else" case for the "after after body"
141            insertion mode was not updated to swtich to the "in body"
142            insertion mode.  U+000B is no longer a space character for the
143            purpose of tree construction phase (HTML5 revision 1738).
144    
145    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: U+000B is no longer a space character (HTML5
148            revision 1738).
149    
150    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
151    
152            * ContentType.pm: 0x0B is no longer a space character (HTML5
153            revision 1738).
154    
155            * HTML.pm.src: U+000B is no longer a space character for the
156            algorithm for extracting an encoding from a Content-Type (HTML5
157            revision 1738).
158    
159    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
160    
161            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
162    
163    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
164    
165            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
166    
167            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
168            matching.
169    
170    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
171    
172            * LangTag.pm: Warn for private use language subtags.  Error level
173            typos fixed.  Support for Suppress-Script field.
174    
175            * mklangreg.pl: Support for dumping of nested structure.
176    
177    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
178    
179            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
180            recommended case as per RFC 4646.
181    
182    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
183    
184            * LangTag.pm (check_rfc4646_langtag): New method.
185    
186    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
187    
188            * mklangreg.pl: New script.
189    
190            * Makefile: Updated for creation of the module for language subtag
191            registry.
192            
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * Makefile: WebIDL.html added.
196    
197            * WebIDL.pod: New documentation.
198    
199    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
200    
201            * WebIDL.pm: Checker's error types are redefined.
202    
203    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
204    
205            * WebIDL.pm: Parser's error types are redefined.  Some forward
206            compatible parsing bugs are fixed.  Some unreachable codes are
207            commented out.
208    
209    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm: Support for the reminding extended attributes are
212            added.  It does not satisfy the definition that a forward
213            interface declaration has an extended attribute.  It seems that
214            unless explicitly allowed multiple extended attributes with the
215            same name is not allowed, though it is not explicitly mentioned in
216            the spec.
217    
218    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
219    
220            * WebIDL.pm: Unescapes extended attribute names and extended
221            attribute identifiers.  Preserve whether an extended attribute has
222            an argument list of not.  Support for extended attributes:
223            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
224            NameGetter, NameSetter, and Null.
225            (has_argument_list): New attribute.
226            (idl_text): Stringifies argument lists, if any, even if it is
227            empty.
228    
229    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
232            |{s_kwd}| in DATA_STATE as default.
233    
234    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
237            by |{s_kwd}| in DATA_STATE.
238    
239    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
240    
241            * HTML.pm.src: Shorten keys.
242    
243    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: Remove checking for control character, surrogate
246            pair, or noncharacter code points and non-Unicode code
247            points (they should be handled by Whatpm::Charset::UnicodeChecker).
248            (parse_char_stream): Support for the |$get_wrapper| argument and
249            character stream error handlers.
250    
251    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
252    
253            * ContentChecker.pm: Don't call |loda_ns_module|
254            for null-namespace elements/attributes.
255    
256            * HTML.pm.src: Fact out $disallowed_control_chars
257            as a hash.
258    
259    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
262            and |{next_char}| initializations are moved to initialization
263            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
264            with |parse_char_stream|.
265    
266    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src (parse_char_stream): Make |set_next_char|
269            invoke |manakai_read_until|, not only |read|, where
270            possible, to decrease the number of |read| method calls.
271    
272            * mkhtmlparser.pl: Related changes to the aforementioned
273            modification.
274    
275    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
278            would report character error from now.
279    
280    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
281    
282            * HTML.pm.src: White-space-leaded non-white-space character
283            tokens in "before head insertion mode" was not
284            correctly handled.
285            (set_inner_html): Reimplemented using CharString decodehandle
286            class.  Support for $get_wrapper argument.  Support
287            for |{read_until}| feature.
288    
289    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Make a "bare ero" error for unknown
292            entities point the "&" character.
293    
294    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: It turns out that U+FFFD don't have to
297            be added to the list of excluded characters.
298    
299    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src ($char_onerror): Have character decoder's |line|
302            and |column| a higher priority than the one set by the
303            tokenizer's input handler.
304            ($self->{read_until}): Exclude U+FFFD (but this might
305            not be necessary, since now we do line/column fixup in
306            the character decode handle).
307    
308    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src: Use |{read_until}| where possible.
311    
312    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
313    
314            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
315            and |manakai_getc_until| to |manakai_read_until| to
316            reduce the number of string copies.
317    
318    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src (parse_char_string): Use newly created
321            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
322            standard feature to |open| a string as a filehandle,
323            since Perl's string filehandle seems not supporting |ungetc|
324            method correctly.
325            (parse_char_stream): Define |{getc_until}| method.
326            (DATA_STATE): Experimental support for |getc_until| feature.
327    
328    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
329    
330            * HTML.pm.src: Check points added to newly added branches.
331    
332    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTML.pm.src: Remove |{char}|, which is no longer used.
335            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
336            and replaced by |{prev_state}|.
337    
338            * mkhtmlparser.pl: Remove |{char}| feature.
339            Remove |!!!back-next-input-character;| macro.
340    
341    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
344            entity related tokenizer states in favor of new states
345            implementing the consume character reference algorithm.
346    
347  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
350          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
351          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
352          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 354 
354    
355  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
356    
357          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
358          into their own tokenizer states.          into their own tokenizer states.
359    
360  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.354

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24