/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.290 by wakaba, Wed Sep 10 10:46:50 2008 UTC revision 1.353 by wakaba, Tue Oct 14 09:00:57 2008 UTC
# Line 1  Line 1 
1    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * NanoDOM.pm (dom_config): New attribute (do nothing), for
4            Whatpm::XML::Parser support.
5    
6    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
7    
8            * Makefile: New rule to make HTML/Tokenizer.pm is added.
9    
10            * HTML.pm.src: Tokenizer part moved to another file.
11    
12    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
15    
16    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
17    
18            * HTML.pm.src: Element category constants redefined.
19            
20    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
23            construction stage synced with the spec (HTML5 revisions 2139 and
24            2302).
25    
26    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
27    
28            * ContentChecker.pm: New error level "html5_fact" added, which
29            should be tentatively used until all of requirements are properly
30            specced as RFC 2119 "MUST" in HTML5.
31    
32    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
33    
34            * ContentChecker.pod: Note on internal flags is added.
35    
36    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
37    
38            * HTML.pm.src: An AAA bug fixed.
39    
40    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
41    
42            * HTML.pm.src: If another node is inserted by the parser, don't
43            reuse existing Text node to append a character (HTML5 revision
44            2124).
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
49            revisions 1731 and 2128).
50    
51    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
52    
53            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
54            Support for end tags of camelCase SVG elements were broken.  A
55            wrong error type text fixed.
56    
57    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
60    
61    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
62    
63            * HTML.pm.src: Support for new definition of |param| and |source|
64            start tag parsing (HTML5 revision 1731).
65    
66    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
67    
68            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
69    
70    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
71    
72            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
73            revisions 1731 and 1831).
74    
75    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
76    
77            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
78            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
79            revision 1731).
80    
81    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
84            revision 1731).  End tags of |option| and |optgroup| elements are
85            now optional (HTML5 revision 1731).
86    
87    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: New "special" elements added to the list (HTML5
90            revision 1778).  "strile" -> "strike".
91    
92    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
93    
94            * ContentType.pm (get_sniffed_type): Support for the "better"
95            content sniffing (HTML5 revision 1927).  In a case the official
96            type was not returned when the method is invoked in the list
97            context.
98    
99    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Character references for non-space C0 characters,
102            including U+000B VT, DEL character, noncharacter code points, are
103            now converted to the U+FFFD character (cf. HTML5 revision 2138).
104    
105    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
106    
107            * ContentChecker.pm: |form=""| check support added.
108    
109    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
110    
111            * ContentChecker.pm: |contextmenu| validness is now checked using
112            |id| and |id_type| properties, and |menu| property is removed.
113    
114    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
115    
116            * ContentChecker.pm: Prepare for |form| |name| attribute's
117            duplication checking.
118    
119    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
120    
121            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
122            unsupported charset="" parameter value (HTML5 revision 2131).
123    
124    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Reminding places where U+000B is allowed as a space
127            character is fixed (cf. HTML5 revision 1738).
128    
129            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
130            space characters (HTML5 revision 1738).
131    
132    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
133    
134            * HTML.pm.src: The "anything else" case for the "after after body"
135            insertion mode was not updated to swtich to the "in body"
136            insertion mode.  U+000B is no longer a space character for the
137            purpose of tree construction phase (HTML5 revision 1738).
138    
139    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: U+000B is no longer a space character (HTML5
142            revision 1738).
143    
144    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
145    
146            * ContentType.pm: 0x0B is no longer a space character (HTML5
147            revision 1738).
148    
149            * HTML.pm.src: U+000B is no longer a space character for the
150            algorithm for extracting an encoding from a Content-Type (HTML5
151            revision 1738).
152    
153    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
154    
155            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
156    
157    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
158    
159            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
160    
161            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
162            matching.
163    
164    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
165    
166            * LangTag.pm: Warn for private use language subtags.  Error level
167            typos fixed.  Support for Suppress-Script field.
168    
169            * mklangreg.pl: Support for dumping of nested structure.
170    
171    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
172    
173            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
174            recommended case as per RFC 4646.
175    
176    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
177    
178            * LangTag.pm (check_rfc4646_langtag): New method.
179    
180    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
181    
182            * mklangreg.pl: New script.
183    
184            * Makefile: Updated for creation of the module for language subtag
185            registry.
186            
187    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
188    
189            * Makefile: WebIDL.html added.
190    
191            * WebIDL.pod: New documentation.
192    
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * WebIDL.pm: Checker's error types are redefined.
196    
197    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
198    
199            * WebIDL.pm: Parser's error types are redefined.  Some forward
200            compatible parsing bugs are fixed.  Some unreachable codes are
201            commented out.
202    
203    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
204    
205            * WebIDL.pm: Support for the reminding extended attributes are
206            added.  It does not satisfy the definition that a forward
207            interface declaration has an extended attribute.  It seems that
208            unless explicitly allowed multiple extended attributes with the
209            same name is not allowed, though it is not explicitly mentioned in
210            the spec.
211    
212    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
213    
214            * WebIDL.pm: Unescapes extended attribute names and extended
215            attribute identifiers.  Preserve whether an extended attribute has
216            an argument list of not.  Support for extended attributes:
217            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
218            NameGetter, NameSetter, and Null.
219            (has_argument_list): New attribute.
220            (idl_text): Stringifies argument lists, if any, even if it is
221            empty.
222    
223    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
224    
225            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
226            |{s_kwd}| in DATA_STATE as default.
227    
228    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
229    
230            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
231            by |{s_kwd}| in DATA_STATE.
232    
233    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
234    
235            * HTML.pm.src: Shorten keys.
236    
237    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src: Remove checking for control character, surrogate
240            pair, or noncharacter code points and non-Unicode code
241            points (they should be handled by Whatpm::Charset::UnicodeChecker).
242            (parse_char_stream): Support for the |$get_wrapper| argument and
243            character stream error handlers.
244    
245    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
246    
247            * ContentChecker.pm: Don't call |loda_ns_module|
248            for null-namespace elements/attributes.
249    
250            * HTML.pm.src: Fact out $disallowed_control_chars
251            as a hash.
252    
253    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
254    
255            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
256            and |{next_char}| initializations are moved to initialization
257            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
258            with |parse_char_stream|.
259    
260    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
261    
262            * HTML.pm.src (parse_char_stream): Make |set_next_char|
263            invoke |manakai_read_until|, not only |read|, where
264            possible, to decrease the number of |read| method calls.
265    
266            * mkhtmlparser.pl: Related changes to the aforementioned
267            modification.
268    
269    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
270    
271            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
272            would report character error from now.
273    
274    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
275    
276            * HTML.pm.src: White-space-leaded non-white-space character
277            tokens in "before head insertion mode" was not
278            correctly handled.
279            (set_inner_html): Reimplemented using CharString decodehandle
280            class.  Support for $get_wrapper argument.  Support
281            for |{read_until}| feature.
282    
283    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
284    
285            * HTML.pm.src: Make a "bare ero" error for unknown
286            entities point the "&" character.
287    
288    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
289    
290            * HTML.pm.src: It turns out that U+FFFD don't have to
291            be added to the list of excluded characters.
292    
293    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
294    
295            * HTML.pm.src ($char_onerror): Have character decoder's |line|
296            and |column| a higher priority than the one set by the
297            tokenizer's input handler.
298            ($self->{read_until}): Exclude U+FFFD (but this might
299            not be necessary, since now we do line/column fixup in
300            the character decode handle).
301    
302    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
303    
304            * HTML.pm.src: Use |{read_until}| where possible.
305    
306    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
307    
308            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
309            and |manakai_getc_until| to |manakai_read_until| to
310            reduce the number of string copies.
311    
312    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
313    
314            * HTML.pm.src (parse_char_string): Use newly created
315            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
316            standard feature to |open| a string as a filehandle,
317            since Perl's string filehandle seems not supporting |ungetc|
318            method correctly.
319            (parse_char_stream): Define |{getc_until}| method.
320            (DATA_STATE): Experimental support for |getc_until| feature.
321    
322    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
323    
324            * HTML.pm.src: Check points added to newly added branches.
325    
326    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
327    
328            * HTML.pm.src: Remove |{char}|, which is no longer used.
329            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
330            and replaced by |{prev_state}|.
331    
332            * mkhtmlparser.pl: Remove |{char}| feature.
333            Remove |!!!back-next-input-character;| macro.
334    
335    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
336    
337            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
338            entity related tokenizer states in favor of new states
339            implementing the consume character reference algorithm.
340    
341    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: "Consume a character reference" algorithm is
344            now implemented as a tokenizer's state, rather than
345            a method, with minimum changes (more changes will
346            be made, in due course).  "Bogus comment state"'s inner
347            loop gets removed.
348    
349    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
350    
351            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
352            into their own tokenizer states.
353    
354    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
355    
356            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
357            is split into three states.
358    
359    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
360    
361            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
362            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
363            no longer does the tokenizer have to push back next input
364            characters in those states.
365    
366    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
367    
368            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
369            into four states so that no longer does the tokenizer have to push
370            back next input characters in that state.
371    
372    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
375            which can be used to insert some wrapper between the character
376            stream handle and the tokenizer.  (It is currently not supported
377            for |set_inner_html| for |Element|s).
378    
379  2008-09-10  Wakaba  <wakaba@suika.fam.cx>  2008-09-10  Wakaba  <wakaba@suika.fam.cx>
380    
381          * HTML.pm.src: Ignore punctuations in charset names.          * HTML.pm.src: Ignore punctuations in charset names.

Legend:
Removed from v.1.290  
changed lines
  Added in v.1.353

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24