/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.273 by wakaba, Fri Aug 15 14:13:42 2008 UTC revision 1.354 by wakaba, Tue Oct 14 13:24:52 2008 UTC
# Line 1  Line 1 
1    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Handling of end tags in the foreign content
4            insertion mode was partially wrong, because of wrong bit
5            operations.
6    
7    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
8    
9            * NanoDOM.pm (dom_config): New attribute (do nothing), for
10            Whatpm::XML::Parser support.
11    
12    2008-10-14  Wakaba  <wakaba@suika.fam.cx>
13    
14            * Makefile: New rule to make HTML/Tokenizer.pm is added.
15    
16            * HTML.pm.src: Tokenizer part moved to another file.
17    
18    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
19    
20            * HTML.pm.src: Merge |DT_EL| and |DD_EL| as |DTDD_EL|.
21    
22    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Element category constants redefined.
25            
26    2008-10-13  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Steps for CDATA/RCDATA elements in tree
29            construction stage synced with the spec (HTML5 revisions 2139 and
30            2302).
31    
32    2008-10-07  Wakaba  <wakaba@suika.fam.cx>
33    
34            * ContentChecker.pm: New error level "html5_fact" added, which
35            should be tentatively used until all of requirements are properly
36            specced as RFC 2119 "MUST" in HTML5.
37    
38    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
39    
40            * ContentChecker.pod: Note on internal flags is added.
41    
42    2008-10-05  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: An AAA bug fixed.
45    
46    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
47    
48            * HTML.pm.src: If another node is inserted by the parser, don't
49            reuse existing Text node to append a character (HTML5 revision
50            2124).
51    
52    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
53    
54            * HTML.pm.src: Support for <option> and <optgroup> in body (HTML5
55            revisions 1731 and 2128).
56    
57    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Make <foreignObject> scoping (HTML5 revision 1837).
60            Support for end tags of camelCase SVG elements were broken.  A
61            wrong error type text fixed.
62    
63    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
66    
67    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Support for new definition of |param| and |source|
70            start tag parsing (HTML5 revision 1731).
71    
72    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
75    
76    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
79            revisions 1731 and 1831).
80    
81    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
84            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
85            revision 1731).
86    
87    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
90            revision 1731).  End tags of |option| and |optgroup| elements are
91            now optional (HTML5 revision 1731).
92    
93    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: New "special" elements added to the list (HTML5
96            revision 1778).  "strile" -> "strike".
97    
98    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
99    
100            * ContentType.pm (get_sniffed_type): Support for the "better"
101            content sniffing (HTML5 revision 1927).  In a case the official
102            type was not returned when the method is invoked in the list
103            context.
104    
105    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
106    
107            * HTML.pm.src: Character references for non-space C0 characters,
108            including U+000B VT, DEL character, noncharacter code points, are
109            now converted to the U+FFFD character (cf. HTML5 revision 2138).
110    
111    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
112    
113            * ContentChecker.pm: |form=""| check support added.
114    
115    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
116    
117            * ContentChecker.pm: |contextmenu| validness is now checked using
118            |id| and |id_type| properties, and |menu| property is removed.
119    
120    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
121    
122            * ContentChecker.pm: Prepare for |form| |name| attribute's
123            duplication checking.
124    
125    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
126    
127            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
128            unsupported charset="" parameter value (HTML5 revision 2131).
129    
130    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Reminding places where U+000B is allowed as a space
133            character is fixed (cf. HTML5 revision 1738).
134    
135            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
136            space characters (HTML5 revision 1738).
137    
138    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src: The "anything else" case for the "after after body"
141            insertion mode was not updated to swtich to the "in body"
142            insertion mode.  U+000B is no longer a space character for the
143            purpose of tree construction phase (HTML5 revision 1738).
144    
145    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: U+000B is no longer a space character (HTML5
148            revision 1738).
149    
150    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
151    
152            * ContentType.pm: 0x0B is no longer a space character (HTML5
153            revision 1738).
154    
155            * HTML.pm.src: U+000B is no longer a space character for the
156            algorithm for extracting an encoding from a Content-Type (HTML5
157            revision 1738).
158    
159    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
160    
161            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
162    
163    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
164    
165            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
166    
167            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
168            matching.
169    
170    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
171    
172            * LangTag.pm: Warn for private use language subtags.  Error level
173            typos fixed.  Support for Suppress-Script field.
174    
175            * mklangreg.pl: Support for dumping of nested structure.
176    
177    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
178    
179            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
180            recommended case as per RFC 4646.
181    
182    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
183    
184            * LangTag.pm (check_rfc4646_langtag): New method.
185    
186    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
187    
188            * mklangreg.pl: New script.
189    
190            * Makefile: Updated for creation of the module for language subtag
191            registry.
192            
193    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
194    
195            * Makefile: WebIDL.html added.
196    
197            * WebIDL.pod: New documentation.
198    
199    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
200    
201            * WebIDL.pm: Checker's error types are redefined.
202    
203    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
204    
205            * WebIDL.pm: Parser's error types are redefined.  Some forward
206            compatible parsing bugs are fixed.  Some unreachable codes are
207            commented out.
208    
209    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
210    
211            * WebIDL.pm: Support for the reminding extended attributes are
212            added.  It does not satisfy the definition that a forward
213            interface declaration has an extended attribute.  It seems that
214            unless explicitly allowed multiple extended attributes with the
215            same name is not allowed, though it is not explicitly mentioned in
216            the spec.
217    
218    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
219    
220            * WebIDL.pm: Unescapes extended attribute names and extended
221            attribute identifiers.  Preserve whether an extended attribute has
222            an argument list of not.  Support for extended attributes:
223            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
224            NameGetter, NameSetter, and Null.
225            (has_argument_list): New attribute.
226            (idl_text): Stringifies argument lists, if any, even if it is
227            empty.
228    
229    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
230    
231            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
232            |{s_kwd}| in DATA_STATE as default.
233    
234    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
235    
236            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
237            by |{s_kwd}| in DATA_STATE.
238    
239    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
240    
241            * HTML.pm.src: Shorten keys.
242    
243    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
244    
245            * HTML.pm.src: Remove checking for control character, surrogate
246            pair, or noncharacter code points and non-Unicode code
247            points (they should be handled by Whatpm::Charset::UnicodeChecker).
248            (parse_char_stream): Support for the |$get_wrapper| argument and
249            character stream error handlers.
250    
251    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
252    
253            * ContentChecker.pm: Don't call |loda_ns_module|
254            for null-namespace elements/attributes.
255    
256            * HTML.pm.src: Fact out $disallowed_control_chars
257            as a hash.
258    
259    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
260    
261            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
262            and |{next_char}| initializations are moved to initialization
263            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
264            with |parse_char_stream|.
265    
266    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src (parse_char_stream): Make |set_next_char|
269            invoke |manakai_read_until|, not only |read|, where
270            possible, to decrease the number of |read| method calls.
271    
272            * mkhtmlparser.pl: Related changes to the aforementioned
273            modification.
274    
275    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
276    
277            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
278            would report character error from now.
279    
280    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
281    
282            * HTML.pm.src: White-space-leaded non-white-space character
283            tokens in "before head insertion mode" was not
284            correctly handled.
285            (set_inner_html): Reimplemented using CharString decodehandle
286            class.  Support for $get_wrapper argument.  Support
287            for |{read_until}| feature.
288    
289    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
290    
291            * HTML.pm.src: Make a "bare ero" error for unknown
292            entities point the "&" character.
293    
294    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
295    
296            * HTML.pm.src: It turns out that U+FFFD don't have to
297            be added to the list of excluded characters.
298    
299    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
300    
301            * HTML.pm.src ($char_onerror): Have character decoder's |line|
302            and |column| a higher priority than the one set by the
303            tokenizer's input handler.
304            ($self->{read_until}): Exclude U+FFFD (but this might
305            not be necessary, since now we do line/column fixup in
306            the character decode handle).
307    
308    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
309    
310            * HTML.pm.src: Use |{read_until}| where possible.
311    
312    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
313    
314            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
315            and |manakai_getc_until| to |manakai_read_until| to
316            reduce the number of string copies.
317    
318    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
319    
320            * HTML.pm.src (parse_char_string): Use newly created
321            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
322            standard feature to |open| a string as a filehandle,
323            since Perl's string filehandle seems not supporting |ungetc|
324            method correctly.
325            (parse_char_stream): Define |{getc_until}| method.
326            (DATA_STATE): Experimental support for |getc_until| feature.
327    
328    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
329    
330            * HTML.pm.src: Check points added to newly added branches.
331    
332    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
333    
334            * HTML.pm.src: Remove |{char}|, which is no longer used.
335            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
336            and replaced by |{prev_state}|.
337    
338            * mkhtmlparser.pl: Remove |{char}| feature.
339            Remove |!!!back-next-input-character;| macro.
340    
341    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
342    
343            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
344            entity related tokenizer states in favor of new states
345            implementing the consume character reference algorithm.
346    
347    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
348    
349            * HTML.pm.src: "Consume a character reference" algorithm is
350            now implemented as a tokenizer's state, rather than
351            a method, with minimum changes (more changes will
352            be made, in due course).  "Bogus comment state"'s inner
353            loop gets removed.
354    
355    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
356    
357            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
358            into their own tokenizer states.
359    
360    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
361    
362            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
363            is split into three states.
364    
365    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
366    
367            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
368            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
369            no longer does the tokenizer have to push back next input
370            characters in those states.
371    
372    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
373    
374            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
375            into four states so that no longer does the tokenizer have to push
376            back next input characters in that state.
377    
378    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
379    
380            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
381            which can be used to insert some wrapper between the character
382            stream handle and the tokenizer.  (It is currently not supported
383            for |set_inner_html| for |Element|s).
384    
385    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
386    
387            * HTML.pm.src: Ignore punctuations in charset names.
388    
389    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
390    
391            * ContentChecker.pm: Support for charset-layer error levels.
392    
393            * HTML.pm.src: Don't specify |text| argument for the
394            |chardecode:fallback| error, since it is not the encoding
395            being used alternatively.
396    
397    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
398    
399            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
400    
401    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
402    
403            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
404    
405    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
406    
407            * HTML.pm.src: Bug fix and sync with the spec with regard
408            to after after frameset insertion mode processing (HTML5
409            revision 1909).  Note that the implementation was wrong
410            per the old spec before the r1909 changes.
411    
412    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
413    
414            * HTMLTable.pm: scope=auto algorithm fix synced with the
415            spec (HTML5 revision 2093).
416            ($process_row): Algorithm step numbers synced with the
417            spec (HTML5 revision 2092).
418    
419    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
420    
421            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
422            revision 2094).
423    
424    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
425    
426            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
427    
428    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
429    
430            * HTML.pm.src: '"' and "'" at the end of attribute
431            name (after another attribute) now raise parse error (HTML5
432            revision 2123).  Empty unquoted attribute values are no
433            longer allowed (HTML5 revision 2122).
434    
435    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
436    
437            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
438            revision 2130).
439    
440    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
441    
442            * ContentChecker.pm: |xml:lang| attribute value must be same
443            as |lang| attribute value for HTML elements (HTML5 revision 2062
444            and so on).
445    
446    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
447    
448            * ContentChecker.pm: Error level definition for |xml_id_error|
449            was missing.
450    
451            * URIChecker.pm: The end of the URL should be marked as the
452            error location for an empty path error.  The position
453            between the userinfo and the port components should be
454            marked as the error location for an empty host error.
455    
456    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
457    
458            * URIChecker.pm: Set parameters representing where in the
459            value the error occurs for errors.  Report unknown
460            address format error in warning level, since address
461            formats are rarely added.  Path segments starting with "/.."
462            were misinterpreted as a dot-segment.
463    
464    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
465    
466            * URIChecker.pm (check_iri_reference): Requires
467            |Message::DOM::DOMImplementation|.
468    
469    2008-08-29  Wakaba  <wakaba@suika.fam.cx>
470    
471            * IMTChecker.pm: Updated for the new error reporting architecture.
472    
473            * ContentChecker.pm: Error levels for IMTs are added.
474    
475    2008-08-17  Wakaba  <wakaba@suika.fam.cx>
476    
477            * H2H.pm (_shift_token): Support for unquoted HTML attribute
478            values.
479    
480    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
481    
482            * CacheManifest.pm: Support for new style of error
483            reports.
484    
485            * HTML.pm.src: Set line=1, column=1 to the document node.
486    
487    2008-08-16  Wakaba  <wakaba@suika.fam.cx>
488    
489            * ContentChecker.pm, RDFXML.pm: Pass {level} object to language tag
490            and URL checkers.  Support for more error levels for bogus
491            langauge tag and URL "standards".
492    
493            * LangTag.pm, URIChecker.pm: Support for new style error
494            level reporting.
495    
496  2008-08-15  Wakaba  <wakaba@suika.fam.cx>  2008-08-15  Wakaba  <wakaba@suika.fam.cx>
497    
498          * ContentChecker.pm: Support for RDF/XML error levels.          * ContentChecker.pm: Support for RDF/XML error levels.

Legend:
Removed from v.1.273  
changed lines
  Added in v.1.354

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24