/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.342 by wakaba, Sat Oct 4 09:17:54 2008 UTC
# Line 1  Line 1 
1    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Drop redundant code (HTML5 revision 1731).
4    
5    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
6    
7            * HTML.pm.src: Support for new definition of |param| and |source|
8            start tag parsing (HTML5 revision 1731).
9    
10    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: <p> steps reimplemented (HTML5 revision 1731).
13    
14    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
15    
16            * HTML.pm.src: <li>, <dt>, and <dd> steps reimplemented (HTML5
17            revisions 1731 and 1831).
18    
19    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
20    
21            * HTML.pm.src: Support for new flow (but not phrasing) elements (HTML5
22            revisions 1731 and 1778).  Support for the </sarcasm> end tag (HTML5
23            revision 1731).
24    
25    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
26    
27            * HTML.pm.src: Support for |command| and |eventsource| elements (HTML5
28            revision 1731).  End tags of |option| and |optgroup| elements are
29            now optional (HTML5 revision 1731).
30    
31    2008-10-04  Wakaba  <wakaba@suika.fam.cx>
32    
33            * HTML.pm.src: New "special" elements added to the list (HTML5
34            revision 1778).  "strile" -> "strike".
35    
36    2008-10-02  Wakaba  <wakaba@suika.fam.cx>
37    
38            * ContentType.pm (get_sniffed_type): Support for the "better"
39            content sniffing (HTML5 revision 1927).  In a case the official
40            type was not returned when the method is invoked in the list
41            context.
42    
43    2008-09-22  Wakaba  <wakaba@suika.fam.cx>
44    
45            * HTML.pm.src: Character references for non-space C0 characters,
46            including U+000B VT, DEL character, noncharacter code points, are
47            now converted to the U+FFFD character (cf. HTML5 revision 2138).
48    
49    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm: |form=""| check support added.
52    
53    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
54    
55            * ContentChecker.pm: |contextmenu| validness is now checked using
56            |id| and |id_type| properties, and |menu| property is removed.
57    
58    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
59    
60            * ContentChecker.pm: Prepare for |form| |name| attribute's
61            duplication checking.
62    
63    2008-09-21  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src (parse_byte_stream): Support (or non-support) for
66            unsupported charset="" parameter value (HTML5 revision 2131).
67    
68    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: Reminding places where U+000B is allowed as a space
71            character is fixed (cf. HTML5 revision 1738).
72    
73            * ContentChecker.pm, HTMLTable.pm: U+000B is no longer part of
74            space characters (HTML5 revision 1738).
75    
76    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: The "anything else" case for the "after after body"
79            insertion mode was not updated to swtich to the "in body"
80            insertion mode.  U+000B is no longer a space character for the
81            purpose of tree construction phase (HTML5 revision 1738).
82    
83    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
84    
85            * HTML.pm.src: U+000B is no longer a space character (HTML5
86            revision 1738).
87    
88    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
89    
90            * ContentType.pm: 0x0B is no longer a space character (HTML5
91            revision 1738).
92    
93            * HTML.pm.src: U+000B is no longer a space character for the
94            algorithm for extracting an encoding from a Content-Type (HTML5
95            revision 1738).
96    
97    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
98    
99            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
100    
101    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
102    
103            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
104    
105            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
106            matching.
107    
108    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
109    
110            * LangTag.pm: Warn for private use language subtags.  Error level
111            typos fixed.  Support for Suppress-Script field.
112    
113            * mklangreg.pl: Support for dumping of nested structure.
114    
115    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
116    
117            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
118            recommended case as per RFC 4646.
119    
120    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
121    
122            * LangTag.pm (check_rfc4646_langtag): New method.
123    
124    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
125    
126            * mklangreg.pl: New script.
127    
128            * Makefile: Updated for creation of the module for language subtag
129            registry.
130            
131    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
132    
133            * Makefile: WebIDL.html added.
134    
135            * WebIDL.pod: New documentation.
136    
137    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
138    
139            * WebIDL.pm: Checker's error types are redefined.
140    
141    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
142    
143            * WebIDL.pm: Parser's error types are redefined.  Some forward
144            compatible parsing bugs are fixed.  Some unreachable codes are
145            commented out.
146    
147    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
148    
149            * WebIDL.pm: Support for the reminding extended attributes are
150            added.  It does not satisfy the definition that a forward
151            interface declaration has an extended attribute.  It seems that
152            unless explicitly allowed multiple extended attributes with the
153            same name is not allowed, though it is not explicitly mentioned in
154            the spec.
155    
156    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
157    
158            * WebIDL.pm: Unescapes extended attribute names and extended
159            attribute identifiers.  Preserve whether an extended attribute has
160            an argument list of not.  Support for extended attributes:
161            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
162            NameGetter, NameSetter, and Null.
163            (has_argument_list): New attribute.
164            (idl_text): Stringifies argument lists, if any, even if it is
165            empty.
166    
167    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
168    
169            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
170            |{s_kwd}| in DATA_STATE as default.
171    
172    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
173    
174            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
175            by |{s_kwd}| in DATA_STATE.
176    
177    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
178    
179            * HTML.pm.src: Shorten keys.
180    
181    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
182    
183            * HTML.pm.src: Remove checking for control character, surrogate
184            pair, or noncharacter code points and non-Unicode code
185            points (they should be handled by Whatpm::Charset::UnicodeChecker).
186            (parse_char_stream): Support for the |$get_wrapper| argument and
187            character stream error handlers.
188    
189    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
190    
191            * ContentChecker.pm: Don't call |loda_ns_module|
192            for null-namespace elements/attributes.
193    
194            * HTML.pm.src: Fact out $disallowed_control_chars
195            as a hash.
196    
197    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
198    
199            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
200            and |{next_char}| initializations are moved to initialization
201            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
202            with |parse_char_stream|.
203    
204    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
205    
206            * HTML.pm.src (parse_char_stream): Make |set_next_char|
207            invoke |manakai_read_until|, not only |read|, where
208            possible, to decrease the number of |read| method calls.
209    
210            * mkhtmlparser.pl: Related changes to the aforementioned
211            modification.
212    
213    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
214    
215            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
216            would report character error from now.
217    
218    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
219    
220            * HTML.pm.src: White-space-leaded non-white-space character
221            tokens in "before head insertion mode" was not
222            correctly handled.
223            (set_inner_html): Reimplemented using CharString decodehandle
224            class.  Support for $get_wrapper argument.  Support
225            for |{read_until}| feature.
226    
227    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
228    
229            * HTML.pm.src: Make a "bare ero" error for unknown
230            entities point the "&" character.
231    
232    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
233    
234            * HTML.pm.src: It turns out that U+FFFD don't have to
235            be added to the list of excluded characters.
236    
237    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
238    
239            * HTML.pm.src ($char_onerror): Have character decoder's |line|
240            and |column| a higher priority than the one set by the
241            tokenizer's input handler.
242            ($self->{read_until}): Exclude U+FFFD (but this might
243            not be necessary, since now we do line/column fixup in
244            the character decode handle).
245    
246    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
247    
248            * HTML.pm.src: Use |{read_until}| where possible.
249    
250    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
251    
252            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
253            and |manakai_getc_until| to |manakai_read_until| to
254            reduce the number of string copies.
255    
256    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
257    
258            * HTML.pm.src (parse_char_string): Use newly created
259            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
260            standard feature to |open| a string as a filehandle,
261            since Perl's string filehandle seems not supporting |ungetc|
262            method correctly.
263            (parse_char_stream): Define |{getc_until}| method.
264            (DATA_STATE): Experimental support for |getc_until| feature.
265    
266    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
267    
268            * HTML.pm.src: Check points added to newly added branches.
269    
270    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
271    
272            * HTML.pm.src: Remove |{char}|, which is no longer used.
273            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
274            and replaced by |{prev_state}|.
275    
276            * mkhtmlparser.pl: Remove |{char}| feature.
277            Remove |!!!back-next-input-character;| macro.
278    
279    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
280    
281            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
282            entity related tokenizer states in favor of new states
283            implementing the consume character reference algorithm.
284    
285  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
286    
287          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
288          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
289          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
290          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 292 
292    
293  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
294    
295          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
296          into their own tokenizer states.          into their own tokenizer states.
297    
298  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.342

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24