/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.296 by wakaba, Sat Sep 13 09:02:28 2008 UTC revision 1.326 by wakaba, Sat Sep 20 07:54:47 2008 UTC
# Line 1  Line 1 
1    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
2    
3            * ContentType.pm: 0x0B is no longer a space character (HTML5
4            revision 1738).
5    
6            * HTML.pm.src: U+000B is no longer a space character for the
7            algorithm for extracting an encoding from a Content-Type (HTML5
8            revision 1738).
9    
10    2008-09-20  Wakaba  <wakaba@suika.fam.cx>
11    
12            * ContentChecker.pm ($IsInHTMLInteractiveContent): New.
13    
14    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
15    
16            * LangTag.pm: Add checks for remaining requirements from RFC 4646.
17    
18            * mklangreg.pl: Sort 'Prefix' values by their length, to ease
19            matching.
20    
21    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
22    
23            * LangTag.pm: Warn for private use language subtags.  Error level
24            typos fixed.  Support for Suppress-Script field.
25    
26            * mklangreg.pl: Support for dumping of nested structure.
27    
28    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
29    
30            * LangTag.pm (check_rfc4646_langtag): Check if a tag is in the
31            recommended case as per RFC 4646.
32    
33    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
34    
35            * LangTag.pm (check_rfc4646_langtag): New method.
36    
37    2008-09-18  Wakaba  <wakaba@suika.fam.cx>
38    
39            * mklangreg.pl: New script.
40    
41            * Makefile: Updated for creation of the module for language subtag
42            registry.
43            
44    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
45    
46            * Makefile: WebIDL.html added.
47    
48            * WebIDL.pod: New documentation.
49    
50    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
51    
52            * WebIDL.pm: Checker's error types are redefined.
53    
54    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
55    
56            * WebIDL.pm: Parser's error types are redefined.  Some forward
57            compatible parsing bugs are fixed.  Some unreachable codes are
58            commented out.
59    
60    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
61    
62            * WebIDL.pm: Support for the reminding extended attributes are
63            added.  It does not satisfy the definition that a forward
64            interface declaration has an extended attribute.  It seems that
65            unless explicitly allowed multiple extended attributes with the
66            same name is not allowed, though it is not explicitly mentioned in
67            the spec.
68    
69    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
70    
71            * WebIDL.pm: Unescapes extended attribute names and extended
72            attribute identifiers.  Preserve whether an extended attribute has
73            an argument list of not.  Support for extended attributes:
74            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
75            NameGetter, NameSetter, and Null.
76            (has_argument_list): New attribute.
77            (idl_text): Stringifies argument lists, if any, even if it is
78            empty.
79    
80    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
81    
82            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
83            |{s_kwd}| in DATA_STATE as default.
84    
85    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
86    
87            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
88            by |{s_kwd}| in DATA_STATE.
89    
90    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Shorten keys.
93    
94    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
95    
96            * HTML.pm.src: Remove checking for control character, surrogate
97            pair, or noncharacter code points and non-Unicode code
98            points (they should be handled by Whatpm::Charset::UnicodeChecker).
99            (parse_char_stream): Support for the |$get_wrapper| argument and
100            character stream error handlers.
101    
102    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
103    
104            * ContentChecker.pm: Don't call |loda_ns_module|
105            for null-namespace elements/attributes.
106    
107            * HTML.pm.src: Fact out $disallowed_control_chars
108            as a hash.
109    
110    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
113            and |{next_char}| initializations are moved to initialization
114            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
115            with |parse_char_stream|.
116    
117    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src (parse_char_stream): Make |set_next_char|
120            invoke |manakai_read_until|, not only |read|, where
121            possible, to decrease the number of |read| method calls.
122    
123            * mkhtmlparser.pl: Related changes to the aforementioned
124            modification.
125    
126    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
129            would report character error from now.
130    
131    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
132    
133            * HTML.pm.src: White-space-leaded non-white-space character
134            tokens in "before head insertion mode" was not
135            correctly handled.
136            (set_inner_html): Reimplemented using CharString decodehandle
137            class.  Support for $get_wrapper argument.  Support
138            for |{read_until}| feature.
139    
140    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTML.pm.src: Make a "bare ero" error for unknown
143            entities point the "&" character.
144    
145    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: It turns out that U+FFFD don't have to
148            be added to the list of excluded characters.
149    
150    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
151    
152            * HTML.pm.src ($char_onerror): Have character decoder's |line|
153            and |column| a higher priority than the one set by the
154            tokenizer's input handler.
155            ($self->{read_until}): Exclude U+FFFD (but this might
156            not be necessary, since now we do line/column fixup in
157            the character decode handle).
158    
159    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
160    
161            * HTML.pm.src: Use |{read_until}| where possible.
162    
163    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
166            and |manakai_getc_until| to |manakai_read_until| to
167            reduce the number of string copies.
168    
169    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
170    
171            * HTML.pm.src (parse_char_string): Use newly created
172            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
173            standard feature to |open| a string as a filehandle,
174            since Perl's string filehandle seems not supporting |ungetc|
175            method correctly.
176            (parse_char_stream): Define |{getc_until}| method.
177            (DATA_STATE): Experimental support for |getc_until| feature.
178    
179    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
180    
181            * HTML.pm.src: Check points added to newly added branches.
182    
183    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
184    
185            * HTML.pm.src: Remove |{char}|, which is no longer used.
186            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
187            and replaced by |{prev_state}|.
188    
189            * mkhtmlparser.pl: Remove |{char}| feature.
190            Remove |!!!back-next-input-character;| macro.
191    
192    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
193    
194            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
195            entity related tokenizer states in favor of new states
196            implementing the consume character reference algorithm.
197    
198  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
199    
200          * HTML.pm: "Consume a character reference" algorithm is          * HTML.pm.src: "Consume a character reference" algorithm is
201          now implemented as a tokenizer's state, rather than          now implemented as a tokenizer's state, rather than
202          a method, with minimum changes (more changes will          a method, with minimum changes (more changes will
203          be made, in due course).  "Bogus comment state"'s inner          be made, in due course).  "Bogus comment state"'s inner
# Line 8  Line 205 
205    
206  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
207    
208          * HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing          * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
209          into their own tokenizer states.          into their own tokenizer states.
210    
211  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>

Legend:
Removed from v.1.296  
changed lines
  Added in v.1.326

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24