/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.292 by wakaba, Sat Sep 13 04:19:55 2008 UTC revision 1.317 by wakaba, Tue Sep 16 13:07:54 2008 UTC
# Line 1  Line 1 
1    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * WebIDL.pm: Parser's error types are redefined.  Some forward
4            compatible parsing bugs are fixed.  Some unreachable codes are
5            commented out.
6    
7    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
8    
9            * WebIDL.pm: Support for the reminding extended attributes are
10            added.  It does not satisfy the definition that a forward
11            interface declaration has an extended attribute.  It seems that
12            unless explicitly allowed multiple extended attributes with the
13            same name is not allowed, though it is not explicitly mentioned in
14            the spec.
15    
16    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
17    
18            * WebIDL.pm: Unescapes extended attribute names and extended
19            attribute identifiers.  Preserve whether an extended attribute has
20            an argument list of not.  Support for extended attributes:
21            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
22            NameGetter, NameSetter, and Null.
23            (has_argument_list): New attribute.
24            (idl_text): Stringifies argument lists, if any, even if it is
25            empty.
26    
27    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
28    
29            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
30            |{s_kwd}| in DATA_STATE as default.
31    
32    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
33    
34            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
35            by |{s_kwd}| in DATA_STATE.
36    
37    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
38    
39            * HTML.pm.src: Shorten keys.
40    
41    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
42    
43            * HTML.pm.src: Remove checking for control character, surrogate
44            pair, or noncharacter code points and non-Unicode code
45            points (they should be handled by Whatpm::Charset::UnicodeChecker).
46            (parse_char_stream): Support for the |$get_wrapper| argument and
47            character stream error handlers.
48    
49    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
50    
51            * ContentChecker.pm: Don't call |loda_ns_module|
52            for null-namespace elements/attributes.
53    
54            * HTML.pm.src: Fact out $disallowed_control_chars
55            as a hash.
56    
57    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
58    
59            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
60            and |{next_char}| initializations are moved to initialization
61            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
62            with |parse_char_stream|.
63    
64    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
65    
66            * HTML.pm.src (parse_char_stream): Make |set_next_char|
67            invoke |manakai_read_until|, not only |read|, where
68            possible, to decrease the number of |read| method calls.
69    
70            * mkhtmlparser.pl: Related changes to the aforementioned
71            modification.
72    
73    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
74    
75            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
76            would report character error from now.
77    
78    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
79    
80            * HTML.pm.src: White-space-leaded non-white-space character
81            tokens in "before head insertion mode" was not
82            correctly handled.
83            (set_inner_html): Reimplemented using CharString decodehandle
84            class.  Support for $get_wrapper argument.  Support
85            for |{read_until}| feature.
86    
87    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
88    
89            * HTML.pm.src: Make a "bare ero" error for unknown
90            entities point the "&" character.
91    
92    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
93    
94            * HTML.pm.src: It turns out that U+FFFD don't have to
95            be added to the list of excluded characters.
96    
97    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src ($char_onerror): Have character decoder's |line|
100            and |column| a higher priority than the one set by the
101            tokenizer's input handler.
102            ($self->{read_until}): Exclude U+FFFD (but this might
103            not be necessary, since now we do line/column fixup in
104            the character decode handle).
105    
106    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Use |{read_until}| where possible.
109    
110    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
111    
112            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
113            and |manakai_getc_until| to |manakai_read_until| to
114            reduce the number of string copies.
115    
116    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
117    
118            * HTML.pm.src (parse_char_string): Use newly created
119            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
120            standard feature to |open| a string as a filehandle,
121            since Perl's string filehandle seems not supporting |ungetc|
122            method correctly.
123            (parse_char_stream): Define |{getc_until}| method.
124            (DATA_STATE): Experimental support for |getc_until| feature.
125    
126    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Check points added to newly added branches.
129    
130    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: Remove |{char}|, which is no longer used.
133            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
134            and replaced by |{prev_state}|.
135    
136            * mkhtmlparser.pl: Remove |{char}| feature.
137            Remove |!!!back-next-input-character;| macro.
138    
139    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
140    
141            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
142            entity related tokenizer states in favor of new states
143            implementing the consume character reference algorithm.
144    
145    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
146    
147            * HTML.pm.src: "Consume a character reference" algorithm is
148            now implemented as a tokenizer's state, rather than
149            a method, with minimum changes (more changes will
150            be made, in due course).  "Bogus comment state"'s inner
151            loop gets removed.
152    
153    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
154    
155            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
156            into their own tokenizer states.
157    
158    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
159    
160            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
161            is split into three states.
162    
163    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
164    
165            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
166            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
167            no longer does the tokenizer have to push back next input
168            characters in those states.
169    
170  2008-09-13  Wakaba  <wakaba@suika.fam.cx>  2008-09-13  Wakaba  <wakaba@suika.fam.cx>
171    
172          * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken          * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken

Legend:
Removed from v.1.292  
changed lines
  Added in v.1.317

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24