/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.290 by wakaba, Wed Sep 10 10:46:50 2008 UTC revision 1.315 by wakaba, Tue Sep 16 04:20:52 2008 UTC
# Line 1  Line 1 
1    2008-09-16  Wakaba  <wakaba@suika.fam.cx>
2    
3            * WebIDL.pm: Unescapes extended attribute names and extended
4            attribute identifiers.  Preserve whether an extended attribute has
5            an argument list of not.  Support for extended attributes:
6            Constructor, ExceptionConsts, IndexGetter, IndexSetter,
7            NameGetter, NameSetter, and Null.
8            (has_argument_list): New attribute.
9            (idl_text): Stringifies argument lists, if any, even if it is
10            empty.
11    
12    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
13    
14            * HTML.pm.src: New state |PCDATA_STATE|.  Use an empty string for
15            |{s_kwd}| in DATA_STATE as default.
16    
17    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
18    
19            * HTML.pm.src, mkhtmlparser.pl: Replace |{prev_char}|
20            by |{s_kwd}| in DATA_STATE.
21    
22    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
23    
24            * HTML.pm.src: Shorten keys.
25    
26    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
27    
28            * HTML.pm.src: Remove checking for control character, surrogate
29            pair, or noncharacter code points and non-Unicode code
30            points (they should be handled by Whatpm::Charset::UnicodeChecker).
31            (parse_char_stream): Support for the |$get_wrapper| argument and
32            character stream error handlers.
33    
34    2008-09-15  Wakaba  <wakaba@suika.fam.cx>
35    
36            * ContentChecker.pm: Don't call |loda_ns_module|
37            for null-namespace elements/attributes.
38    
39            * HTML.pm.src: Fact out $disallowed_control_chars
40            as a hash.
41    
42    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
43    
44            * HTML.pm.src: Regexp typo fixed.  |{prev_char}|
45            and |{next_char}| initializations are moved to initialization
46            method.  |{read_until}| now supports buffering.  Sync |set_inner_html|
47            with |parse_char_stream|.
48    
49    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src (parse_char_stream): Make |set_next_char|
52            invoke |manakai_read_until|, not only |read|, where
53            possible, to decrease the number of |read| method calls.
54    
55            * mkhtmlparser.pl: Related changes to the aforementioned
56            modification.
57    
58    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
59    
60            * HTML.pm.src: Use |read| instead of |getc|.  |set_inner_html|
61            would report character error from now.
62    
63    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
64    
65            * HTML.pm.src: White-space-leaded non-white-space character
66            tokens in "before head insertion mode" was not
67            correctly handled.
68            (set_inner_html): Reimplemented using CharString decodehandle
69            class.  Support for $get_wrapper argument.  Support
70            for |{read_until}| feature.
71    
72    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: Make a "bare ero" error for unknown
75            entities point the "&" character.
76    
77    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: It turns out that U+FFFD don't have to
80            be added to the list of excluded characters.
81    
82    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
83    
84            * HTML.pm.src ($char_onerror): Have character decoder's |line|
85            and |column| a higher priority than the one set by the
86            tokenizer's input handler.
87            ($self->{read_until}): Exclude U+FFFD (but this might
88            not be necessary, since now we do line/column fixup in
89            the character decode handle).
90    
91    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
92    
93            * HTML.pm.src: Use |{read_until}| where possible.
94    
95    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
96    
97            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
98            and |manakai_getc_until| to |manakai_read_until| to
99            reduce the number of string copies.
100    
101    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
102    
103            * HTML.pm.src (parse_char_string): Use newly created
104            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
105            standard feature to |open| a string as a filehandle,
106            since Perl's string filehandle seems not supporting |ungetc|
107            method correctly.
108            (parse_char_stream): Define |{getc_until}| method.
109            (DATA_STATE): Experimental support for |getc_until| feature.
110    
111    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
112    
113            * HTML.pm.src: Check points added to newly added branches.
114    
115    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
116    
117            * HTML.pm.src: Remove |{char}|, which is no longer used.
118            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
119            and replaced by |{prev_state}|.
120    
121            * mkhtmlparser.pl: Remove |{char}| feature.
122            Remove |!!!back-next-input-character;| macro.
123    
124    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
125    
126            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
127            entity related tokenizer states in favor of new states
128            implementing the consume character reference algorithm.
129    
130    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
131    
132            * HTML.pm.src: "Consume a character reference" algorithm is
133            now implemented as a tokenizer's state, rather than
134            a method, with minimum changes (more changes will
135            be made, in due course).  "Bogus comment state"'s inner
136            loop gets removed.
137    
138    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
139    
140            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
141            into their own tokenizer states.
142    
143    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
144    
145            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
146            is split into three states.
147    
148    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
149    
150            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
151            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
152            no longer does the tokenizer have to push back next input
153            characters in those states.
154    
155    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
156    
157            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
158            into four states so that no longer does the tokenizer have to push
159            back next input characters in that state.
160    
161    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
162    
163            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
164            which can be used to insert some wrapper between the character
165            stream handle and the tokenizer.  (It is currently not supported
166            for |set_inner_html| for |Element|s).
167    
168  2008-09-10  Wakaba  <wakaba@suika.fam.cx>  2008-09-10  Wakaba  <wakaba@suika.fam.cx>
169    
170          * HTML.pm.src: Ignore punctuations in charset names.          * HTML.pm.src: Ignore punctuations in charset names.

Legend:
Removed from v.1.290  
changed lines
  Added in v.1.315

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24