/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.281 by wakaba, Sat Aug 30 12:33:36 2008 UTC revision 1.306 by wakaba, Sun Sep 14 09:05:54 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: White-space-leaded non-white-space character
4            tokens in "before head insertion mode" was not
5            correctly handled.
6            (set_inner_html): Reimplemented using CharString decodehandle
7            class.  Support for $get_wrapper argument.  Support
8            for |{read_until}| feature.
9    
10    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
11    
12            * HTML.pm.src: Make a "bare ero" error for unknown
13            entities point the "&" character.
14    
15    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
16    
17            * HTML.pm.src: It turns out that U+FFFD don't have to
18            be added to the list of excluded characters.
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src ($char_onerror): Have character decoder's |line|
23            and |column| a higher priority than the one set by the
24            tokenizer's input handler.
25            ($self->{read_until}): Exclude U+FFFD (but this might
26            not be necessary, since now we do line/column fixup in
27            the character decode handle).
28    
29    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
30    
31            * HTML.pm.src: Use |{read_until}| where possible.
32    
33    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
34    
35            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
36            and |manakai_getc_until| to |manakai_read_until| to
37            reduce the number of string copies.
38    
39    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
40    
41            * HTML.pm.src (parse_char_string): Use newly created
42            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
43            standard feature to |open| a string as a filehandle,
44            since Perl's string filehandle seems not supporting |ungetc|
45            method correctly.
46            (parse_char_stream): Define |{getc_until}| method.
47            (DATA_STATE): Experimental support for |getc_until| feature.
48    
49    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
50    
51            * HTML.pm.src: Check points added to newly added branches.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Remove |{char}|, which is no longer used.
56            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
57            and replaced by |{prev_state}|.
58    
59            * mkhtmlparser.pl: Remove |{char}| feature.
60            Remove |!!!back-next-input-character;| macro.
61    
62    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
63    
64            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
65            entity related tokenizer states in favor of new states
66            implementing the consume character reference algorithm.
67    
68    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
69    
70            * HTML.pm.src: "Consume a character reference" algorithm is
71            now implemented as a tokenizer's state, rather than
72            a method, with minimum changes (more changes will
73            be made, in due course).  "Bogus comment state"'s inner
74            loop gets removed.
75    
76    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
77    
78            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
79            into their own tokenizer states.
80    
81    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
82    
83            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
84            is split into three states.
85    
86    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
87    
88            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
89            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
90            no longer does the tokenizer have to push back next input
91            characters in those states.
92    
93    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
94    
95            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
96            into four states so that no longer does the tokenizer have to push
97            back next input characters in that state.
98    
99    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
100    
101            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
102            which can be used to insert some wrapper between the character
103            stream handle and the tokenizer.  (It is currently not supported
104            for |set_inner_html| for |Element|s).
105    
106    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
107    
108            * HTML.pm.src: Ignore punctuations in charset names.
109    
110    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
111    
112            * ContentChecker.pm: Support for charset-layer error levels.
113    
114            * HTML.pm.src: Don't specify |text| argument for the
115            |chardecode:fallback| error, since it is not the encoding
116            being used alternatively.
117    
118    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
119    
120            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
121    
122    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
123    
124            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
125    
126    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
127    
128            * HTML.pm.src: Bug fix and sync with the spec with regard
129            to after after frameset insertion mode processing (HTML5
130            revision 1909).  Note that the implementation was wrong
131            per the old spec before the r1909 changes.
132    
133    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
134    
135            * HTMLTable.pm: scope=auto algorithm fix synced with the
136            spec (HTML5 revision 2093).
137            ($process_row): Algorithm step numbers synced with the
138            spec (HTML5 revision 2092).
139    
140    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
141    
142            * HTMLTable.pm: Zs is not what we want; we want White_Space! (HTML5
143            revision 2094).
144    
145    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
146    
147            * ContentType.pm: Support for image/svg+xml (HTML5 revision 2096).
148    
149    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
150    
151            * HTML.pm.src: '"' and "'" at the end of attribute
152            name (after another attribute) now raise parse error (HTML5
153            revision 2123).  Empty unquoted attribute values are no
154            longer allowed (HTML5 revision 2122).
155    
156    2008-08-30  Wakaba  <wakaba@suika.fam.cx>
157    
158            * mkhtmlparser.pl: Support for MathML |definitionURL| attribute (HTML5
159            revision 2130).
160    
161  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
162    
163          * ContentChecker.pm: |xml:lang| attribute value must be same          * ContentChecker.pm: |xml:lang| attribute value must be same

Legend:
Removed from v.1.281  
changed lines
  Added in v.1.306

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24