/[suikacvs]/markup/html/whatpm/Whatpm/ChangeLog
Suika

Diff of /markup/html/whatpm/Whatpm/ChangeLog

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.285 by wakaba, Sat Aug 30 15:14:32 2008 UTC revision 1.305 by wakaba, Sun Sep 14 07:19:47 2008 UTC
# Line 1  Line 1 
1    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
2    
3            * HTML.pm.src: Make a "bare ero" error for unknown
4            entities point the "&" character.
5    
6    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
7    
8            * HTML.pm.src: It turns out that U+FFFD don't have to
9            be added to the list of excluded characters.
10    
11    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
12    
13            * HTML.pm.src ($char_onerror): Have character decoder's |line|
14            and |column| a higher priority than the one set by the
15            tokenizer's input handler.
16            ($self->{read_until}): Exclude U+FFFD (but this might
17            not be necessary, since now we do line/column fixup in
18            the character decode handle).
19    
20    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
21    
22            * HTML.pm.src: Use |{read_until}| where possible.
23    
24    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
25    
26            * HTML.pm.src: Change |{getc_until}| to |{read_until}|
27            and |manakai_getc_until| to |manakai_read_until| to
28            reduce the number of string copies.
29    
30    2008-09-14  Wakaba  <wakaba@suika.fam.cx>
31    
32            * HTML.pm.src (parse_char_string): Use newly created
33            |Whatpm::Charset::DecodeHandle::CharString| instead of Perl's
34            standard feature to |open| a string as a filehandle,
35            since Perl's string filehandle seems not supporting |ungetc|
36            method correctly.
37            (parse_char_stream): Define |{getc_until}| method.
38            (DATA_STATE): Experimental support for |getc_until| feature.
39    
40    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
41    
42            * HTML.pm.src: Check points added to newly added branches.
43    
44    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
45    
46            * HTML.pm.src: Remove |{char}|, which is no longer used.
47            Remove |{entity_in_attr}| and |{last_attribute_value_state}|
48            and replaced by |{prev_state}|.
49    
50            * mkhtmlparser.pl: Remove |{char}| feature.
51            Remove |!!!back-next-input-character;| macro.
52    
53    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
54    
55            * HTML.pm.src: Finally we get rid of all the inner loops.  Remove
56            entity related tokenizer states in favor of new states
57            implementing the consume character reference algorithm.
58    
59    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
60    
61            * HTML.pm.src: "Consume a character reference" algorithm is
62            now implemented as a tokenizer's state, rather than
63            a method, with minimum changes (more changes will
64            be made, in due course).  "Bogus comment state"'s inner
65            loop gets removed.
66    
67    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
68    
69            * HTML.pm.src: Make |PUBLIC| and |SYSTEM| keyword tokenizing
70            into their own tokenizer states.
71    
72    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
73    
74            * HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE|
75            is split into three states.
76    
77    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
78    
79            * HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into
80            itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that
81            no longer does the tokenizer have to push back next input
82            characters in those states.
83    
84    2008-09-13  Wakaba  <wakaba@suika.fam.cx>
85    
86            * HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken
87            into four states so that no longer does the tokenizer have to push
88            back next input characters in that state.
89    
90    2008-09-11  Wakaba  <wakaba@suika.fam.cx>
91    
92            * HTML.pm.src: Methods now accept additional parameter, $get_wrapper,
93            which can be used to insert some wrapper between the character
94            stream handle and the tokenizer.  (It is currently not supported
95            for |set_inner_html| for |Element|s).
96    
97    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
98    
99            * HTML.pm.src: Ignore punctuations in charset names.
100    
101    2008-09-10  Wakaba  <wakaba@suika.fam.cx>
102    
103            * ContentChecker.pm: Support for charset-layer error levels.
104    
105            * HTML.pm.src: Don't specify |text| argument for the
106            |chardecode:fallback| error, since it is not the encoding
107            being used alternatively.
108    
109    2008-09-06  Wakaba  <wakaba@suika.fam.cx>
110    
111            * HTML.pm.src: Support for |XSLT-compat| (HTML5 revision 2141).
112    
113    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
114    
115            * CacheManifest.pm: Support for extensibility (HTML5 revision 2051).
116    
117    2008-08-31  Wakaba  <wakaba@suika.fam.cx>
118    
119            * HTML.pm.src: Bug fix and sync with the spec with regard
120            to after after frameset insertion mode processing (HTML5
121            revision 1909).  Note that the implementation was wrong
122            per the old spec before the r1909 changes.
123    
124  2008-08-30  Wakaba  <wakaba@suika.fam.cx>  2008-08-30  Wakaba  <wakaba@suika.fam.cx>
125    
126          * HTMLTable.pm: scope=auto algorithm fix synced with the          * HTMLTable.pm: scope=auto algorithm fix synced with the

Legend:
Removed from v.1.285  
changed lines
  Added in v.1.305

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24