1 |
2008-09-13 Wakaba <wakaba@suika.fam.cx> |
2008-09-13 Wakaba <wakaba@suika.fam.cx> |
2 |
|
|
3 |
|
* HTML.pm: "Consume a character reference" algorithm is |
4 |
|
now implemented as a tokenizer's state, rather than |
5 |
|
a method, with minimum changes (more changes will |
6 |
|
be made, in due course). "Bogus comment state"'s inner |
7 |
|
loop gets removed. |
8 |
|
|
9 |
|
2008-09-13 Wakaba <wakaba@suika.fam.cx> |
10 |
|
|
11 |
|
* HTML.pm: Make |PUBLIC| and |SYSTEM| keyword tokenizing |
12 |
|
into their own tokenizer states. |
13 |
|
|
14 |
|
2008-09-13 Wakaba <wakaba@suika.fam.cx> |
15 |
|
|
16 |
|
* HTML.pm.src: |CDATA_SECTION_STATE| (formally |CDATA_BLOCK_STATE| |
17 |
|
is split into three states. |
18 |
|
|
19 |
|
2008-09-13 Wakaba <wakaba@suika.fam.cx> |
20 |
|
|
21 |
|
* HTML.pm.src: |CLOSE_TAG_OPEN_STATE| is broken into |
22 |
|
itself and new |CDATA_PCDATA_CLOSE_TAG_STATE| so that |
23 |
|
no longer does the tokenizer have to push back next input |
24 |
|
characters in those states. |
25 |
|
|
26 |
|
2008-09-13 Wakaba <wakaba@suika.fam.cx> |
27 |
|
|
28 |
* HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken |
* HTML.pm.src: |MARKUP_DECLARATION_OPEN_STATE| broken |
29 |
into four states so that no longer does the tokenizer have to push |
into four states so that no longer does the tokenizer have to push |
30 |
back next input characters in that state. |
back next input characters in that state. |