Contents of /markup/html/whatpm/Whatpm/HTML/ChangeLog

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for EntityValue.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Dump text content of Entity nodes.

        * Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
        'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
        states such that <!ENTITY> and <!NOTATION> can be tokenized by
        those states as well.
        (BOGUS_MD_STATE): A new state; used for bogus markup declarations,
        in favor of BOGUS_COMMENT_STATE.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
        document, is now fully implemented.

        * Dumper.pm (dumptree): Output allowed tokens and default value
        always.

2008-10-17  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
        GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
        are added.  New intertion modes for markup declarations are added.

2008-10-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
        New states DOCTYPE_TAG_STATE and
        BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added.  (Bogus
        string after the internal subset, which was handled by the state
        BOGUS_DOCTYPE_STATE, are now handled by the new state.)  Support
        for comments, bogus comments, and processing instructions in the
        internal subset.  If there is the internal subset, then emit the
        doctype token before the internal subset (with its
        $token->{has_internal_subset} flag set) and an
        END_OF_DOCTYPE_TOKEN after the internal subset.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
        renamed as $self->{kwd} to avoid confliction.  Don't raise
        case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
        Support for internal subsets (internal subset itself only; no
        declaration in them is supported yet).  Raise a parse error for
        non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode.  Raise a
        parse error if no system identifier is specified for a DOCTYPE
        declaration with a public identifier.  Don't close the DOCTYPE
        declaration by a ">" character in the system declaration in XML
        mode.
        
2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set index attribute to each attribute token,
        for ignoring namespaced duplicate attribute at the XML namespace
        parser layer.  Raise a parse error if the attribute value is
        omitted, in XML mode.  Raise a parse error if the attribute value
        is not quoted, in XML mode.  Raise a parse error if "<" character
        is found in a quoted attribute value, in XML mode.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for end
        tags.  Support for the short end tag syntax of XML5.  Raise a
        parse erorr for a lowercase <!doctype> in XML.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for start
        tags.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for XML processing instructions.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
        as such, for the support of XML parse error.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Parse error if CDATA section is not closed or
        is placed outside of the root element.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
        CDATA section end.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for case-insensitive XML attribute
        names.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Typo fixed.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: New module.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
        support in XML.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make *_TOKEN (token type constants)
        exportable.  New token types, PI_TOKEN for XML and ABORT_TOKEN for
        document.write() or incremental parsing, are added for future
        extensions.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New file.

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Don't escape |"| in
        content (HTML5 revision 1592).

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Append "\n" after the start
        tag of a |listing| element (HTML5 revision 1675).

2008-03-02  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Typo fixed.

2008-03-01  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
        1277).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pod: New file.

        * Makefile: New file.

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm: New module (split from ../HTML.pm.src).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * ChangeLog: New file.
        

1	wakaba	1.26	2008-10-19 Wakaba <wakaba@suika.fam.cx>
2
3	wakaba	1.28	* Tokenizer.pm.src: Support for EntityValue.
4
5			2008-10-19 Wakaba <wakaba@suika.fam.cx>
6
7	wakaba	1.27	* Dumper.pm: Dump text content of Entity nodes.
8
9			* Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.
10
11			2008-10-19 Wakaba <wakaba@suika.fam.cx>
12
13	wakaba	1.26	* Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
14			'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.
15
16	wakaba	1.24	2008-10-18 Wakaba <wakaba@suika.fam.cx>
17
18	wakaba	1.25	* Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
19			states such that <!ENTITY> and <!NOTATION> can be tokenized by
20			those states as well.
21			(BOGUS_MD_STATE): A new state; used for bogus markup declarations,
22			in favor of BOGUS_COMMENT_STATE.
23
24			2008-10-18 Wakaba <wakaba@suika.fam.cx>
25
26	wakaba	1.24	* Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
27			document, is now fully implemented.
28
29			* Dumper.pm (dumptree): Output allowed tokens and default value
30			always.
31
32	wakaba	1.23	2008-10-17 Wakaba <wakaba@suika.fam.cx>
33
34			* Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
35			GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
36			are added. New intertion modes for markup declarations are added.
37
38	wakaba	1.22	2008-10-16 Wakaba <wakaba@suika.fam.cx>
39
40			* Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
41			New states DOCTYPE_TAG_STATE and
42			BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added. (Bogus
43			string after the internal subset, which was handled by the state
44			BOGUS_DOCTYPE_STATE, are now handled by the new state.) Support
45			for comments, bogus comments, and processing instructions in the
46			internal subset. If there is the internal subset, then emit the
47			doctype token before the internal subset (with its
48			$token->{has_internal_subset} flag set) and an
49			END_OF_DOCTYPE_TOKEN after the internal subset.
50
51	wakaba	1.16	2008-10-15 Wakaba <wakaba@suika.fam.cx>
52
53	wakaba	1.21	* Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
54			renamed as $self->{kwd} to avoid confliction. Don't raise
55			case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
56			Support for internal subsets (internal subset itself only; no
57			declaration in them is supported yet). Raise a parse error for
58			non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode. Raise a
59			parse error if no system identifier is specified for a DOCTYPE
60			declaration with a public identifier. Don't close the DOCTYPE
61			declaration by a ">" character in the system declaration in XML
62			mode.
63
64			2008-10-15 Wakaba <wakaba@suika.fam.cx>
65
66	wakaba	1.20	* Tokenizer.pm.src: Set index attribute to each attribute token,
67			for ignoring namespaced duplicate attribute at the XML namespace
68			parser layer. Raise a parse error if the attribute value is
69			omitted, in XML mode. Raise a parse error if the attribute value
70			is not quoted, in XML mode. Raise a parse error if "<" character
71			is found in a quoted attribute value, in XML mode.
72
73			2008-10-15 Wakaba <wakaba@suika.fam.cx>
74
75	wakaba	1.19	* Tokenizer.pm.src: XML tag name start character support for end
76			tags. Support for the short end tag syntax of XML5. Raise a
77			parse erorr for a lowercase <!doctype> in XML.
78
79			2008-10-15 Wakaba <wakaba@suika.fam.cx>
80
81			* Tokenizer.pm.src: XML tag name start character support for start
82	wakaba	1.18	tags.
83
84			2008-10-15 Wakaba <wakaba@suika.fam.cx>
85
86	wakaba	1.17	* Tokenizer.pm.src: Support for XML processing instructions.
87
88			2008-10-15 Wakaba <wakaba@suika.fam.cx>
89
90	wakaba	1.16	* Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
91			as such, for the support of XML parse error.
92
93	wakaba	1.8	2008-10-14 Wakaba <wakaba@suika.fam.cx>
94
95	wakaba	1.15	* Tokenizer.pm.src: Parse error if CDATA section is not closed or
96			is placed outside of the root element.
97
98			2008-10-14 Wakaba <wakaba@suika.fam.cx>
99
100	wakaba	1.14	* Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
101			CDATA section end.
102
103			2008-10-14 Wakaba <wakaba@suika.fam.cx>
104
105	wakaba	1.13	* Tokenizer.pm.src: Support for case-insensitive XML attribute
106			names.
107
108			2008-10-14 Wakaba <wakaba@suika.fam.cx>
109
110	wakaba	1.12	* Dumper.pm: Typo fixed.
111
112			2008-10-14 Wakaba <wakaba@suika.fam.cx>
113
114	wakaba	1.11	* Dumper.pm: New module.
115
116			2008-10-14 Wakaba <wakaba@suika.fam.cx>
117
118	wakaba	1.10	* Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
119			support in XML.
120
121			2008-10-14 Wakaba <wakaba@suika.fam.cx>
122
123	wakaba	1.9	* Tokenizer.pm.src: Make *_TOKEN (token type constants)
124			exportable. New token types, PI_TOKEN for XML and ABORT_TOKEN for
125			document.write() or incremental parsing, are added for future
126			extensions.
127
128			2008-10-14 Wakaba <wakaba@suika.fam.cx>
129
130	wakaba	1.8	* Tokenizer.pm.src: New file.
131
132	wakaba	1.5	2008-05-24 Wakaba <wakaba@suika.fam.cx>
133
134	wakaba	1.7	* Serializer.pm (get_inner_html): Don't escape \|"\| in
135			content (HTML5 revision 1592).
136
137			2008-05-24 Wakaba <wakaba@suika.fam.cx>
138
139	wakaba	1.5	* Serializer.pm (get_inner_html): Append "\n" after the start
140	wakaba	1.6	tag of a \|listing\| element (HTML5 revision 1675).
141	wakaba	1.5
142	wakaba	1.4	2008-03-02 Wakaba <wakaba@suika.fam.cx>
143
144			* Serializer.pm (get_inner_html): Typo fixed.
145
146	wakaba	1.3	2008-03-01 Wakaba <wakaba@suika.fam.cx>
147
148			* Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
149			1277).
150
151	wakaba	1.2	2007-11-11 Wakaba <wakaba@suika.fam.cx>
152
153			* Serializer.pod: New file.
154
155			* Makefile: New file.
156
157			2007-11-11 Wakaba <wakaba@suika.fam.cx>
158
159			* Serializer.pm: New module (split from ../HTML.pm.src).
160
161			2007-11-11 Wakaba <wakaba@suika.fam.cx>
162
163			* ChangeLog: New file.
164
165