Contents of /markup/html/whatpm/Whatpm/HTML/ChangeLog

2009-09-06  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Added |keygen| to the list of
        void elements (HTML5 revision 2960).

2009-09-05  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Changed to keep non-normal character
        references as is (HTML5 revision 3374).

2009-09-05  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Discard unclosed tags (HTML5 revision 2990).

2009-09-05  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Implemented the "comment end
        space state" (HTML5 revision 3195).

2009-09-05  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Implemented the "comment end
        bang state" (HTML5 revision 3191).

2009-08-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Any "<" character in attribute names become
        parse error (HTML5 revision 3354).

2009-08-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Lowercase-fold doctype names (HTML5 revision
        2501, cf. HTML5 revision 3571).

2009-07-05  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Reduced the number of parse errors on broken
        DOCTYPE (HTML5 revision 3121).

2009-07-03  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Reduced a parse error (HTML5 revision 3194).

2009-07-03  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: "<" in unquoted attribute values is now
        treated as parse error (HTML5 revision 3206).

2008-11-07  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm (dumptree): Support for namespace abbreviation for
        SWML namespaces.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Normalize white space characters in attribute
        value literals in XML documents.  Don't apply character reference
        mapping table for non-NULL non-surrogate code points.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set the "stop_processing" flag true when a
        parameter entity occurs in a standalone="no" document.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Column number counting fixed.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for '&' that does not
        introduce a reference in XML.  Support for non-ASCII entity
        reference names.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make uppercase "&#X" in XML a parse error.
        Remove the limitation of entity name length.  Enable replacement
        of text-only general entities.  Raise a parse error for an
        unparsed entity reference.  Raise a parse error for a general
        entity reference to an undefined entity.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for <!ELEMENT>.
        (AFTER_NOTATION_NAME_STATE): Renamed as |AFTER_MD_DEF_STATE| (i.e.
        after markup declaration definition state).

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for EntityValue.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Dump text content of Entity nodes.

        * Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
        'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
        states such that <!ENTITY> and <!NOTATION> can be tokenized by
        those states as well.
        (BOGUS_MD_STATE): A new state; used for bogus markup declarations,
        in favor of BOGUS_COMMENT_STATE.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
        document, is now fully implemented.

        * Dumper.pm (dumptree): Output allowed tokens and default value
        always.

2008-10-17  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
        GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
        are added.  New intertion modes for markup declarations are added.

2008-10-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
        New states DOCTYPE_TAG_STATE and
        BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added.  (Bogus
        string after the internal subset, which was handled by the state
        BOGUS_DOCTYPE_STATE, are now handled by the new state.)  Support
        for comments, bogus comments, and processing instructions in the
        internal subset.  If there is the internal subset, then emit the
        doctype token before the internal subset (with its
        $token->{has_internal_subset} flag set) and an
        END_OF_DOCTYPE_TOKEN after the internal subset.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
        renamed as $self->{kwd} to avoid confliction.  Don't raise
        case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
        Support for internal subsets (internal subset itself only; no
        declaration in them is supported yet).  Raise a parse error for
        non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode.  Raise a
        parse error if no system identifier is specified for a DOCTYPE
        declaration with a public identifier.  Don't close the DOCTYPE
        declaration by a ">" character in the system declaration in XML
        mode.
        
2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set index attribute to each attribute token,
        for ignoring namespaced duplicate attribute at the XML namespace
        parser layer.  Raise a parse error if the attribute value is
        omitted, in XML mode.  Raise a parse error if the attribute value
        is not quoted, in XML mode.  Raise a parse error if "<" character
        is found in a quoted attribute value, in XML mode.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for end
        tags.  Support for the short end tag syntax of XML5.  Raise a
        parse erorr for a lowercase <!doctype> in XML.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for start
        tags.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for XML processing instructions.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
        as such, for the support of XML parse error.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Parse error if CDATA section is not closed or
        is placed outside of the root element.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
        CDATA section end.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for case-insensitive XML attribute
        names.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Typo fixed.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: New module.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
        support in XML.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make *_TOKEN (token type constants)
        exportable.  New token types, PI_TOKEN for XML and ABORT_TOKEN for
        document.write() or incremental parsing, are added for future
        extensions.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New file.

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Don't escape |"| in
        content (HTML5 revision 1592).

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Append "\n" after the start
        tag of a |listing| element (HTML5 revision 1675).

2008-03-02  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Typo fixed.

2008-03-01  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
        1277).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pod: New file.

        * Makefile: New file.

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm: New module (split from ../HTML.pm.src).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * ChangeLog: New file.
        

1	2009-09-06 Wakaba <wakaba@suika.fam.cx>
2
3	* Serializer.pm (get_inner_html): Added \|keygen\| to the list of
4	void elements (HTML5 revision 2960).
5
6	2009-09-05 Wakaba <wakaba@suika.fam.cx>
7
8	* Tokenizer.pm.src: Changed to keep non-normal character
9	references as is (HTML5 revision 3374).
10
11	2009-09-05 Wakaba <wakaba@suika.fam.cx>
12
13	* Tokenizer.pm.src: Discard unclosed tags (HTML5 revision 2990).
14
15	2009-09-05 Wakaba <wakaba@suika.fam.cx>
16
17	* Tokenizer.pm.src (_get_next_token): Implemented the "comment end
18	space state" (HTML5 revision 3195).
19
20	2009-09-05 Wakaba <wakaba@suika.fam.cx>
21
22	* Tokenizer.pm.src (_get_next_token): Implemented the "comment end
23	bang state" (HTML5 revision 3191).
24
25	2009-08-16 Wakaba <wakaba@suika.fam.cx>
26
27	* Tokenizer.pm.src: Any "<" character in attribute names become
28	parse error (HTML5 revision 3354).
29
30	2009-08-16 Wakaba <wakaba@suika.fam.cx>
31
32	* Tokenizer.pm.src: Lowercase-fold doctype names (HTML5 revision
33	2501, cf. HTML5 revision 3571).
34
35	2009-07-05 Wakaba <wakaba@suika.fam.cx>
36
37	* Tokenizer.pm.src: Reduced the number of parse errors on broken
38	DOCTYPE (HTML5 revision 3121).
39
40	2009-07-03 Wakaba <wakaba@suika.fam.cx>
41
42	* Tokenizer.pm.src: Reduced a parse error (HTML5 revision 3194).
43
44	2009-07-03 Wakaba <wakaba@suika.fam.cx>
45
46	* Tokenizer.pm.src: "<" in unquoted attribute values is now
47	treated as parse error (HTML5 revision 3206).
48
49	2008-11-07 Wakaba <wakaba@suika.fam.cx>
50
51	* Dumper.pm (dumptree): Support for namespace abbreviation for
52	SWML namespaces.
53
54	2008-10-19 Wakaba <wakaba@suika.fam.cx>
55
56	* Tokenizer.pm.src: Normalize white space characters in attribute
57	value literals in XML documents. Don't apply character reference
58	mapping table for non-NULL non-surrogate code points.
59
60	2008-10-19 Wakaba <wakaba@suika.fam.cx>
61
62	* Tokenizer.pm.src: Set the "stop_processing" flag true when a
63	parameter entity occurs in a standalone="no" document.
64
65	2008-10-19 Wakaba <wakaba@suika.fam.cx>
66
67	* Tokenizer.pm.src: Column number counting fixed.
68
69	2008-10-19 Wakaba <wakaba@suika.fam.cx>
70
71	* Tokenizer.pm.src: Raise a parse error for '&' that does not
72	introduce a reference in XML. Support for non-ASCII entity
73	reference names.
74
75	2008-10-19 Wakaba <wakaba@suika.fam.cx>
76
77	* Tokenizer.pm.src: Make uppercase "&#X" in XML a parse error.
78	Remove the limitation of entity name length. Enable replacement
79	of text-only general entities. Raise a parse error for an
80	unparsed entity reference. Raise a parse error for a general
81	entity reference to an undefined entity.
82
83	2008-10-19 Wakaba <wakaba@suika.fam.cx>
84
85	* Tokenizer.pm.src: Support for <!ELEMENT>.
86	(AFTER_NOTATION_NAME_STATE): Renamed as \|AFTER_MD_DEF_STATE\| (i.e.
87	after markup declaration definition state).
88
89	2008-10-19 Wakaba <wakaba@suika.fam.cx>
90
91	* Tokenizer.pm.src: Support for EntityValue.
92
93	2008-10-19 Wakaba <wakaba@suika.fam.cx>
94
95	* Dumper.pm: Dump text content of Entity nodes.
96
97	* Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.
98
99	2008-10-19 Wakaba <wakaba@suika.fam.cx>
100
101	* Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
102	'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.
103
104	2008-10-18 Wakaba <wakaba@suika.fam.cx>
105
106	* Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
107	states such that <!ENTITY> and <!NOTATION> can be tokenized by
108	those states as well.
109	(BOGUS_MD_STATE): A new state; used for bogus markup declarations,
110	in favor of BOGUS_COMMENT_STATE.
111
112	2008-10-18 Wakaba <wakaba@suika.fam.cx>
113
114	* Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
115	document, is now fully implemented.
116
117	* Dumper.pm (dumptree): Output allowed tokens and default value
118	always.
119
120	2008-10-17 Wakaba <wakaba@suika.fam.cx>
121
122	* Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
123	GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
124	are added. New intertion modes for markup declarations are added.
125
126	2008-10-16 Wakaba <wakaba@suika.fam.cx>
127
128	* Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
129	New states DOCTYPE_TAG_STATE and
130	BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added. (Bogus
131	string after the internal subset, which was handled by the state
132	BOGUS_DOCTYPE_STATE, are now handled by the new state.) Support
133	for comments, bogus comments, and processing instructions in the
134	internal subset. If there is the internal subset, then emit the
135	doctype token before the internal subset (with its
136	$token->{has_internal_subset} flag set) and an
137	END_OF_DOCTYPE_TOKEN after the internal subset.
138
139	2008-10-15 Wakaba <wakaba@suika.fam.cx>
140
141	* Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
142	renamed as $self->{kwd} to avoid confliction. Don't raise
143	case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
144	Support for internal subsets (internal subset itself only; no
145	declaration in them is supported yet). Raise a parse error for
146	non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode. Raise a
147	parse error if no system identifier is specified for a DOCTYPE
148	declaration with a public identifier. Don't close the DOCTYPE
149	declaration by a ">" character in the system declaration in XML
150	mode.
151
152	2008-10-15 Wakaba <wakaba@suika.fam.cx>
153
154	* Tokenizer.pm.src: Set index attribute to each attribute token,
155	for ignoring namespaced duplicate attribute at the XML namespace
156	parser layer. Raise a parse error if the attribute value is
157	omitted, in XML mode. Raise a parse error if the attribute value
158	is not quoted, in XML mode. Raise a parse error if "<" character
159	is found in a quoted attribute value, in XML mode.
160
161	2008-10-15 Wakaba <wakaba@suika.fam.cx>
162
163	* Tokenizer.pm.src: XML tag name start character support for end
164	tags. Support for the short end tag syntax of XML5. Raise a
165	parse erorr for a lowercase <!doctype> in XML.
166
167	2008-10-15 Wakaba <wakaba@suika.fam.cx>
168
169	* Tokenizer.pm.src: XML tag name start character support for start
170	tags.
171
172	2008-10-15 Wakaba <wakaba@suika.fam.cx>
173
174	* Tokenizer.pm.src: Support for XML processing instructions.
175
176	2008-10-15 Wakaba <wakaba@suika.fam.cx>
177
178	* Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
179	as such, for the support of XML parse error.
180
181	2008-10-14 Wakaba <wakaba@suika.fam.cx>
182
183	* Tokenizer.pm.src: Parse error if CDATA section is not closed or
184	is placed outside of the root element.
185
186	2008-10-14 Wakaba <wakaba@suika.fam.cx>
187
188	* Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
189	CDATA section end.
190
191	2008-10-14 Wakaba <wakaba@suika.fam.cx>
192
193	* Tokenizer.pm.src: Support for case-insensitive XML attribute
194	names.
195
196	2008-10-14 Wakaba <wakaba@suika.fam.cx>
197
198	* Dumper.pm: Typo fixed.
199
200	2008-10-14 Wakaba <wakaba@suika.fam.cx>
201
202	* Dumper.pm: New module.
203
204	2008-10-14 Wakaba <wakaba@suika.fam.cx>
205
206	* Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
207	support in XML.
208
209	2008-10-14 Wakaba <wakaba@suika.fam.cx>
210
211	* Tokenizer.pm.src: Make *_TOKEN (token type constants)
212	exportable. New token types, PI_TOKEN for XML and ABORT_TOKEN for
213	document.write() or incremental parsing, are added for future
214	extensions.
215
216	2008-10-14 Wakaba <wakaba@suika.fam.cx>
217
218	* Tokenizer.pm.src: New file.
219
220	2008-05-24 Wakaba <wakaba@suika.fam.cx>
221
222	* Serializer.pm (get_inner_html): Don't escape \|"\| in
223	content (HTML5 revision 1592).
224
225	2008-05-24 Wakaba <wakaba@suika.fam.cx>
226
227	* Serializer.pm (get_inner_html): Append "\n" after the start
228	tag of a \|listing\| element (HTML5 revision 1675).
229
230	2008-03-02 Wakaba <wakaba@suika.fam.cx>
231
232	* Serializer.pm (get_inner_html): Typo fixed.
233
234	2008-03-01 Wakaba <wakaba@suika.fam.cx>
235
236	* Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
237	1277).
238
239	2007-11-11 Wakaba <wakaba@suika.fam.cx>
240
241	* Serializer.pod: New file.
242
243	* Makefile: New file.
244
245	2007-11-11 Wakaba <wakaba@suika.fam.cx>
246
247	* Serializer.pm: New module (split from ../HTML.pm.src).
248
249	2007-11-11 Wakaba <wakaba@suika.fam.cx>
250
251	* ChangeLog: New file.
252
253