Contents of /markup/html/whatpm/Whatpm/HTML/ChangeLog

2008-11-07  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm (dumptree): Support for namespace abbreviation for
        SWML namespaces.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Normalize white space characters in attribute
        value literals in XML documents.  Don't apply character reference
        mapping table for non-NULL non-surrogate code points.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set the "stop_processing" flag true when a
        parameter entity occurs in a standalone="no" document.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Column number counting fixed.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for '&' that does not
        introduce a reference in XML.  Support for non-ASCII entity
        reference names.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make uppercase "&#X" in XML a parse error.
        Remove the limitation of entity name length.  Enable replacement
        of text-only general entities.  Raise a parse error for an
        unparsed entity reference.  Raise a parse error for a general
        entity reference to an undefined entity.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for <!ELEMENT>.
        (AFTER_NOTATION_NAME_STATE): Renamed as |AFTER_MD_DEF_STATE| (i.e.
        after markup declaration definition state).

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for EntityValue.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Dump text content of Entity nodes.

        * Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
        'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
        states such that <!ENTITY> and <!NOTATION> can be tokenized by
        those states as well.
        (BOGUS_MD_STATE): A new state; used for bogus markup declarations,
        in favor of BOGUS_COMMENT_STATE.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
        document, is now fully implemented.

        * Dumper.pm (dumptree): Output allowed tokens and default value
        always.

2008-10-17  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
        GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
        are added.  New intertion modes for markup declarations are added.

2008-10-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
        New states DOCTYPE_TAG_STATE and
        BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added.  (Bogus
        string after the internal subset, which was handled by the state
        BOGUS_DOCTYPE_STATE, are now handled by the new state.)  Support
        for comments, bogus comments, and processing instructions in the
        internal subset.  If there is the internal subset, then emit the
        doctype token before the internal subset (with its
        $token->{has_internal_subset} flag set) and an
        END_OF_DOCTYPE_TOKEN after the internal subset.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
        renamed as $self->{kwd} to avoid confliction.  Don't raise
        case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
        Support for internal subsets (internal subset itself only; no
        declaration in them is supported yet).  Raise a parse error for
        non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode.  Raise a
        parse error if no system identifier is specified for a DOCTYPE
        declaration with a public identifier.  Don't close the DOCTYPE
        declaration by a ">" character in the system declaration in XML
        mode.
        
2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set index attribute to each attribute token,
        for ignoring namespaced duplicate attribute at the XML namespace
        parser layer.  Raise a parse error if the attribute value is
        omitted, in XML mode.  Raise a parse error if the attribute value
        is not quoted, in XML mode.  Raise a parse error if "<" character
        is found in a quoted attribute value, in XML mode.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for end
        tags.  Support for the short end tag syntax of XML5.  Raise a
        parse erorr for a lowercase <!doctype> in XML.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for start
        tags.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for XML processing instructions.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
        as such, for the support of XML parse error.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Parse error if CDATA section is not closed or
        is placed outside of the root element.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
        CDATA section end.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for case-insensitive XML attribute
        names.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Typo fixed.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: New module.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
        support in XML.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make *_TOKEN (token type constants)
        exportable.  New token types, PI_TOKEN for XML and ABORT_TOKEN for
        document.write() or incremental parsing, are added for future
        extensions.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New file.

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Don't escape |"| in
        content (HTML5 revision 1592).

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Append "\n" after the start
        tag of a |listing| element (HTML5 revision 1675).

2008-03-02  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Typo fixed.

2008-03-01  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
        1277).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pod: New file.

        * Makefile: New file.

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm: New module (split from ../HTML.pm.src).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * ChangeLog: New file.
        

1	2008-11-07 Wakaba <wakaba@suika.fam.cx>
2
3	* Dumper.pm (dumptree): Support for namespace abbreviation for
4	SWML namespaces.
5
6	2008-10-19 Wakaba <wakaba@suika.fam.cx>
7
8	* Tokenizer.pm.src: Normalize white space characters in attribute
9	value literals in XML documents. Don't apply character reference
10	mapping table for non-NULL non-surrogate code points.
11
12	2008-10-19 Wakaba <wakaba@suika.fam.cx>
13
14	* Tokenizer.pm.src: Set the "stop_processing" flag true when a
15	parameter entity occurs in a standalone="no" document.
16
17	2008-10-19 Wakaba <wakaba@suika.fam.cx>
18
19	* Tokenizer.pm.src: Column number counting fixed.
20
21	2008-10-19 Wakaba <wakaba@suika.fam.cx>
22
23	* Tokenizer.pm.src: Raise a parse error for '&' that does not
24	introduce a reference in XML. Support for non-ASCII entity
25	reference names.
26
27	2008-10-19 Wakaba <wakaba@suika.fam.cx>
28
29	* Tokenizer.pm.src: Make uppercase "&#X" in XML a parse error.
30	Remove the limitation of entity name length. Enable replacement
31	of text-only general entities. Raise a parse error for an
32	unparsed entity reference. Raise a parse error for a general
33	entity reference to an undefined entity.
34
35	2008-10-19 Wakaba <wakaba@suika.fam.cx>
36
37	* Tokenizer.pm.src: Support for <!ELEMENT>.
38	(AFTER_NOTATION_NAME_STATE): Renamed as \|AFTER_MD_DEF_STATE\| (i.e.
39	after markup declaration definition state).
40
41	2008-10-19 Wakaba <wakaba@suika.fam.cx>
42
43	* Tokenizer.pm.src: Support for EntityValue.
44
45	2008-10-19 Wakaba <wakaba@suika.fam.cx>
46
47	* Dumper.pm: Dump text content of Entity nodes.
48
49	* Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.
50
51	2008-10-19 Wakaba <wakaba@suika.fam.cx>
52
53	* Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
54	'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.
55
56	2008-10-18 Wakaba <wakaba@suika.fam.cx>
57
58	* Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
59	states such that <!ENTITY> and <!NOTATION> can be tokenized by
60	those states as well.
61	(BOGUS_MD_STATE): A new state; used for bogus markup declarations,
62	in favor of BOGUS_COMMENT_STATE.
63
64	2008-10-18 Wakaba <wakaba@suika.fam.cx>
65
66	* Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
67	document, is now fully implemented.
68
69	* Dumper.pm (dumptree): Output allowed tokens and default value
70	always.
71
72	2008-10-17 Wakaba <wakaba@suika.fam.cx>
73
74	* Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
75	GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
76	are added. New intertion modes for markup declarations are added.
77
78	2008-10-16 Wakaba <wakaba@suika.fam.cx>
79
80	* Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
81	New states DOCTYPE_TAG_STATE and
82	BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added. (Bogus
83	string after the internal subset, which was handled by the state
84	BOGUS_DOCTYPE_STATE, are now handled by the new state.) Support
85	for comments, bogus comments, and processing instructions in the
86	internal subset. If there is the internal subset, then emit the
87	doctype token before the internal subset (with its
88	$token->{has_internal_subset} flag set) and an
89	END_OF_DOCTYPE_TOKEN after the internal subset.
90
91	2008-10-15 Wakaba <wakaba@suika.fam.cx>
92
93	* Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
94	renamed as $self->{kwd} to avoid confliction. Don't raise
95	case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
96	Support for internal subsets (internal subset itself only; no
97	declaration in them is supported yet). Raise a parse error for
98	non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode. Raise a
99	parse error if no system identifier is specified for a DOCTYPE
100	declaration with a public identifier. Don't close the DOCTYPE
101	declaration by a ">" character in the system declaration in XML
102	mode.
103
104	2008-10-15 Wakaba <wakaba@suika.fam.cx>
105
106	* Tokenizer.pm.src: Set index attribute to each attribute token,
107	for ignoring namespaced duplicate attribute at the XML namespace
108	parser layer. Raise a parse error if the attribute value is
109	omitted, in XML mode. Raise a parse error if the attribute value
110	is not quoted, in XML mode. Raise a parse error if "<" character
111	is found in a quoted attribute value, in XML mode.
112
113	2008-10-15 Wakaba <wakaba@suika.fam.cx>
114
115	* Tokenizer.pm.src: XML tag name start character support for end
116	tags. Support for the short end tag syntax of XML5. Raise a
117	parse erorr for a lowercase <!doctype> in XML.
118
119	2008-10-15 Wakaba <wakaba@suika.fam.cx>
120
121	* Tokenizer.pm.src: XML tag name start character support for start
122	tags.
123
124	2008-10-15 Wakaba <wakaba@suika.fam.cx>
125
126	* Tokenizer.pm.src: Support for XML processing instructions.
127
128	2008-10-15 Wakaba <wakaba@suika.fam.cx>
129
130	* Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
131	as such, for the support of XML parse error.
132
133	2008-10-14 Wakaba <wakaba@suika.fam.cx>
134
135	* Tokenizer.pm.src: Parse error if CDATA section is not closed or
136	is placed outside of the root element.
137
138	2008-10-14 Wakaba <wakaba@suika.fam.cx>
139
140	* Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
141	CDATA section end.
142
143	2008-10-14 Wakaba <wakaba@suika.fam.cx>
144
145	* Tokenizer.pm.src: Support for case-insensitive XML attribute
146	names.
147
148	2008-10-14 Wakaba <wakaba@suika.fam.cx>
149
150	* Dumper.pm: Typo fixed.
151
152	2008-10-14 Wakaba <wakaba@suika.fam.cx>
153
154	* Dumper.pm: New module.
155
156	2008-10-14 Wakaba <wakaba@suika.fam.cx>
157
158	* Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
159	support in XML.
160
161	2008-10-14 Wakaba <wakaba@suika.fam.cx>
162
163	* Tokenizer.pm.src: Make *_TOKEN (token type constants)
164	exportable. New token types, PI_TOKEN for XML and ABORT_TOKEN for
165	document.write() or incremental parsing, are added for future
166	extensions.
167
168	2008-10-14 Wakaba <wakaba@suika.fam.cx>
169
170	* Tokenizer.pm.src: New file.
171
172	2008-05-24 Wakaba <wakaba@suika.fam.cx>
173
174	* Serializer.pm (get_inner_html): Don't escape \|"\| in
175	content (HTML5 revision 1592).
176
177	2008-05-24 Wakaba <wakaba@suika.fam.cx>
178
179	* Serializer.pm (get_inner_html): Append "\n" after the start
180	tag of a \|listing\| element (HTML5 revision 1675).
181
182	2008-03-02 Wakaba <wakaba@suika.fam.cx>
183
184	* Serializer.pm (get_inner_html): Typo fixed.
185
186	2008-03-01 Wakaba <wakaba@suika.fam.cx>
187
188	* Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
189	1277).
190
191	2007-11-11 Wakaba <wakaba@suika.fam.cx>
192
193	* Serializer.pod: New file.
194
195	* Makefile: New file.
196
197	2007-11-11 Wakaba <wakaba@suika.fam.cx>
198
199	* Serializer.pm: New module (split from ../HTML.pm.src).
200
201	2007-11-11 Wakaba <wakaba@suika.fam.cx>
202
203	* ChangeLog: New file.
204
205