Contents of /markup/html/whatpm/Whatpm/HTML/ChangeLog

2009-07-03  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: "<" in unquoted attribute values is now
        treated as parse error (HTML5 revision 3206).

2008-11-07  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm (dumptree): Support for namespace abbreviation for
        SWML namespaces.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Normalize white space characters in attribute
        value literals in XML documents.  Don't apply character reference
        mapping table for non-NULL non-surrogate code points.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set the "stop_processing" flag true when a
        parameter entity occurs in a standalone="no" document.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Column number counting fixed.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for '&' that does not
        introduce a reference in XML.  Support for non-ASCII entity
        reference names.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make uppercase "&#X" in XML a parse error.
        Remove the limitation of entity name length.  Enable replacement
        of text-only general entities.  Raise a parse error for an
        unparsed entity reference.  Raise a parse error for a general
        entity reference to an undefined entity.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for <!ELEMENT>.
        (AFTER_NOTATION_NAME_STATE): Renamed as |AFTER_MD_DEF_STATE| (i.e.
        after markup declaration definition state).

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for EntityValue.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Dump text content of Entity nodes.

        * Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
        'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
        states such that <!ENTITY> and <!NOTATION> can be tokenized by
        those states as well.
        (BOGUS_MD_STATE): A new state; used for bogus markup declarations,
        in favor of BOGUS_COMMENT_STATE.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
        document, is now fully implemented.

        * Dumper.pm (dumptree): Output allowed tokens and default value
        always.

2008-10-17  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
        GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
        are added.  New intertion modes for markup declarations are added.

2008-10-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
        New states DOCTYPE_TAG_STATE and
        BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added.  (Bogus
        string after the internal subset, which was handled by the state
        BOGUS_DOCTYPE_STATE, are now handled by the new state.)  Support
        for comments, bogus comments, and processing instructions in the
        internal subset.  If there is the internal subset, then emit the
        doctype token before the internal subset (with its
        $token->{has_internal_subset} flag set) and an
        END_OF_DOCTYPE_TOKEN after the internal subset.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
        renamed as $self->{kwd} to avoid confliction.  Don't raise
        case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
        Support for internal subsets (internal subset itself only; no
        declaration in them is supported yet).  Raise a parse error for
        non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode.  Raise a
        parse error if no system identifier is specified for a DOCTYPE
        declaration with a public identifier.  Don't close the DOCTYPE
        declaration by a ">" character in the system declaration in XML
        mode.
        
2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set index attribute to each attribute token,
        for ignoring namespaced duplicate attribute at the XML namespace
        parser layer.  Raise a parse error if the attribute value is
        omitted, in XML mode.  Raise a parse error if the attribute value
        is not quoted, in XML mode.  Raise a parse error if "<" character
        is found in a quoted attribute value, in XML mode.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for end
        tags.  Support for the short end tag syntax of XML5.  Raise a
        parse erorr for a lowercase <!doctype> in XML.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for start
        tags.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for XML processing instructions.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
        as such, for the support of XML parse error.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Parse error if CDATA section is not closed or
        is placed outside of the root element.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
        CDATA section end.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for case-insensitive XML attribute
        names.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Typo fixed.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: New module.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
        support in XML.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make *_TOKEN (token type constants)
        exportable.  New token types, PI_TOKEN for XML and ABORT_TOKEN for
        document.write() or incremental parsing, are added for future
        extensions.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New file.

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Don't escape |"| in
        content (HTML5 revision 1592).

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Append "\n" after the start
        tag of a |listing| element (HTML5 revision 1675).

2008-03-02  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Typo fixed.

2008-03-01  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
        1277).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pod: New file.

        * Makefile: New file.

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm: New module (split from ../HTML.pm.src).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * ChangeLog: New file.
        

1	wakaba	1.36	2009-07-03 Wakaba <wakaba@suika.fam.cx>
2
3			* Tokenizer.pm.src: "<" in unquoted attribute values is now
4			treated as parse error (HTML5 revision 3206).
5
6	wakaba	1.35	2008-11-07 Wakaba <wakaba@suika.fam.cx>
7
8			* Dumper.pm (dumptree): Support for namespace abbreviation for
9			SWML namespaces.
10
11	wakaba	1.26	2008-10-19 Wakaba <wakaba@suika.fam.cx>
12
13	wakaba	1.34	* Tokenizer.pm.src: Normalize white space characters in attribute
14			value literals in XML documents. Don't apply character reference
15			mapping table for non-NULL non-surrogate code points.
16
17			2008-10-19 Wakaba <wakaba@suika.fam.cx>
18
19	wakaba	1.33	* Tokenizer.pm.src: Set the "stop_processing" flag true when a
20			parameter entity occurs in a standalone="no" document.
21
22			2008-10-19 Wakaba <wakaba@suika.fam.cx>
23
24	wakaba	1.32	* Tokenizer.pm.src: Column number counting fixed.
25
26			2008-10-19 Wakaba <wakaba@suika.fam.cx>
27
28	wakaba	1.31	* Tokenizer.pm.src: Raise a parse error for '&' that does not
29			introduce a reference in XML. Support for non-ASCII entity
30			reference names.
31
32			2008-10-19 Wakaba <wakaba@suika.fam.cx>
33
34	wakaba	1.30	* Tokenizer.pm.src: Make uppercase "&#X" in XML a parse error.
35			Remove the limitation of entity name length. Enable replacement
36			of text-only general entities. Raise a parse error for an
37			unparsed entity reference. Raise a parse error for a general
38			entity reference to an undefined entity.
39
40			2008-10-19 Wakaba <wakaba@suika.fam.cx>
41
42	wakaba	1.29	* Tokenizer.pm.src: Support for <!ELEMENT>.
43			(AFTER_NOTATION_NAME_STATE): Renamed as \|AFTER_MD_DEF_STATE\| (i.e.
44			after markup declaration definition state).
45
46			2008-10-19 Wakaba <wakaba@suika.fam.cx>
47
48	wakaba	1.28	* Tokenizer.pm.src: Support for EntityValue.
49
50			2008-10-19 Wakaba <wakaba@suika.fam.cx>
51
52	wakaba	1.27	* Dumper.pm: Dump text content of Entity nodes.
53
54			* Tokenizer.pm.src: Support for <!ENTITY ... NDATA>.
55
56			2008-10-19 Wakaba <wakaba@suika.fam.cx>
57
58	wakaba	1.26	* Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
59			'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.
60
61	wakaba	1.24	2008-10-18 Wakaba <wakaba@suika.fam.cx>
62
63	wakaba	1.25	* Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
64			states such that <!ENTITY> and <!NOTATION> can be tokenized by
65			those states as well.
66			(BOGUS_MD_STATE): A new state; used for bogus markup declarations,
67			in favor of BOGUS_COMMENT_STATE.
68
69			2008-10-18 Wakaba <wakaba@suika.fam.cx>
70
71	wakaba	1.24	* Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
72			document, is now fully implemented.
73
74			* Dumper.pm (dumptree): Output allowed tokens and default value
75			always.
76
77	wakaba	1.23	2008-10-17 Wakaba <wakaba@suika.fam.cx>
78
79			* Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
80			GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
81			are added. New intertion modes for markup declarations are added.
82
83	wakaba	1.22	2008-10-16 Wakaba <wakaba@suika.fam.cx>
84
85			* Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
86			New states DOCTYPE_TAG_STATE and
87			BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added. (Bogus
88			string after the internal subset, which was handled by the state
89			BOGUS_DOCTYPE_STATE, are now handled by the new state.) Support
90			for comments, bogus comments, and processing instructions in the
91			internal subset. If there is the internal subset, then emit the
92			doctype token before the internal subset (with its
93			$token->{has_internal_subset} flag set) and an
94			END_OF_DOCTYPE_TOKEN after the internal subset.
95
96	wakaba	1.16	2008-10-15 Wakaba <wakaba@suika.fam.cx>
97
98	wakaba	1.21	* Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
99			renamed as $self->{kwd} to avoid confliction. Don't raise
100			case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
101			Support for internal subsets (internal subset itself only; no
102			declaration in them is supported yet). Raise a parse error for
103			non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode. Raise a
104			parse error if no system identifier is specified for a DOCTYPE
105			declaration with a public identifier. Don't close the DOCTYPE
106			declaration by a ">" character in the system declaration in XML
107			mode.
108
109			2008-10-15 Wakaba <wakaba@suika.fam.cx>
110
111	wakaba	1.20	* Tokenizer.pm.src: Set index attribute to each attribute token,
112			for ignoring namespaced duplicate attribute at the XML namespace
113			parser layer. Raise a parse error if the attribute value is
114			omitted, in XML mode. Raise a parse error if the attribute value
115			is not quoted, in XML mode. Raise a parse error if "<" character
116			is found in a quoted attribute value, in XML mode.
117
118			2008-10-15 Wakaba <wakaba@suika.fam.cx>
119
120	wakaba	1.19	* Tokenizer.pm.src: XML tag name start character support for end
121			tags. Support for the short end tag syntax of XML5. Raise a
122			parse erorr for a lowercase <!doctype> in XML.
123
124			2008-10-15 Wakaba <wakaba@suika.fam.cx>
125
126			* Tokenizer.pm.src: XML tag name start character support for start
127	wakaba	1.18	tags.
128
129			2008-10-15 Wakaba <wakaba@suika.fam.cx>
130
131	wakaba	1.17	* Tokenizer.pm.src: Support for XML processing instructions.
132
133			2008-10-15 Wakaba <wakaba@suika.fam.cx>
134
135	wakaba	1.16	* Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
136			as such, for the support of XML parse error.
137
138	wakaba	1.8	2008-10-14 Wakaba <wakaba@suika.fam.cx>
139
140	wakaba	1.15	* Tokenizer.pm.src: Parse error if CDATA section is not closed or
141			is placed outside of the root element.
142
143			2008-10-14 Wakaba <wakaba@suika.fam.cx>
144
145	wakaba	1.14	* Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
146			CDATA section end.
147
148			2008-10-14 Wakaba <wakaba@suika.fam.cx>
149
150	wakaba	1.13	* Tokenizer.pm.src: Support for case-insensitive XML attribute
151			names.
152
153			2008-10-14 Wakaba <wakaba@suika.fam.cx>
154
155	wakaba	1.12	* Dumper.pm: Typo fixed.
156
157			2008-10-14 Wakaba <wakaba@suika.fam.cx>
158
159	wakaba	1.11	* Dumper.pm: New module.
160
161			2008-10-14 Wakaba <wakaba@suika.fam.cx>
162
163	wakaba	1.10	* Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
164			support in XML.
165
166			2008-10-14 Wakaba <wakaba@suika.fam.cx>
167
168	wakaba	1.9	* Tokenizer.pm.src: Make *_TOKEN (token type constants)
169			exportable. New token types, PI_TOKEN for XML and ABORT_TOKEN for
170			document.write() or incremental parsing, are added for future
171			extensions.
172
173			2008-10-14 Wakaba <wakaba@suika.fam.cx>
174
175	wakaba	1.8	* Tokenizer.pm.src: New file.
176
177	wakaba	1.5	2008-05-24 Wakaba <wakaba@suika.fam.cx>
178
179	wakaba	1.7	* Serializer.pm (get_inner_html): Don't escape \|"\| in
180			content (HTML5 revision 1592).
181
182			2008-05-24 Wakaba <wakaba@suika.fam.cx>
183
184	wakaba	1.5	* Serializer.pm (get_inner_html): Append "\n" after the start
185	wakaba	1.6	tag of a \|listing\| element (HTML5 revision 1675).
186	wakaba	1.5
187	wakaba	1.4	2008-03-02 Wakaba <wakaba@suika.fam.cx>
188
189			* Serializer.pm (get_inner_html): Typo fixed.
190
191	wakaba	1.3	2008-03-01 Wakaba <wakaba@suika.fam.cx>
192
193			* Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
194			1277).
195
196	wakaba	1.2	2007-11-11 Wakaba <wakaba@suika.fam.cx>
197
198			* Serializer.pod: New file.
199
200			* Makefile: New file.
201
202			2007-11-11 Wakaba <wakaba@suika.fam.cx>
203
204			* Serializer.pm: New module (split from ../HTML.pm.src).
205
206			2007-11-11 Wakaba <wakaba@suika.fam.cx>
207
208			* ChangeLog: New file.
209
210