Contents of /markup/html/whatpm/Whatpm/HTML/ChangeLog

2008-10-19  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
        'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
        states such that <!ENTITY> and <!NOTATION> can be tokenized by
        those states as well.
        (BOGUS_MD_STATE): A new state; used for bogus markup declarations,
        in favor of BOGUS_COMMENT_STATE.

2008-10-18  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
        document, is now fully implemented.

        * Dumper.pm (dumptree): Output allowed tokens and default value
        always.

2008-10-17  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
        GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
        are added.  New intertion modes for markup declarations are added.

2008-10-16  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
        New states DOCTYPE_TAG_STATE and
        BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added.  (Bogus
        string after the internal subset, which was handled by the state
        BOGUS_DOCTYPE_STATE, are now handled by the new state.)  Support
        for comments, bogus comments, and processing instructions in the
        internal subset.  If there is the internal subset, then emit the
        doctype token before the internal subset (with its
        $token->{has_internal_subset} flag set) and an
        END_OF_DOCTYPE_TOKEN after the internal subset.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
        renamed as $self->{kwd} to avoid confliction.  Don't raise
        case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
        Support for internal subsets (internal subset itself only; no
        declaration in them is supported yet).  Raise a parse error for
        non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode.  Raise a
        parse error if no system identifier is specified for a DOCTYPE
        declaration with a public identifier.  Don't close the DOCTYPE
        declaration by a ">" character in the system declaration in XML
        mode.
        
2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Set index attribute to each attribute token,
        for ignoring namespaced duplicate attribute at the XML namespace
        parser layer.  Raise a parse error if the attribute value is
        omitted, in XML mode.  Raise a parse error if the attribute value
        is not quoted, in XML mode.  Raise a parse error if "<" character
        is found in a quoted attribute value, in XML mode.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for end
        tags.  Support for the short end tag syntax of XML5.  Raise a
        parse erorr for a lowercase <!doctype> in XML.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: XML tag name start character support for start
        tags.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for XML processing instructions.

2008-10-15  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
        as such, for the support of XML parse error.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Parse error if CDATA section is not closed or
        is placed outside of the root element.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
        CDATA section end.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Support for case-insensitive XML attribute
        names.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: Typo fixed.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Dumper.pm: New module.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
        support in XML.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: Make *_TOKEN (token type constants)
        exportable.  New token types, PI_TOKEN for XML and ABORT_TOKEN for
        document.write() or incremental parsing, are added for future
        extensions.

2008-10-14  Wakaba  <wakaba@suika.fam.cx>

        * Tokenizer.pm.src: New file.

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Don't escape |"| in
        content (HTML5 revision 1592).

2008-05-24  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Append "\n" after the start
        tag of a |listing| element (HTML5 revision 1675).

2008-03-02  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Typo fixed.

2008-03-01  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
        1277).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pod: New file.

        * Makefile: New file.

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * Serializer.pm: New module (split from ../HTML.pm.src).

2007-11-11  Wakaba  <wakaba@suika.fam.cx>

        * ChangeLog: New file.
        

1	2008-10-19 Wakaba <wakaba@suika.fam.cx>
2
3	* Tokenizer.pm.src (_get_next_token): Make keywords 'ENTITY',
4	'ELEMENT', 'ATTLIST', and 'NOTATION' ASCII case-insensitive.
5
6	2008-10-18 Wakaba <wakaba@suika.fam.cx>
7
8	* Tokenizer.pm.src: Modifies PUBLIC/SYSTEM identifier tokenizer
9	states such that <!ENTITY> and <!NOTATION> can be tokenized by
10	those states as well.
11	(BOGUS_MD_STATE): A new state; used for bogus markup declarations,
12	in favor of BOGUS_COMMENT_STATE.
13
14	2008-10-18 Wakaba <wakaba@suika.fam.cx>
15
16	* Tokenizer.pm.src: <!ATTLIST> in the internal subset of an XML
17	document, is now fully implemented.
18
19	* Dumper.pm (dumptree): Output allowed tokens and default value
20	always.
21
22	2008-10-17 Wakaba <wakaba@suika.fam.cx>
23
24	* Tokenizer.pm.src: New token types AtTLIST_TOKEN, ELEMENT_TOKEN,
25	GENERAL_ENTITY_TOKEN, PARAMETER_ENTITY_TOKEN, and NOTATION_TOKEN
26	are added. New intertion modes for markup declarations are added.
27
28	2008-10-16 Wakaba <wakaba@suika.fam.cx>
29
30	* Tokenizer.pm.src: New token type END_OF_DOCTYPE_TOKEN added.
31	New states DOCTYPE_TAG_STATE and
32	BOGUS_DOCTYPE_INTERNAL_SUBSET_AFTER_STATE are added. (Bogus
33	string after the internal subset, which was handled by the state
34	BOGUS_DOCTYPE_STATE, are now handled by the new state.) Support
35	for comments, bogus comments, and processing instructions in the
36	internal subset. If there is the internal subset, then emit the
37	doctype token before the internal subset (with its
38	$token->{has_internal_subset} flag set) and an
39	END_OF_DOCTYPE_TOKEN after the internal subset.
40
41	2008-10-15 Wakaba <wakaba@suika.fam.cx>
42
43	* Tokenizer.pm.src: $self->{s_kwd} for non-DATA_STATE states are
44	renamed as $self->{kwd} to avoid confliction. Don't raise
45	case-sensitivity error for the keyword "DOCTYPE" in HTML mode.
46	Support for internal subsets (internal subset itself only; no
47	declaration in them is supported yet). Raise a parse error for
48	non-uppercase keywords "PUBLIC" and "SYSTEM" in XML mode. Raise a
49	parse error if no system identifier is specified for a DOCTYPE
50	declaration with a public identifier. Don't close the DOCTYPE
51	declaration by a ">" character in the system declaration in XML
52	mode.
53
54	2008-10-15 Wakaba <wakaba@suika.fam.cx>
55
56	* Tokenizer.pm.src: Set index attribute to each attribute token,
57	for ignoring namespaced duplicate attribute at the XML namespace
58	parser layer. Raise a parse error if the attribute value is
59	omitted, in XML mode. Raise a parse error if the attribute value
60	is not quoted, in XML mode. Raise a parse error if "<" character
61	is found in a quoted attribute value, in XML mode.
62
63	2008-10-15 Wakaba <wakaba@suika.fam.cx>
64
65	* Tokenizer.pm.src: XML tag name start character support for end
66	tags. Support for the short end tag syntax of XML5. Raise a
67	parse erorr for a lowercase <!doctype> in XML.
68
69	2008-10-15 Wakaba <wakaba@suika.fam.cx>
70
71	* Tokenizer.pm.src: XML tag name start character support for start
72	tags.
73
74	2008-10-15 Wakaba <wakaba@suika.fam.cx>
75
76	* Tokenizer.pm.src: Support for XML processing instructions.
77
78	2008-10-15 Wakaba <wakaba@suika.fam.cx>
79
80	* Tokenizer.pm.src: Mark CHARACTER_TOKEN with character reference
81	as such, for the support of XML parse error.
82
83	2008-10-14 Wakaba <wakaba@suika.fam.cx>
84
85	* Tokenizer.pm.src: Parse error if CDATA section is not closed or
86	is placed outside of the root element.
87
88	2008-10-14 Wakaba <wakaba@suika.fam.cx>
89
90	* Tokenizer.pm.src: Raise a parse error for XML "]]>" other than
91	CDATA section end.
92
93	2008-10-14 Wakaba <wakaba@suika.fam.cx>
94
95	* Tokenizer.pm.src: Support for case-insensitive XML attribute
96	names.
97
98	2008-10-14 Wakaba <wakaba@suika.fam.cx>
99
100	* Dumper.pm: Typo fixed.
101
102	2008-10-14 Wakaba <wakaba@suika.fam.cx>
103
104	* Dumper.pm: New module.
105
106	2008-10-14 Wakaba <wakaba@suika.fam.cx>
107
108	* Tokenizer.pm.src: Introduced "in_xml" flag for CDATA section
109	support in XML.
110
111	2008-10-14 Wakaba <wakaba@suika.fam.cx>
112
113	* Tokenizer.pm.src: Make *_TOKEN (token type constants)
114	exportable. New token types, PI_TOKEN for XML and ABORT_TOKEN for
115	document.write() or incremental parsing, are added for future
116	extensions.
117
118	2008-10-14 Wakaba <wakaba@suika.fam.cx>
119
120	* Tokenizer.pm.src: New file.
121
122	2008-05-24 Wakaba <wakaba@suika.fam.cx>
123
124	* Serializer.pm (get_inner_html): Don't escape \|"\| in
125	content (HTML5 revision 1592).
126
127	2008-05-24 Wakaba <wakaba@suika.fam.cx>
128
129	* Serializer.pm (get_inner_html): Append "\n" after the start
130	tag of a \|listing\| element (HTML5 revision 1675).
131
132	2008-03-02 Wakaba <wakaba@suika.fam.cx>
133
134	* Serializer.pm (get_inner_html): Typo fixed.
135
136	2008-03-01 Wakaba <wakaba@suika.fam.cx>
137
138	* Serializer.pm (get_inner_html): Escape NBSP (HTML5 revision
139	1277).
140
141	2007-11-11 Wakaba <wakaba@suika.fam.cx>
142
143	* Serializer.pod: New file.
144
145	* Makefile: New file.
146
147	2007-11-11 Wakaba <wakaba@suika.fam.cx>
148
149	* Serializer.pm: New module (split from ../HTML.pm.src).
150
151	2007-11-11 Wakaba <wakaba@suika.fam.cx>
152
153	* ChangeLog: New file.
154
155