/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Contents of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.26 - (hide annotations) (download) (as text)
Sat Mar 29 06:35:16 2008 UTC (16 years, 8 months ago) by wakaba
Branch: MAIN
Changes since 1.25: +116 -15 lines
File MIME type: text/html
How general entity declarations should be handled

1 wakaba 1.1 <!DOCTYPE html>
2     <html lang="en" class="formal-specification status-work-in-progress">
3     <head>
4 wakaba 1.2 <title>manakai's XML Conformance Checking</title>
5 wakaba 1.1 <link rel="stylesheet" href="http://suika.fam.cx/www/style/html/spec">
6     <link rel="license" href="http://suika.fam.cx/c/gnu/fdl">
7 wakaba 1.6 <style>
8     a[id]::before {
9     content: "[" attr(id) "]";
10     color: gray;
11     font-size: 70%;
12     font-family: Courier New, monospace;
13     vertical-align: 60%;
14     }
15     </style>
16 wakaba 1.1 </head>
17     <body class="has-abstract">
18    
19     <div class="header">
20 wakaba 1.12 <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21 wakaba 1.24 <h2>Working Draft <time datetime=2008-03-29>29 March 2008</time></h2>
22 wakaba 1.1
23 wakaba 1.2 <dl class="versions-uri">
24 wakaba 1.1 <dt>This Version</dt>
25 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
26     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
27 wakaba 1.1 <dt>Latest Version</dt>
28 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc"
29     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc</a>&gt;</code></dd>
30 wakaba 1.1 <dt>Latest Working Draft</dt>
31 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
32     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
33 wakaba 1.1 <!-- <dt>Previous Version</dt> -->
34     <dt>Version History</dt>
35 wakaba 1.2 <dd><code class=URI>&lt;<a href="http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html">http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html</a>&gt;</code></dd>
36 wakaba 1.1 </dl>
37     <dl class="authors" id="author">
38     <dt>Author</dt>
39     <dd><a href="http://suika.fam.cx/~wakaba/who?" rel="author"
40     lang=ja>Wakaba</a>
41     <code class="mail">&lt;<a href="mailto:w@suika.fam.cx" rel="author"
42     >w@suika.fam.cx</a>&gt;</code></dd>
43     </dl>
44    
45 wakaba 1.22 <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46 wakaba 1.1 href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47     Permission is granted to copy, distribute and/or modify this document
48     under the terms of the <a rel="license"
49     href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>,
50     Version 1.2
51     or any later version published by the <a
52     href="http://www.fsf.org/">Free Software Foundation</a>;
53     with no Invariant Sections, no Front$B!>(BCover Texts, and no Back$B!>(BCover
54     Texts. A copy of the license is
55     available at <code class="URI">&lt;<a
56     href="http://www.gnu.org/copyleft/fdl.html"
57     rel="license">http://www.gnu.org/copyleft/fdl.html</a>&gt;</code>.</p>
58     </div>
59    
60     <div id="abstract" class="section">
61     <h2>Abstract</h2>
62    
63     <div class="ed issue">...</div>
64    
65     </div>
66    
67     <div id="status" class="section">
68     <h2>Status of This Document</h2>
69    
70     <p class=section-info><em>This section describes the status of this document
71     at the time of its publication. Other documents
72     might supersede this document.</em></p>
73    
74     <p>This document is a working draft, produced as part of the
75     <a href="http://suika.fam.cx/www/markup/html/whatpm/readme">Whatpm</a>
76     subproject of the
77     <a href="http://suika.fam.cx/www/manakai-core/doc/web/">manakai</a>
78     project. It might be updated, replaced, or obsoleted by
79     other documents at any time. It is inappropriate to
80     cite this document as other than <q>work in progress</q>.</p>
81    
82     <p>Comments on this document are welcome and
83     may be sent to the <a href="#author">author</a>.</p>
84    
85     <p>Translations of thie document might be available.
86     The English version of the document is the only
87     normative version.</p>
88     </div>
89    
90     <div id="introduction" class="section">
91     <h2>Introduction</h2>
92    
93     <p class=section-info>This section is <em>non$B!>(Bnormative</em>.</p>
94    
95     <div class="issue ed">...</div>
96    
97    
98     </div>
99    
100     <div class="section" id="terminology">
101     <h2>Terminology</h2>
102    
103     <p>The key words <q><em class="rfc2119">MUST</em></q>,
104     <q><em class="rfc2119">MUST NOT</em></q>,
105     <q><em class="rfc2119">SHOULD</em></q>,
106     <q><em class="rfc2119">SHOULD NOT</em></q>, and
107     <q><em class=rfc2119>MAY</em></q> in this
108     document are to be interpreted as described in
109     <a href="urn:ietf:rfc:2119"><abbr title="Request for Comments">RFC</abbr>
110     2119</a>
111     <cite class="bibref normative">[<a href="#ref-KEYWORDS">KEYWORDS</a>]</cite>.</p>
112    
113     <!-- Strictly speaking the paragraph below is not part of terminology. -->
114     <p>All examples and notes in this specification are non$B!>(Bnormative,
115     as are all sections explicitly marked non$B!>(Bnormative. Everything
116     else in this specification is normative.</p>
117    
118     <p><span class=ed>Algorithm is normative but non-normative</span>.
119     In addition, the order in which <a href="#errors">errors</a> are
120     raised is undefined.</p>
121 wakaba 1.22
122     <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123     by hyperlinks. When the document being processed is an <abbr>XML</abbr> 1.1
124     document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125     specification should be consulted instead.</p>
126 wakaba 1.1 </div>
127    
128    
129 wakaba 1.9 <div class=section id=error-categories>
130 wakaba 1.14 <h2>Error Classification</h2>
131 wakaba 1.9
132 wakaba 1.1 <p class=ed>If a <code>Document</code> node has no
133     xml-well-formedness-error, entity-error, and unknown-error,
134     then it is well-formed. If a well-formed <code>Document</code>
135     node has no xml-validity-error, it is valid.</p>
136    
137     <div class="note memo">
138     <p>A well$B!>(Bformed <code>Document</code> can be safely serialized
139     into a well$B!>(Bformed XML document. A valid <code>Document</code>
140     can be easily serialized into a valid XML document.</p>
141     </div>
142    
143 wakaba 1.17 <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
144     ...</p>
145    
146 wakaba 1.1 <p>Errors are classified into these
147     <dfn id=error-category title="error category">error categories</dfn>:</p>
148    
149     <dl id=errors>
150     <dt><dfn id=entity-error><code>entity-error</code></dfn></dt>
151 wakaba 1.4 <dd>
152     <p class=ed>@@</p>
153     <div class="note memo">
154     <p>This <span class=ed>algorithm</span> does not support
155     <abbr>DOM</abbr> tree with one or more <code>EntityReference</code>
156     nodes. It is expected that any entity references are expanded
157     at the parse time and any unexpandable entity references
158     make parse time errors raised so that never result in
159     <abbr>DOM</abbr> tree with <code>EntityReference</code> nodes.</p>
160     </div>
161     </dd>
162 wakaba 1.1 <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
163     <dd class=ed>@@</dd>
164 wakaba 1.12 <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
165     <dd>
166     <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
167     will be raised when a construct, which might not be restored to the same
168     construct when it is serialized and then re-parsed by a conforming
169     processor, is encountered.</p>
170     <div class="example memo">
171     <p>For a <code>Comment</code> node a
172     <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
173     be raised, since <abbr>XML</abbr> processors are not required to
174     report texts of comments for applications.</p>
175     </div>
176     </dd>
177 wakaba 1.1 <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
178     <dd class=ed>@@</dd>
179     <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
180     <dd>An <abbr>XML</abbr> <i>error</i> (<abbr>XML</abbr> 1.0
181     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
182     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-error">error</a> /
183     <abbr>XML</abbr> 1.1
184     <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
185     <a href="@@/#dt-error">error</a>)
186     that is not classified to any other <a href="#error-category">error
187     category</a>.</dd>
188     <dt><dfn id=xml-misc-fatal-error><code>xml-misc-fatal-error</code></dt>
189     <dd>An <abbr>XML</abbr> <i>fatal error</i> (<abbr>XML</abbr> 1.0
190     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
191     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-fatal">fatal
192     error</a> / <abbr>XML</abbr> 1.1
193     <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
194     <a href="@@/#dt-fatal">fatal error</a>)
195     that is not classified to any other <a href="#error-category">error
196     category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
197 wakaba 1.13 <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
198     <dd>
199     <p>An
200     <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
201     will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
202     in <abbr>XML</abbr> specification is not met.</p>
203     </dd>
204 wakaba 1.1 <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
205     <dd>A violation of validity constraint in XML document.</dd>
206     <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
207     <dd><p>If an <code>xml-well-formedness-error</code> is raised,
208     it would not be possible to generate an XML serialization that
209     would match to the appropriate production rule and that would not
210     violate to any well$B!>(Bformedness constraint in XML
211     specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,
212     <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>
213 wakaba 1.26 <dt><dfn id=misc-info><code>misc-info</code></dfn>
214     <dd><p>A <code>misc-info</code> is raised when some status information
215     on parsing or checking process that are considered useful for debugging
216     and so on is available. It by no means implies the non-conformance of
217     the document.
218 wakaba 1.1 </dl>
219    
220     <div class=ed><p>@@ TODO: #dt-atuseroption at user option
221     (MAY or MUST), #dt-compat for compatibility,
222 wakaba 1.23 #dt-interop for interoperability</p>
223    
224     <p>TODO: XML 1.1, XML Namespace 1.0/1.1, xml:base, xml:id
225     </div>
226 wakaba 1.1
227 wakaba 1.9 </div>
228 wakaba 1.14
229     <div class=section id=parsing-xml>
230     <h2>Parsing <abbr>XML</abbr> Document</h2>
231    
232 wakaba 1.22 <p>When an <abbr>XML</abbr> document is parsed, the following clauses
233     are applied:</p>
234 wakaba 1.25 <dl class=switch>
235     <dt>For the document
236 wakaba 1.22 <dd>If the <abbr>XML</abbr> document does not begin with an
237     <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
238     raise an
239     <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
240 wakaba 1.25 <dt>For the document type declaration
241     <dd class=ed>@@ read external entity
242 wakaba 1.26 <dd>The <code>entities</code> attribute of the <code>DocumentType</code>
243     node <em class=rfc2119>MUST</em> contain a <code>NamedNodeMap</code> object
244     whose first five items are as follows:
245     <ol start=0>
246     <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
247     is <code>amp</code>. It contains a <code>Text</code> node whose
248     <code>data</code> attribute is set to <code>&amp;</code>.
249     <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
250     is <code>lt</code>. It contains a <code>Text</code> node whose
251     <code>data</code> attribute is set to <code>&lt;</code>.
252     <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
253     is <code>gt</code>. It contains a <code>Text</code> node whose
254     <code>data</code> attribute is set to <code>></code>.
255     <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
256     is <code>quot</code>. It contains a <code>Text</code> node whose
257     <code>data</code> attribute is set to <code>"</code>.
258     <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
259     is <code>apos</code>. It contains a <code>Text</code> node whose
260     <code>data</code> attribute is set to <code>'</code>.
261     </ol>
262 wakaba 1.25 <dt>For each internal general entity declaration being processed by the parser
263 wakaba 1.22 <dd>If the
264     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
265     part of the general entity declaration contains a bare <code>U+003C</code>
266     <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
267     <em class=rfc2119>MUST</em> raise an
268     <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
269 wakaba 1.14 "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
270 wakaba 1.22 -->
271 wakaba 1.25 <dt>For each element type declaration being processed by the parser
272     <dd>If there is another processed element type declaration whose
273     <code>Name</code> is equal to the <code>Name</code> of the element type
274     declaration, then the parser <em class=rfc2119>MUST</em> raise an
275 wakaba 1.22 <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
276 wakaba 1.25 <dt>For each attribute definition list declaration being processed by the
277     parser
278     <dd>If there is another processed attribute defintion list declaration whose
279     <code>Name</code> is equal to the <code>Name</code> of the attribute
280     definition list declaration, then the parser <em class=rfc2119>MUST</em>
281     raise an
282     <a href="#xml-misc-warning" id=xme-attlist-unique><code>xml-misc-warning</code></a>.
283     <dd>For each attribute definition in the attribute definition list
284     declaration, if there is another processed attribute definition whose
285     <code>Name</code> is equal to the <code>Name</code> of the attribute
286     definition (whether or not in the same attribute definition list
287     declaration), then the parser <em class=rfc2119>MUST</em> raise an
288     <a href="#xml-misc-warning" id=xme-attrdef-unique><code>xml-misc-warning</code></a>.
289     <!--
290     <q>For interoperability, an XML processor <em class=rfc2119>MAY</em> at
291     user option issue a warning when more than one attribute-list declaration
292     is provided for a given element type, or more than one attribute definition
293     is provided for a given attribute, but this is not an error.</q>
294     -->
295 wakaba 1.22 <!--
296     NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
297     will not be warned.
298     -->
299 wakaba 1.25
300     <dt>For each entity declaration being processed by the parser
301 wakaba 1.26 <dd>Handle as follows:
302     <ol>
303     <li><p>If the entity declaration declares a general entity, the following
304     is applied:
305     <dl>
306     <dt>If the <code>Name</code> is <code>lt</code> or <code>amp</code>
307     <dd><p>If the entity declaration does not declare an internal entity,
308     or if the replacement text of the entity is not the escaped form of
309     <code>&lt;</code> (if <code>lt</code>) or <code>&amp;</code> (if
310     <code>amp</code>), then the parser <em class=rfc2119>MUST</em> raise an
311     <a href="#xml-misc-error" id=xme-double-escape><code>xml-misc-error</code></a>.
312    
313     <div class="note memo informative">
314     <p>In other word, the character in the <code>EntityValue</code>
315     has to be double-escaped.
316     </div>
317     <dt>If the <code>Name</code> is <code>gt</code>, <code>quot</code>, or
318     <code>apos</code>
319     <dd><p>If the entity declaration does not declare an internal entity,
320     or if the replacement text of the entity is not equal to or not the
321     escaped form of <code>></code> (if <code>gt</code>), <code>"</code> (if
322     <code>quot</code>), or <code>'</code> (if <code>apos</code>), then the
323     parser <em class=rfc2119>MUST</em> raise an
324     <a href="#xml-misc-error" id=xme-single-escape><code>xml-misc-error</code></a>.
325    
326     <div class="note memo informative">
327     <p>In other word, the character in the <code>EntityValue</code>
328     has to be single- or double-escaped.
329     </div>
330     </dl>
331     <!--
332     <q>If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless).</q>
333     -->
334    
335     <li><p>If the entity declaration has to be ignored since there has already
336     been declared an entity with the same <code>Name</code> as the declaration,
337     then the parser <em class=rfc2119>MUST</em> raise a
338     <a href="#misc-info" id=mi-ent-unique><code>misc-info</code></a>
339     and abort these steps.
340    
341     <div class="informative note memo">
342     <p>Five predefined entities, i.e. <code>amp</code>, <code>lt</code>,
343     <code>gt</code>, <code>quot</code>, and <code>apos</code>, are always
344     declared implicitly and therefore any declaration for such an entity
345     always raises an
346     <a href="#misc-info" id=mi-ent-unique><code>misc-info</code></a>.
347     </div>
348    
349     <li><p>If the entity declaration declares a parameter entity and the
350     <code>Name</code> of the entity begins with the string <code>xml</code>
351     (in any combination of upper- and lowercase letters), then the parser
352     <em class=rfc2119>MUST</em> raise an
353     <a href="#xml-misc-warning" id=xmw-reserved-pe-name><code>xml-misc-warning</code></a>.
354    
355     <li><p>If the entity declaration contains the <code>EntityValue</code>,
356     then for each occurence of any references to unparsed entities in the
357     <code>EntityValue</code>, the parser <em class=rfc2119>MUST</em> raise an
358     <a href="#xml-misc-error" id=xme-unparsed-in-ev><code>xml-misc-error</code></a>.
359     <!--
360     <q>It is an error for a reference to an unparsed entity to appear in the
361     EntityValue in an entity declaration.</q>
362     -->
363     <li><p>If the entity declaration declares a general entity, then an
364     <code>Entity</code> node <em class=rfc2119>MUST</em> be created and
365     appended to the <code>NamedNodeMap</code> object in the
366     <code>entities</code> attribute of the <code>DocumentType</code> node.
367    
368     <p class=ed>Read the external entity
369    
370     <p>If the replacement text of the entity is read, then parse the
371     replacement text as if it were referenced from the content of an
372     element (with no namespace bindings). If no <span class=ed>@@ parse error</span>
373     is raised by the parsing process, then the nodes generated by the
374     parsing <em class=rfc2119>MUST</em> be appended to the <code>Entity</code>
375     node. The parse error <em class=rfc2119>MUST NOT</em> be propagated to
376     the entire parsing process. Other kinds of errors
377     <em class=rfc2119>MUST</em> be propagated. The first parse error
378     <em class=rfc2119>MUST</em> abort the internal parsing process.
379     <span class=ed>@@ better wording</span>
380    
381     <p class=ed>@@ prop
382    
383     <p>Then, the <code>Entity</code> node and its descendant
384     <em class=rfc2119>MUST</em> be marked as read-only.
385     </ol>
386 wakaba 1.25
387     <dt>For each notation declaration being processed by the parser
388     <dd>If there is another processed notation declaration whose
389     <code>Name</code> is equal to the <code>Name</code> of the notation
390     declaration, then the parser <em class=rfc2119>MUST</em> raise an
391     <a href="#xml-validity-error" id=vc-uniquenotationname><code>xml-validity-error</code></a>.
392     <!-- <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#UniqueNotationName">Validity constraint: Unique Notation Name</a> -->
393    
394 wakaba 1.22 <dt>For each empty-element tag
395     <dd>If the <code>Name</code> of the tag is not declared by a processed
396     element type declaration as <code>EMPTY</code> content, then the parser
397     <em class=rfc2119>MUST</em> raise an
398     <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
399     <dt>For each start-tag
400     <dd>If the <code>Name</code> of the tag is declared by a processed element
401     type declaration as <code>EMPTY</code> content, then the parser
402     <em class=rfc2119>MUST</em> raise an
403     <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
404 wakaba 1.25
405     <dt>For each parameter entity reference
406     <dd>If the declaration for the entity is not read (i.e. no declaration
407     for the entity is processed or the external entity referenced by the
408     declaration cannot be retrieved), then:
409     <ul>
410     <li>If the parameter entity is contained in a declaration, then the
411     declaration <em class=rfc2119>MUST</em> be ignored <em>except</em> that
412     any error before the parameter entity <em class=rfc2119>MUST</em> be
413     raised as usual.
414     <li>If the parameter entity is contained in the status portion of a
415     conditional section, then the conditional section
416     <em class=rfc2119>MUST</em> be processed as if it were an
417     <code>IGNORE</code>d section.
418     <li>The parser <em class=rfc2119>MUST NOT</em> process any entity or
419     attribute-list declaration after the parameter entity reference in the DTD
420     <em>except</em> when the <code>standalone</code> pseudo-attribute of the
421     XML declaration (if any) is set to <code>yes</code>.
422     <!-- This requirement is enforced for internal DTD subset case in
423     XML 1.0/1.1 specification (section 5.1) but not for any other cases. -->
424     <!-- According to this definition, element type declarations, notation
425     declarations, and PIs ARE processed. -->
426     <li>If the parameter entity reference is the first reference to an entity
427     that is not read, then the parser <em class=rfc2119>MUST</em> raise an
428     <a href="#entity-error" id=ee-unread-pe><code>entity-error</code></a>.
429     <li>The <code>allDeclarationsProcessed</code> <span class=ed>@@ ref</span>
430     attribute of the <code>Document</code> node <em class=rfc2119>MUST</em> be
431     set to <code>false</code>.
432     </ul>
433     <dt>For each general entity reference in an attribute value or in the content
434     of an element
435     <dd>If the declaration for the entity is not read (i.e. no declaration for
436     the entity is processed or the external entity referenced by the declaration
437     cannot be retrieved), then:
438     <ul>
439     <li>If the general entity reference is the first reference to an entity
440     that is not read, then the parser <em class=rfc2119>MUST</em> raise an
441     <a href="#entity-error" id=ee-unread-ge><code>entity-error</code></a>.
442     <span class=ed>@@ entity declared WFC?</span>
443     <li class=ed>An unexpended entity reference node <em class=rfc2119>MUST</em> be inserted to the current node.
444     </ul>
445 wakaba 1.24 </dl>
446    
447 wakaba 1.25 <p class=ed>@@ MUST try to read external entity
448    
449     <p>In addition, the parser has to check whether the
450 wakaba 1.24 following constraints are met.
451    
452     <p><strong>Well-formedness constraints</strong>. For each violation to
453     one of constraints below, an
454     <a href="#xml-well-formedness-error"><code>xml-well-formedness-error</code></a>
455     <em class=rfc2119>MUST</em> be raised. The list of well-formedness
456     constraints is below:
457     <ul>
458     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wfc-PEinInternalSubset">Well-formedness constraint: PEs in Internal Subset</a>
459     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#GIMatch">Well-formedness constraint: Element Type Match</a>
460     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#uniqattspec">Well-formedness constraint: Unique Att Spec</a>
461     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NoExternalRefs">Well-formedness constraint: No External Entity References</a>
462     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#CleanAttrVals">Well-formedness constraint: No &lt; in Attribute Values</a>
463     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wf-Legalchar">Well-formedness constraint: Legal Character</a>
464     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#textent">Well-formedness constraint: Parsed Entity</a>
465     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#norecursion">Well-formedness constraint: No Recursion</a>
466     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#indtd">Well-formedness constraint: In DTD</a>
467     </ul>
468    
469     <p><strong>Validity constraints</strong>. For each violation to
470     one of constraints below, an
471     <a href="#xml-validity-error"><code>xml-validity-error</code></a>.
472     <em class=rfc2119>MUST</em> be raised. The list of validity
473     constraints is below:
474     <ul>
475     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinMarkupDecl">Validity constraint: Proper Declaration/PE Nesting</a>
476     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinGroup">Validity constraint: Proper Group/PE Nesting</a>
477     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#condsec-nesting">Validity constraint: Proper Conditional Section/PE Nesting</a>
478     </ul>
479    
480     <p><strong>Other creteria</strong>. For each violation to
481     one of constraints below, an
482     <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
483     <em class=rfc2119>MUST</em> be raised. The list of constraints is below:
484     <ul>
485     <li><q>For interoperability, if a parameter-entity reference appears in a
486     <code>choice</code>, <code>seq</code>, or <code>Mixed</code> construct, its
487     replacement text <em class=rfc2119>SHOULD</em> contain at least one non-blank
488     character, and neither the first nor last non-blank character of the
489     replacement text <em class=rfc2119>SHOULD</em> be a connector (<code>|</code>
490     or <code>,</code>).</q>
491     <li><q>External parsed entities <em class=rfc2119>SHOULD</em> each begin with a
492     text declaration.</q>
493     </ul>
494    
495    
496 wakaba 1.14 <!--
497    
498     @@ Need detailed review, but maybe should be in parsing phase
499    
500     #vc-check-rmd Validity constraint: Standalone Document Declaration
501    
502 wakaba 1.17 @@ Need dtailed review
503    
504     #wf-entdeclared Well-formedness constraint: Entity Declared
505     #vc-entdeclared Validity constraint: Entity Declared
506     "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
507    
508     @@ flaged and then reported in DOM check phase
509    
510     "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
511     "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
512     "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
513    
514     @@ in parsing phase
515    
516     "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
517 wakaba 1.23
518     @@ We should phrase out what the parser should do where the XML specification
519     left undefined. For example: Comment must be converted to a Comment node,
520     illegal xml:space value must be preserved, so on.
521 wakaba 1.17
522 wakaba 1.25 Warn <!ENTITY % xml... ...>
523    
524     -->
525    
526     <p>The parser <em class=rfc2119>MUST</em> raise an
527     <a href="#xml-well-formedness-error" id=wfe-syntax><code>xml-well-formedness-error</code></a>
528     for any failure to match to a production rule in the XML specification.
529     <!--
530     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#ExtSubset">Well-formedness constraint: External Subset</a>
531     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#PE-between-Decls">Well-formedness constraint: PE Between Declarations</a>
532 wakaba 1.14 -->
533     </div>
534    
535     <div class="section" id=checking-dom>
536     <h2>Checking <abbr>DOM</abbr></h2>
537    
538     <p>The following algorithms and definitions are applied to
539     <abbr>XML</abbr> documents; especially, they are not applied
540     to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
541 wakaba 1.9
542     <div class=section id=definitions>
543     <h3>Definitions</h3>
544    
545 wakaba 1.1 <p>The <dfn id=xml-version><abbr>XML</abbr> version</dfn> of a node is
546     the <abbr>XML</abbr> version of the document to which the node belongs.
547     For a <code>Documemt</code> node, the <abbr>XML</abbr> version
548     of the document is the value of the <code>xmlVersion</code>
549 wakaba 1.9 attribute of the node. For a <code>DocumentType</code> node whose
550     <code>ownerDocument</code> attribute is set to <code>null</code>,
551     the <abbr>XML</abbr> version of the document is <code>1.0</code><!--
552     since createDocumentType [DOM3CORE] assumes XML 1.0 [XML10TE] -->.
553 wakaba 1.1 For any other node, the <abbr>XML</abbr> version of the document
554     is that of the <code>Document</code> node contained in the
555     <code>ownerDocument</code> attribute of the node.</p>
556    
557 wakaba 1.9 </div>
558    
559 wakaba 1.10
560     <div class=section id=checking-components>
561 wakaba 1.19 <h3>Conformance Checking Algorithms for Components</h3>
562 wakaba 1.10
563 wakaba 1.19 <p>To
564     <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
565     validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
566     the following algorithm <em class=rfc2119>MUST</em> be used:</p>
567 wakaba 1.10 <ol>
568 wakaba 1.11 <li>If <var>s</var> contains a character that is <em>not</em> in
569     the character class <a href="#class-Char10"><code>Char10</code></a>,
570     then raise an
571     <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
572     <li>If <var>s</var> contains a character that is in the character
573     class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
574     then raise an
575     <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
576     <li>If <var>s</var> contains a character that is in the character
577     class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
578     then raise an
579     <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
580     <li class=ed>@@ XML 1.1 support</li>
581 wakaba 1.13 <li>If <var>s</var> contains a <code class=char>U+000D</code>
582     <code class=charname>CARRIAGE RETURN</code> character, then
583     raise a
584     <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
585     <span class=ed>@@ We should not raise duplicate errors for U+000D
586     in attribute values. In addition, we should support a mode where
587     U+000D will be serialized as &#x000D; (so that no round-trip-error
588     will be raised).</span></li>
589 wakaba 1.10 </ol>
590    
591 wakaba 1.19 <p>To
592     <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
593     <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
594     algorithm <em class=rfc2119>MUST</em> be used:</p>
595 wakaba 1.10 <ol>
596 wakaba 1.19 <li>If <var>s</var> is an empty string, then raise an
597     <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
598     Abort these steps.</li>
599     <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
600     <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
601     <li>If the first character in <var>s</var> is a character that is
602     <em>not</em> in the character class
603     <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
604     an
605     <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
606     <li>If a character other than the first character in <var>s</var> is a
607     character that is <em>not</em> in the character class
608     <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
609     <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
610 wakaba 1.12 <li>If <var>s</var> begins with the string <code>xml</code> (in any
611     case combination), then raise an
612     <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
613     <span class=ed>@@ except for attribute names <code>xml:lang</code>,
614 wakaba 1.25 <code>xml:space</code><!--, <code>xml:base</code>, <code>xml:id</code>,
615 wakaba 1.12 <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
616 wakaba 1.25 pi name <code>xml-stylesheet</code>-->.</span><!--
617 wakaba 1.15 "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
618 wakaba 1.19 xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
619 wakaba 1.15 --></li>
620 wakaba 1.12 <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
621     [XML11]. -->
622 wakaba 1.19 <li class=ed>@@ XML 1.1 support</li>
623 wakaba 1.10 </ol>
624    
625 wakaba 1.19 <p>To
626     <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
627     an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
628     following algorithm <em class=rfc2119>MUST</em> be used:</p>
629 wakaba 1.10 <ol>
630 wakaba 1.19 <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
631     <var>s</var> as a <code>Name</code></a>.</li>
632 wakaba 1.10 <li class=ed>@@</li>
633     </ol>
634    
635 wakaba 1.18 <p>To
636     <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
637 wakaba 1.10 a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
638 wakaba 1.18 following algorithm <em class=rfc2119>MUST</em> be used:</p>
639 wakaba 1.10 <ol>
640 wakaba 1.20 <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
641     <li>If <var>pid</var> contains a character that is <em>not</em> in the
642     character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
643     raise an
644     <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
645 wakaba 1.10 <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
646     <code class=charname>CHARACTER TABULATION</code>,
647     <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
648     and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
649     characters, if the first character of <var>pid</var> is
650     <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
651     if the last character of <var>pid</var> is <code class=char>U+0020</code>
652     <code class=charname>SPACE</code> character, or if there is a
653     <code class=char>U+0020</code> <code class=charname>SPACE</code> character
654     immediately followed by another <code class=char>U+0020</code>
655     <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
656 wakaba 1.18 <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
657     <span class=ed>Is this really a roundtripness problem? XML spec
658     does only define the way to match public identifiers in fact, no
659     canonical form.</span></li>
660 wakaba 1.10 </ol>
661    
662 wakaba 1.18 <p>To
663     <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
664     a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
665     following algorithm <em class=rfc2119>MUST</em> be used:</p>
666     <ol>
667     <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
668 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
669     <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
670 wakaba 1.18 <li>If <var>sid</var> contains both <code class=char>U+0022</code>
671     <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
672     <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
673     (<code class=char>'</code>) characters, raise an
674     <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
675     <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
676     <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
677     character, then raise an
678     <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
679     <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
680     reference, then raise a fact-level error (xml-misc-warning?).<!--
681     XML 1.0 has no conformance language for system identifier being
682     a URI reference. --></li>
683     </ol>
684    
685 wakaba 1.10 </div>
686    
687 wakaba 1.9 <div class=section id=checking-node>
688     <h3>Checking <code>Node</code></h3>
689    
690 wakaba 1.1 <p>The algorithm <dfn title="to check a node" id=algorithm-to-check-a-node>to
691     check a node (<dfn id=var-n><var>n</var></dfn>)</dfn> is defined as
692     following:</p>
693     <dl class=switch>
694     <dt>If <var>n</var> is an <code>Attr</code> node</dt>
695     <dd>
696     <ol>
697 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
698 wakaba 1.1 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
699     <li>If the <code>prefix</code> attribute value is different from
700 wakaba 1.18 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
701 wakaba 1.1 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
702 wakaba 1.8 <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
703 wakaba 1.4 <code>childNodes</code> list of <var>n</var>,
704 wakaba 1.8 <ol>
705     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
706     <code>Text</code> or <code>EntityReference</code> node, then it is an
707     <a href="#xml-well-formedness-error" id=wfe-attr-child><code>xml-well-formedness-error</code></a>.</li>
708     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
709     <code>EntityReference</code> node, then it is an
710     <a href="#entity-error" id=ee-attr-er><code>entity-error</code></a>.</li>
711     <li>Otherwise,
712     <a href="#algorithm-to-check-a-node" title="check a node">check
713     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
714     </ol></li>
715 wakaba 1.13 <li>If <code>nodeName</code> attribute of <var>n</var> is
716     <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
717     and <code>value</code> attribute of <var>n</var> is neither
718     <code>default</code> nor <code>preserve</code>, then it is an
719     <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
720     Note that <!ATTLIST e xml:space (default|preserve) "a">...
721     <e xml:space="default"/> is conforming and valid. --></li>
722     <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
723     successor] or an empty string, then xml-misc-warning
724     (a "fact"-level error; not an XML error).</li>
725 wakaba 1.1 <li class=ed>@@ <code>specified</code>,
726 wakaba 1.15 <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
727 wakaba 1.13 <!-- xml:space MUST be declared to be valid. -->
728 wakaba 1.16 <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
729     attribute <code>value</code> of <var>n</var>.</li>
730     <li>Validate the <var>n</var> against the <span class=ed>declared
731     type</span> as following:
732     <dl class=switch>
733     <dt class=ed><code>ID_ATTR</code></dt>
734     <dd>
735     <ol>
736     <li><span class=ed>Validate <var>v</var> as an
737     <code>Name</code>.</span> If it fails, then raise an
738     <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
739     <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
740     then raise an
741     <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
742     </ol>
743     </dd>
744     <dt class=ed>IDREF_ATTR</dt>
745     <dd>
746     <ol>
747     <li><span class=ed>Validate <var>v</var> as an
748     <code>Name</code>.</span> If it fails, then raise an
749     <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
750     <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
751     defined,</span> then raise an
752     <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
753     </ol>
754     </dd>
755     <dt class=ed>IDREFS_ATTR</dt>
756     <dd class=ed>@@</dd>
757     <dt class=ed>ENTITY_ATTR</dt>
758     <dd>
759     <ol>
760     <li><span class=ed>Validate <var>v</var> as an
761     <code>Name</code>.</span> If it fails, then raise an
762     <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
763     <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
764     defined,</span> then raise an
765     <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
766     </ol>
767     </dd>
768     <dt class=ed>ENTITIES_ATTR</dt>
769     <dd class=ed>@@</dd>
770     <dt class=ed>NMTOKEN_ATTR</dt>
771     <dd>
772     <ol>
773     <li><span class=ed>Validate <var>v</var> as an
774     <code>Nmtoken</code>.</span> If it fails, then raise an
775     <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
776     </ol>
777     </dd>
778     <dt class=ed>NMTOKENS_ATTR</dt>
779     <dd class=ed>@@</dd>
780     <dt class=ed>NOTATION_ATTR</dt>
781     <dd class=ed><var>v</var> must be one of enumerated values.
782     If not, then raise an
783     <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
784     <dt class=ed>ENUMERATED_ATTR</dt>
785     <dd class=ed><var>v</var> must be one of enumerated values.
786     If not, then raise an
787     <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
788     <dt class=ed>@@</dt>
789     </dl>
790     </li>
791 wakaba 1.17 <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
792 wakaba 1.16 raise an
793     <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
794 wakaba 1.17 <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
795     <li class=ed>@@ strict serialization error for U+000D, U+000A, and
796     U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
797 wakaba 1.1 </ol>
798     </dd>
799     <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
800 wakaba 1.8 <dd>
801     <ol>
802     <li class=ed></li>
803 wakaba 1.13 <li>If <code>nodeName</code> attribute of <var>n</var> is
804     <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
805     and <span class=ed>its declared type is different from (default|preserve),
806 wakaba 1.22 (preserve|default), (default), or (preserve)</span>, then raise an
807 wakaba 1.13 <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
808 wakaba 1.8 <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
809     <code>childNodes</code> list of <var>n</var>,
810     <ol>
811     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
812     <code>Text</code> or <code>EntityReference</code> node, then it is an
813     <a href="#xml-well-formedness-error" id=wfe-at-child><code>xml-well-formedness-error</code></a>.</li>
814     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
815     <code>EntityReference</code> node, then it is an
816     <a href="#entity-error" id=ee-at-er><code>entity-error</code></a>.</li>
817     <li>Otherwise,
818     <a href="#algorithm-to-check-a-node" title="check a node">check
819     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
820     </ol></li>
821 wakaba 1.16 <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
822     be declared. If not, then raise an
823     <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
824     <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
825     values MUST all be distinct. If not, then raise an
826     <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
827     <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
828     element, then raise an
829     <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
830 wakaba 1.17 <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
831 wakaba 1.8 </ol>
832     </dd>
833 wakaba 1.1 <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
834     <dd>
835     <ol>
836 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
837 wakaba 1.1 <code>data</code> attribute value as an <abbr>XML</abbr> character
838     data</a>.</li>
839     <li>If the <code>data</code> attribute value contains
840 wakaba 1.12 a string <code>]]></code>, then raise an
841     <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
842 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
843     any nodes, they are in
844 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
845 wakaba 1.1 </ol>
846     </dd>
847     <dt>If <var>n</var> is a <code>Comment</code> node</dt>
848     <dd>
849     <ol>
850 wakaba 1.12 <li>Raise an
851     <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
852 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
853 wakaba 1.1 <code>data</code> attribute value as an <abbr>XML</abbr> character
854     data</a>.</li>
855     <li>If the <code>data</code> attribute value contains
856     a string <code>--</code>, or if it ends with a character
857 wakaba 1.12 <code>-</code>, then raise an
858     <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
859     "for compatibility" --></li>
860 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
861     any nodes, they are in
862 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
863 wakaba 1.1 </ol>
864     </dd>
865     <dt>If <var>n</var> is a <code>Document</code> node</dt>
866     <dd>
867     <ol>
868     <li>If <a href="#xml-version"><abbr>XML</abbr> version</a> of <var>n</var>
869     is different from <code>1.0</code> or <code>1.1</code>,
870     then it is an <code class=ed>unknown-error?</code>.</li>
871     <li>If the <code>xmlEncoding</code> attribute value does <em>not</em>
872     match to <code>[A-Za-z] ([A-Za-z0-9._] | '-')*</code>
873 wakaba 1.8 <span class=ed>@@ formal def</span>, then it is an
874 wakaba 1.1 <a href="#xml-well-formedness-error" id=wfe-encoding><code>xml-well-formedness-error</code></a>.</li>
875     <li>The <code>childNodes</code> list of <var>n</var> have to
876     consist of zero or more <code>Comment</code> and/or
877     <code>ProcessingInstruction</code> nodes, followed by
878     an optional <code>DocumentType</code> node, followed
879     by zero or more <code>Comment</code> and/or
880     <code>ProcessingInstruction</code> nodes, followed by
881     an <code>Element</code> node, followed
882     by zero or more <code>Comment</code> and/or
883 wakaba 1.8 <code>ProcessingInstruction</code> nodes. Any violation to this is an
884 wakaba 1.1 <a href="#xml-well-formedness-error" id=wfe-document-child><code>xml-well-formedness-error</code></a>.</li>
885 wakaba 1.8 <li>For each node <dfn id=var-document-nc><var>n<sub><var>c</var></sub></var></dfn>
886     in the <code>childNodes</code> list of <var>n</var>,
887     <ol>
888     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
889     <code>EntityReference</code> node, then
890     <a href="#algorithm-to-check-a-node" title="check a node">check
891     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
892     </ol></li>
893 wakaba 1.1 <li class=ed>@@ <code>allDeclarationsProcessed</code></li>
894     </ol>
895     </dd>
896     <dt>If <var>n</var> is a <code>DocumentFragment</code> node</dt>
897     <dd>
898     <ol>
899 wakaba 1.8 <li>For each node <dfn id=var-df-nc><var>n<sub><var>c</var></sub></var></dfn>
900     in the <code>childNodes</code> list of <var>n</var>,
901     <ol>
902     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
903     <code>Element</code>, <code>Text</code>, <code>CDATASection</code>,
904     <code>Comment</code>, <code>ProcessingInstruction</code>, or
905     <code>EntityReference</code> node, then it is an
906     <a href="#xml-well-formedness-error" id=wfe-df-child><code>xml-well-formedness-error</code></a>.</li>
907     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
908     <code>EntityReference</code> node, then it is an
909     <a href="#entity-error" id=ee-df-er><code>entity-error</code></a>.</li>
910     <li>Otherwise,
911     <a href="#algorithm-to-check-a-node" title="check a node">check
912     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
913     </ol></li>
914 wakaba 1.1 </ol>
915     </dd>
916     <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
917     <dd>
918     <ol>
919 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
920 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
921 wakaba 1.13 <li>Follow the following substeps:
922     <ol>
923     <li>If <code>ownerDocument</code> attribute of <var>n</var> is
924     <code>null</code>, then abort these substeps.</li>
925     <li>If <code>documentElement</code> attribute of the node
926     set to <code>ownerDocument</code> attribute of <var>n</var> is
927     <code>null</code>, then abort these substeps.</li>
928     <li>If <code>nodeName</code> attribute of the node set to
929     <code>documentElement</code> attribute of the node set to
930     <code>ownerDocument</code> attribute of <var>n</var> is
931     different from <code>nodeName</code> of <var>n</var>,
932     then raise an
933     <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
934     </ol>
935     </li>
936 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
937     the <code>publicId</code> attribute value as a public identifier</a>.</li>
938 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
939     the <code>systemId</code> attribute value as a system identifier</a>.</li>
940     <li>If the <code>publicId</code> attribute value of <var>n</var> is
941     <em>not</em> <code>null</code> and the <code>systemId</code> attribute
942     value of <var>n</var> is <code>null</code>, then raise an
943     <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
944     <span class=ed>@@ publicId == null? Or, publicId == ""</span></li>
945 wakaba 1.8 <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
946     in the <code>childNodes</code> list of <var>n</var>,
947     <ol>
948     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
949     <code>ProcessingInstruction</code> node, then it is an
950     <a href="#xml-well-formedness-error" id=wfe-dt-child><code>xml-well-formedness-error</code></a>.
951     <span class=ed>@@ ref to manakai's extensions</span></li>
952     <li>Otherwise, <!-- if n_c is not an EntityReference node, -->
953     <a href="#algorithm-to-check-a-node" title="check a node">check
954     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
955     </ol></li>
956 wakaba 1.18 <li>For each node in the <code>entities</code>, <code>notations</code>,
957     and <code>elementTypes</code> lists of <var>n</var>,
958     <a href="#algorithm-to-check-a-node" title="check a node">check the
959     node</a> recursively.</li>
960     <li class=ed>@@ externally declared?</li>
961 wakaba 1.26 <li>If the <code>NamedNodeMap</code> object in the <code>entities</code>
962     attribute of <var>n</var> does not contain <code>Entity</code> nodes
963     whose <code>nodeName</code> attribute are <code>amp</code>,
964     <code>lt</code>, <code>gt</code>, <code>apos</code>, and <code>quot</code>
965     then raise
966     <a href="#xml-misc-recommentation" id=xmr-predefined><code>xml-misc-recommendation</code></a>(s).
967 wakaba 1.1 </ol>
968     </dd>
969     <dt>If <var>n</var> is an <code>Element</code> node</dt>
970     <dd>
971     <ol>
972 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
973 wakaba 1.1 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
974     <li>If the <code>prefix</code> attribute value is different from
975 wakaba 1.18 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
976 wakaba 1.1 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
977 wakaba 1.8 <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
978     in the <code>childNodes</code> list of <var>n</var>,
979     <ol>
980     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
981     <code>Element</code>,
982     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
983     <code>ProcessingInstruction</code>, or
984     <code>EntityReference</code> node, then it is an
985     <a href="#xml-well-formedness-error" id=wfe-element-child><code>xml-well-formedness-error</code></a>.</li>
986     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
987     <code>EntityReference</code> node, then it is an
988     <a href="#entity-error" id=ee-element-er><code>entity-error</code></a>.</li>
989     <li>Otherwise,
990     <a href="#algorithm-to-check-a-node" title="check a node">check
991     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
992     </ol></li>
993 wakaba 1.15 <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
994 wakaba 1.18 <li>Let <var>attrs</var> be the value of the <code>attribute</code>
995     attribute of <var>n</var>. Check conformance of <var>attrs</var>
996     as following:
997     <ol>
998     <li>If <var>attrs</var> contains an <code>Attr</code> node whose
999     <code>nodeName</code> attribute value is equal to that of another
1000     <code>Attr</code> node in <var>attrs</var>, then raise an
1001     <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
1002     #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
1003     <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
1004     </ol>
1005     </li>
1006 wakaba 1.8 </ol>
1007     </dd>
1008     <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
1009     <dd>
1010     <ol>
1011     <li class=ed></li>
1012 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
1013 wakaba 1.8 any nodes, they are in
1014     <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
1015 wakaba 1.16 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
1016     <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
1017     <li class=ed>@@ #vc-MixedChildrenUnique
1018     Validity constraint: No Duplicate Types</li>
1019     <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
1020     <li>If there is more than one <code>AttributeDefinition</code> node
1021     with <span class=ed>attribute type <code>ID</code></span> in the
1022     <code>NamedNodeMap</code> list contained in the
1023     <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
1024     <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
1025     <li>If there is more than one <code>AttributeDefinition</code> node
1026     with <span class=ed>attribute type <code>NOTATION</code></span> in the
1027     <code>NamedNodeMap</code> list contained in the
1028     <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
1029     <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
1030     <li class=ed>"For interoperability, the same Nmtoken SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
1031 wakaba 1.1 </ol>
1032     </dd>
1033 wakaba 1.18 <dt>If <var>n</var> is an <code>Entity</code> node whose
1034     <code>notationName</code> attribute value is <code>null</code> (i.e. a
1035     parsed entity)</dt>
1036 wakaba 1.1 <dd>
1037     <ol>
1038 wakaba 1.18 <li>Raise an
1039     <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
1040     <li><a href="#algorithm-to-validate-an-ncname">Validate the
1041 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1042 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1043     the <code>publicId</code> attribute value as a public identifier</a>.</li>
1044 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1045     the <code>systemId</code> attribute value as a system identifier</a>.</li>
1046     <li>If the <code>publicId</code> attribute value of <var>n</var> is
1047     <em>not</em> <code>null</code> and the <code>systemId</code> attribute
1048     value of <var>n</var> is <code>null</code>, then raise an
1049     <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
1050 wakaba 1.8 <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
1051     in the <code>childNodes</code> list of <var>n</var>,
1052     <ol>
1053     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
1054     <code>Element</code>,
1055     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
1056     <code>ProcessingInstruction</code>, or <code>EntityReference</code>
1057     node, then it is an
1058     <a href="#xml-well-formedness-error" id=wfe-entity-child><code>xml-well-formedness-error</code></a>.</li>
1059     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
1060     <code>EntityReference</code> node, then it is an
1061     <a href="#entity-error" id=ee-er-er><code>entity-error</code></a>.</li>
1062     <li>Otherwise,
1063     <a href="#algorithm-to-check-a-node" title="check a node">check
1064     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
1065     </ol></li>
1066 wakaba 1.1 </ol>
1067     </dd>
1068 wakaba 1.18 <dt>If <var>n</var> is an <code>Entity</code> node whose
1069     <code>notationName</code> attribute value is <em>not</em> <code>null</code>
1070     (i.e. an unparsed entity)</dt>
1071     <dd>
1072     <ol>
1073     <li><a href="#algorithm-to-validate-an-ncname">Validate the
1074     <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1075     <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1076     the <code>publicId</code> attribute value as a public identifier</a>.</li>
1077     <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1078     the <code>systemId</code> attribute value as a system identifier</a>.</li>
1079     <li>If the <code>systemId</code> attribute value of <var>n</var> is
1080     <code>null</code>, then raise an
1081     <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
1082     <li><a href="#algorithm-to-validate-an-ncname">Validate the
1083     <code>notationName</code> attribute value of <var>n</var> as an
1084     <code>NCName</code></a>.</li>
1085     <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
1086     <li>If the <code>childNodes</code> list of <var>n</var> contains
1087     any nodes, they are in
1088     <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
1089     </ol>
1090     </dd>
1091 wakaba 1.1 <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
1092     <dd>
1093     <ol>
1094     <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
1095 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
1096 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1097 wakaba 1.8 <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
1098     in the <code>childNodes</code> list of <var>n</var>,
1099     <ol>
1100     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
1101     <code>Element</code>,
1102     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
1103     <code>ProcessingInstruction</code>, or <code>EntityReference</code>
1104     node, then it is an
1105     <a href="#xml-well-formedness-error" id=wfe-er-child><code>xml-well-formedness-error</code></a>.</li>
1106     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is <em>not</em> an
1107     <code>EntityReference</code> node, then it is an
1108     <a href="#entity-error" id=ee-entity-er><code>entity-error</code></a>.</li>
1109     <li>Otherwise,
1110     <a href="#algorithm-to-check-a-node" title="check a node">check
1111     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
1112     </ol></li>
1113 wakaba 1.1 </ol>
1114     </dd>
1115     <dt>If <var>n</var> is a <code>Notation</code> node</dt>
1116     <dd>
1117     <ol>
1118 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
1119 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1120 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1121     the <code>publicId</code> attribute value as a public identifier</a>.</li>
1122 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1123     the <code>systemId</code> attribute value as a system identifier</a>.</li>
1124 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
1125     any nodes, they are in
1126 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
1127 wakaba 1.1 </ol>
1128     </dd>
1129     <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
1130     <dd>
1131     <ol>
1132 wakaba 1.13 <li>If the <code>target</code> attribute value matches to the string
1133     <code>xml</code> in any case combination, then raise a
1134     <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
1135 wakaba 1.18 <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
1136 wakaba 1.1 <code>target</code> attribute value as an <code>NCName</code></a>.</li>
1137 wakaba 1.13 <li>Then,
1138 wakaba 1.19 <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
1139 wakaba 1.13 the <code>data</code> attribute value as an <abbr>XML</abbr> character
1140 wakaba 1.1 data</a>.</li>
1141 wakaba 1.13 <li>If the <code>data</code> attribute value contains a string
1142     <code>?></code>, then raise a
1143     <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
1144     <li>If the <code>data</code> attribute value starts with either
1145     <code class=char>U+0009</code> <code class=charname>CHARACTER
1146     TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
1147     FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
1148     RETURN</code>, or <code class=char>U+0020</code>
1149     <code class=charname>SPACE</code> character, then raise a
1150     <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
1151 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
1152 wakaba 1.13 any nodes, then raise an
1153 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
1154 wakaba 1.1 </ol>
1155     </dd>
1156     <dt>If <var>n</var> is a <code>Text</code> node</dt>
1157     <dd>
1158     <ol>
1159 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
1160 wakaba 1.1 <code>data</code> attribute value as an <abbr>XML</abbr> character
1161     data</a>.</li>
1162     <li>If the <code>childNodes</code> list of <var>n</var> contains
1163     any nodes, they are in
1164 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-text-child><code>xml-well-formedness-error</code></a>.</li>
1165 wakaba 1.1 </ol>
1166     </dd>
1167     <dt>Otherwise</dt>
1168     <dd><span class=ed>xml-well-formedness-error? unknown-error?</span></dd>
1169     </dl>
1170    
1171 wakaba 1.11 </div>
1172    
1173     </div>
1174    
1175     <div id=classes class=section>
1176     <h2>Character Classes</h2>
1177    
1178     <p>This section defines a couple of character classes.
1179     These classes are referred to by algorithms specified above.</p>
1180    
1181     <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
1182     contains the following characters:</p>
1183     <ul>
1184     <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1185     TABULATION</code></li>
1186     <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1187     <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1188     RETURN</code></li>
1189     <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
1190     .. <code class=char>U+D7FF</code></li>
1191     <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
1192     <code class=charname>REPLACEMENT CHARACTER</code></li>
1193     <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
1194     </ul>
1195     <div class="note memo">
1196     <p>This character class contains all characters allowed in the production rule
1197     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
1198     of <abbr>XML</abbr> 1.0
1199     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1200     </div>
1201    
1202     <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
1203     contains the following characters:</p>
1204     <ul>
1205     <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
1206     </ul>
1207    
1208     <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
1209     contains the following characters:</p>
1210     <ul>
1211     <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
1212     <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
1213     <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
1214     AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
1215     PROGRAM COMMAND</code></li>
1216     <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
1217     <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
1218     <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
1219     <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
1220     <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
1221     <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
1222     <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
1223     <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
1224     <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
1225     <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
1226     <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
1227     <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
1228     <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
1229     <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
1230     <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
1231     <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
1232     <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
1233     </ul>
1234     <div class="note memo">
1235     <p>This character class contains the characters listed in the Note in
1236     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
1237     of <abbr>XML</abbr> 1.0
1238     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
1239     as amended by
1240     <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
1241 wakaba 1.1 </div>
1242 wakaba 1.19
1243     <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
1244     contains the following characters:</p>
1245     <ul class=ed>
1246     </ul>
1247 wakaba 1.20 <div class="note memo">
1248     <p>This character class contains all characters allowed as the first character
1249     of a string matching to the production rule
1250     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1251     of <abbr>XML</abbr> 1.0
1252     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1253     </div>
1254 wakaba 1.19
1255     <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1256     contains the following characters:</p>
1257     <ul>
1258     <li>The characters in the character class
1259     <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1260     <li class=ed></li>
1261     </ul>
1262 wakaba 1.20 <div class="note memo">
1263     <p>This character class contains all characters allowed as the second
1264     character of a string matching to the production rule
1265     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1266     of <abbr>XML</abbr> 1.0
1267     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1268     </div>
1269    
1270     <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1271     contains the following characters:</p>
1272     <ul>
1273     <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1274     TABULATION</code></li>
1275     <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1276     <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1277     RETURN</code></li>
1278     <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1279     <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1280     (<code class=char>!</code>)</li>
1281     <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1282     (<code class=char>$</code>)</li>
1283     <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1284     (<code class=char>#</code>)</li>
1285     <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1286     (<code class=char>%</code>)</li>
1287     <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1288     (<code class=char>'</code>)</li>
1289     <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1290     (<code class=char>(</code>)</li>
1291     <li><code class=char>U+0029</code> <code class=charname>RIGHT
1292     PARENTHESIS</code> (<code class=char>)</code>)</li>
1293     <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1294     (<code class=char>*</code>)</li>
1295     <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1296     (<code class=char>+</code>)</li>
1297     <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1298     (<code class=char>,</code>)</li>
1299     <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1300     (<code class=char>-</code>)</li>
1301     <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1302     (<code class=char>.</code>)</li>
1303     <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1304     (<code class=char>/</code>)</li>
1305 wakaba 1.21 <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1306     (<code class=char>0</code>) .. <code class=char>U+0039</code>
1307     <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1308 wakaba 1.20 <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1309     (<code class=char>:</code>)</li>
1310     <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1311     (<code class=char>;</code>)</li>
1312     <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1313     (<code class=char>=</code>)</li>
1314     <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1315     (<code class=char>?</code>)</li>
1316     <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1317     (<code class=char>@</code>)</li>
1318 wakaba 1.21 <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1319     A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1320     <code class=charname>LATIN CAPITAL LETTER Z</code>
1321     (<code class=char>Z</code>)</li>
1322 wakaba 1.20 <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1323     (<code class=char>_</code>)</li>
1324 wakaba 1.21 <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1325     A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1326     <code class=charname>LATIN CAPITAL LETTER Z</code>
1327     (<code class=char>Z</code>)</li>
1328 wakaba 1.20 </ul>
1329     <div class="note memo">
1330     <p>This character class contains all characters allowed in the production rule
1331     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1332     of <abbr>XML</abbr> 1.0
1333     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1334     </div>
1335 wakaba 1.1
1336     </div>
1337    
1338     <div id="references" class="section reference">
1339     <h2>References</h2>
1340    
1341     <div id="normative-references" class="section normative reference">
1342     <h3>Normative References</h3>
1343    
1344     <dl>
1345 wakaba 1.3 <dt id=ref-DOM3CORE>DOM3CORE</dt>
1346     <dd class=ed>@@ W3C DOM Level 3 Core</dd>
1347     <dt id=ref-DOMDTDEF>DOMDTDEF</dt>
1348     <dd class=ed>@@ manakai's extension to DOM for document type definitions</dd>
1349 wakaba 1.1 <dt id="ref-KEYWORDS">KEYWORDS</dt>
1350     <dd><cite><a href="urn:ietf:rfc:2119">Key words for use in
1351     <abbr>RFCs</abbr> to Indicate Requirement Levels</a></cite>,
1352     <abbr title="Internet Engineering Task Force">IETF</abbr>
1353     <a href="urn:ietf:bcp:14"><abbr title="Best Current Practice">BCP</abbr>
1354     14</a>, <abbr title="Request for Comments">RFC</abbr> 2119,
1355     <time datetime=1997-03>March 1997</time>.
1356     <em>This</em> version of the specification is referenced.</dd>
1357     <dt id=ref-INFOSET>INFOSET</dt>
1358     <dd class=ed>@@</dd>
1359     <dt id=ref-XML10">XML10</dt>
1360     <dd><cite><a href="http://www.w3.org/TR/2006/REC-xml-20060816">Extensible
1361     Markup Language (<abbr>XML</abbr>) 1.0 (Fourth Edition)</a></cite>,
1362     <abbr>W3C</abbr> Recommendation, <time datetime=2006-08-16>16 August
1363     2006</time>, edited in place <time datetime=2006-09-29>29 September
1364     2006</time>. Tje latest version of the specification is available at
1365     <code class=uri>&lt;<a href="http://www.w3.org/TR/xml">http://www.w3.org/TR/xml</a>&gt;</code>.
1366     <em>This</em> version of the specification is referenced.</dd>
1367     <dt id=ref-XML11>XML11</dt>
1368     <dd class=ed>@@</dd>
1369     </dl>
1370     </div>
1371    
1372     <div id="informative-references" class="section informative reference">
1373     <h3>Non$B!>(Bnormative References</h3>
1374    
1375     <dl class=ed>
1376     <dt id=ref-CSS>CSS</dt>
1377     <dd><cite><a href="http://www.w3.org/TR/2007/CR-CSS21-20070719">Cascading
1378     Style Sheets Level 2 Revision 1 (<abbr>CSS</abbr> 2.1)
1379     Specification</a></cite>,
1380     <abbr title="World Wide Web Consortium">W3C</abbr> Candidate
1381     Recommendation, <time datetime=2007-07-19>19 July 2007</time>.
1382     Work in progress. The latest version of the specification is
1383     available at
1384     <code class=URI>&lt;<a href="http://www.w3.org/TR/CSS21">http://www.w3.org/TR/CSS21</a>></code>.</dd>
1385     <dt id=ref-CSSOM>CSSOM</dt>
1386     <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/csswg/cssom/Overview.html?rev=1.55&amp;content-type=text/html;%20charset=utf-8">Cascading
1387     Style Sheets Object Model (<abbr>CSSOM</abbr>)</a></cite>,
1388     <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1389     <time datetime=2007-06-18>18 June 2007</time>. Work in progress.
1390     The latest Editor's Draft of the specification is available at
1391     <code class=URI>&lt;<a href="http://dev.w3.org/csswg/cssom/Overview.html">http://dev.w3.org/csswg/cssom/Overview.html</a>></code>.</dd>
1392     <dt id=ref-HTML5>HTML5</dt>
1393     <dd><cite><a href="http://whatwg.org/html5"><abbr>HTML</abbr> 5</a></cite>,
1394     <abbr title="Web Hypertext Application Technology Working Group">WHATWG</abbr>
1395     Working Draft. Work in progress.</dd>
1396     <dt id=ref-SAPI>SAPI</dt>
1397     <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/2006/webapi/selectors-api/Overview.html?rev=1.28&amp;content-type=text/html;%20charset=utf-8">Selectors <abbr>API</abbr></a></cite>,
1398     <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1399     <time datetime=2007-08-29>29 August 2007</time>. Work in progress.
1400     The latest Editor's Draft of the specification is available at
1401     <code class=URI>&lt;<a href="http://dev.w3.org/2006/webapi/selectors-api/Overview.html">http://dev.w3.org/2006/webapi/selectors-api/Overview.html</a>></code>.
1402     The latest published version of the specification is available at
1403     <code class=URI>&lt;<a href="http://www.w3.org/TR/selectors-api/">http://www.w3.org/TR/selectors-api/</a>></code>.</dd>
1404     <dt id=XBL2>XBL2</dt>
1405     <dd><cite><a href="http://www.mozilla.org/projects/xbl/xbl2.html"><abbr>XBL</abbr>
1406     2.0</a></cite>, Mozilla.org, <time datetime=2007-03-15>15 Mar 2007</time>.
1407     Work in progress. The latest
1408     <abbr title="World Wide Web Consortium">W3C</abbr>$B!>(Bpublished
1409     version of the specification is available at
1410     <code class=URI>&lt;<a href="http://www.w3.org/TR/xbl/">http://www.w3.org/TR/xbl/</a>></code>.</dd>
1411 wakaba 1.3 <dt id=ref-XML5>XML5</dt>
1412     <dd class=ed>@@</dd>
1413 wakaba 1.1 </dl>
1414     </div>
1415    
1416     </div>
1417    
1418     </body>
1419     </html>

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24