/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Contents of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.25 - (hide annotations) (download) (as text)
Sat Mar 29 02:22:57 2008 UTC (16 years, 7 months ago) by wakaba
Branch: MAIN
Changes since 1.24: +107 -19 lines
File MIME type: text/html
More text on parsing

1 wakaba 1.1 <!DOCTYPE html>
2     <html lang="en" class="formal-specification status-work-in-progress">
3     <head>
4 wakaba 1.2 <title>manakai's XML Conformance Checking</title>
5 wakaba 1.1 <link rel="stylesheet" href="http://suika.fam.cx/www/style/html/spec">
6     <link rel="license" href="http://suika.fam.cx/c/gnu/fdl">
7 wakaba 1.6 <style>
8     a[id]::before {
9     content: "[" attr(id) "]";
10     color: gray;
11     font-size: 70%;
12     font-family: Courier New, monospace;
13     vertical-align: 60%;
14     }
15     </style>
16 wakaba 1.1 </head>
17     <body class="has-abstract">
18    
19     <div class="header">
20 wakaba 1.12 <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21 wakaba 1.24 <h2>Working Draft <time datetime=2008-03-29>29 March 2008</time></h2>
22 wakaba 1.1
23 wakaba 1.2 <dl class="versions-uri">
24 wakaba 1.1 <dt>This Version</dt>
25 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
26     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
27 wakaba 1.1 <dt>Latest Version</dt>
28 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc"
29     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc</a>&gt;</code></dd>
30 wakaba 1.1 <dt>Latest Working Draft</dt>
31 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
32     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
33 wakaba 1.1 <!-- <dt>Previous Version</dt> -->
34     <dt>Version History</dt>
35 wakaba 1.2 <dd><code class=URI>&lt;<a href="http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html">http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html</a>&gt;</code></dd>
36 wakaba 1.1 </dl>
37     <dl class="authors" id="author">
38     <dt>Author</dt>
39     <dd><a href="http://suika.fam.cx/~wakaba/who?" rel="author"
40     lang=ja>Wakaba</a>
41     <code class="mail">&lt;<a href="mailto:w@suika.fam.cx" rel="author"
42     >w@suika.fam.cx</a>&gt;</code></dd>
43     </dl>
44    
45 wakaba 1.22 <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46 wakaba 1.1 href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47     Permission is granted to copy, distribute and/or modify this document
48     under the terms of the <a rel="license"
49     href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>,
50     Version 1.2
51     or any later version published by the <a
52     href="http://www.fsf.org/">Free Software Foundation</a>;
53     with no Invariant Sections, no Front$B!>(BCover Texts, and no Back$B!>(BCover
54     Texts. A copy of the license is
55     available at <code class="URI">&lt;<a
56     href="http://www.gnu.org/copyleft/fdl.html"
57     rel="license">http://www.gnu.org/copyleft/fdl.html</a>&gt;</code>.</p>
58     </div>
59    
60     <div id="abstract" class="section">
61     <h2>Abstract</h2>
62    
63     <div class="ed issue">...</div>
64    
65     </div>
66    
67     <div id="status" class="section">
68     <h2>Status of This Document</h2>
69    
70     <p class=section-info><em>This section describes the status of this document
71     at the time of its publication. Other documents
72     might supersede this document.</em></p>
73    
74     <p>This document is a working draft, produced as part of the
75     <a href="http://suika.fam.cx/www/markup/html/whatpm/readme">Whatpm</a>
76     subproject of the
77     <a href="http://suika.fam.cx/www/manakai-core/doc/web/">manakai</a>
78     project. It might be updated, replaced, or obsoleted by
79     other documents at any time. It is inappropriate to
80     cite this document as other than <q>work in progress</q>.</p>
81    
82     <p>Comments on this document are welcome and
83     may be sent to the <a href="#author">author</a>.</p>
84    
85     <p>Translations of thie document might be available.
86     The English version of the document is the only
87     normative version.</p>
88     </div>
89    
90     <div id="introduction" class="section">
91     <h2>Introduction</h2>
92    
93     <p class=section-info>This section is <em>non$B!>(Bnormative</em>.</p>
94    
95     <div class="issue ed">...</div>
96    
97    
98     </div>
99    
100     <div class="section" id="terminology">
101     <h2>Terminology</h2>
102    
103     <p>The key words <q><em class="rfc2119">MUST</em></q>,
104     <q><em class="rfc2119">MUST NOT</em></q>,
105     <q><em class="rfc2119">SHOULD</em></q>,
106     <q><em class="rfc2119">SHOULD NOT</em></q>, and
107     <q><em class=rfc2119>MAY</em></q> in this
108     document are to be interpreted as described in
109     <a href="urn:ietf:rfc:2119"><abbr title="Request for Comments">RFC</abbr>
110     2119</a>
111     <cite class="bibref normative">[<a href="#ref-KEYWORDS">KEYWORDS</a>]</cite>.</p>
112    
113     <!-- Strictly speaking the paragraph below is not part of terminology. -->
114     <p>All examples and notes in this specification are non$B!>(Bnormative,
115     as are all sections explicitly marked non$B!>(Bnormative. Everything
116     else in this specification is normative.</p>
117    
118     <p><span class=ed>Algorithm is normative but non-normative</span>.
119     In addition, the order in which <a href="#errors">errors</a> are
120     raised is undefined.</p>
121 wakaba 1.22
122     <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123     by hyperlinks. When the document being processed is an <abbr>XML</abbr> 1.1
124     document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125     specification should be consulted instead.</p>
126 wakaba 1.1 </div>
127    
128    
129 wakaba 1.9 <div class=section id=error-categories>
130 wakaba 1.14 <h2>Error Classification</h2>
131 wakaba 1.9
132 wakaba 1.1 <p class=ed>If a <code>Document</code> node has no
133     xml-well-formedness-error, entity-error, and unknown-error,
134     then it is well-formed. If a well-formed <code>Document</code>
135     node has no xml-validity-error, it is valid.</p>
136    
137     <div class="note memo">
138     <p>A well$B!>(Bformed <code>Document</code> can be safely serialized
139     into a well$B!>(Bformed XML document. A valid <code>Document</code>
140     can be easily serialized into a valid XML document.</p>
141     </div>
142    
143 wakaba 1.17 <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
144     ...</p>
145    
146 wakaba 1.1 <p>Errors are classified into these
147     <dfn id=error-category title="error category">error categories</dfn>:</p>
148    
149     <dl id=errors>
150     <dt><dfn id=entity-error><code>entity-error</code></dfn></dt>
151 wakaba 1.4 <dd>
152     <p class=ed>@@</p>
153     <div class="note memo">
154     <p>This <span class=ed>algorithm</span> does not support
155     <abbr>DOM</abbr> tree with one or more <code>EntityReference</code>
156     nodes. It is expected that any entity references are expanded
157     at the parse time and any unexpandable entity references
158     make parse time errors raised so that never result in
159     <abbr>DOM</abbr> tree with <code>EntityReference</code> nodes.</p>
160     </div>
161     </dd>
162 wakaba 1.1 <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
163     <dd class=ed>@@</dd>
164 wakaba 1.12 <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
165     <dd>
166     <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
167     will be raised when a construct, which might not be restored to the same
168     construct when it is serialized and then re-parsed by a conforming
169     processor, is encountered.</p>
170     <div class="example memo">
171     <p>For a <code>Comment</code> node a
172     <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
173     be raised, since <abbr>XML</abbr> processors are not required to
174     report texts of comments for applications.</p>
175     </div>
176     </dd>
177 wakaba 1.1 <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
178     <dd class=ed>@@</dd>
179     <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
180     <dd>An <abbr>XML</abbr> <i>error</i> (<abbr>XML</abbr> 1.0
181     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
182     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-error">error</a> /
183     <abbr>XML</abbr> 1.1
184     <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
185     <a href="@@/#dt-error">error</a>)
186     that is not classified to any other <a href="#error-category">error
187     category</a>.</dd>
188     <dt><dfn id=xml-misc-fatal-error><code>xml-misc-fatal-error</code></dt>
189     <dd>An <abbr>XML</abbr> <i>fatal error</i> (<abbr>XML</abbr> 1.0
190     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
191     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-fatal">fatal
192     error</a> / <abbr>XML</abbr> 1.1
193     <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
194     <a href="@@/#dt-fatal">fatal error</a>)
195     that is not classified to any other <a href="#error-category">error
196     category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
197 wakaba 1.13 <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
198     <dd>
199     <p>An
200     <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
201     will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
202     in <abbr>XML</abbr> specification is not met.</p>
203     </dd>
204 wakaba 1.1 <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
205     <dd>A violation of validity constraint in XML document.</dd>
206     <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
207     <dd><p>If an <code>xml-well-formedness-error</code> is raised,
208     it would not be possible to generate an XML serialization that
209     would match to the appropriate production rule and that would not
210     violate to any well$B!>(Bformedness constraint in XML
211     specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,
212     <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>
213     </dl>
214    
215     <div class=ed><p>@@ TODO: #dt-atuseroption at user option
216     (MAY or MUST), #dt-compat for compatibility,
217 wakaba 1.23 #dt-interop for interoperability</p>
218    
219     <p>TODO: XML 1.1, XML Namespace 1.0/1.1, xml:base, xml:id
220     </div>
221 wakaba 1.1
222 wakaba 1.9 </div>
223 wakaba 1.14
224     <div class=section id=parsing-xml>
225     <h2>Parsing <abbr>XML</abbr> Document</h2>
226    
227 wakaba 1.22 <p>When an <abbr>XML</abbr> document is parsed, the following clauses
228     are applied:</p>
229 wakaba 1.25 <dl class=switch>
230     <dt>For the document
231 wakaba 1.22 <dd>If the <abbr>XML</abbr> document does not begin with an
232     <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
233     raise an
234     <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
235 wakaba 1.25 <dt>For the document type declaration
236     <dd class=ed>@@ read external entity
237     <dt>For each internal general entity declaration being processed by the parser
238 wakaba 1.22 <dd>If the
239     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
240     part of the general entity declaration contains a bare <code>U+003C</code>
241     <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
242     <em class=rfc2119>MUST</em> raise an
243     <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
244 wakaba 1.14 "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
245 wakaba 1.22 -->
246 wakaba 1.25 <dt>For each element type declaration being processed by the parser
247     <dd>If there is another processed element type declaration whose
248     <code>Name</code> is equal to the <code>Name</code> of the element type
249     declaration, then the parser <em class=rfc2119>MUST</em> raise an
250 wakaba 1.22 <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
251 wakaba 1.25 <dt>For each attribute definition list declaration being processed by the
252     parser
253     <dd>If there is another processed attribute defintion list declaration whose
254     <code>Name</code> is equal to the <code>Name</code> of the attribute
255     definition list declaration, then the parser <em class=rfc2119>MUST</em>
256     raise an
257     <a href="#xml-misc-warning" id=xme-attlist-unique><code>xml-misc-warning</code></a>.
258     <dd>For each attribute definition in the attribute definition list
259     declaration, if there is another processed attribute definition whose
260     <code>Name</code> is equal to the <code>Name</code> of the attribute
261     definition (whether or not in the same attribute definition list
262     declaration), then the parser <em class=rfc2119>MUST</em> raise an
263     <a href="#xml-misc-warning" id=xme-attrdef-unique><code>xml-misc-warning</code></a>.
264     <!--
265     <q>For interoperability, an XML processor <em class=rfc2119>MAY</em> at
266     user option issue a warning when more than one attribute-list declaration
267     is provided for a given element type, or more than one attribute definition
268     is provided for a given attribute, but this is not an error.</q>
269     -->
270 wakaba 1.22 <!--
271     NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
272     will not be warned.
273     -->
274 wakaba 1.25
275     <dt>For each entity declaration being processed by the parser
276     <dd>If the entity declaration declares a parameter entity and the
277     <code>Name</code> of the entity begins with the string <code>xml</code>
278     (in any combination of upper- and lowercase letters), then the parser
279     <em class=rfc2119>MUST</em> raise an
280     <a href="#xml-misc-warning" id=xmw-reserved-pe-name><code>xml-misc-warning</code></a>.
281     <dd>If the entity declaration contains the <code>EntityValue</code>, then
282     for each occurence of any references to unparsed entities in the
283     <code>EntityValue</code>, the parser <em class=rfc2119>MUST</em> raise an
284     <a href="#xml-misc-error" id=xme-unparsed-in-ev><code>xml-misc-error</code></a>.
285     <!--
286     <q>It is an error for a reference to an unparsed entity to appear in the
287     EntityValue in an entity declaration.</q>
288     -->
289    
290     <dt>For each notation declaration being processed by the parser
291     <dd>If there is another processed notation declaration whose
292     <code>Name</code> is equal to the <code>Name</code> of the notation
293     declaration, then the parser <em class=rfc2119>MUST</em> raise an
294     <a href="#xml-validity-error" id=vc-uniquenotationname><code>xml-validity-error</code></a>.
295     <!-- <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#UniqueNotationName">Validity constraint: Unique Notation Name</a> -->
296    
297 wakaba 1.22 <dt>For each empty-element tag
298     <dd>If the <code>Name</code> of the tag is not declared by a processed
299     element type declaration as <code>EMPTY</code> content, then the parser
300     <em class=rfc2119>MUST</em> raise an
301     <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
302     <dt>For each start-tag
303     <dd>If the <code>Name</code> of the tag is declared by a processed element
304     type declaration as <code>EMPTY</code> content, then the parser
305     <em class=rfc2119>MUST</em> raise an
306     <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
307 wakaba 1.25
308     <dt>For each parameter entity reference
309     <dd>If the declaration for the entity is not read (i.e. no declaration
310     for the entity is processed or the external entity referenced by the
311     declaration cannot be retrieved), then:
312     <ul>
313     <li>If the parameter entity is contained in a declaration, then the
314     declaration <em class=rfc2119>MUST</em> be ignored <em>except</em> that
315     any error before the parameter entity <em class=rfc2119>MUST</em> be
316     raised as usual.
317     <li>If the parameter entity is contained in the status portion of a
318     conditional section, then the conditional section
319     <em class=rfc2119>MUST</em> be processed as if it were an
320     <code>IGNORE</code>d section.
321     <li>The parser <em class=rfc2119>MUST NOT</em> process any entity or
322     attribute-list declaration after the parameter entity reference in the DTD
323     <em>except</em> when the <code>standalone</code> pseudo-attribute of the
324     XML declaration (if any) is set to <code>yes</code>.
325     <!-- This requirement is enforced for internal DTD subset case in
326     XML 1.0/1.1 specification (section 5.1) but not for any other cases. -->
327     <!-- According to this definition, element type declarations, notation
328     declarations, and PIs ARE processed. -->
329     <li>If the parameter entity reference is the first reference to an entity
330     that is not read, then the parser <em class=rfc2119>MUST</em> raise an
331     <a href="#entity-error" id=ee-unread-pe><code>entity-error</code></a>.
332     <li>The <code>allDeclarationsProcessed</code> <span class=ed>@@ ref</span>
333     attribute of the <code>Document</code> node <em class=rfc2119>MUST</em> be
334     set to <code>false</code>.
335     </ul>
336     <dt>For each general entity reference in an attribute value or in the content
337     of an element
338     <dd>If the declaration for the entity is not read (i.e. no declaration for
339     the entity is processed or the external entity referenced by the declaration
340     cannot be retrieved), then:
341     <ul>
342     <li>If the general entity reference is the first reference to an entity
343     that is not read, then the parser <em class=rfc2119>MUST</em> raise an
344     <a href="#entity-error" id=ee-unread-ge><code>entity-error</code></a>.
345     <span class=ed>@@ entity declared WFC?</span>
346     <li class=ed>An unexpended entity reference node <em class=rfc2119>MUST</em> be inserted to the current node.
347     </ul>
348 wakaba 1.24 </dl>
349    
350 wakaba 1.25 <p class=ed>@@ MUST try to read external entity
351    
352     <p>In addition, the parser has to check whether the
353 wakaba 1.24 following constraints are met.
354    
355     <p><strong>Well-formedness constraints</strong>. For each violation to
356     one of constraints below, an
357     <a href="#xml-well-formedness-error"><code>xml-well-formedness-error</code></a>
358     <em class=rfc2119>MUST</em> be raised. The list of well-formedness
359     constraints is below:
360     <ul>
361     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wfc-PEinInternalSubset">Well-formedness constraint: PEs in Internal Subset</a>
362     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#GIMatch">Well-formedness constraint: Element Type Match</a>
363     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#uniqattspec">Well-formedness constraint: Unique Att Spec</a>
364     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NoExternalRefs">Well-formedness constraint: No External Entity References</a>
365     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#CleanAttrVals">Well-formedness constraint: No &lt; in Attribute Values</a>
366     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wf-Legalchar">Well-formedness constraint: Legal Character</a>
367     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#textent">Well-formedness constraint: Parsed Entity</a>
368     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#norecursion">Well-formedness constraint: No Recursion</a>
369     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#indtd">Well-formedness constraint: In DTD</a>
370     </ul>
371    
372     <p><strong>Validity constraints</strong>. For each violation to
373     one of constraints below, an
374     <a href="#xml-validity-error"><code>xml-validity-error</code></a>.
375     <em class=rfc2119>MUST</em> be raised. The list of validity
376     constraints is below:
377     <ul>
378     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinMarkupDecl">Validity constraint: Proper Declaration/PE Nesting</a>
379     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinGroup">Validity constraint: Proper Group/PE Nesting</a>
380     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#condsec-nesting">Validity constraint: Proper Conditional Section/PE Nesting</a>
381     </ul>
382    
383     <p><strong>Other creteria</strong>. For each violation to
384     one of constraints below, an
385     <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
386     <em class=rfc2119>MUST</em> be raised. The list of constraints is below:
387     <ul>
388     <li><q>For interoperability, if a parameter-entity reference appears in a
389     <code>choice</code>, <code>seq</code>, or <code>Mixed</code> construct, its
390     replacement text <em class=rfc2119>SHOULD</em> contain at least one non-blank
391     character, and neither the first nor last non-blank character of the
392     replacement text <em class=rfc2119>SHOULD</em> be a connector (<code>|</code>
393     or <code>,</code>).</q>
394     <li><q>External parsed entities <em class=rfc2119>SHOULD</em> each begin with a
395     text declaration.</q>
396     </ul>
397    
398    
399 wakaba 1.14 <!--
400    
401    
402     @@ Need detailed review, but maybe should be in parsing phase
403    
404     #vc-check-rmd Validity constraint: Standalone Document Declaration
405    
406 wakaba 1.17 @@ Need dtailed review
407    
408     #wf-entdeclared Well-formedness constraint: Entity Declared
409     #vc-entdeclared Validity constraint: Entity Declared
410     "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
411     "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
412    
413     @@ flaged and then reported in DOM check phase
414    
415     "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
416     "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
417     "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
418    
419     @@ in parsing phase
420    
421     "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
422 wakaba 1.23
423     @@ We should phrase out what the parser should do where the XML specification
424     left undefined. For example: Comment must be converted to a Comment node,
425     illegal xml:space value must be preserved, so on.
426 wakaba 1.17
427 wakaba 1.25 Warn <!ENTITY % xml... ...>
428    
429     -->
430    
431     <p>The parser <em class=rfc2119>MUST</em> raise an
432     <a href="#xml-well-formedness-error" id=wfe-syntax><code>xml-well-formedness-error</code></a>
433     for any failure to match to a production rule in the XML specification.
434     <!--
435     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#ExtSubset">Well-formedness constraint: External Subset</a>
436     <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#PE-between-Decls">Well-formedness constraint: PE Between Declarations</a>
437 wakaba 1.14 -->
438     </div>
439    
440     <div class="section" id=checking-dom>
441     <h2>Checking <abbr>DOM</abbr></h2>
442    
443     <p>The following algorithms and definitions are applied to
444     <abbr>XML</abbr> documents; especially, they are not applied
445     to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
446 wakaba 1.9
447     <div class=section id=definitions>
448     <h3>Definitions</h3>
449    
450 wakaba 1.1 <p>The <dfn id=xml-version><abbr>XML</abbr> version</dfn> of a node is
451     the <abbr>XML</abbr> version of the document to which the node belongs.
452     For a <code>Documemt</code> node, the <abbr>XML</abbr> version
453     of the document is the value of the <code>xmlVersion</code>
454 wakaba 1.9 attribute of the node. For a <code>DocumentType</code> node whose
455     <code>ownerDocument</code> attribute is set to <code>null</code>,
456     the <abbr>XML</abbr> version of the document is <code>1.0</code><!--
457     since createDocumentType [DOM3CORE] assumes XML 1.0 [XML10TE] -->.
458 wakaba 1.1 For any other node, the <abbr>XML</abbr> version of the document
459     is that of the <code>Document</code> node contained in the
460     <code>ownerDocument</code> attribute of the node.</p>
461    
462 wakaba 1.9 </div>
463    
464 wakaba 1.10
465     <div class=section id=checking-components>
466 wakaba 1.19 <h3>Conformance Checking Algorithms for Components</h3>
467 wakaba 1.10
468 wakaba 1.19 <p>To
469     <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
470     validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
471     the following algorithm <em class=rfc2119>MUST</em> be used:</p>
472 wakaba 1.10 <ol>
473 wakaba 1.11 <li>If <var>s</var> contains a character that is <em>not</em> in
474     the character class <a href="#class-Char10"><code>Char10</code></a>,
475     then raise an
476     <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
477     <li>If <var>s</var> contains a character that is in the character
478     class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
479     then raise an
480     <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
481     <li>If <var>s</var> contains a character that is in the character
482     class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
483     then raise an
484     <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
485     <li class=ed>@@ XML 1.1 support</li>
486 wakaba 1.13 <li>If <var>s</var> contains a <code class=char>U+000D</code>
487     <code class=charname>CARRIAGE RETURN</code> character, then
488     raise a
489     <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
490     <span class=ed>@@ We should not raise duplicate errors for U+000D
491     in attribute values. In addition, we should support a mode where
492     U+000D will be serialized as &#x000D; (so that no round-trip-error
493     will be raised).</span></li>
494 wakaba 1.10 </ol>
495    
496 wakaba 1.19 <p>To
497     <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
498     <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
499     algorithm <em class=rfc2119>MUST</em> be used:</p>
500 wakaba 1.10 <ol>
501 wakaba 1.19 <li>If <var>s</var> is an empty string, then raise an
502     <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
503     Abort these steps.</li>
504     <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
505     <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
506     <li>If the first character in <var>s</var> is a character that is
507     <em>not</em> in the character class
508     <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
509     an
510     <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
511     <li>If a character other than the first character in <var>s</var> is a
512     character that is <em>not</em> in the character class
513     <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
514     <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
515 wakaba 1.12 <li>If <var>s</var> begins with the string <code>xml</code> (in any
516     case combination), then raise an
517     <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
518     <span class=ed>@@ except for attribute names <code>xml:lang</code>,
519 wakaba 1.25 <code>xml:space</code><!--, <code>xml:base</code>, <code>xml:id</code>,
520 wakaba 1.12 <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
521 wakaba 1.25 pi name <code>xml-stylesheet</code>-->.</span><!--
522 wakaba 1.15 "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
523 wakaba 1.19 xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
524 wakaba 1.15 --></li>
525 wakaba 1.12 <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
526     [XML11]. -->
527 wakaba 1.19 <li class=ed>@@ XML 1.1 support</li>
528 wakaba 1.10 </ol>
529    
530 wakaba 1.19 <p>To
531     <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
532     an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
533     following algorithm <em class=rfc2119>MUST</em> be used:</p>
534 wakaba 1.10 <ol>
535 wakaba 1.19 <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
536     <var>s</var> as a <code>Name</code></a>.</li>
537 wakaba 1.10 <li class=ed>@@</li>
538     </ol>
539    
540 wakaba 1.18 <p>To
541     <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
542 wakaba 1.10 a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
543 wakaba 1.18 following algorithm <em class=rfc2119>MUST</em> be used:</p>
544 wakaba 1.10 <ol>
545 wakaba 1.20 <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
546     <li>If <var>pid</var> contains a character that is <em>not</em> in the
547     character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
548     raise an
549     <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
550 wakaba 1.10 <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
551     <code class=charname>CHARACTER TABULATION</code>,
552     <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
553     and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
554     characters, if the first character of <var>pid</var> is
555     <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
556     if the last character of <var>pid</var> is <code class=char>U+0020</code>
557     <code class=charname>SPACE</code> character, or if there is a
558     <code class=char>U+0020</code> <code class=charname>SPACE</code> character
559     immediately followed by another <code class=char>U+0020</code>
560     <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
561 wakaba 1.18 <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
562     <span class=ed>Is this really a roundtripness problem? XML spec
563     does only define the way to match public identifiers in fact, no
564     canonical form.</span></li>
565 wakaba 1.10 </ol>
566    
567 wakaba 1.18 <p>To
568     <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
569     a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
570     following algorithm <em class=rfc2119>MUST</em> be used:</p>
571     <ol>
572     <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
573 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
574     <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
575 wakaba 1.18 <li>If <var>sid</var> contains both <code class=char>U+0022</code>
576     <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
577     <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
578     (<code class=char>'</code>) characters, raise an
579     <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
580     <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
581     <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
582     character, then raise an
583     <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
584     <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
585     reference, then raise a fact-level error (xml-misc-warning?).<!--
586     XML 1.0 has no conformance language for system identifier being
587     a URI reference. --></li>
588     </ol>
589    
590 wakaba 1.10 </div>
591    
592 wakaba 1.9 <div class=section id=checking-node>
593     <h3>Checking <code>Node</code></h3>
594    
595 wakaba 1.1 <p>The algorithm <dfn title="to check a node" id=algorithm-to-check-a-node>to
596     check a node (<dfn id=var-n><var>n</var></dfn>)</dfn> is defined as
597     following:</p>
598     <dl class=switch>
599     <dt>If <var>n</var> is an <code>Attr</code> node</dt>
600     <dd>
601     <ol>
602 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
603 wakaba 1.1 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
604     <li>If the <code>prefix</code> attribute value is different from
605 wakaba 1.18 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
606 wakaba 1.1 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
607 wakaba 1.8 <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
608 wakaba 1.4 <code>childNodes</code> list of <var>n</var>,
609 wakaba 1.8 <ol>
610     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
611     <code>Text</code> or <code>EntityReference</code> node, then it is an
612     <a href="#xml-well-formedness-error" id=wfe-attr-child><code>xml-well-formedness-error</code></a>.</li>
613     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
614     <code>EntityReference</code> node, then it is an
615     <a href="#entity-error" id=ee-attr-er><code>entity-error</code></a>.</li>
616     <li>Otherwise,
617     <a href="#algorithm-to-check-a-node" title="check a node">check
618     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
619     </ol></li>
620 wakaba 1.13 <li>If <code>nodeName</code> attribute of <var>n</var> is
621     <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
622     and <code>value</code> attribute of <var>n</var> is neither
623     <code>default</code> nor <code>preserve</code>, then it is an
624     <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
625     Note that <!ATTLIST e xml:space (default|preserve) "a">...
626     <e xml:space="default"/> is conforming and valid. --></li>
627     <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
628     successor] or an empty string, then xml-misc-warning
629     (a "fact"-level error; not an XML error).</li>
630 wakaba 1.1 <li class=ed>@@ <code>specified</code>,
631 wakaba 1.15 <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
632 wakaba 1.13 <!-- xml:space MUST be declared to be valid. -->
633 wakaba 1.16 <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
634     attribute <code>value</code> of <var>n</var>.</li>
635     <li>Validate the <var>n</var> against the <span class=ed>declared
636     type</span> as following:
637     <dl class=switch>
638     <dt class=ed><code>ID_ATTR</code></dt>
639     <dd>
640     <ol>
641     <li><span class=ed>Validate <var>v</var> as an
642     <code>Name</code>.</span> If it fails, then raise an
643     <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
644     <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
645     then raise an
646     <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
647     </ol>
648     </dd>
649     <dt class=ed>IDREF_ATTR</dt>
650     <dd>
651     <ol>
652     <li><span class=ed>Validate <var>v</var> as an
653     <code>Name</code>.</span> If it fails, then raise an
654     <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
655     <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
656     defined,</span> then raise an
657     <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
658     </ol>
659     </dd>
660     <dt class=ed>IDREFS_ATTR</dt>
661     <dd class=ed>@@</dd>
662     <dt class=ed>ENTITY_ATTR</dt>
663     <dd>
664     <ol>
665     <li><span class=ed>Validate <var>v</var> as an
666     <code>Name</code>.</span> If it fails, then raise an
667     <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
668     <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
669     defined,</span> then raise an
670     <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
671     </ol>
672     </dd>
673     <dt class=ed>ENTITIES_ATTR</dt>
674     <dd class=ed>@@</dd>
675     <dt class=ed>NMTOKEN_ATTR</dt>
676     <dd>
677     <ol>
678     <li><span class=ed>Validate <var>v</var> as an
679     <code>Nmtoken</code>.</span> If it fails, then raise an
680     <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
681     </ol>
682     </dd>
683     <dt class=ed>NMTOKENS_ATTR</dt>
684     <dd class=ed>@@</dd>
685     <dt class=ed>NOTATION_ATTR</dt>
686     <dd class=ed><var>v</var> must be one of enumerated values.
687     If not, then raise an
688     <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
689     <dt class=ed>ENUMERATED_ATTR</dt>
690     <dd class=ed><var>v</var> must be one of enumerated values.
691     If not, then raise an
692     <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
693     <dt class=ed>@@</dt>
694     </dl>
695     </li>
696 wakaba 1.17 <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
697 wakaba 1.16 raise an
698     <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
699 wakaba 1.17 <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
700     <li class=ed>@@ strict serialization error for U+000D, U+000A, and
701     U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
702 wakaba 1.1 </ol>
703     </dd>
704     <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
705 wakaba 1.8 <dd>
706     <ol>
707     <li class=ed></li>
708 wakaba 1.13 <li>If <code>nodeName</code> attribute of <var>n</var> is
709     <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
710     and <span class=ed>its declared type is different from (default|preserve),
711 wakaba 1.22 (preserve|default), (default), or (preserve)</span>, then raise an
712 wakaba 1.13 <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
713 wakaba 1.8 <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
714     <code>childNodes</code> list of <var>n</var>,
715     <ol>
716     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
717     <code>Text</code> or <code>EntityReference</code> node, then it is an
718     <a href="#xml-well-formedness-error" id=wfe-at-child><code>xml-well-formedness-error</code></a>.</li>
719     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
720     <code>EntityReference</code> node, then it is an
721     <a href="#entity-error" id=ee-at-er><code>entity-error</code></a>.</li>
722     <li>Otherwise,
723     <a href="#algorithm-to-check-a-node" title="check a node">check
724     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
725     </ol></li>
726 wakaba 1.16 <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
727     be declared. If not, then raise an
728     <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
729     <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
730     values MUST all be distinct. If not, then raise an
731     <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
732     <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
733     element, then raise an
734     <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
735 wakaba 1.17 <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
736 wakaba 1.8 </ol>
737     </dd>
738 wakaba 1.1 <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
739     <dd>
740     <ol>
741 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
742 wakaba 1.1 <code>data</code> attribute value as an <abbr>XML</abbr> character
743     data</a>.</li>
744     <li>If the <code>data</code> attribute value contains
745 wakaba 1.12 a string <code>]]></code>, then raise an
746     <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
747 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
748     any nodes, they are in
749 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
750 wakaba 1.1 </ol>
751     </dd>
752     <dt>If <var>n</var> is a <code>Comment</code> node</dt>
753     <dd>
754     <ol>
755 wakaba 1.12 <li>Raise an
756     <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
757 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
758 wakaba 1.1 <code>data</code> attribute value as an <abbr>XML</abbr> character
759     data</a>.</li>
760     <li>If the <code>data</code> attribute value contains
761     a string <code>--</code>, or if it ends with a character
762 wakaba 1.12 <code>-</code>, then raise an
763     <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
764     "for compatibility" --></li>
765 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
766     any nodes, they are in
767 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
768 wakaba 1.1 </ol>
769     </dd>
770     <dt>If <var>n</var> is a <code>Document</code> node</dt>
771     <dd>
772     <ol>
773     <li>If <a href="#xml-version"><abbr>XML</abbr> version</a> of <var>n</var>
774     is different from <code>1.0</code> or <code>1.1</code>,
775     then it is an <code class=ed>unknown-error?</code>.</li>
776     <li>If the <code>xmlEncoding</code> attribute value does <em>not</em>
777     match to <code>[A-Za-z] ([A-Za-z0-9._] | '-')*</code>
778 wakaba 1.8 <span class=ed>@@ formal def</span>, then it is an
779 wakaba 1.1 <a href="#xml-well-formedness-error" id=wfe-encoding><code>xml-well-formedness-error</code></a>.</li>
780     <li>The <code>childNodes</code> list of <var>n</var> have to
781     consist of zero or more <code>Comment</code> and/or
782     <code>ProcessingInstruction</code> nodes, followed by
783     an optional <code>DocumentType</code> node, followed
784     by zero or more <code>Comment</code> and/or
785     <code>ProcessingInstruction</code> nodes, followed by
786     an <code>Element</code> node, followed
787     by zero or more <code>Comment</code> and/or
788 wakaba 1.8 <code>ProcessingInstruction</code> nodes. Any violation to this is an
789 wakaba 1.1 <a href="#xml-well-formedness-error" id=wfe-document-child><code>xml-well-formedness-error</code></a>.</li>
790 wakaba 1.8 <li>For each node <dfn id=var-document-nc><var>n<sub><var>c</var></sub></var></dfn>
791     in the <code>childNodes</code> list of <var>n</var>,
792     <ol>
793     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
794     <code>EntityReference</code> node, then
795     <a href="#algorithm-to-check-a-node" title="check a node">check
796     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
797     </ol></li>
798 wakaba 1.1 <li class=ed>@@ <code>allDeclarationsProcessed</code></li>
799     </ol>
800     </dd>
801     <dt>If <var>n</var> is a <code>DocumentFragment</code> node</dt>
802     <dd>
803     <ol>
804 wakaba 1.8 <li>For each node <dfn id=var-df-nc><var>n<sub><var>c</var></sub></var></dfn>
805     in the <code>childNodes</code> list of <var>n</var>,
806     <ol>
807     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
808     <code>Element</code>, <code>Text</code>, <code>CDATASection</code>,
809     <code>Comment</code>, <code>ProcessingInstruction</code>, or
810     <code>EntityReference</code> node, then it is an
811     <a href="#xml-well-formedness-error" id=wfe-df-child><code>xml-well-formedness-error</code></a>.</li>
812     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
813     <code>EntityReference</code> node, then it is an
814     <a href="#entity-error" id=ee-df-er><code>entity-error</code></a>.</li>
815     <li>Otherwise,
816     <a href="#algorithm-to-check-a-node" title="check a node">check
817     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
818     </ol></li>
819 wakaba 1.1 </ol>
820     </dd>
821     <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
822     <dd>
823     <ol>
824 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
825 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
826 wakaba 1.13 <li>Follow the following substeps:
827     <ol>
828     <li>If <code>ownerDocument</code> attribute of <var>n</var> is
829     <code>null</code>, then abort these substeps.</li>
830     <li>If <code>documentElement</code> attribute of the node
831     set to <code>ownerDocument</code> attribute of <var>n</var> is
832     <code>null</code>, then abort these substeps.</li>
833     <li>If <code>nodeName</code> attribute of the node set to
834     <code>documentElement</code> attribute of the node set to
835     <code>ownerDocument</code> attribute of <var>n</var> is
836     different from <code>nodeName</code> of <var>n</var>,
837     then raise an
838     <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
839     </ol>
840     </li>
841 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
842     the <code>publicId</code> attribute value as a public identifier</a>.</li>
843 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
844     the <code>systemId</code> attribute value as a system identifier</a>.</li>
845     <li>If the <code>publicId</code> attribute value of <var>n</var> is
846     <em>not</em> <code>null</code> and the <code>systemId</code> attribute
847     value of <var>n</var> is <code>null</code>, then raise an
848     <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
849     <span class=ed>@@ publicId == null? Or, publicId == ""</span></li>
850 wakaba 1.8 <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
851     in the <code>childNodes</code> list of <var>n</var>,
852     <ol>
853     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
854     <code>ProcessingInstruction</code> node, then it is an
855     <a href="#xml-well-formedness-error" id=wfe-dt-child><code>xml-well-formedness-error</code></a>.
856     <span class=ed>@@ ref to manakai's extensions</span></li>
857     <li>Otherwise, <!-- if n_c is not an EntityReference node, -->
858     <a href="#algorithm-to-check-a-node" title="check a node">check
859     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
860     </ol></li>
861 wakaba 1.18 <li>For each node in the <code>entities</code>, <code>notations</code>,
862     and <code>elementTypes</code> lists of <var>n</var>,
863     <a href="#algorithm-to-check-a-node" title="check a node">check the
864     node</a> recursively.</li>
865     <li class=ed>@@ externally declared?</li>
866 wakaba 1.1 </ol>
867     </dd>
868     <dt>If <var>n</var> is an <code>Element</code> node</dt>
869     <dd>
870     <ol>
871 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
872 wakaba 1.1 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
873     <li>If the <code>prefix</code> attribute value is different from
874 wakaba 1.18 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
875 wakaba 1.1 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
876 wakaba 1.8 <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
877     in the <code>childNodes</code> list of <var>n</var>,
878     <ol>
879     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
880     <code>Element</code>,
881     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
882     <code>ProcessingInstruction</code>, or
883     <code>EntityReference</code> node, then it is an
884     <a href="#xml-well-formedness-error" id=wfe-element-child><code>xml-well-formedness-error</code></a>.</li>
885     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
886     <code>EntityReference</code> node, then it is an
887     <a href="#entity-error" id=ee-element-er><code>entity-error</code></a>.</li>
888     <li>Otherwise,
889     <a href="#algorithm-to-check-a-node" title="check a node">check
890     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
891     </ol></li>
892 wakaba 1.15 <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
893 wakaba 1.18 <li>Let <var>attrs</var> be the value of the <code>attribute</code>
894     attribute of <var>n</var>. Check conformance of <var>attrs</var>
895     as following:
896     <ol>
897     <li>If <var>attrs</var> contains an <code>Attr</code> node whose
898     <code>nodeName</code> attribute value is equal to that of another
899     <code>Attr</code> node in <var>attrs</var>, then raise an
900     <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
901     #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
902     <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
903     </ol>
904     </li>
905 wakaba 1.8 </ol>
906     </dd>
907     <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
908     <dd>
909     <ol>
910     <li class=ed></li>
911 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
912 wakaba 1.8 any nodes, they are in
913     <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
914 wakaba 1.16 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
915     <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
916     <li class=ed>@@ #vc-MixedChildrenUnique
917     Validity constraint: No Duplicate Types</li>
918     <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
919     <li>If there is more than one <code>AttributeDefinition</code> node
920     with <span class=ed>attribute type <code>ID</code></span> in the
921     <code>NamedNodeMap</code> list contained in the
922     <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
923     <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
924     <li>If there is more than one <code>AttributeDefinition</code> node
925     with <span class=ed>attribute type <code>NOTATION</code></span> in the
926     <code>NamedNodeMap</code> list contained in the
927     <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
928     <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
929     <li class=ed>"For interoperability, the same Nmtoken SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
930 wakaba 1.1 </ol>
931     </dd>
932 wakaba 1.18 <dt>If <var>n</var> is an <code>Entity</code> node whose
933     <code>notationName</code> attribute value is <code>null</code> (i.e. a
934     parsed entity)</dt>
935 wakaba 1.1 <dd>
936     <ol>
937 wakaba 1.18 <li>Raise an
938     <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
939     <li><a href="#algorithm-to-validate-an-ncname">Validate the
940 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
941 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
942     the <code>publicId</code> attribute value as a public identifier</a>.</li>
943 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
944     the <code>systemId</code> attribute value as a system identifier</a>.</li>
945     <li>If the <code>publicId</code> attribute value of <var>n</var> is
946     <em>not</em> <code>null</code> and the <code>systemId</code> attribute
947     value of <var>n</var> is <code>null</code>, then raise an
948     <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
949 wakaba 1.8 <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
950     in the <code>childNodes</code> list of <var>n</var>,
951     <ol>
952     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
953     <code>Element</code>,
954     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
955     <code>ProcessingInstruction</code>, or <code>EntityReference</code>
956     node, then it is an
957     <a href="#xml-well-formedness-error" id=wfe-entity-child><code>xml-well-formedness-error</code></a>.</li>
958     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
959     <code>EntityReference</code> node, then it is an
960     <a href="#entity-error" id=ee-er-er><code>entity-error</code></a>.</li>
961     <li>Otherwise,
962     <a href="#algorithm-to-check-a-node" title="check a node">check
963     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
964     </ol></li>
965 wakaba 1.1 </ol>
966     </dd>
967 wakaba 1.18 <dt>If <var>n</var> is an <code>Entity</code> node whose
968     <code>notationName</code> attribute value is <em>not</em> <code>null</code>
969     (i.e. an unparsed entity)</dt>
970     <dd>
971     <ol>
972     <li><a href="#algorithm-to-validate-an-ncname">Validate the
973     <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
974     <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
975     the <code>publicId</code> attribute value as a public identifier</a>.</li>
976     <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
977     the <code>systemId</code> attribute value as a system identifier</a>.</li>
978     <li>If the <code>systemId</code> attribute value of <var>n</var> is
979     <code>null</code>, then raise an
980     <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
981     <li><a href="#algorithm-to-validate-an-ncname">Validate the
982     <code>notationName</code> attribute value of <var>n</var> as an
983     <code>NCName</code></a>.</li>
984     <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
985     <li>If the <code>childNodes</code> list of <var>n</var> contains
986     any nodes, they are in
987     <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
988     </ol>
989     </dd>
990 wakaba 1.1 <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
991     <dd>
992     <ol>
993     <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
994 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
995 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
996 wakaba 1.8 <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
997     in the <code>childNodes</code> list of <var>n</var>,
998     <ol>
999     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
1000     <code>Element</code>,
1001     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
1002     <code>ProcessingInstruction</code>, or <code>EntityReference</code>
1003     node, then it is an
1004     <a href="#xml-well-formedness-error" id=wfe-er-child><code>xml-well-formedness-error</code></a>.</li>
1005     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is <em>not</em> an
1006     <code>EntityReference</code> node, then it is an
1007     <a href="#entity-error" id=ee-entity-er><code>entity-error</code></a>.</li>
1008     <li>Otherwise,
1009     <a href="#algorithm-to-check-a-node" title="check a node">check
1010     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
1011     </ol></li>
1012 wakaba 1.1 </ol>
1013     </dd>
1014     <dt>If <var>n</var> is a <code>Notation</code> node</dt>
1015     <dd>
1016     <ol>
1017 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
1018 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1019 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1020     the <code>publicId</code> attribute value as a public identifier</a>.</li>
1021 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1022     the <code>systemId</code> attribute value as a system identifier</a>.</li>
1023 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
1024     any nodes, they are in
1025 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
1026 wakaba 1.1 </ol>
1027     </dd>
1028     <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
1029     <dd>
1030     <ol>
1031 wakaba 1.13 <li>If the <code>target</code> attribute value matches to the string
1032     <code>xml</code> in any case combination, then raise a
1033     <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
1034 wakaba 1.18 <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
1035 wakaba 1.1 <code>target</code> attribute value as an <code>NCName</code></a>.</li>
1036 wakaba 1.13 <li>Then,
1037 wakaba 1.19 <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
1038 wakaba 1.13 the <code>data</code> attribute value as an <abbr>XML</abbr> character
1039 wakaba 1.1 data</a>.</li>
1040 wakaba 1.13 <li>If the <code>data</code> attribute value contains a string
1041     <code>?></code>, then raise a
1042     <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
1043     <li>If the <code>data</code> attribute value starts with either
1044     <code class=char>U+0009</code> <code class=charname>CHARACTER
1045     TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
1046     FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
1047     RETURN</code>, or <code class=char>U+0020</code>
1048     <code class=charname>SPACE</code> character, then raise a
1049     <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
1050 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
1051 wakaba 1.13 any nodes, then raise an
1052 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
1053 wakaba 1.1 </ol>
1054     </dd>
1055     <dt>If <var>n</var> is a <code>Text</code> node</dt>
1056     <dd>
1057     <ol>
1058 wakaba 1.19 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
1059 wakaba 1.1 <code>data</code> attribute value as an <abbr>XML</abbr> character
1060     data</a>.</li>
1061     <li>If the <code>childNodes</code> list of <var>n</var> contains
1062     any nodes, they are in
1063 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-text-child><code>xml-well-formedness-error</code></a>.</li>
1064 wakaba 1.1 </ol>
1065     </dd>
1066     <dt>Otherwise</dt>
1067     <dd><span class=ed>xml-well-formedness-error? unknown-error?</span></dd>
1068     </dl>
1069    
1070 wakaba 1.11 </div>
1071    
1072     </div>
1073    
1074     <div id=classes class=section>
1075     <h2>Character Classes</h2>
1076    
1077     <p>This section defines a couple of character classes.
1078     These classes are referred to by algorithms specified above.</p>
1079    
1080     <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
1081     contains the following characters:</p>
1082     <ul>
1083     <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1084     TABULATION</code></li>
1085     <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1086     <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1087     RETURN</code></li>
1088     <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
1089     .. <code class=char>U+D7FF</code></li>
1090     <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
1091     <code class=charname>REPLACEMENT CHARACTER</code></li>
1092     <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
1093     </ul>
1094     <div class="note memo">
1095     <p>This character class contains all characters allowed in the production rule
1096     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
1097     of <abbr>XML</abbr> 1.0
1098     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1099     </div>
1100    
1101     <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
1102     contains the following characters:</p>
1103     <ul>
1104     <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
1105     </ul>
1106    
1107     <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
1108     contains the following characters:</p>
1109     <ul>
1110     <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
1111     <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
1112     <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
1113     AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
1114     PROGRAM COMMAND</code></li>
1115     <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
1116     <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
1117     <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
1118     <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
1119     <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
1120     <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
1121     <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
1122     <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
1123     <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
1124     <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
1125     <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
1126     <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
1127     <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
1128     <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
1129     <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
1130     <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
1131     <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
1132     </ul>
1133     <div class="note memo">
1134     <p>This character class contains the characters listed in the Note in
1135     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
1136     of <abbr>XML</abbr> 1.0
1137     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
1138     as amended by
1139     <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
1140 wakaba 1.1 </div>
1141 wakaba 1.19
1142     <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
1143     contains the following characters:</p>
1144     <ul class=ed>
1145     </ul>
1146 wakaba 1.20 <div class="note memo">
1147     <p>This character class contains all characters allowed as the first character
1148     of a string matching to the production rule
1149     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1150     of <abbr>XML</abbr> 1.0
1151     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1152     </div>
1153 wakaba 1.19
1154     <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1155     contains the following characters:</p>
1156     <ul>
1157     <li>The characters in the character class
1158     <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1159     <li class=ed></li>
1160     </ul>
1161 wakaba 1.20 <div class="note memo">
1162     <p>This character class contains all characters allowed as the second
1163     character of a string matching to the production rule
1164     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1165     of <abbr>XML</abbr> 1.0
1166     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1167     </div>
1168    
1169     <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1170     contains the following characters:</p>
1171     <ul>
1172     <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1173     TABULATION</code></li>
1174     <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1175     <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1176     RETURN</code></li>
1177     <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1178     <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1179     (<code class=char>!</code>)</li>
1180     <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1181     (<code class=char>$</code>)</li>
1182     <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1183     (<code class=char>#</code>)</li>
1184     <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1185     (<code class=char>%</code>)</li>
1186     <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1187     (<code class=char>'</code>)</li>
1188     <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1189     (<code class=char>(</code>)</li>
1190     <li><code class=char>U+0029</code> <code class=charname>RIGHT
1191     PARENTHESIS</code> (<code class=char>)</code>)</li>
1192     <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1193     (<code class=char>*</code>)</li>
1194     <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1195     (<code class=char>+</code>)</li>
1196     <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1197     (<code class=char>,</code>)</li>
1198     <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1199     (<code class=char>-</code>)</li>
1200     <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1201     (<code class=char>.</code>)</li>
1202     <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1203     (<code class=char>/</code>)</li>
1204 wakaba 1.21 <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1205     (<code class=char>0</code>) .. <code class=char>U+0039</code>
1206     <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1207 wakaba 1.20 <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1208     (<code class=char>:</code>)</li>
1209     <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1210     (<code class=char>;</code>)</li>
1211     <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1212     (<code class=char>=</code>)</li>
1213     <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1214     (<code class=char>?</code>)</li>
1215     <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1216     (<code class=char>@</code>)</li>
1217 wakaba 1.21 <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1218     A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1219     <code class=charname>LATIN CAPITAL LETTER Z</code>
1220     (<code class=char>Z</code>)</li>
1221 wakaba 1.20 <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1222     (<code class=char>_</code>)</li>
1223 wakaba 1.21 <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1224     A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1225     <code class=charname>LATIN CAPITAL LETTER Z</code>
1226     (<code class=char>Z</code>)</li>
1227 wakaba 1.20 </ul>
1228     <div class="note memo">
1229     <p>This character class contains all characters allowed in the production rule
1230     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1231     of <abbr>XML</abbr> 1.0
1232     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1233     </div>
1234 wakaba 1.1
1235     </div>
1236    
1237     <div id="references" class="section reference">
1238     <h2>References</h2>
1239    
1240     <div id="normative-references" class="section normative reference">
1241     <h3>Normative References</h3>
1242    
1243     <dl>
1244 wakaba 1.3 <dt id=ref-DOM3CORE>DOM3CORE</dt>
1245     <dd class=ed>@@ W3C DOM Level 3 Core</dd>
1246     <dt id=ref-DOMDTDEF>DOMDTDEF</dt>
1247     <dd class=ed>@@ manakai's extension to DOM for document type definitions</dd>
1248 wakaba 1.1 <dt id="ref-KEYWORDS">KEYWORDS</dt>
1249     <dd><cite><a href="urn:ietf:rfc:2119">Key words for use in
1250     <abbr>RFCs</abbr> to Indicate Requirement Levels</a></cite>,
1251     <abbr title="Internet Engineering Task Force">IETF</abbr>
1252     <a href="urn:ietf:bcp:14"><abbr title="Best Current Practice">BCP</abbr>
1253     14</a>, <abbr title="Request for Comments">RFC</abbr> 2119,
1254     <time datetime=1997-03>March 1997</time>.
1255     <em>This</em> version of the specification is referenced.</dd>
1256     <dt id=ref-INFOSET>INFOSET</dt>
1257     <dd class=ed>@@</dd>
1258     <dt id=ref-XML10">XML10</dt>
1259     <dd><cite><a href="http://www.w3.org/TR/2006/REC-xml-20060816">Extensible
1260     Markup Language (<abbr>XML</abbr>) 1.0 (Fourth Edition)</a></cite>,
1261     <abbr>W3C</abbr> Recommendation, <time datetime=2006-08-16>16 August
1262     2006</time>, edited in place <time datetime=2006-09-29>29 September
1263     2006</time>. Tje latest version of the specification is available at
1264     <code class=uri>&lt;<a href="http://www.w3.org/TR/xml">http://www.w3.org/TR/xml</a>&gt;</code>.
1265     <em>This</em> version of the specification is referenced.</dd>
1266     <dt id=ref-XML11>XML11</dt>
1267     <dd class=ed>@@</dd>
1268     </dl>
1269     </div>
1270    
1271     <div id="informative-references" class="section informative reference">
1272     <h3>Non$B!>(Bnormative References</h3>
1273    
1274     <dl class=ed>
1275     <dt id=ref-CSS>CSS</dt>
1276     <dd><cite><a href="http://www.w3.org/TR/2007/CR-CSS21-20070719">Cascading
1277     Style Sheets Level 2 Revision 1 (<abbr>CSS</abbr> 2.1)
1278     Specification</a></cite>,
1279     <abbr title="World Wide Web Consortium">W3C</abbr> Candidate
1280     Recommendation, <time datetime=2007-07-19>19 July 2007</time>.
1281     Work in progress. The latest version of the specification is
1282     available at
1283     <code class=URI>&lt;<a href="http://www.w3.org/TR/CSS21">http://www.w3.org/TR/CSS21</a>></code>.</dd>
1284     <dt id=ref-CSSOM>CSSOM</dt>
1285     <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/csswg/cssom/Overview.html?rev=1.55&amp;content-type=text/html;%20charset=utf-8">Cascading
1286     Style Sheets Object Model (<abbr>CSSOM</abbr>)</a></cite>,
1287     <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1288     <time datetime=2007-06-18>18 June 2007</time>. Work in progress.
1289     The latest Editor's Draft of the specification is available at
1290     <code class=URI>&lt;<a href="http://dev.w3.org/csswg/cssom/Overview.html">http://dev.w3.org/csswg/cssom/Overview.html</a>></code>.</dd>
1291     <dt id=ref-HTML5>HTML5</dt>
1292     <dd><cite><a href="http://whatwg.org/html5"><abbr>HTML</abbr> 5</a></cite>,
1293     <abbr title="Web Hypertext Application Technology Working Group">WHATWG</abbr>
1294     Working Draft. Work in progress.</dd>
1295     <dt id=ref-SAPI>SAPI</dt>
1296     <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/2006/webapi/selectors-api/Overview.html?rev=1.28&amp;content-type=text/html;%20charset=utf-8">Selectors <abbr>API</abbr></a></cite>,
1297     <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1298     <time datetime=2007-08-29>29 August 2007</time>. Work in progress.
1299     The latest Editor's Draft of the specification is available at
1300     <code class=URI>&lt;<a href="http://dev.w3.org/2006/webapi/selectors-api/Overview.html">http://dev.w3.org/2006/webapi/selectors-api/Overview.html</a>></code>.
1301     The latest published version of the specification is available at
1302     <code class=URI>&lt;<a href="http://www.w3.org/TR/selectors-api/">http://www.w3.org/TR/selectors-api/</a>></code>.</dd>
1303     <dt id=XBL2>XBL2</dt>
1304     <dd><cite><a href="http://www.mozilla.org/projects/xbl/xbl2.html"><abbr>XBL</abbr>
1305     2.0</a></cite>, Mozilla.org, <time datetime=2007-03-15>15 Mar 2007</time>.
1306     Work in progress. The latest
1307     <abbr title="World Wide Web Consortium">W3C</abbr>$B!>(Bpublished
1308     version of the specification is available at
1309     <code class=URI>&lt;<a href="http://www.w3.org/TR/xbl/">http://www.w3.org/TR/xbl/</a>></code>.</dd>
1310 wakaba 1.3 <dt id=ref-XML5>XML5</dt>
1311     <dd class=ed>@@</dd>
1312 wakaba 1.1 </dl>
1313     </div>
1314    
1315     </div>
1316    
1317     </body>
1318     </html>

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24