/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Contents of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.18 - (hide annotations) (download) (as text)
Sat Dec 1 14:32:30 2007 UTC (16 years, 5 months ago) by wakaba
Branch: MAIN
Changes since 1.17: +99 -33 lines
File MIME type: text/html
System id validation revised; unparsed entity validation specced; attr uniqueness check specced

1 wakaba 1.1 <!DOCTYPE html>
2     <html lang="en" class="formal-specification status-work-in-progress">
3     <head>
4 wakaba 1.2 <title>manakai's XML Conformance Checking</title>
5 wakaba 1.1 <link rel="stylesheet" href="http://suika.fam.cx/www/style/html/spec">
6     <link rel="license" href="http://suika.fam.cx/c/gnu/fdl">
7 wakaba 1.6 <style>
8     a[id]::before {
9     content: "[" attr(id) "]";
10     color: gray;
11     font-size: 70%;
12     font-family: Courier New, monospace;
13     vertical-align: 60%;
14     }
15     </style>
16 wakaba 1.1 </head>
17     <body class="has-abstract">
18    
19     <div class="header">
20 wakaba 1.12 <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21 wakaba 1.17 <h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2>
22 wakaba 1.1
23 wakaba 1.2 <dl class="versions-uri">
24 wakaba 1.1 <dt>This Version</dt>
25 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
26     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
27 wakaba 1.1 <dt>Latest Version</dt>
28 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc"
29     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc</a>&gt;</code></dd>
30 wakaba 1.1 <dt>Latest Working Draft</dt>
31 wakaba 1.2 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
32     >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
33 wakaba 1.1 <!-- <dt>Previous Version</dt> -->
34     <dt>Version History</dt>
35 wakaba 1.2 <dd><code class=URI>&lt;<a href="http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html">http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html</a>&gt;</code></dd>
36 wakaba 1.1 </dl>
37     <dl class="authors" id="author">
38     <dt>Author</dt>
39     <dd><a href="http://suika.fam.cx/~wakaba/who?" rel="author"
40     lang=ja>Wakaba</a>
41     <code class="mail">&lt;<a href="mailto:w@suika.fam.cx" rel="author"
42     >w@suika.fam.cx</a>&gt;</code></dd>
43     </dl>
44    
45     <p class="copyright" lang="en">&#xA9; <time>2007</time> <a
46     href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47     Permission is granted to copy, distribute and/or modify this document
48     under the terms of the <a rel="license"
49     href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>,
50     Version 1.2
51     or any later version published by the <a
52     href="http://www.fsf.org/">Free Software Foundation</a>;
53     with no Invariant Sections, no Front$B!>(BCover Texts, and no Back$B!>(BCover
54     Texts. A copy of the license is
55     available at <code class="URI">&lt;<a
56     href="http://www.gnu.org/copyleft/fdl.html"
57     rel="license">http://www.gnu.org/copyleft/fdl.html</a>&gt;</code>.</p>
58     </div>
59    
60     <div id="abstract" class="section">
61     <h2>Abstract</h2>
62    
63     <div class="ed issue">...</div>
64    
65     </div>
66    
67     <div id="status" class="section">
68     <h2>Status of This Document</h2>
69    
70     <p class=section-info><em>This section describes the status of this document
71     at the time of its publication. Other documents
72     might supersede this document.</em></p>
73    
74     <p>This document is a working draft, produced as part of the
75     <a href="http://suika.fam.cx/www/markup/html/whatpm/readme">Whatpm</a>
76     subproject of the
77     <a href="http://suika.fam.cx/www/manakai-core/doc/web/">manakai</a>
78     project. It might be updated, replaced, or obsoleted by
79     other documents at any time. It is inappropriate to
80     cite this document as other than <q>work in progress</q>.</p>
81    
82     <p>Comments on this document are welcome and
83     may be sent to the <a href="#author">author</a>.</p>
84    
85     <p>Translations of thie document might be available.
86     The English version of the document is the only
87     normative version.</p>
88     </div>
89    
90     <div id="introduction" class="section">
91     <h2>Introduction</h2>
92    
93     <p class=section-info>This section is <em>non$B!>(Bnormative</em>.</p>
94    
95     <div class="issue ed">...</div>
96    
97    
98     </div>
99    
100     <div class="section" id="terminology">
101     <h2>Terminology</h2>
102    
103     <p>The key words <q><em class="rfc2119">MUST</em></q>,
104     <q><em class="rfc2119">MUST NOT</em></q>,
105     <q><em class="rfc2119">SHOULD</em></q>,
106     <q><em class="rfc2119">SHOULD NOT</em></q>, and
107     <q><em class=rfc2119>MAY</em></q> in this
108     document are to be interpreted as described in
109     <a href="urn:ietf:rfc:2119"><abbr title="Request for Comments">RFC</abbr>
110     2119</a>
111     <cite class="bibref normative">[<a href="#ref-KEYWORDS">KEYWORDS</a>]</cite>.</p>
112    
113     <!-- Strictly speaking the paragraph below is not part of terminology. -->
114     <p>All examples and notes in this specification are non$B!>(Bnormative,
115     as are all sections explicitly marked non$B!>(Bnormative. Everything
116     else in this specification is normative.</p>
117    
118     <p><span class=ed>Algorithm is normative but non-normative</span>.
119     In addition, the order in which <a href="#errors">errors</a> are
120     raised is undefined.</p>
121     </div>
122    
123    
124 wakaba 1.9 <div class=section id=error-categories>
125 wakaba 1.14 <h2>Error Classification</h2>
126 wakaba 1.9
127 wakaba 1.1 <p class=ed>If a <code>Document</code> node has no
128     xml-well-formedness-error, entity-error, and unknown-error,
129     then it is well-formed. If a well-formed <code>Document</code>
130     node has no xml-validity-error, it is valid.</p>
131    
132     <div class="note memo">
133     <p>A well$B!>(Bformed <code>Document</code> can be safely serialized
134     into a well$B!>(Bformed XML document. A valid <code>Document</code>
135     can be easily serialized into a valid XML document.</p>
136     </div>
137    
138 wakaba 1.17 <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
139     ...</p>
140    
141 wakaba 1.1 <p>Errors are classified into these
142     <dfn id=error-category title="error category">error categories</dfn>:</p>
143    
144     <dl id=errors>
145     <dt><dfn id=entity-error><code>entity-error</code></dfn></dt>
146 wakaba 1.4 <dd>
147     <p class=ed>@@</p>
148     <div class="note memo">
149     <p>This <span class=ed>algorithm</span> does not support
150     <abbr>DOM</abbr> tree with one or more <code>EntityReference</code>
151     nodes. It is expected that any entity references are expanded
152     at the parse time and any unexpandable entity references
153     make parse time errors raised so that never result in
154     <abbr>DOM</abbr> tree with <code>EntityReference</code> nodes.</p>
155     </div>
156     </dd>
157 wakaba 1.1 <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
158     <dd class=ed>@@</dd>
159 wakaba 1.12 <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
160     <dd>
161     <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
162     will be raised when a construct, which might not be restored to the same
163     construct when it is serialized and then re-parsed by a conforming
164     processor, is encountered.</p>
165     <div class="example memo">
166     <p>For a <code>Comment</code> node a
167     <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
168     be raised, since <abbr>XML</abbr> processors are not required to
169     report texts of comments for applications.</p>
170     </div>
171     </dd>
172 wakaba 1.1 <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
173     <dd class=ed>@@</dd>
174     <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
175     <dd>An <abbr>XML</abbr> <i>error</i> (<abbr>XML</abbr> 1.0
176     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
177     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-error">error</a> /
178     <abbr>XML</abbr> 1.1
179     <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
180     <a href="@@/#dt-error">error</a>)
181     that is not classified to any other <a href="#error-category">error
182     category</a>.</dd>
183     <dt><dfn id=xml-misc-fatal-error><code>xml-misc-fatal-error</code></dt>
184     <dd>An <abbr>XML</abbr> <i>fatal error</i> (<abbr>XML</abbr> 1.0
185     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
186     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-fatal">fatal
187     error</a> / <abbr>XML</abbr> 1.1
188     <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
189     <a href="@@/#dt-fatal">fatal error</a>)
190     that is not classified to any other <a href="#error-category">error
191     category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
192 wakaba 1.13 <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
193     <dd>
194     <p>An
195     <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
196     will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
197     in <abbr>XML</abbr> specification is not met.</p>
198     </dd>
199 wakaba 1.1 <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
200     <dd>A violation of validity constraint in XML document.</dd>
201     <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
202     <dd><p>If an <code>xml-well-formedness-error</code> is raised,
203     it would not be possible to generate an XML serialization that
204     would match to the appropriate production rule and that would not
205     violate to any well$B!>(Bformedness constraint in XML
206     specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,
207     <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>
208     </dl>
209    
210     <div class=ed><p>@@ TODO: #dt-atuseroption at user option
211     (MAY or MUST), #dt-compat for compatibility,
212     #dt-interop for interoperability</p></div>
213    
214 wakaba 1.9 </div>
215 wakaba 1.14
216     <div class=section id=parsing-xml>
217     <h2>Parsing <abbr>XML</abbr> Document</h2>
218    
219     <ul>
220     <li>If the <abbr>XML</abbr> document does not begin with an
221     <abbr>XML</abbr> declaration, then raise an
222     <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.</li>
223     <li>If the replacement text of an entity declaration is
224     <code>&lt;</code>, then raise an
225     <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
226     "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
227     --></li>
228 wakaba 1.16 <li>If there is an element type declaration whose <code>Name</code>
229     value is already declared, then raise an
230     <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.</li>
231 wakaba 1.14 <li>If attribute definition whose <code>Name</code> is
232     <code>xml:space</code> has <span class=ed>declared type different from
233     (default|preserve), (default), or (preserve)</span>, then raise an
234     <a href="#xml-misc-error" id=xme-ad-xml-space><code>xml-misc-error</code></a>.
235     <span class=ed>@@ duplication with
236     <a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!--
237     <!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li>
238 wakaba 1.15 <li>If an empty-element tag is used for an element which is <em>not</em>
239     declared <code>EMPTY</code>, then raise an
240     <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.</li>
241     <li>If an empty-element tag is <em>not</em> used for an element which is
242     declared <code>EMPTY</code>, then raise an
243     <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.</li>
244 wakaba 1.14 <!--
245    
246     #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting
247     #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset
248     #ExtSubset Well-formedness constraint: External Subset
249     #PE-between-Decls Well-formedness constraint: PE Between Declarations
250 wakaba 1.15 #GIMatch Well-formedness constraint: Element Type Match
251     #uniqattspec Well-formedness constraint: Unique Att Spec
252     #NoExternalRefs Well-formedness constraint: No External Entity References
253     #CleanAttrVals Well-formedness constraint: No < in Attribute Values
254 wakaba 1.16 #vc-PEinGroup Validity constraint: Proper Group/PE Nesting
255     "For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)."
256     "For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error."
257 wakaba 1.17 #condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting
258     #wf-Legalchar Well-formedness constraint: Legal Character
259     #textent Well-formedness constraint: Parsed Entity
260     #norecursion Well-formedness constraint: No Recursion
261     #indtd Well-formedness constraint: In DTD
262     "External parsed entities SHOULD each begin with a text declaration."
263     "It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."
264     #UniqueNotationName Validity constraint: Unique Notation Name
265 wakaba 1.14
266     @@ Need detailed review, but maybe should be in parsing phase
267    
268     #vc-check-rmd Validity constraint: Standalone Document Declaration
269    
270 wakaba 1.17 @@ Need dtailed review
271    
272     #wf-entdeclared Well-formedness constraint: Entity Declared
273     #vc-entdeclared Validity constraint: Entity Declared
274     "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
275     "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
276    
277     @@ flaged and then reported in DOM check phase
278    
279     "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
280     "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
281     "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
282    
283     @@ in parsing phase
284    
285     "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
286    
287 wakaba 1.14 -->
288     </ul>
289     </div>
290    
291     <div class="section" id=checking-dom>
292     <h2>Checking <abbr>DOM</abbr></h2>
293    
294     <p>The following algorithms and definitions are applied to
295     <abbr>XML</abbr> documents; especially, they are not applied
296     to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
297 wakaba 1.9
298     <div class=section id=definitions>
299     <h3>Definitions</h3>
300    
301 wakaba 1.1 <p>The <dfn id=xml-version><abbr>XML</abbr> version</dfn> of a node is
302     the <abbr>XML</abbr> version of the document to which the node belongs.
303     For a <code>Documemt</code> node, the <abbr>XML</abbr> version
304     of the document is the value of the <code>xmlVersion</code>
305 wakaba 1.9 attribute of the node. For a <code>DocumentType</code> node whose
306     <code>ownerDocument</code> attribute is set to <code>null</code>,
307     the <abbr>XML</abbr> version of the document is <code>1.0</code><!--
308     since createDocumentType [DOM3CORE] assumes XML 1.0 [XML10TE] -->.
309 wakaba 1.1 For any other node, the <abbr>XML</abbr> version of the document
310     is that of the <code>Document</code> node contained in the
311     <code>ownerDocument</code> attribute of the node.</p>
312    
313 wakaba 1.9 </div>
314    
315 wakaba 1.10
316     <div class=section id=checking-components>
317     <h3>Checking Components</h3>
318    
319     <p>The algorithm
320     <dfn id=algorithm-to-validate-an-xml-character-data title-"to validate an XML character data">to
321     validate an <abbr>XML</abbr> character data
322     (<dfn id=var-s><var>s</var></dfn>)</dfn> is defined as following:</p>
323     <ol>
324 wakaba 1.11 <li>If <var>s</var> contains a character that is <em>not</em> in
325     the character class <a href="#class-Char10"><code>Char10</code></a>,
326     then raise an
327     <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
328     <li>If <var>s</var> contains a character that is in the character
329     class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
330     then raise an
331     <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
332     <li>If <var>s</var> contains a character that is in the character
333     class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
334     then raise an
335     <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
336     <li class=ed>@@ XML 1.1 support</li>
337 wakaba 1.13 <li>If <var>s</var> contains a <code class=char>U+000D</code>
338     <code class=charname>CARRIAGE RETURN</code> character, then
339     raise a
340     <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
341     <span class=ed>@@ We should not raise duplicate errors for U+000D
342     in attribute values. In addition, we should support a mode where
343     U+000D will be serialized as &#x000D; (so that no round-trip-error
344     will be raised).</span></li>
345 wakaba 1.10 </ol>
346    
347     <p>The algorithm
348     <dfn id=algorithm-to-validate-a-name title="to validate a Name">to validate a
349 wakaba 1.12 <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn> is defined
350 wakaba 1.10 as following:</p>
351     <ol>
352 wakaba 1.12 <li>If <var>s</var> begins with the string <code>xml</code> (in any
353     case combination), then raise an
354     <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
355     <span class=ed>@@ except for attribute names <code>xml:lang</code>,
356     <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,
357     <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
358 wakaba 1.15 pi name <code>xml-stylesheet</code>.</span><!--
359     "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
360     xml:base and xml:id specifications violate to this sentense!
361     --></li>
362 wakaba 1.12 <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
363     [XML11]. -->
364 wakaba 1.10 <li class=ed>@@</li>
365     </ol>
366    
367     <p>The algorithm
368 wakaba 1.18 <dfn id=algorithm-to-validate-an-ncname title="to validate an NCName">to
369 wakaba 1.10 validate an <code>NCName</code> (<dfn id=var-name><var>name</var></dfn>)</dfn>
370     is defined as following:</p>
371     <ol>
372     <li class=ed>@@</li>
373     </ol>
374    
375 wakaba 1.18 <p>To
376     <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
377 wakaba 1.10 a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
378 wakaba 1.18 following algorithm <em class=rfc2119>MUST</em> be used:</p>
379 wakaba 1.10 <ol>
380 wakaba 1.18 <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
381 wakaba 1.10 <li>If <var>pid</var> contains any character
382     that is outside of the range of <code>#x20 | #xD | #xA |
383     [a-zA-Z0-9] | [-'()+,./:=?;!*#@$_%]</code><!-- @@ TODO: formal def -->,
384     then it is an
385     <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
386     <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
387     <code class=charname>CHARACTER TABULATION</code>,
388     <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
389     and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
390     characters, if the first character of <var>pid</var> is
391     <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
392     if the last character of <var>pid</var> is <code class=char>U+0020</code>
393     <code class=charname>SPACE</code> character, or if there is a
394     <code class=char>U+0020</code> <code class=charname>SPACE</code> character
395     immediately followed by another <code class=char>U+0020</code>
396     <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
397 wakaba 1.18 <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
398     <span class=ed>Is this really a roundtripness problem? XML spec
399     does only define the way to match public identifiers in fact, no
400     canonical form.</span></li>
401 wakaba 1.10 <li class=ed>@@ Should we check formal-public-identifierness?</li>
402     </ol>
403    
404 wakaba 1.18 <p>To
405     <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
406     a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
407     following algorithm <em class=rfc2119>MUST</em> be used:</p>
408     <ol>
409     <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
410     <li>If <var>sid</var> contains both <code class=char>U+0022</code>
411     <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
412     <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
413     (<code class=char>'</code>) characters, raise an
414     <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
415     <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
416     <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
417     character, then raise an
418     <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
419     <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
420     reference, then raise a fact-level error (xml-misc-warning?).<!--
421     XML 1.0 has no conformance language for system identifier being
422     a URI reference. --></li>
423     </ol>
424    
425 wakaba 1.10 </div>
426    
427 wakaba 1.9 <div class=section id=checking-node>
428     <h3>Checking <code>Node</code></h3>
429    
430 wakaba 1.1 <p>The algorithm <dfn title="to check a node" id=algorithm-to-check-a-node>to
431     check a node (<dfn id=var-n><var>n</var></dfn>)</dfn> is defined as
432     following:</p>
433     <dl class=switch>
434     <dt>If <var>n</var> is an <code>Attr</code> node</dt>
435     <dd>
436     <ol>
437 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
438 wakaba 1.1 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
439     <li>If the <code>prefix</code> attribute value is different from
440 wakaba 1.18 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
441 wakaba 1.1 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
442 wakaba 1.8 <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
443 wakaba 1.4 <code>childNodes</code> list of <var>n</var>,
444 wakaba 1.8 <ol>
445     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
446     <code>Text</code> or <code>EntityReference</code> node, then it is an
447     <a href="#xml-well-formedness-error" id=wfe-attr-child><code>xml-well-formedness-error</code></a>.</li>
448     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
449     <code>EntityReference</code> node, then it is an
450     <a href="#entity-error" id=ee-attr-er><code>entity-error</code></a>.</li>
451     <li>Otherwise,
452     <a href="#algorithm-to-check-a-node" title="check a node">check
453     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
454     </ol></li>
455 wakaba 1.13 <li>If <code>nodeName</code> attribute of <var>n</var> is
456     <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
457     and <code>value</code> attribute of <var>n</var> is neither
458     <code>default</code> nor <code>preserve</code>, then it is an
459     <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
460     Note that <!ATTLIST e xml:space (default|preserve) "a">...
461     <e xml:space="default"/> is conforming and valid. --></li>
462     <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
463     successor] or an empty string, then xml-misc-warning
464     (a "fact"-level error; not an XML error).</li>
465 wakaba 1.1 <li class=ed>@@ <code>specified</code>,
466 wakaba 1.15 <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
467 wakaba 1.13 <!-- xml:space MUST be declared to be valid. -->
468 wakaba 1.16 <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
469     attribute <code>value</code> of <var>n</var>.</li>
470     <li>Validate the <var>n</var> against the <span class=ed>declared
471     type</span> as following:
472     <dl class=switch>
473     <dt class=ed><code>ID_ATTR</code></dt>
474     <dd>
475     <ol>
476     <li><span class=ed>Validate <var>v</var> as an
477     <code>Name</code>.</span> If it fails, then raise an
478     <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
479     <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
480     then raise an
481     <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
482     </ol>
483     </dd>
484     <dt class=ed>IDREF_ATTR</dt>
485     <dd>
486     <ol>
487     <li><span class=ed>Validate <var>v</var> as an
488     <code>Name</code>.</span> If it fails, then raise an
489     <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
490     <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
491     defined,</span> then raise an
492     <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
493     </ol>
494     </dd>
495     <dt class=ed>IDREFS_ATTR</dt>
496     <dd class=ed>@@</dd>
497     <dt class=ed>ENTITY_ATTR</dt>
498     <dd>
499     <ol>
500     <li><span class=ed>Validate <var>v</var> as an
501     <code>Name</code>.</span> If it fails, then raise an
502     <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
503     <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
504     defined,</span> then raise an
505     <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
506     </ol>
507     </dd>
508     <dt class=ed>ENTITIES_ATTR</dt>
509     <dd class=ed>@@</dd>
510     <dt class=ed>NMTOKEN_ATTR</dt>
511     <dd>
512     <ol>
513     <li><span class=ed>Validate <var>v</var> as an
514     <code>Nmtoken</code>.</span> If it fails, then raise an
515     <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
516     </ol>
517     </dd>
518     <dt class=ed>NMTOKENS_ATTR</dt>
519     <dd class=ed>@@</dd>
520     <dt class=ed>NOTATION_ATTR</dt>
521     <dd class=ed><var>v</var> must be one of enumerated values.
522     If not, then raise an
523     <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
524     <dt class=ed>ENUMERATED_ATTR</dt>
525     <dd class=ed><var>v</var> must be one of enumerated values.
526     If not, then raise an
527     <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
528     <dt class=ed>@@</dt>
529     </dl>
530     </li>
531 wakaba 1.17 <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
532 wakaba 1.16 raise an
533     <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
534 wakaba 1.17 <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
535     <li class=ed>@@ strict serialization error for U+000D, U+000A, and
536     U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
537 wakaba 1.1 </ol>
538     </dd>
539     <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
540 wakaba 1.8 <dd>
541     <ol>
542     <li class=ed></li>
543 wakaba 1.13 <li>If <code>nodeName</code> attribute of <var>n</var> is
544     <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
545     and <span class=ed>its declared type is different from (default|preserve),
546     (default), or (preserve)</span>, then raise an
547     <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
548 wakaba 1.8 <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
549     <code>childNodes</code> list of <var>n</var>,
550     <ol>
551     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
552     <code>Text</code> or <code>EntityReference</code> node, then it is an
553     <a href="#xml-well-formedness-error" id=wfe-at-child><code>xml-well-formedness-error</code></a>.</li>
554     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
555     <code>EntityReference</code> node, then it is an
556     <a href="#entity-error" id=ee-at-er><code>entity-error</code></a>.</li>
557     <li>Otherwise,
558     <a href="#algorithm-to-check-a-node" title="check a node">check
559     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
560     </ol></li>
561 wakaba 1.16 <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
562     be declared. If not, then raise an
563     <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
564     <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
565     values MUST all be distinct. If not, then raise an
566     <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
567     <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
568     element, then raise an
569     <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
570 wakaba 1.17 <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
571 wakaba 1.8 </ol>
572     </dd>
573 wakaba 1.1 <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
574     <dd>
575     <ol>
576     <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the
577     <code>data</code> attribute value as an <abbr>XML</abbr> character
578     data</a>.</li>
579     <li>If the <code>data</code> attribute value contains
580 wakaba 1.12 a string <code>]]></code>, then raise an
581     <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
582 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
583     any nodes, they are in
584 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
585 wakaba 1.1 </ol>
586     </dd>
587     <dt>If <var>n</var> is a <code>Comment</code> node</dt>
588     <dd>
589     <ol>
590 wakaba 1.12 <li>Raise an
591     <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
592 wakaba 1.1 <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the
593     <code>data</code> attribute value as an <abbr>XML</abbr> character
594     data</a>.</li>
595     <li>If the <code>data</code> attribute value contains
596     a string <code>--</code>, or if it ends with a character
597 wakaba 1.12 <code>-</code>, then raise an
598     <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
599     "for compatibility" --></li>
600 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
601     any nodes, they are in
602 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
603 wakaba 1.1 </ol>
604     </dd>
605     <dt>If <var>n</var> is a <code>Document</code> node</dt>
606     <dd>
607     <ol>
608     <li>If <a href="#xml-version"><abbr>XML</abbr> version</a> of <var>n</var>
609     is different from <code>1.0</code> or <code>1.1</code>,
610     then it is an <code class=ed>unknown-error?</code>.</li>
611     <li>If the <code>xmlEncoding</code> attribute value does <em>not</em>
612     match to <code>[A-Za-z] ([A-Za-z0-9._] | '-')*</code>
613 wakaba 1.8 <span class=ed>@@ formal def</span>, then it is an
614 wakaba 1.1 <a href="#xml-well-formedness-error" id=wfe-encoding><code>xml-well-formedness-error</code></a>.</li>
615     <li>The <code>childNodes</code> list of <var>n</var> have to
616     consist of zero or more <code>Comment</code> and/or
617     <code>ProcessingInstruction</code> nodes, followed by
618     an optional <code>DocumentType</code> node, followed
619     by zero or more <code>Comment</code> and/or
620     <code>ProcessingInstruction</code> nodes, followed by
621     an <code>Element</code> node, followed
622     by zero or more <code>Comment</code> and/or
623 wakaba 1.8 <code>ProcessingInstruction</code> nodes. Any violation to this is an
624 wakaba 1.1 <a href="#xml-well-formedness-error" id=wfe-document-child><code>xml-well-formedness-error</code></a>.</li>
625 wakaba 1.8 <li>For each node <dfn id=var-document-nc><var>n<sub><var>c</var></sub></var></dfn>
626     in the <code>childNodes</code> list of <var>n</var>,
627     <ol>
628     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
629     <code>EntityReference</code> node, then
630     <a href="#algorithm-to-check-a-node" title="check a node">check
631     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
632     </ol></li>
633 wakaba 1.1 <li class=ed>@@ <code>allDeclarationsProcessed</code></li>
634     </ol>
635     </dd>
636     <dt>If <var>n</var> is a <code>DocumentFragment</code> node</dt>
637     <dd>
638     <ol>
639 wakaba 1.8 <li>For each node <dfn id=var-df-nc><var>n<sub><var>c</var></sub></var></dfn>
640     in the <code>childNodes</code> list of <var>n</var>,
641     <ol>
642     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
643     <code>Element</code>, <code>Text</code>, <code>CDATASection</code>,
644     <code>Comment</code>, <code>ProcessingInstruction</code>, or
645     <code>EntityReference</code> node, then it is an
646     <a href="#xml-well-formedness-error" id=wfe-df-child><code>xml-well-formedness-error</code></a>.</li>
647     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
648     <code>EntityReference</code> node, then it is an
649     <a href="#entity-error" id=ee-df-er><code>entity-error</code></a>.</li>
650     <li>Otherwise,
651     <a href="#algorithm-to-check-a-node" title="check a node">check
652     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
653     </ol></li>
654 wakaba 1.1 </ol>
655     </dd>
656     <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
657     <dd>
658     <ol>
659 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
660 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
661 wakaba 1.13 <li>Follow the following substeps:
662     <ol>
663     <li>If <code>ownerDocument</code> attribute of <var>n</var> is
664     <code>null</code>, then abort these substeps.</li>
665     <li>If <code>documentElement</code> attribute of the node
666     set to <code>ownerDocument</code> attribute of <var>n</var> is
667     <code>null</code>, then abort these substeps.</li>
668     <li>If <code>nodeName</code> attribute of the node set to
669     <code>documentElement</code> attribute of the node set to
670     <code>ownerDocument</code> attribute of <var>n</var> is
671     different from <code>nodeName</code> of <var>n</var>,
672     then raise an
673     <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
674     </ol>
675     </li>
676 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
677     the <code>publicId</code> attribute value as a public identifier</a>.</li>
678 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
679     the <code>systemId</code> attribute value as a system identifier</a>.</li>
680     <li>If the <code>publicId</code> attribute value of <var>n</var> is
681     <em>not</em> <code>null</code> and the <code>systemId</code> attribute
682     value of <var>n</var> is <code>null</code>, then raise an
683     <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
684     <span class=ed>@@ publicId == null? Or, publicId == ""</span></li>
685 wakaba 1.8 <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
686     in the <code>childNodes</code> list of <var>n</var>,
687     <ol>
688     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
689     <code>ProcessingInstruction</code> node, then it is an
690     <a href="#xml-well-formedness-error" id=wfe-dt-child><code>xml-well-formedness-error</code></a>.
691     <span class=ed>@@ ref to manakai's extensions</span></li>
692     <li>Otherwise, <!-- if n_c is not an EntityReference node, -->
693     <a href="#algorithm-to-check-a-node" title="check a node">check
694     <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
695     </ol></li>
696 wakaba 1.18 <li>For each node in the <code>entities</code>, <code>notations</code>,
697     and <code>elementTypes</code> lists of <var>n</var>,
698     <a href="#algorithm-to-check-a-node" title="check a node">check the
699     node</a> recursively.</li>
700     <li class=ed>@@ externally declared?</li>
701 wakaba 1.1 </ol>
702     </dd>
703     <dt>If <var>n</var> is an <code>Element</code> node</dt>
704     <dd>
705     <ol>
706 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
707 wakaba 1.1 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
708     <li>If the <code>prefix</code> attribute value is different from
709 wakaba 1.18 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
710 wakaba 1.1 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
711 wakaba 1.8 <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
712     in the <code>childNodes</code> list of <var>n</var>,
713     <ol>
714     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
715     <code>Element</code>,
716     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
717     <code>ProcessingInstruction</code>, or
718     <code>EntityReference</code> node, then it is an
719     <a href="#xml-well-formedness-error" id=wfe-element-child><code>xml-well-formedness-error</code></a>.</li>
720     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
721     <code>EntityReference</code> node, then it is an
722     <a href="#entity-error" id=ee-element-er><code>entity-error</code></a>.</li>
723     <li>Otherwise,
724     <a href="#algorithm-to-check-a-node" title="check a node">check
725     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
726     </ol></li>
727 wakaba 1.15 <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
728 wakaba 1.18 <li>Let <var>attrs</var> be the value of the <code>attribute</code>
729     attribute of <var>n</var>. Check conformance of <var>attrs</var>
730     as following:
731     <ol>
732     <li>If <var>attrs</var> contains an <code>Attr</code> node whose
733     <code>nodeName</code> attribute value is equal to that of another
734     <code>Attr</code> node in <var>attrs</var>, then raise an
735     <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
736     #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
737     <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
738     </ol>
739     </li>
740 wakaba 1.8 </ol>
741     </dd>
742     <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
743     <dd>
744     <ol>
745     <li class=ed></li>
746 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
747 wakaba 1.8 any nodes, they are in
748     <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
749 wakaba 1.16 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
750     <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
751     <li class=ed>@@ #vc-MixedChildrenUnique
752     Validity constraint: No Duplicate Types</li>
753     <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
754     <li>If there is more than one <code>AttributeDefinition</code> node
755     with <span class=ed>attribute type <code>ID</code></span> in the
756     <code>NamedNodeMap</code> list contained in the
757     <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
758     <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
759     <li>If there is more than one <code>AttributeDefinition</code> node
760     with <span class=ed>attribute type <code>NOTATION</code></span> in the
761     <code>NamedNodeMap</code> list contained in the
762     <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
763     <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
764     <li class=ed>"For interoperability, the same Nmtoken SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
765 wakaba 1.1 </ol>
766     </dd>
767 wakaba 1.18 <dt>If <var>n</var> is an <code>Entity</code> node whose
768     <code>notationName</code> attribute value is <code>null</code> (i.e. a
769     parsed entity)</dt>
770 wakaba 1.1 <dd>
771     <ol>
772 wakaba 1.18 <li>Raise an
773     <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
774     <li><a href="#algorithm-to-validate-an-ncname">Validate the
775 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
776 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
777     the <code>publicId</code> attribute value as a public identifier</a>.</li>
778 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
779     the <code>systemId</code> attribute value as a system identifier</a>.</li>
780     <li>If the <code>publicId</code> attribute value of <var>n</var> is
781     <em>not</em> <code>null</code> and the <code>systemId</code> attribute
782     value of <var>n</var> is <code>null</code>, then raise an
783     <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
784 wakaba 1.8 <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
785     in the <code>childNodes</code> list of <var>n</var>,
786     <ol>
787     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
788     <code>Element</code>,
789     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
790     <code>ProcessingInstruction</code>, or <code>EntityReference</code>
791     node, then it is an
792     <a href="#xml-well-formedness-error" id=wfe-entity-child><code>xml-well-formedness-error</code></a>.</li>
793     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
794     <code>EntityReference</code> node, then it is an
795     <a href="#entity-error" id=ee-er-er><code>entity-error</code></a>.</li>
796     <li>Otherwise,
797     <a href="#algorithm-to-check-a-node" title="check a node">check
798     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
799     </ol></li>
800 wakaba 1.1 </ol>
801     </dd>
802 wakaba 1.18 <dt>If <var>n</var> is an <code>Entity</code> node whose
803     <code>notationName</code> attribute value is <em>not</em> <code>null</code>
804     (i.e. an unparsed entity)</dt>
805     <dd>
806     <ol>
807     <li><a href="#algorithm-to-validate-an-ncname">Validate the
808     <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
809     <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
810     the <code>publicId</code> attribute value as a public identifier</a>.</li>
811     <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
812     the <code>systemId</code> attribute value as a system identifier</a>.</li>
813     <li>If the <code>systemId</code> attribute value of <var>n</var> is
814     <code>null</code>, then raise an
815     <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
816     <li><a href="#algorithm-to-validate-an-ncname">Validate the
817     <code>notationName</code> attribute value of <var>n</var> as an
818     <code>NCName</code></a>.</li>
819     <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
820     <li>If the <code>childNodes</code> list of <var>n</var> contains
821     any nodes, they are in
822     <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
823     </ol>
824     </dd>
825 wakaba 1.1 <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
826     <dd>
827     <ol>
828     <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
829 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
830 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
831 wakaba 1.8 <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
832     in the <code>childNodes</code> list of <var>n</var>,
833     <ol>
834     <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
835     <code>Element</code>,
836     <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
837     <code>ProcessingInstruction</code>, or <code>EntityReference</code>
838     node, then it is an
839     <a href="#xml-well-formedness-error" id=wfe-er-child><code>xml-well-formedness-error</code></a>.</li>
840     <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is <em>not</em> an
841     <code>EntityReference</code> node, then it is an
842     <a href="#entity-error" id=ee-entity-er><code>entity-error</code></a>.</li>
843     <li>Otherwise,
844     <a href="#algorithm-to-check-a-node" title="check a node">check
845     <var>n<sub><var>c</var></sub></var></a> recursively.</li>
846     </ol></li>
847 wakaba 1.1 </ol>
848     </dd>
849     <dt>If <var>n</var> is a <code>Notation</code> node</dt>
850     <dd>
851     <ol>
852 wakaba 1.18 <li><a href="#algorithm-to-validate-an-ncname">Validate the
853 wakaba 1.1 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
854 wakaba 1.5 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
855     the <code>publicId</code> attribute value as a public identifier</a>.</li>
856 wakaba 1.18 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
857     the <code>systemId</code> attribute value as a system identifier</a>.</li>
858 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
859     any nodes, they are in
860 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
861 wakaba 1.1 </ol>
862     </dd>
863     <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
864     <dd>
865     <ol>
866 wakaba 1.13 <li>If the <code>target</code> attribute value matches to the string
867     <code>xml</code> in any case combination, then raise a
868     <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
869 wakaba 1.18 <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
870 wakaba 1.1 <code>target</code> attribute value as an <code>NCName</code></a>.</li>
871 wakaba 1.13 <li>Then,
872     <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML character data">validate
873     the <code>data</code> attribute value as an <abbr>XML</abbr> character
874 wakaba 1.1 data</a>.</li>
875 wakaba 1.13 <li>If the <code>data</code> attribute value contains a string
876     <code>?></code>, then raise a
877     <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
878     <li>If the <code>data</code> attribute value starts with either
879     <code class=char>U+0009</code> <code class=charname>CHARACTER
880     TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
881     FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
882     RETURN</code>, or <code class=char>U+0020</code>
883     <code class=charname>SPACE</code> character, then raise a
884     <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
885 wakaba 1.1 <li>If the <code>childNodes</code> list of <var>n</var> contains
886 wakaba 1.13 any nodes, then raise an
887 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
888 wakaba 1.1 </ol>
889     </dd>
890     <dt>If <var>n</var> is a <code>Text</code> node</dt>
891     <dd>
892     <ol>
893     <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the
894     <code>data</code> attribute value as an <abbr>XML</abbr> character
895     data</a>.</li>
896     <li>If the <code>childNodes</code> list of <var>n</var> contains
897     any nodes, they are in
898 wakaba 1.8 <a href="#xml-well-formedness-error" id=wfe-text-child><code>xml-well-formedness-error</code></a>.</li>
899 wakaba 1.1 </ol>
900     </dd>
901     <dt>Otherwise</dt>
902     <dd><span class=ed>xml-well-formedness-error? unknown-error?</span></dd>
903     </dl>
904    
905 wakaba 1.11 </div>
906    
907     </div>
908    
909     <div id=classes class=section>
910     <h2>Character Classes</h2>
911    
912     <p>This section defines a couple of character classes.
913     These classes are referred to by algorithms specified above.</p>
914    
915     <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
916     contains the following characters:</p>
917     <ul>
918     <li><code class=char>U+0009</code> <code class=charname>CHARACTER
919     TABULATION</code></li>
920     <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
921     <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
922     RETURN</code></li>
923     <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
924     .. <code class=char>U+D7FF</code></li>
925     <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
926     <code class=charname>REPLACEMENT CHARACTER</code></li>
927     <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
928     </ul>
929     <div class="note memo">
930     <p>This character class contains all characters allowed in the production rule
931     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
932     of <abbr>XML</abbr> 1.0
933     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
934     </div>
935    
936     <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
937     contains the following characters:</p>
938     <ul>
939     <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
940     </ul>
941    
942     <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
943     contains the following characters:</p>
944     <ul>
945     <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
946     <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
947     <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
948     AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
949     PROGRAM COMMAND</code></li>
950     <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
951     <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
952     <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
953     <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
954     <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
955     <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
956     <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
957     <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
958     <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
959     <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
960     <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
961     <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
962     <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
963     <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
964     <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
965     <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
966     <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
967     </ul>
968     <div class="note memo">
969     <p>This character class contains the characters listed in the Note in
970     <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
971     of <abbr>XML</abbr> 1.0
972     <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
973     as amended by
974     <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
975 wakaba 1.1 </div>
976    
977     </div>
978    
979     <div id="references" class="section reference">
980     <h2>References</h2>
981    
982     <div id="normative-references" class="section normative reference">
983     <h3>Normative References</h3>
984    
985     <dl>
986 wakaba 1.3 <dt id=ref-DOM3CORE>DOM3CORE</dt>
987     <dd class=ed>@@ W3C DOM Level 3 Core</dd>
988     <dt id=ref-DOMDTDEF>DOMDTDEF</dt>
989     <dd class=ed>@@ manakai's extension to DOM for document type definitions</dd>
990 wakaba 1.1 <dt id="ref-KEYWORDS">KEYWORDS</dt>
991     <dd><cite><a href="urn:ietf:rfc:2119">Key words for use in
992     <abbr>RFCs</abbr> to Indicate Requirement Levels</a></cite>,
993     <abbr title="Internet Engineering Task Force">IETF</abbr>
994     <a href="urn:ietf:bcp:14"><abbr title="Best Current Practice">BCP</abbr>
995     14</a>, <abbr title="Request for Comments">RFC</abbr> 2119,
996     <time datetime=1997-03>March 1997</time>.
997     <em>This</em> version of the specification is referenced.</dd>
998     <dt id=ref-INFOSET>INFOSET</dt>
999     <dd class=ed>@@</dd>
1000     <dt id=ref-XML10">XML10</dt>
1001     <dd><cite><a href="http://www.w3.org/TR/2006/REC-xml-20060816">Extensible
1002     Markup Language (<abbr>XML</abbr>) 1.0 (Fourth Edition)</a></cite>,
1003     <abbr>W3C</abbr> Recommendation, <time datetime=2006-08-16>16 August
1004     2006</time>, edited in place <time datetime=2006-09-29>29 September
1005     2006</time>. Tje latest version of the specification is available at
1006     <code class=uri>&lt;<a href="http://www.w3.org/TR/xml">http://www.w3.org/TR/xml</a>&gt;</code>.
1007     <em>This</em> version of the specification is referenced.</dd>
1008     <dt id=ref-XML11>XML11</dt>
1009     <dd class=ed>@@</dd>
1010     </dl>
1011     </div>
1012    
1013     <div id="informative-references" class="section informative reference">
1014     <h3>Non$B!>(Bnormative References</h3>
1015    
1016     <dl class=ed>
1017     <dt id=ref-CSS>CSS</dt>
1018     <dd><cite><a href="http://www.w3.org/TR/2007/CR-CSS21-20070719">Cascading
1019     Style Sheets Level 2 Revision 1 (<abbr>CSS</abbr> 2.1)
1020     Specification</a></cite>,
1021     <abbr title="World Wide Web Consortium">W3C</abbr> Candidate
1022     Recommendation, <time datetime=2007-07-19>19 July 2007</time>.
1023     Work in progress. The latest version of the specification is
1024     available at
1025     <code class=URI>&lt;<a href="http://www.w3.org/TR/CSS21">http://www.w3.org/TR/CSS21</a>></code>.</dd>
1026     <dt id=ref-CSSOM>CSSOM</dt>
1027     <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/csswg/cssom/Overview.html?rev=1.55&amp;content-type=text/html;%20charset=utf-8">Cascading
1028     Style Sheets Object Model (<abbr>CSSOM</abbr>)</a></cite>,
1029     <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1030     <time datetime=2007-06-18>18 June 2007</time>. Work in progress.
1031     The latest Editor's Draft of the specification is available at
1032     <code class=URI>&lt;<a href="http://dev.w3.org/csswg/cssom/Overview.html">http://dev.w3.org/csswg/cssom/Overview.html</a>></code>.</dd>
1033     <dt id=ref-HTML5>HTML5</dt>
1034     <dd><cite><a href="http://whatwg.org/html5"><abbr>HTML</abbr> 5</a></cite>,
1035     <abbr title="Web Hypertext Application Technology Working Group">WHATWG</abbr>
1036     Working Draft. Work in progress.</dd>
1037     <dt id=ref-SAPI>SAPI</dt>
1038     <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/2006/webapi/selectors-api/Overview.html?rev=1.28&amp;content-type=text/html;%20charset=utf-8">Selectors <abbr>API</abbr></a></cite>,
1039     <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1040     <time datetime=2007-08-29>29 August 2007</time>. Work in progress.
1041     The latest Editor's Draft of the specification is available at
1042     <code class=URI>&lt;<a href="http://dev.w3.org/2006/webapi/selectors-api/Overview.html">http://dev.w3.org/2006/webapi/selectors-api/Overview.html</a>></code>.
1043     The latest published version of the specification is available at
1044     <code class=URI>&lt;<a href="http://www.w3.org/TR/selectors-api/">http://www.w3.org/TR/selectors-api/</a>></code>.</dd>
1045     <dt id=XBL2>XBL2</dt>
1046     <dd><cite><a href="http://www.mozilla.org/projects/xbl/xbl2.html"><abbr>XBL</abbr>
1047     2.0</a></cite>, Mozilla.org, <time datetime=2007-03-15>15 Mar 2007</time>.
1048     Work in progress. The latest
1049     <abbr title="World Wide Web Consortium">W3C</abbr>$B!>(Bpublished
1050     version of the specification is available at
1051     <code class=URI>&lt;<a href="http://www.w3.org/TR/xbl/">http://www.w3.org/TR/xbl/</a>></code>.</dd>
1052 wakaba 1.3 <dt id=ref-XML5>XML5</dt>
1053     <dd class=ed>@@</dd>
1054 wakaba 1.1 </dl>
1055     </div>
1056    
1057     </div>
1058    
1059     </body>
1060     </html>

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24