/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Contents of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.21 - (show annotations) (download) (as text)
Sat Dec 1 15:18:43 2007 UTC (16 years, 5 months ago) by wakaba
Branch: MAIN
Changes since 1.20: +11 -0 lines
File MIME type: text/html
Oops\!  The most important characters were not part of PubidChar class

1 <!DOCTYPE html>
2 <html lang="en" class="formal-specification status-work-in-progress">
3 <head>
4 <title>manakai's XML Conformance Checking</title>
5 <link rel="stylesheet" href="http://suika.fam.cx/www/style/html/spec">
6 <link rel="license" href="http://suika.fam.cx/c/gnu/fdl">
7 <style>
8 a[id]::before {
9 content: "[" attr(id) "]";
10 color: gray;
11 font-size: 70%;
12 font-family: Courier New, monospace;
13 vertical-align: 60%;
14 }
15 </style>
16 </head>
17 <body class="has-abstract">
18
19 <div class="header">
20 <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21 <h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2>
22
23 <dl class="versions-uri">
24 <dt>This Version</dt>
25 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
26 >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
27 <dt>Latest Version</dt>
28 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc"
29 >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc</a>&gt;</code></dd>
30 <dt>Latest Working Draft</dt>
31 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
32 >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
33 <!-- <dt>Previous Version</dt> -->
34 <dt>Version History</dt>
35 <dd><code class=URI>&lt;<a href="http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html">http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html</a>&gt;</code></dd>
36 </dl>
37 <dl class="authors" id="author">
38 <dt>Author</dt>
39 <dd><a href="http://suika.fam.cx/~wakaba/who?" rel="author"
40 lang=ja>Wakaba</a>
41 <code class="mail">&lt;<a href="mailto:w@suika.fam.cx" rel="author"
42 >w@suika.fam.cx</a>&gt;</code></dd>
43 </dl>
44
45 <p class="copyright" lang="en">&#xA9; <time>2007</time> <a
46 href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47 Permission is granted to copy, distribute and/or modify this document
48 under the terms of the <a rel="license"
49 href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>,
50 Version 1.2
51 or any later version published by the <a
52 href="http://www.fsf.org/">Free Software Foundation</a>;
53 with no Invariant Sections, no Front$B!>(BCover Texts, and no Back$B!>(BCover
54 Texts. A copy of the license is
55 available at <code class="URI">&lt;<a
56 href="http://www.gnu.org/copyleft/fdl.html"
57 rel="license">http://www.gnu.org/copyleft/fdl.html</a>&gt;</code>.</p>
58 </div>
59
60 <div id="abstract" class="section">
61 <h2>Abstract</h2>
62
63 <div class="ed issue">...</div>
64
65 </div>
66
67 <div id="status" class="section">
68 <h2>Status of This Document</h2>
69
70 <p class=section-info><em>This section describes the status of this document
71 at the time of its publication. Other documents
72 might supersede this document.</em></p>
73
74 <p>This document is a working draft, produced as part of the
75 <a href="http://suika.fam.cx/www/markup/html/whatpm/readme">Whatpm</a>
76 subproject of the
77 <a href="http://suika.fam.cx/www/manakai-core/doc/web/">manakai</a>
78 project. It might be updated, replaced, or obsoleted by
79 other documents at any time. It is inappropriate to
80 cite this document as other than <q>work in progress</q>.</p>
81
82 <p>Comments on this document are welcome and
83 may be sent to the <a href="#author">author</a>.</p>
84
85 <p>Translations of thie document might be available.
86 The English version of the document is the only
87 normative version.</p>
88 </div>
89
90 <div id="introduction" class="section">
91 <h2>Introduction</h2>
92
93 <p class=section-info>This section is <em>non$B!>(Bnormative</em>.</p>
94
95 <div class="issue ed">...</div>
96
97
98 </div>
99
100 <div class="section" id="terminology">
101 <h2>Terminology</h2>
102
103 <p>The key words <q><em class="rfc2119">MUST</em></q>,
104 <q><em class="rfc2119">MUST NOT</em></q>,
105 <q><em class="rfc2119">SHOULD</em></q>,
106 <q><em class="rfc2119">SHOULD NOT</em></q>, and
107 <q><em class=rfc2119>MAY</em></q> in this
108 document are to be interpreted as described in
109 <a href="urn:ietf:rfc:2119"><abbr title="Request for Comments">RFC</abbr>
110 2119</a>
111 <cite class="bibref normative">[<a href="#ref-KEYWORDS">KEYWORDS</a>]</cite>.</p>
112
113 <!-- Strictly speaking the paragraph below is not part of terminology. -->
114 <p>All examples and notes in this specification are non$B!>(Bnormative,
115 as are all sections explicitly marked non$B!>(Bnormative. Everything
116 else in this specification is normative.</p>
117
118 <p><span class=ed>Algorithm is normative but non-normative</span>.
119 In addition, the order in which <a href="#errors">errors</a> are
120 raised is undefined.</p>
121 </div>
122
123
124 <div class=section id=error-categories>
125 <h2>Error Classification</h2>
126
127 <p class=ed>If a <code>Document</code> node has no
128 xml-well-formedness-error, entity-error, and unknown-error,
129 then it is well-formed. If a well-formed <code>Document</code>
130 node has no xml-validity-error, it is valid.</p>
131
132 <div class="note memo">
133 <p>A well$B!>(Bformed <code>Document</code> can be safely serialized
134 into a well$B!>(Bformed XML document. A valid <code>Document</code>
135 can be easily serialized into a valid XML document.</p>
136 </div>
137
138 <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
139 ...</p>
140
141 <p>Errors are classified into these
142 <dfn id=error-category title="error category">error categories</dfn>:</p>
143
144 <dl id=errors>
145 <dt><dfn id=entity-error><code>entity-error</code></dfn></dt>
146 <dd>
147 <p class=ed>@@</p>
148 <div class="note memo">
149 <p>This <span class=ed>algorithm</span> does not support
150 <abbr>DOM</abbr> tree with one or more <code>EntityReference</code>
151 nodes. It is expected that any entity references are expanded
152 at the parse time and any unexpandable entity references
153 make parse time errors raised so that never result in
154 <abbr>DOM</abbr> tree with <code>EntityReference</code> nodes.</p>
155 </div>
156 </dd>
157 <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
158 <dd class=ed>@@</dd>
159 <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
160 <dd>
161 <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
162 will be raised when a construct, which might not be restored to the same
163 construct when it is serialized and then re-parsed by a conforming
164 processor, is encountered.</p>
165 <div class="example memo">
166 <p>For a <code>Comment</code> node a
167 <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
168 be raised, since <abbr>XML</abbr> processors are not required to
169 report texts of comments for applications.</p>
170 </div>
171 </dd>
172 <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
173 <dd class=ed>@@</dd>
174 <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
175 <dd>An <abbr>XML</abbr> <i>error</i> (<abbr>XML</abbr> 1.0
176 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
177 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-error">error</a> /
178 <abbr>XML</abbr> 1.1
179 <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
180 <a href="@@/#dt-error">error</a>)
181 that is not classified to any other <a href="#error-category">error
182 category</a>.</dd>
183 <dt><dfn id=xml-misc-fatal-error><code>xml-misc-fatal-error</code></dt>
184 <dd>An <abbr>XML</abbr> <i>fatal error</i> (<abbr>XML</abbr> 1.0
185 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
186 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-fatal">fatal
187 error</a> / <abbr>XML</abbr> 1.1
188 <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
189 <a href="@@/#dt-fatal">fatal error</a>)
190 that is not classified to any other <a href="#error-category">error
191 category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
192 <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
193 <dd>
194 <p>An
195 <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
196 will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
197 in <abbr>XML</abbr> specification is not met.</p>
198 </dd>
199 <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
200 <dd>A violation of validity constraint in XML document.</dd>
201 <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
202 <dd><p>If an <code>xml-well-formedness-error</code> is raised,
203 it would not be possible to generate an XML serialization that
204 would match to the appropriate production rule and that would not
205 violate to any well$B!>(Bformedness constraint in XML
206 specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,
207 <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>
208 </dl>
209
210 <div class=ed><p>@@ TODO: #dt-atuseroption at user option
211 (MAY or MUST), #dt-compat for compatibility,
212 #dt-interop for interoperability</p></div>
213
214 </div>
215
216 <div class=section id=parsing-xml>
217 <h2>Parsing <abbr>XML</abbr> Document</h2>
218
219 <ul>
220 <li>If the <abbr>XML</abbr> document does not begin with an
221 <abbr>XML</abbr> declaration, then raise an
222 <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.</li>
223 <li>If the replacement text of an entity declaration is
224 <code>&lt;</code>, then raise an
225 <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
226 "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
227 --></li>
228 <li>If there is an element type declaration whose <code>Name</code>
229 value is already declared, then raise an
230 <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.</li>
231 <li>If attribute definition whose <code>Name</code> is
232 <code>xml:space</code> has <span class=ed>declared type different from
233 (default|preserve), (default), or (preserve)</span>, then raise an
234 <a href="#xml-misc-error" id=xme-ad-xml-space><code>xml-misc-error</code></a>.
235 <span class=ed>@@ duplication with
236 <a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!--
237 <!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li>
238 <li>If an empty-element tag is used for an element which is <em>not</em>
239 declared <code>EMPTY</code>, then raise an
240 <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.</li>
241 <li>If an empty-element tag is <em>not</em> used for an element which is
242 declared <code>EMPTY</code>, then raise an
243 <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.</li>
244 <!--
245
246 #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting
247 #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset
248 #ExtSubset Well-formedness constraint: External Subset
249 #PE-between-Decls Well-formedness constraint: PE Between Declarations
250 #GIMatch Well-formedness constraint: Element Type Match
251 #uniqattspec Well-formedness constraint: Unique Att Spec
252 #NoExternalRefs Well-formedness constraint: No External Entity References
253 #CleanAttrVals Well-formedness constraint: No < in Attribute Values
254 #vc-PEinGroup Validity constraint: Proper Group/PE Nesting
255 "For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)."
256 "For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error."
257 #condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting
258 #wf-Legalchar Well-formedness constraint: Legal Character
259 #textent Well-formedness constraint: Parsed Entity
260 #norecursion Well-formedness constraint: No Recursion
261 #indtd Well-formedness constraint: In DTD
262 "External parsed entities SHOULD each begin with a text declaration."
263 "It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."
264 #UniqueNotationName Validity constraint: Unique Notation Name
265
266 @@ Need detailed review, but maybe should be in parsing phase
267
268 #vc-check-rmd Validity constraint: Standalone Document Declaration
269
270 @@ Need dtailed review
271
272 #wf-entdeclared Well-formedness constraint: Entity Declared
273 #vc-entdeclared Validity constraint: Entity Declared
274 "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
275 "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
276
277 @@ flaged and then reported in DOM check phase
278
279 "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
280 "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
281 "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
282
283 @@ in parsing phase
284
285 "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
286
287 -->
288 </ul>
289 </div>
290
291 <div class="section" id=checking-dom>
292 <h2>Checking <abbr>DOM</abbr></h2>
293
294 <p>The following algorithms and definitions are applied to
295 <abbr>XML</abbr> documents; especially, they are not applied
296 to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
297
298 <div class=section id=definitions>
299 <h3>Definitions</h3>
300
301 <p>The <dfn id=xml-version><abbr>XML</abbr> version</dfn> of a node is
302 the <abbr>XML</abbr> version of the document to which the node belongs.
303 For a <code>Documemt</code> node, the <abbr>XML</abbr> version
304 of the document is the value of the <code>xmlVersion</code>
305 attribute of the node. For a <code>DocumentType</code> node whose
306 <code>ownerDocument</code> attribute is set to <code>null</code>,
307 the <abbr>XML</abbr> version of the document is <code>1.0</code><!--
308 since createDocumentType [DOM3CORE] assumes XML 1.0 [XML10TE] -->.
309 For any other node, the <abbr>XML</abbr> version of the document
310 is that of the <code>Document</code> node contained in the
311 <code>ownerDocument</code> attribute of the node.</p>
312
313 </div>
314
315
316 <div class=section id=checking-components>
317 <h3>Conformance Checking Algorithms for Components</h3>
318
319 <p>To
320 <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
321 validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
322 the following algorithm <em class=rfc2119>MUST</em> be used:</p>
323 <ol>
324 <li>If <var>s</var> contains a character that is <em>not</em> in
325 the character class <a href="#class-Char10"><code>Char10</code></a>,
326 then raise an
327 <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
328 <li>If <var>s</var> contains a character that is in the character
329 class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
330 then raise an
331 <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
332 <li>If <var>s</var> contains a character that is in the character
333 class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
334 then raise an
335 <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
336 <li class=ed>@@ XML 1.1 support</li>
337 <li>If <var>s</var> contains a <code class=char>U+000D</code>
338 <code class=charname>CARRIAGE RETURN</code> character, then
339 raise a
340 <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
341 <span class=ed>@@ We should not raise duplicate errors for U+000D
342 in attribute values. In addition, we should support a mode where
343 U+000D will be serialized as &#x000D; (so that no round-trip-error
344 will be raised).</span></li>
345 </ol>
346
347 <p>To
348 <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
349 <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
350 algorithm <em class=rfc2119>MUST</em> be used:</p>
351 <ol>
352 <li>If <var>s</var> is an empty string, then raise an
353 <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
354 Abort these steps.</li>
355 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
356 <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
357 <li>If the first character in <var>s</var> is a character that is
358 <em>not</em> in the character class
359 <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
360 an
361 <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
362 <li>If a character other than the first character in <var>s</var> is a
363 character that is <em>not</em> in the character class
364 <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
365 <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
366 <li>If <var>s</var> begins with the string <code>xml</code> (in any
367 case combination), then raise an
368 <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
369 <span class=ed>@@ except for attribute names <code>xml:lang</code>,
370 <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,
371 <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
372 pi name <code>xml-stylesheet</code>.</span><!--
373 "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
374 xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
375 --></li>
376 <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
377 [XML11]. -->
378 <li class=ed>@@ XML 1.1 support</li>
379 </ol>
380
381 <p>To
382 <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
383 an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
384 following algorithm <em class=rfc2119>MUST</em> be used:</p>
385 <ol>
386 <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
387 <var>s</var> as a <code>Name</code></a>.</li>
388 <li class=ed>@@</li>
389 </ol>
390
391 <p>To
392 <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
393 a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
394 following algorithm <em class=rfc2119>MUST</em> be used:</p>
395 <ol>
396 <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
397 <li>If <var>pid</var> contains a character that is <em>not</em> in the
398 character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
399 raise an
400 <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
401 <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
402 <code class=charname>CHARACTER TABULATION</code>,
403 <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
404 and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
405 characters, if the first character of <var>pid</var> is
406 <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
407 if the last character of <var>pid</var> is <code class=char>U+0020</code>
408 <code class=charname>SPACE</code> character, or if there is a
409 <code class=char>U+0020</code> <code class=charname>SPACE</code> character
410 immediately followed by another <code class=char>U+0020</code>
411 <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
412 <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
413 <span class=ed>Is this really a roundtripness problem? XML spec
414 does only define the way to match public identifiers in fact, no
415 canonical form.</span></li>
416 </ol>
417
418 <p>To
419 <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
420 a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
421 following algorithm <em class=rfc2119>MUST</em> be used:</p>
422 <ol>
423 <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
424 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
425 <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
426 <li>If <var>sid</var> contains both <code class=char>U+0022</code>
427 <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
428 <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
429 (<code class=char>'</code>) characters, raise an
430 <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
431 <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
432 <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
433 character, then raise an
434 <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
435 <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
436 reference, then raise a fact-level error (xml-misc-warning?).<!--
437 XML 1.0 has no conformance language for system identifier being
438 a URI reference. --></li>
439 </ol>
440
441 </div>
442
443 <div class=section id=checking-node>
444 <h3>Checking <code>Node</code></h3>
445
446 <p>The algorithm <dfn title="to check a node" id=algorithm-to-check-a-node>to
447 check a node (<dfn id=var-n><var>n</var></dfn>)</dfn> is defined as
448 following:</p>
449 <dl class=switch>
450 <dt>If <var>n</var> is an <code>Attr</code> node</dt>
451 <dd>
452 <ol>
453 <li><a href="#algorithm-to-validate-an-ncname">Validate the
454 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
455 <li>If the <code>prefix</code> attribute value is different from
456 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
457 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
458 <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
459 <code>childNodes</code> list of <var>n</var>,
460 <ol>
461 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
462 <code>Text</code> or <code>EntityReference</code> node, then it is an
463 <a href="#xml-well-formedness-error" id=wfe-attr-child><code>xml-well-formedness-error</code></a>.</li>
464 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
465 <code>EntityReference</code> node, then it is an
466 <a href="#entity-error" id=ee-attr-er><code>entity-error</code></a>.</li>
467 <li>Otherwise,
468 <a href="#algorithm-to-check-a-node" title="check a node">check
469 <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
470 </ol></li>
471 <li>If <code>nodeName</code> attribute of <var>n</var> is
472 <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
473 and <code>value</code> attribute of <var>n</var> is neither
474 <code>default</code> nor <code>preserve</code>, then it is an
475 <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
476 Note that <!ATTLIST e xml:space (default|preserve) "a">...
477 <e xml:space="default"/> is conforming and valid. --></li>
478 <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
479 successor] or an empty string, then xml-misc-warning
480 (a "fact"-level error; not an XML error).</li>
481 <li class=ed>@@ <code>specified</code>,
482 <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
483 <!-- xml:space MUST be declared to be valid. -->
484 <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
485 attribute <code>value</code> of <var>n</var>.</li>
486 <li>Validate the <var>n</var> against the <span class=ed>declared
487 type</span> as following:
488 <dl class=switch>
489 <dt class=ed><code>ID_ATTR</code></dt>
490 <dd>
491 <ol>
492 <li><span class=ed>Validate <var>v</var> as an
493 <code>Name</code>.</span> If it fails, then raise an
494 <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
495 <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
496 then raise an
497 <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
498 </ol>
499 </dd>
500 <dt class=ed>IDREF_ATTR</dt>
501 <dd>
502 <ol>
503 <li><span class=ed>Validate <var>v</var> as an
504 <code>Name</code>.</span> If it fails, then raise an
505 <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
506 <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
507 defined,</span> then raise an
508 <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
509 </ol>
510 </dd>
511 <dt class=ed>IDREFS_ATTR</dt>
512 <dd class=ed>@@</dd>
513 <dt class=ed>ENTITY_ATTR</dt>
514 <dd>
515 <ol>
516 <li><span class=ed>Validate <var>v</var> as an
517 <code>Name</code>.</span> If it fails, then raise an
518 <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
519 <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
520 defined,</span> then raise an
521 <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
522 </ol>
523 </dd>
524 <dt class=ed>ENTITIES_ATTR</dt>
525 <dd class=ed>@@</dd>
526 <dt class=ed>NMTOKEN_ATTR</dt>
527 <dd>
528 <ol>
529 <li><span class=ed>Validate <var>v</var> as an
530 <code>Nmtoken</code>.</span> If it fails, then raise an
531 <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
532 </ol>
533 </dd>
534 <dt class=ed>NMTOKENS_ATTR</dt>
535 <dd class=ed>@@</dd>
536 <dt class=ed>NOTATION_ATTR</dt>
537 <dd class=ed><var>v</var> must be one of enumerated values.
538 If not, then raise an
539 <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
540 <dt class=ed>ENUMERATED_ATTR</dt>
541 <dd class=ed><var>v</var> must be one of enumerated values.
542 If not, then raise an
543 <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
544 <dt class=ed>@@</dt>
545 </dl>
546 </li>
547 <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
548 raise an
549 <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
550 <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
551 <li class=ed>@@ strict serialization error for U+000D, U+000A, and
552 U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
553 </ol>
554 </dd>
555 <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
556 <dd>
557 <ol>
558 <li class=ed></li>
559 <li>If <code>nodeName</code> attribute of <var>n</var> is
560 <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
561 and <span class=ed>its declared type is different from (default|preserve),
562 (default), or (preserve)</span>, then raise an
563 <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
564 <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
565 <code>childNodes</code> list of <var>n</var>,
566 <ol>
567 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
568 <code>Text</code> or <code>EntityReference</code> node, then it is an
569 <a href="#xml-well-formedness-error" id=wfe-at-child><code>xml-well-formedness-error</code></a>.</li>
570 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
571 <code>EntityReference</code> node, then it is an
572 <a href="#entity-error" id=ee-at-er><code>entity-error</code></a>.</li>
573 <li>Otherwise,
574 <a href="#algorithm-to-check-a-node" title="check a node">check
575 <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
576 </ol></li>
577 <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
578 be declared. If not, then raise an
579 <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
580 <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
581 values MUST all be distinct. If not, then raise an
582 <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
583 <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
584 element, then raise an
585 <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
586 <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
587 </ol>
588 </dd>
589 <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
590 <dd>
591 <ol>
592 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
593 <code>data</code> attribute value as an <abbr>XML</abbr> character
594 data</a>.</li>
595 <li>If the <code>data</code> attribute value contains
596 a string <code>]]></code>, then raise an
597 <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
598 <li>If the <code>childNodes</code> list of <var>n</var> contains
599 any nodes, they are in
600 <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
601 </ol>
602 </dd>
603 <dt>If <var>n</var> is a <code>Comment</code> node</dt>
604 <dd>
605 <ol>
606 <li>Raise an
607 <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
608 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
609 <code>data</code> attribute value as an <abbr>XML</abbr> character
610 data</a>.</li>
611 <li>If the <code>data</code> attribute value contains
612 a string <code>--</code>, or if it ends with a character
613 <code>-</code>, then raise an
614 <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
615 "for compatibility" --></li>
616 <li>If the <code>childNodes</code> list of <var>n</var> contains
617 any nodes, they are in
618 <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
619 </ol>
620 </dd>
621 <dt>If <var>n</var> is a <code>Document</code> node</dt>
622 <dd>
623 <ol>
624 <li>If <a href="#xml-version"><abbr>XML</abbr> version</a> of <var>n</var>
625 is different from <code>1.0</code> or <code>1.1</code>,
626 then it is an <code class=ed>unknown-error?</code>.</li>
627 <li>If the <code>xmlEncoding</code> attribute value does <em>not</em>
628 match to <code>[A-Za-z] ([A-Za-z0-9._] | '-')*</code>
629 <span class=ed>@@ formal def</span>, then it is an
630 <a href="#xml-well-formedness-error" id=wfe-encoding><code>xml-well-formedness-error</code></a>.</li>
631 <li>The <code>childNodes</code> list of <var>n</var> have to
632 consist of zero or more <code>Comment</code> and/or
633 <code>ProcessingInstruction</code> nodes, followed by
634 an optional <code>DocumentType</code> node, followed
635 by zero or more <code>Comment</code> and/or
636 <code>ProcessingInstruction</code> nodes, followed by
637 an <code>Element</code> node, followed
638 by zero or more <code>Comment</code> and/or
639 <code>ProcessingInstruction</code> nodes. Any violation to this is an
640 <a href="#xml-well-formedness-error" id=wfe-document-child><code>xml-well-formedness-error</code></a>.</li>
641 <li>For each node <dfn id=var-document-nc><var>n<sub><var>c</var></sub></var></dfn>
642 in the <code>childNodes</code> list of <var>n</var>,
643 <ol>
644 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
645 <code>EntityReference</code> node, then
646 <a href="#algorithm-to-check-a-node" title="check a node">check
647 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
648 </ol></li>
649 <li class=ed>@@ <code>allDeclarationsProcessed</code></li>
650 </ol>
651 </dd>
652 <dt>If <var>n</var> is a <code>DocumentFragment</code> node</dt>
653 <dd>
654 <ol>
655 <li>For each node <dfn id=var-df-nc><var>n<sub><var>c</var></sub></var></dfn>
656 in the <code>childNodes</code> list of <var>n</var>,
657 <ol>
658 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
659 <code>Element</code>, <code>Text</code>, <code>CDATASection</code>,
660 <code>Comment</code>, <code>ProcessingInstruction</code>, or
661 <code>EntityReference</code> node, then it is an
662 <a href="#xml-well-formedness-error" id=wfe-df-child><code>xml-well-formedness-error</code></a>.</li>
663 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
664 <code>EntityReference</code> node, then it is an
665 <a href="#entity-error" id=ee-df-er><code>entity-error</code></a>.</li>
666 <li>Otherwise,
667 <a href="#algorithm-to-check-a-node" title="check a node">check
668 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
669 </ol></li>
670 </ol>
671 </dd>
672 <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
673 <dd>
674 <ol>
675 <li><a href="#algorithm-to-validate-an-ncname">Validate the
676 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
677 <li>Follow the following substeps:
678 <ol>
679 <li>If <code>ownerDocument</code> attribute of <var>n</var> is
680 <code>null</code>, then abort these substeps.</li>
681 <li>If <code>documentElement</code> attribute of the node
682 set to <code>ownerDocument</code> attribute of <var>n</var> is
683 <code>null</code>, then abort these substeps.</li>
684 <li>If <code>nodeName</code> attribute of the node set to
685 <code>documentElement</code> attribute of the node set to
686 <code>ownerDocument</code> attribute of <var>n</var> is
687 different from <code>nodeName</code> of <var>n</var>,
688 then raise an
689 <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
690 </ol>
691 </li>
692 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
693 the <code>publicId</code> attribute value as a public identifier</a>.</li>
694 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
695 the <code>systemId</code> attribute value as a system identifier</a>.</li>
696 <li>If the <code>publicId</code> attribute value of <var>n</var> is
697 <em>not</em> <code>null</code> and the <code>systemId</code> attribute
698 value of <var>n</var> is <code>null</code>, then raise an
699 <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
700 <span class=ed>@@ publicId == null? Or, publicId == ""</span></li>
701 <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
702 in the <code>childNodes</code> list of <var>n</var>,
703 <ol>
704 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
705 <code>ProcessingInstruction</code> node, then it is an
706 <a href="#xml-well-formedness-error" id=wfe-dt-child><code>xml-well-formedness-error</code></a>.
707 <span class=ed>@@ ref to manakai's extensions</span></li>
708 <li>Otherwise, <!-- if n_c is not an EntityReference node, -->
709 <a href="#algorithm-to-check-a-node" title="check a node">check
710 <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
711 </ol></li>
712 <li>For each node in the <code>entities</code>, <code>notations</code>,
713 and <code>elementTypes</code> lists of <var>n</var>,
714 <a href="#algorithm-to-check-a-node" title="check a node">check the
715 node</a> recursively.</li>
716 <li class=ed>@@ externally declared?</li>
717 </ol>
718 </dd>
719 <dt>If <var>n</var> is an <code>Element</code> node</dt>
720 <dd>
721 <ol>
722 <li><a href="#algorithm-to-validate-an-ncname">Validate the
723 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
724 <li>If the <code>prefix</code> attribute value is different from
725 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
726 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
727 <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
728 in the <code>childNodes</code> list of <var>n</var>,
729 <ol>
730 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
731 <code>Element</code>,
732 <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
733 <code>ProcessingInstruction</code>, or
734 <code>EntityReference</code> node, then it is an
735 <a href="#xml-well-formedness-error" id=wfe-element-child><code>xml-well-formedness-error</code></a>.</li>
736 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
737 <code>EntityReference</code> node, then it is an
738 <a href="#entity-error" id=ee-element-er><code>entity-error</code></a>.</li>
739 <li>Otherwise,
740 <a href="#algorithm-to-check-a-node" title="check a node">check
741 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
742 </ol></li>
743 <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
744 <li>Let <var>attrs</var> be the value of the <code>attribute</code>
745 attribute of <var>n</var>. Check conformance of <var>attrs</var>
746 as following:
747 <ol>
748 <li>If <var>attrs</var> contains an <code>Attr</code> node whose
749 <code>nodeName</code> attribute value is equal to that of another
750 <code>Attr</code> node in <var>attrs</var>, then raise an
751 <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
752 #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
753 <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
754 </ol>
755 </li>
756 </ol>
757 </dd>
758 <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
759 <dd>
760 <ol>
761 <li class=ed></li>
762 <li>If the <code>childNodes</code> list of <var>n</var> contains
763 any nodes, they are in
764 <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
765 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
766 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
767 <li class=ed>@@ #vc-MixedChildrenUnique
768 Validity constraint: No Duplicate Types</li>
769 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
770 <li>If there is more than one <code>AttributeDefinition</code> node
771 with <span class=ed>attribute type <code>ID</code></span> in the
772 <code>NamedNodeMap</code> list contained in the
773 <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
774 <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
775 <li>If there is more than one <code>AttributeDefinition</code> node
776 with <span class=ed>attribute type <code>NOTATION</code></span> in the
777 <code>NamedNodeMap</code> list contained in the
778 <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
779 <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
780 <li class=ed>"For interoperability, the same Nmtoken SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
781 </ol>
782 </dd>
783 <dt>If <var>n</var> is an <code>Entity</code> node whose
784 <code>notationName</code> attribute value is <code>null</code> (i.e. a
785 parsed entity)</dt>
786 <dd>
787 <ol>
788 <li>Raise an
789 <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
790 <li><a href="#algorithm-to-validate-an-ncname">Validate the
791 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
792 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
793 the <code>publicId</code> attribute value as a public identifier</a>.</li>
794 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
795 the <code>systemId</code> attribute value as a system identifier</a>.</li>
796 <li>If the <code>publicId</code> attribute value of <var>n</var> is
797 <em>not</em> <code>null</code> and the <code>systemId</code> attribute
798 value of <var>n</var> is <code>null</code>, then raise an
799 <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
800 <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
801 in the <code>childNodes</code> list of <var>n</var>,
802 <ol>
803 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
804 <code>Element</code>,
805 <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
806 <code>ProcessingInstruction</code>, or <code>EntityReference</code>
807 node, then it is an
808 <a href="#xml-well-formedness-error" id=wfe-entity-child><code>xml-well-formedness-error</code></a>.</li>
809 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
810 <code>EntityReference</code> node, then it is an
811 <a href="#entity-error" id=ee-er-er><code>entity-error</code></a>.</li>
812 <li>Otherwise,
813 <a href="#algorithm-to-check-a-node" title="check a node">check
814 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
815 </ol></li>
816 </ol>
817 </dd>
818 <dt>If <var>n</var> is an <code>Entity</code> node whose
819 <code>notationName</code> attribute value is <em>not</em> <code>null</code>
820 (i.e. an unparsed entity)</dt>
821 <dd>
822 <ol>
823 <li><a href="#algorithm-to-validate-an-ncname">Validate the
824 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
825 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
826 the <code>publicId</code> attribute value as a public identifier</a>.</li>
827 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
828 the <code>systemId</code> attribute value as a system identifier</a>.</li>
829 <li>If the <code>systemId</code> attribute value of <var>n</var> is
830 <code>null</code>, then raise an
831 <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
832 <li><a href="#algorithm-to-validate-an-ncname">Validate the
833 <code>notationName</code> attribute value of <var>n</var> as an
834 <code>NCName</code></a>.</li>
835 <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
836 <li>If the <code>childNodes</code> list of <var>n</var> contains
837 any nodes, they are in
838 <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
839 </ol>
840 </dd>
841 <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
842 <dd>
843 <ol>
844 <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
845 <li><a href="#algorithm-to-validate-an-ncname">Validate the
846 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
847 <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
848 in the <code>childNodes</code> list of <var>n</var>,
849 <ol>
850 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
851 <code>Element</code>,
852 <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
853 <code>ProcessingInstruction</code>, or <code>EntityReference</code>
854 node, then it is an
855 <a href="#xml-well-formedness-error" id=wfe-er-child><code>xml-well-formedness-error</code></a>.</li>
856 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is <em>not</em> an
857 <code>EntityReference</code> node, then it is an
858 <a href="#entity-error" id=ee-entity-er><code>entity-error</code></a>.</li>
859 <li>Otherwise,
860 <a href="#algorithm-to-check-a-node" title="check a node">check
861 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
862 </ol></li>
863 </ol>
864 </dd>
865 <dt>If <var>n</var> is a <code>Notation</code> node</dt>
866 <dd>
867 <ol>
868 <li><a href="#algorithm-to-validate-an-ncname">Validate the
869 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
870 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
871 the <code>publicId</code> attribute value as a public identifier</a>.</li>
872 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
873 the <code>systemId</code> attribute value as a system identifier</a>.</li>
874 <li>If the <code>childNodes</code> list of <var>n</var> contains
875 any nodes, they are in
876 <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
877 </ol>
878 </dd>
879 <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
880 <dd>
881 <ol>
882 <li>If the <code>target</code> attribute value matches to the string
883 <code>xml</code> in any case combination, then raise a
884 <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
885 <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
886 <code>target</code> attribute value as an <code>NCName</code></a>.</li>
887 <li>Then,
888 <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
889 the <code>data</code> attribute value as an <abbr>XML</abbr> character
890 data</a>.</li>
891 <li>If the <code>data</code> attribute value contains a string
892 <code>?></code>, then raise a
893 <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
894 <li>If the <code>data</code> attribute value starts with either
895 <code class=char>U+0009</code> <code class=charname>CHARACTER
896 TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
897 FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
898 RETURN</code>, or <code class=char>U+0020</code>
899 <code class=charname>SPACE</code> character, then raise a
900 <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
901 <li>If the <code>childNodes</code> list of <var>n</var> contains
902 any nodes, then raise an
903 <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
904 </ol>
905 </dd>
906 <dt>If <var>n</var> is a <code>Text</code> node</dt>
907 <dd>
908 <ol>
909 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
910 <code>data</code> attribute value as an <abbr>XML</abbr> character
911 data</a>.</li>
912 <li>If the <code>childNodes</code> list of <var>n</var> contains
913 any nodes, they are in
914 <a href="#xml-well-formedness-error" id=wfe-text-child><code>xml-well-formedness-error</code></a>.</li>
915 </ol>
916 </dd>
917 <dt>Otherwise</dt>
918 <dd><span class=ed>xml-well-formedness-error? unknown-error?</span></dd>
919 </dl>
920
921 </div>
922
923 </div>
924
925 <div id=classes class=section>
926 <h2>Character Classes</h2>
927
928 <p>This section defines a couple of character classes.
929 These classes are referred to by algorithms specified above.</p>
930
931 <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
932 contains the following characters:</p>
933 <ul>
934 <li><code class=char>U+0009</code> <code class=charname>CHARACTER
935 TABULATION</code></li>
936 <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
937 <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
938 RETURN</code></li>
939 <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
940 .. <code class=char>U+D7FF</code></li>
941 <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
942 <code class=charname>REPLACEMENT CHARACTER</code></li>
943 <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
944 </ul>
945 <div class="note memo">
946 <p>This character class contains all characters allowed in the production rule
947 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
948 of <abbr>XML</abbr> 1.0
949 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
950 </div>
951
952 <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
953 contains the following characters:</p>
954 <ul>
955 <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
956 </ul>
957
958 <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
959 contains the following characters:</p>
960 <ul>
961 <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
962 <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
963 <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
964 AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
965 PROGRAM COMMAND</code></li>
966 <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
967 <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
968 <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
969 <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
970 <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
971 <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
972 <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
973 <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
974 <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
975 <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
976 <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
977 <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
978 <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
979 <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
980 <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
981 <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
982 <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
983 </ul>
984 <div class="note memo">
985 <p>This character class contains the characters listed in the Note in
986 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
987 of <abbr>XML</abbr> 1.0
988 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
989 as amended by
990 <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
991 </div>
992
993 <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
994 contains the following characters:</p>
995 <ul class=ed>
996 </ul>
997 <div class="note memo">
998 <p>This character class contains all characters allowed as the first character
999 of a string matching to the production rule
1000 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1001 of <abbr>XML</abbr> 1.0
1002 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1003 </div>
1004
1005 <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1006 contains the following characters:</p>
1007 <ul>
1008 <li>The characters in the character class
1009 <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1010 <li class=ed></li>
1011 </ul>
1012 <div class="note memo">
1013 <p>This character class contains all characters allowed as the second
1014 character of a string matching to the production rule
1015 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1016 of <abbr>XML</abbr> 1.0
1017 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1018 </div>
1019
1020 <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1021 contains the following characters:</p>
1022 <ul>
1023 <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1024 TABULATION</code></li>
1025 <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1026 <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1027 RETURN</code></li>
1028 <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1029 <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1030 (<code class=char>!</code>)</li>
1031 <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1032 (<code class=char>$</code>)</li>
1033 <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1034 (<code class=char>#</code>)</li>
1035 <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1036 (<code class=char>%</code>)</li>
1037 <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1038 (<code class=char>'</code>)</li>
1039 <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1040 (<code class=char>(</code>)</li>
1041 <li><code class=char>U+0029</code> <code class=charname>RIGHT
1042 PARENTHESIS</code> (<code class=char>)</code>)</li>
1043 <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1044 (<code class=char>*</code>)</li>
1045 <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1046 (<code class=char>+</code>)</li>
1047 <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1048 (<code class=char>,</code>)</li>
1049 <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1050 (<code class=char>-</code>)</li>
1051 <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1052 (<code class=char>.</code>)</li>
1053 <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1054 (<code class=char>/</code>)</li>
1055 <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1056 (<code class=char>0</code>) .. <code class=char>U+0039</code>
1057 <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1058 <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1059 (<code class=char>:</code>)</li>
1060 <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1061 (<code class=char>;</code>)</li>
1062 <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1063 (<code class=char>=</code>)</li>
1064 <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1065 (<code class=char>?</code>)</li>
1066 <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1067 (<code class=char>@</code>)</li>
1068 <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1069 A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1070 <code class=charname>LATIN CAPITAL LETTER Z</code>
1071 (<code class=char>Z</code>)</li>
1072 <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1073 (<code class=char>_</code>)</li>
1074 <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1075 A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1076 <code class=charname>LATIN CAPITAL LETTER Z</code>
1077 (<code class=char>Z</code>)</li>
1078 </ul>
1079 <div class="note memo">
1080 <p>This character class contains all characters allowed in the production rule
1081 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1082 of <abbr>XML</abbr> 1.0
1083 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1084 </div>
1085
1086 </div>
1087
1088 <div id="references" class="section reference">
1089 <h2>References</h2>
1090
1091 <div id="normative-references" class="section normative reference">
1092 <h3>Normative References</h3>
1093
1094 <dl>
1095 <dt id=ref-DOM3CORE>DOM3CORE</dt>
1096 <dd class=ed>@@ W3C DOM Level 3 Core</dd>
1097 <dt id=ref-DOMDTDEF>DOMDTDEF</dt>
1098 <dd class=ed>@@ manakai's extension to DOM for document type definitions</dd>
1099 <dt id="ref-KEYWORDS">KEYWORDS</dt>
1100 <dd><cite><a href="urn:ietf:rfc:2119">Key words for use in
1101 <abbr>RFCs</abbr> to Indicate Requirement Levels</a></cite>,
1102 <abbr title="Internet Engineering Task Force">IETF</abbr>
1103 <a href="urn:ietf:bcp:14"><abbr title="Best Current Practice">BCP</abbr>
1104 14</a>, <abbr title="Request for Comments">RFC</abbr> 2119,
1105 <time datetime=1997-03>March 1997</time>.
1106 <em>This</em> version of the specification is referenced.</dd>
1107 <dt id=ref-INFOSET>INFOSET</dt>
1108 <dd class=ed>@@</dd>
1109 <dt id=ref-XML10">XML10</dt>
1110 <dd><cite><a href="http://www.w3.org/TR/2006/REC-xml-20060816">Extensible
1111 Markup Language (<abbr>XML</abbr>) 1.0 (Fourth Edition)</a></cite>,
1112 <abbr>W3C</abbr> Recommendation, <time datetime=2006-08-16>16 August
1113 2006</time>, edited in place <time datetime=2006-09-29>29 September
1114 2006</time>. Tje latest version of the specification is available at
1115 <code class=uri>&lt;<a href="http://www.w3.org/TR/xml">http://www.w3.org/TR/xml</a>&gt;</code>.
1116 <em>This</em> version of the specification is referenced.</dd>
1117 <dt id=ref-XML11>XML11</dt>
1118 <dd class=ed>@@</dd>
1119 </dl>
1120 </div>
1121
1122 <div id="informative-references" class="section informative reference">
1123 <h3>Non$B!>(Bnormative References</h3>
1124
1125 <dl class=ed>
1126 <dt id=ref-CSS>CSS</dt>
1127 <dd><cite><a href="http://www.w3.org/TR/2007/CR-CSS21-20070719">Cascading
1128 Style Sheets Level 2 Revision 1 (<abbr>CSS</abbr> 2.1)
1129 Specification</a></cite>,
1130 <abbr title="World Wide Web Consortium">W3C</abbr> Candidate
1131 Recommendation, <time datetime=2007-07-19>19 July 2007</time>.
1132 Work in progress. The latest version of the specification is
1133 available at
1134 <code class=URI>&lt;<a href="http://www.w3.org/TR/CSS21">http://www.w3.org/TR/CSS21</a>></code>.</dd>
1135 <dt id=ref-CSSOM>CSSOM</dt>
1136 <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/csswg/cssom/Overview.html?rev=1.55&amp;content-type=text/html;%20charset=utf-8">Cascading
1137 Style Sheets Object Model (<abbr>CSSOM</abbr>)</a></cite>,
1138 <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1139 <time datetime=2007-06-18>18 June 2007</time>. Work in progress.
1140 The latest Editor's Draft of the specification is available at
1141 <code class=URI>&lt;<a href="http://dev.w3.org/csswg/cssom/Overview.html">http://dev.w3.org/csswg/cssom/Overview.html</a>></code>.</dd>
1142 <dt id=ref-HTML5>HTML5</dt>
1143 <dd><cite><a href="http://whatwg.org/html5"><abbr>HTML</abbr> 5</a></cite>,
1144 <abbr title="Web Hypertext Application Technology Working Group">WHATWG</abbr>
1145 Working Draft. Work in progress.</dd>
1146 <dt id=ref-SAPI>SAPI</dt>
1147 <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/2006/webapi/selectors-api/Overview.html?rev=1.28&amp;content-type=text/html;%20charset=utf-8">Selectors <abbr>API</abbr></a></cite>,
1148 <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1149 <time datetime=2007-08-29>29 August 2007</time>. Work in progress.
1150 The latest Editor's Draft of the specification is available at
1151 <code class=URI>&lt;<a href="http://dev.w3.org/2006/webapi/selectors-api/Overview.html">http://dev.w3.org/2006/webapi/selectors-api/Overview.html</a>></code>.
1152 The latest published version of the specification is available at
1153 <code class=URI>&lt;<a href="http://www.w3.org/TR/selectors-api/">http://www.w3.org/TR/selectors-api/</a>></code>.</dd>
1154 <dt id=XBL2>XBL2</dt>
1155 <dd><cite><a href="http://www.mozilla.org/projects/xbl/xbl2.html"><abbr>XBL</abbr>
1156 2.0</a></cite>, Mozilla.org, <time datetime=2007-03-15>15 Mar 2007</time>.
1157 Work in progress. The latest
1158 <abbr title="World Wide Web Consortium">W3C</abbr>$B!>(Bpublished
1159 version of the specification is available at
1160 <code class=URI>&lt;<a href="http://www.w3.org/TR/xbl/">http://www.w3.org/TR/xbl/</a>></code>.</dd>
1161 <dt id=ref-XML5>XML5</dt>
1162 <dd class=ed>@@</dd>
1163 </dl>
1164 </div>
1165
1166 </div>
1167
1168 </body>
1169 </html>

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24