/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Contents of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.23 - (show annotations) (download) (as text)
Sun Mar 16 12:37:17 2008 UTC (16 years, 8 months ago) by wakaba
Branch: MAIN
Changes since 1.22: +8 -1 lines
File MIME type: text/html
Notes TODOs

1 <!DOCTYPE html>
2 <html lang="en" class="formal-specification status-work-in-progress">
3 <head>
4 <title>manakai's XML Conformance Checking</title>
5 <link rel="stylesheet" href="http://suika.fam.cx/www/style/html/spec">
6 <link rel="license" href="http://suika.fam.cx/c/gnu/fdl">
7 <style>
8 a[id]::before {
9 content: "[" attr(id) "]";
10 color: gray;
11 font-size: 70%;
12 font-family: Courier New, monospace;
13 vertical-align: 60%;
14 }
15 </style>
16 </head>
17 <body class="has-abstract">
18
19 <div class="header">
20 <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21 <h2>Working Draft <time datetime=2008-03-16>16 March 2008</time></h2>
22
23 <dl class="versions-uri">
24 <dt>This Version</dt>
25 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
26 >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
27 <dt>Latest Version</dt>
28 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc"
29 >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc</a>&gt;</code></dd>
30 <dt>Latest Working Draft</dt>
31 <dd><code class="URI">&lt;<a href="http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work"
32 >http://suika.fam.cx/www/markup/xml/xmlcc/xmlcc-work</a>&gt;</code></dd>
33 <!-- <dt>Previous Version</dt> -->
34 <dt>Version History</dt>
35 <dd><code class=URI>&lt;<a href="http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html">http://suika.fam.cx/gate/cvs/markup/xml/xmlcc/xmlcc-work.en.html</a>&gt;</code></dd>
36 </dl>
37 <dl class="authors" id="author">
38 <dt>Author</dt>
39 <dd><a href="http://suika.fam.cx/~wakaba/who?" rel="author"
40 lang=ja>Wakaba</a>
41 <code class="mail">&lt;<a href="mailto:w@suika.fam.cx" rel="author"
42 >w@suika.fam.cx</a>&gt;</code></dd>
43 </dl>
44
45 <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46 href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47 Permission is granted to copy, distribute and/or modify this document
48 under the terms of the <a rel="license"
49 href="http://www.gnu.org/copyleft/fdl.html">GNU Free Documentation License</a>,
50 Version 1.2
51 or any later version published by the <a
52 href="http://www.fsf.org/">Free Software Foundation</a>;
53 with no Invariant Sections, no Front$B!>(BCover Texts, and no Back$B!>(BCover
54 Texts. A copy of the license is
55 available at <code class="URI">&lt;<a
56 href="http://www.gnu.org/copyleft/fdl.html"
57 rel="license">http://www.gnu.org/copyleft/fdl.html</a>&gt;</code>.</p>
58 </div>
59
60 <div id="abstract" class="section">
61 <h2>Abstract</h2>
62
63 <div class="ed issue">...</div>
64
65 </div>
66
67 <div id="status" class="section">
68 <h2>Status of This Document</h2>
69
70 <p class=section-info><em>This section describes the status of this document
71 at the time of its publication. Other documents
72 might supersede this document.</em></p>
73
74 <p>This document is a working draft, produced as part of the
75 <a href="http://suika.fam.cx/www/markup/html/whatpm/readme">Whatpm</a>
76 subproject of the
77 <a href="http://suika.fam.cx/www/manakai-core/doc/web/">manakai</a>
78 project. It might be updated, replaced, or obsoleted by
79 other documents at any time. It is inappropriate to
80 cite this document as other than <q>work in progress</q>.</p>
81
82 <p>Comments on this document are welcome and
83 may be sent to the <a href="#author">author</a>.</p>
84
85 <p>Translations of thie document might be available.
86 The English version of the document is the only
87 normative version.</p>
88 </div>
89
90 <div id="introduction" class="section">
91 <h2>Introduction</h2>
92
93 <p class=section-info>This section is <em>non$B!>(Bnormative</em>.</p>
94
95 <div class="issue ed">...</div>
96
97
98 </div>
99
100 <div class="section" id="terminology">
101 <h2>Terminology</h2>
102
103 <p>The key words <q><em class="rfc2119">MUST</em></q>,
104 <q><em class="rfc2119">MUST NOT</em></q>,
105 <q><em class="rfc2119">SHOULD</em></q>,
106 <q><em class="rfc2119">SHOULD NOT</em></q>, and
107 <q><em class=rfc2119>MAY</em></q> in this
108 document are to be interpreted as described in
109 <a href="urn:ietf:rfc:2119"><abbr title="Request for Comments">RFC</abbr>
110 2119</a>
111 <cite class="bibref normative">[<a href="#ref-KEYWORDS">KEYWORDS</a>]</cite>.</p>
112
113 <!-- Strictly speaking the paragraph below is not part of terminology. -->
114 <p>All examples and notes in this specification are non$B!>(Bnormative,
115 as are all sections explicitly marked non$B!>(Bnormative. Everything
116 else in this specification is normative.</p>
117
118 <p><span class=ed>Algorithm is normative but non-normative</span>.
119 In addition, the order in which <a href="#errors">errors</a> are
120 raised is undefined.</p>
121
122 <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123 by hyperlinks. When the document being processed is an <abbr>XML</abbr> 1.1
124 document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125 specification should be consulted instead.</p>
126 </div>
127
128
129 <div class=section id=error-categories>
130 <h2>Error Classification</h2>
131
132 <p class=ed>If a <code>Document</code> node has no
133 xml-well-formedness-error, entity-error, and unknown-error,
134 then it is well-formed. If a well-formed <code>Document</code>
135 node has no xml-validity-error, it is valid.</p>
136
137 <div class="note memo">
138 <p>A well$B!>(Bformed <code>Document</code> can be safely serialized
139 into a well$B!>(Bformed XML document. A valid <code>Document</code>
140 can be easily serialized into a valid XML document.</p>
141 </div>
142
143 <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
144 ...</p>
145
146 <p>Errors are classified into these
147 <dfn id=error-category title="error category">error categories</dfn>:</p>
148
149 <dl id=errors>
150 <dt><dfn id=entity-error><code>entity-error</code></dfn></dt>
151 <dd>
152 <p class=ed>@@</p>
153 <div class="note memo">
154 <p>This <span class=ed>algorithm</span> does not support
155 <abbr>DOM</abbr> tree with one or more <code>EntityReference</code>
156 nodes. It is expected that any entity references are expanded
157 at the parse time and any unexpandable entity references
158 make parse time errors raised so that never result in
159 <abbr>DOM</abbr> tree with <code>EntityReference</code> nodes.</p>
160 </div>
161 </dd>
162 <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
163 <dd class=ed>@@</dd>
164 <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
165 <dd>
166 <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
167 will be raised when a construct, which might not be restored to the same
168 construct when it is serialized and then re-parsed by a conforming
169 processor, is encountered.</p>
170 <div class="example memo">
171 <p>For a <code>Comment</code> node a
172 <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
173 be raised, since <abbr>XML</abbr> processors are not required to
174 report texts of comments for applications.</p>
175 </div>
176 </dd>
177 <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
178 <dd class=ed>@@</dd>
179 <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
180 <dd>An <abbr>XML</abbr> <i>error</i> (<abbr>XML</abbr> 1.0
181 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
182 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-error">error</a> /
183 <abbr>XML</abbr> 1.1
184 <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
185 <a href="@@/#dt-error">error</a>)
186 that is not classified to any other <a href="#error-category">error
187 category</a>.</dd>
188 <dt><dfn id=xml-misc-fatal-error><code>xml-misc-fatal-error</code></dt>
189 <dd>An <abbr>XML</abbr> <i>fatal error</i> (<abbr>XML</abbr> 1.0
190 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>
191 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-fatal">fatal
192 error</a> / <abbr>XML</abbr> 1.1
193 <cite class="bibref normative">[<a href="#ref-XML11">XML11</a>]</cite>
194 <a href="@@/#dt-fatal">fatal error</a>)
195 that is not classified to any other <a href="#error-category">error
196 category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
197 <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
198 <dd>
199 <p>An
200 <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
201 will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
202 in <abbr>XML</abbr> specification is not met.</p>
203 </dd>
204 <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
205 <dd>A violation of validity constraint in XML document.</dd>
206 <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
207 <dd><p>If an <code>xml-well-formedness-error</code> is raised,
208 it would not be possible to generate an XML serialization that
209 would match to the appropriate production rule and that would not
210 violate to any well$B!>(Bformedness constraint in XML
211 specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,
212 <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>
213 </dl>
214
215 <div class=ed><p>@@ TODO: #dt-atuseroption at user option
216 (MAY or MUST), #dt-compat for compatibility,
217 #dt-interop for interoperability</p>
218
219 <p>TODO: XML 1.1, XML Namespace 1.0/1.1, xml:base, xml:id
220 </div>
221
222 </div>
223
224 <div class=section id=parsing-xml>
225 <h2>Parsing <abbr>XML</abbr> Document</h2>
226
227 <p>When an <abbr>XML</abbr> document is parsed, the following clauses
228 are applied:</p>
229 <dl>
230 <dt>For each document
231 <dd>If the <abbr>XML</abbr> document does not begin with an
232 <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
233 raise an
234 <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
235 <dt>For each internal general entity declaration processed by the parser
236 <dd>If the
237 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
238 part of the general entity declaration contains a bare <code>U+003C</code>
239 <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
240 <em class=rfc2119>MUST</em> raise an
241 <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
242 "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
243 -->
244 <dt>For each element type declaration processed by the parser
245 <dd>If there is another element type declaration whose <code>Name</code>
246 is equal to the <code>Name</code> of the element type declaration, then
247 the parser <em class=rfc2119>MUST</em> raise an
248 <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
249 <!--
250 NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
251 will not be warned.
252 -->
253 <dt>For each empty-element tag
254 <dd>If the <code>Name</code> of the tag is not declared by a processed
255 element type declaration as <code>EMPTY</code> content, then the parser
256 <em class=rfc2119>MUST</em> raise an
257 <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
258 <dt>For each start-tag
259 <dd>If the <code>Name</code> of the tag is declared by a processed element
260 type declaration as <code>EMPTY</code> content, then the parser
261 <em class=rfc2119>MUST</em> raise an
262 <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
263 <!--
264
265 #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting
266 #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset
267 #ExtSubset Well-formedness constraint: External Subset
268 #PE-between-Decls Well-formedness constraint: PE Between Declarations
269 #GIMatch Well-formedness constraint: Element Type Match
270 #uniqattspec Well-formedness constraint: Unique Att Spec
271 #NoExternalRefs Well-formedness constraint: No External Entity References
272 #CleanAttrVals Well-formedness constraint: No < in Attribute Values
273 #vc-PEinGroup Validity constraint: Proper Group/PE Nesting
274 "For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)."
275 "For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error."
276 #condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting
277 #wf-Legalchar Well-formedness constraint: Legal Character
278 #textent Well-formedness constraint: Parsed Entity
279 #norecursion Well-formedness constraint: No Recursion
280 #indtd Well-formedness constraint: In DTD
281 "External parsed entities SHOULD each begin with a text declaration."
282 "It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."
283 #UniqueNotationName Validity constraint: Unique Notation Name
284
285 @@ Need detailed review, but maybe should be in parsing phase
286
287 #vc-check-rmd Validity constraint: Standalone Document Declaration
288
289 @@ Need dtailed review
290
291 #wf-entdeclared Well-formedness constraint: Entity Declared
292 #vc-entdeclared Validity constraint: Entity Declared
293 "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
294 "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
295
296 @@ flaged and then reported in DOM check phase
297
298 "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
299 "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
300 "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
301
302 @@ in parsing phase
303
304 "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
305
306 @@ We should phrase out what the parser should do where the XML specification
307 left undefined. For example: Comment must be converted to a Comment node,
308 illegal xml:space value must be preserved, so on.
309
310 -->
311 </dl>
312 </div>
313
314 <div class="section" id=checking-dom>
315 <h2>Checking <abbr>DOM</abbr></h2>
316
317 <p>The following algorithms and definitions are applied to
318 <abbr>XML</abbr> documents; especially, they are not applied
319 to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
320
321 <div class=section id=definitions>
322 <h3>Definitions</h3>
323
324 <p>The <dfn id=xml-version><abbr>XML</abbr> version</dfn> of a node is
325 the <abbr>XML</abbr> version of the document to which the node belongs.
326 For a <code>Documemt</code> node, the <abbr>XML</abbr> version
327 of the document is the value of the <code>xmlVersion</code>
328 attribute of the node. For a <code>DocumentType</code> node whose
329 <code>ownerDocument</code> attribute is set to <code>null</code>,
330 the <abbr>XML</abbr> version of the document is <code>1.0</code><!--
331 since createDocumentType [DOM3CORE] assumes XML 1.0 [XML10TE] -->.
332 For any other node, the <abbr>XML</abbr> version of the document
333 is that of the <code>Document</code> node contained in the
334 <code>ownerDocument</code> attribute of the node.</p>
335
336 </div>
337
338
339 <div class=section id=checking-components>
340 <h3>Conformance Checking Algorithms for Components</h3>
341
342 <p>To
343 <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
344 validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
345 the following algorithm <em class=rfc2119>MUST</em> be used:</p>
346 <ol>
347 <li>If <var>s</var> contains a character that is <em>not</em> in
348 the character class <a href="#class-Char10"><code>Char10</code></a>,
349 then raise an
350 <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
351 <li>If <var>s</var> contains a character that is in the character
352 class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
353 then raise an
354 <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
355 <li>If <var>s</var> contains a character that is in the character
356 class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
357 then raise an
358 <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
359 <li class=ed>@@ XML 1.1 support</li>
360 <li>If <var>s</var> contains a <code class=char>U+000D</code>
361 <code class=charname>CARRIAGE RETURN</code> character, then
362 raise a
363 <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
364 <span class=ed>@@ We should not raise duplicate errors for U+000D
365 in attribute values. In addition, we should support a mode where
366 U+000D will be serialized as &#x000D; (so that no round-trip-error
367 will be raised).</span></li>
368 </ol>
369
370 <p>To
371 <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
372 <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
373 algorithm <em class=rfc2119>MUST</em> be used:</p>
374 <ol>
375 <li>If <var>s</var> is an empty string, then raise an
376 <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
377 Abort these steps.</li>
378 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
379 <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
380 <li>If the first character in <var>s</var> is a character that is
381 <em>not</em> in the character class
382 <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
383 an
384 <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
385 <li>If a character other than the first character in <var>s</var> is a
386 character that is <em>not</em> in the character class
387 <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
388 <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
389 <li>If <var>s</var> begins with the string <code>xml</code> (in any
390 case combination), then raise an
391 <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
392 <span class=ed>@@ except for attribute names <code>xml:lang</code>,
393 <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,
394 <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
395 pi name <code>xml-stylesheet</code>.</span><!--
396 "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
397 xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
398 --></li>
399 <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
400 [XML11]. -->
401 <li class=ed>@@ XML 1.1 support</li>
402 </ol>
403
404 <p>To
405 <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
406 an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
407 following algorithm <em class=rfc2119>MUST</em> be used:</p>
408 <ol>
409 <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
410 <var>s</var> as a <code>Name</code></a>.</li>
411 <li class=ed>@@</li>
412 </ol>
413
414 <p>To
415 <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
416 a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
417 following algorithm <em class=rfc2119>MUST</em> be used:</p>
418 <ol>
419 <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
420 <li>If <var>pid</var> contains a character that is <em>not</em> in the
421 character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
422 raise an
423 <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
424 <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
425 <code class=charname>CHARACTER TABULATION</code>,
426 <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
427 and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
428 characters, if the first character of <var>pid</var> is
429 <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
430 if the last character of <var>pid</var> is <code class=char>U+0020</code>
431 <code class=charname>SPACE</code> character, or if there is a
432 <code class=char>U+0020</code> <code class=charname>SPACE</code> character
433 immediately followed by another <code class=char>U+0020</code>
434 <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
435 <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
436 <span class=ed>Is this really a roundtripness problem? XML spec
437 does only define the way to match public identifiers in fact, no
438 canonical form.</span></li>
439 </ol>
440
441 <p>To
442 <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
443 a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
444 following algorithm <em class=rfc2119>MUST</em> be used:</p>
445 <ol>
446 <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
447 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
448 <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
449 <li>If <var>sid</var> contains both <code class=char>U+0022</code>
450 <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
451 <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
452 (<code class=char>'</code>) characters, raise an
453 <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
454 <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
455 <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
456 character, then raise an
457 <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
458 <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
459 reference, then raise a fact-level error (xml-misc-warning?).<!--
460 XML 1.0 has no conformance language for system identifier being
461 a URI reference. --></li>
462 </ol>
463
464 </div>
465
466 <div class=section id=checking-node>
467 <h3>Checking <code>Node</code></h3>
468
469 <p>The algorithm <dfn title="to check a node" id=algorithm-to-check-a-node>to
470 check a node (<dfn id=var-n><var>n</var></dfn>)</dfn> is defined as
471 following:</p>
472 <dl class=switch>
473 <dt>If <var>n</var> is an <code>Attr</code> node</dt>
474 <dd>
475 <ol>
476 <li><a href="#algorithm-to-validate-an-ncname">Validate the
477 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
478 <li>If the <code>prefix</code> attribute value is different from
479 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
480 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
481 <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
482 <code>childNodes</code> list of <var>n</var>,
483 <ol>
484 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
485 <code>Text</code> or <code>EntityReference</code> node, then it is an
486 <a href="#xml-well-formedness-error" id=wfe-attr-child><code>xml-well-formedness-error</code></a>.</li>
487 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
488 <code>EntityReference</code> node, then it is an
489 <a href="#entity-error" id=ee-attr-er><code>entity-error</code></a>.</li>
490 <li>Otherwise,
491 <a href="#algorithm-to-check-a-node" title="check a node">check
492 <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
493 </ol></li>
494 <li>If <code>nodeName</code> attribute of <var>n</var> is
495 <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
496 and <code>value</code> attribute of <var>n</var> is neither
497 <code>default</code> nor <code>preserve</code>, then it is an
498 <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
499 Note that <!ATTLIST e xml:space (default|preserve) "a">...
500 <e xml:space="default"/> is conforming and valid. --></li>
501 <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
502 successor] or an empty string, then xml-misc-warning
503 (a "fact"-level error; not an XML error).</li>
504 <li class=ed>@@ <code>specified</code>,
505 <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
506 <!-- xml:space MUST be declared to be valid. -->
507 <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
508 attribute <code>value</code> of <var>n</var>.</li>
509 <li>Validate the <var>n</var> against the <span class=ed>declared
510 type</span> as following:
511 <dl class=switch>
512 <dt class=ed><code>ID_ATTR</code></dt>
513 <dd>
514 <ol>
515 <li><span class=ed>Validate <var>v</var> as an
516 <code>Name</code>.</span> If it fails, then raise an
517 <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
518 <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
519 then raise an
520 <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
521 </ol>
522 </dd>
523 <dt class=ed>IDREF_ATTR</dt>
524 <dd>
525 <ol>
526 <li><span class=ed>Validate <var>v</var> as an
527 <code>Name</code>.</span> If it fails, then raise an
528 <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
529 <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
530 defined,</span> then raise an
531 <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
532 </ol>
533 </dd>
534 <dt class=ed>IDREFS_ATTR</dt>
535 <dd class=ed>@@</dd>
536 <dt class=ed>ENTITY_ATTR</dt>
537 <dd>
538 <ol>
539 <li><span class=ed>Validate <var>v</var> as an
540 <code>Name</code>.</span> If it fails, then raise an
541 <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
542 <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
543 defined,</span> then raise an
544 <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
545 </ol>
546 </dd>
547 <dt class=ed>ENTITIES_ATTR</dt>
548 <dd class=ed>@@</dd>
549 <dt class=ed>NMTOKEN_ATTR</dt>
550 <dd>
551 <ol>
552 <li><span class=ed>Validate <var>v</var> as an
553 <code>Nmtoken</code>.</span> If it fails, then raise an
554 <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
555 </ol>
556 </dd>
557 <dt class=ed>NMTOKENS_ATTR</dt>
558 <dd class=ed>@@</dd>
559 <dt class=ed>NOTATION_ATTR</dt>
560 <dd class=ed><var>v</var> must be one of enumerated values.
561 If not, then raise an
562 <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
563 <dt class=ed>ENUMERATED_ATTR</dt>
564 <dd class=ed><var>v</var> must be one of enumerated values.
565 If not, then raise an
566 <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
567 <dt class=ed>@@</dt>
568 </dl>
569 </li>
570 <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
571 raise an
572 <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
573 <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
574 <li class=ed>@@ strict serialization error for U+000D, U+000A, and
575 U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
576 </ol>
577 </dd>
578 <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
579 <dd>
580 <ol>
581 <li class=ed></li>
582 <li>If <code>nodeName</code> attribute of <var>n</var> is
583 <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
584 and <span class=ed>its declared type is different from (default|preserve),
585 (preserve|default), (default), or (preserve)</span>, then raise an
586 <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
587 <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
588 <code>childNodes</code> list of <var>n</var>,
589 <ol>
590 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
591 <code>Text</code> or <code>EntityReference</code> node, then it is an
592 <a href="#xml-well-formedness-error" id=wfe-at-child><code>xml-well-formedness-error</code></a>.</li>
593 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
594 <code>EntityReference</code> node, then it is an
595 <a href="#entity-error" id=ee-at-er><code>entity-error</code></a>.</li>
596 <li>Otherwise,
597 <a href="#algorithm-to-check-a-node" title="check a node">check
598 <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
599 </ol></li>
600 <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
601 be declared. If not, then raise an
602 <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
603 <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
604 values MUST all be distinct. If not, then raise an
605 <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
606 <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
607 element, then raise an
608 <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
609 <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
610 </ol>
611 </dd>
612 <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
613 <dd>
614 <ol>
615 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
616 <code>data</code> attribute value as an <abbr>XML</abbr> character
617 data</a>.</li>
618 <li>If the <code>data</code> attribute value contains
619 a string <code>]]></code>, then raise an
620 <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
621 <li>If the <code>childNodes</code> list of <var>n</var> contains
622 any nodes, they are in
623 <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
624 </ol>
625 </dd>
626 <dt>If <var>n</var> is a <code>Comment</code> node</dt>
627 <dd>
628 <ol>
629 <li>Raise an
630 <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
631 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
632 <code>data</code> attribute value as an <abbr>XML</abbr> character
633 data</a>.</li>
634 <li>If the <code>data</code> attribute value contains
635 a string <code>--</code>, or if it ends with a character
636 <code>-</code>, then raise an
637 <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
638 "for compatibility" --></li>
639 <li>If the <code>childNodes</code> list of <var>n</var> contains
640 any nodes, they are in
641 <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
642 </ol>
643 </dd>
644 <dt>If <var>n</var> is a <code>Document</code> node</dt>
645 <dd>
646 <ol>
647 <li>If <a href="#xml-version"><abbr>XML</abbr> version</a> of <var>n</var>
648 is different from <code>1.0</code> or <code>1.1</code>,
649 then it is an <code class=ed>unknown-error?</code>.</li>
650 <li>If the <code>xmlEncoding</code> attribute value does <em>not</em>
651 match to <code>[A-Za-z] ([A-Za-z0-9._] | '-')*</code>
652 <span class=ed>@@ formal def</span>, then it is an
653 <a href="#xml-well-formedness-error" id=wfe-encoding><code>xml-well-formedness-error</code></a>.</li>
654 <li>The <code>childNodes</code> list of <var>n</var> have to
655 consist of zero or more <code>Comment</code> and/or
656 <code>ProcessingInstruction</code> nodes, followed by
657 an optional <code>DocumentType</code> node, followed
658 by zero or more <code>Comment</code> and/or
659 <code>ProcessingInstruction</code> nodes, followed by
660 an <code>Element</code> node, followed
661 by zero or more <code>Comment</code> and/or
662 <code>ProcessingInstruction</code> nodes. Any violation to this is an
663 <a href="#xml-well-formedness-error" id=wfe-document-child><code>xml-well-formedness-error</code></a>.</li>
664 <li>For each node <dfn id=var-document-nc><var>n<sub><var>c</var></sub></var></dfn>
665 in the <code>childNodes</code> list of <var>n</var>,
666 <ol>
667 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
668 <code>EntityReference</code> node, then
669 <a href="#algorithm-to-check-a-node" title="check a node">check
670 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
671 </ol></li>
672 <li class=ed>@@ <code>allDeclarationsProcessed</code></li>
673 </ol>
674 </dd>
675 <dt>If <var>n</var> is a <code>DocumentFragment</code> node</dt>
676 <dd>
677 <ol>
678 <li>For each node <dfn id=var-df-nc><var>n<sub><var>c</var></sub></var></dfn>
679 in the <code>childNodes</code> list of <var>n</var>,
680 <ol>
681 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
682 <code>Element</code>, <code>Text</code>, <code>CDATASection</code>,
683 <code>Comment</code>, <code>ProcessingInstruction</code>, or
684 <code>EntityReference</code> node, then it is an
685 <a href="#xml-well-formedness-error" id=wfe-df-child><code>xml-well-formedness-error</code></a>.</li>
686 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
687 <code>EntityReference</code> node, then it is an
688 <a href="#entity-error" id=ee-df-er><code>entity-error</code></a>.</li>
689 <li>Otherwise,
690 <a href="#algorithm-to-check-a-node" title="check a node">check
691 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
692 </ol></li>
693 </ol>
694 </dd>
695 <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
696 <dd>
697 <ol>
698 <li><a href="#algorithm-to-validate-an-ncname">Validate the
699 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
700 <li>Follow the following substeps:
701 <ol>
702 <li>If <code>ownerDocument</code> attribute of <var>n</var> is
703 <code>null</code>, then abort these substeps.</li>
704 <li>If <code>documentElement</code> attribute of the node
705 set to <code>ownerDocument</code> attribute of <var>n</var> is
706 <code>null</code>, then abort these substeps.</li>
707 <li>If <code>nodeName</code> attribute of the node set to
708 <code>documentElement</code> attribute of the node set to
709 <code>ownerDocument</code> attribute of <var>n</var> is
710 different from <code>nodeName</code> of <var>n</var>,
711 then raise an
712 <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
713 </ol>
714 </li>
715 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
716 the <code>publicId</code> attribute value as a public identifier</a>.</li>
717 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
718 the <code>systemId</code> attribute value as a system identifier</a>.</li>
719 <li>If the <code>publicId</code> attribute value of <var>n</var> is
720 <em>not</em> <code>null</code> and the <code>systemId</code> attribute
721 value of <var>n</var> is <code>null</code>, then raise an
722 <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
723 <span class=ed>@@ publicId == null? Or, publicId == ""</span></li>
724 <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
725 in the <code>childNodes</code> list of <var>n</var>,
726 <ol>
727 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> a
728 <code>ProcessingInstruction</code> node, then it is an
729 <a href="#xml-well-formedness-error" id=wfe-dt-child><code>xml-well-formedness-error</code></a>.
730 <span class=ed>@@ ref to manakai's extensions</span></li>
731 <li>Otherwise, <!-- if n_c is not an EntityReference node, -->
732 <a href="#algorithm-to-check-a-node" title="check a node">check
733 <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
734 </ol></li>
735 <li>For each node in the <code>entities</code>, <code>notations</code>,
736 and <code>elementTypes</code> lists of <var>n</var>,
737 <a href="#algorithm-to-check-a-node" title="check a node">check the
738 node</a> recursively.</li>
739 <li class=ed>@@ externally declared?</li>
740 </ol>
741 </dd>
742 <dt>If <var>n</var> is an <code>Element</code> node</dt>
743 <dd>
744 <ol>
745 <li><a href="#algorithm-to-validate-an-ncname">Validate the
746 <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
747 <li>If the <code>prefix</code> attribute value is different from
748 <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
749 the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
750 <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
751 in the <code>childNodes</code> list of <var>n</var>,
752 <ol>
753 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
754 <code>Element</code>,
755 <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
756 <code>ProcessingInstruction</code>, or
757 <code>EntityReference</code> node, then it is an
758 <a href="#xml-well-formedness-error" id=wfe-element-child><code>xml-well-formedness-error</code></a>.</li>
759 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
760 <code>EntityReference</code> node, then it is an
761 <a href="#entity-error" id=ee-element-er><code>entity-error</code></a>.</li>
762 <li>Otherwise,
763 <a href="#algorithm-to-check-a-node" title="check a node">check
764 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
765 </ol></li>
766 <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
767 <li>Let <var>attrs</var> be the value of the <code>attribute</code>
768 attribute of <var>n</var>. Check conformance of <var>attrs</var>
769 as following:
770 <ol>
771 <li>If <var>attrs</var> contains an <code>Attr</code> node whose
772 <code>nodeName</code> attribute value is equal to that of another
773 <code>Attr</code> node in <var>attrs</var>, then raise an
774 <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
775 #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
776 <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
777 </ol>
778 </li>
779 </ol>
780 </dd>
781 <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
782 <dd>
783 <ol>
784 <li class=ed></li>
785 <li>If the <code>childNodes</code> list of <var>n</var> contains
786 any nodes, they are in
787 <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
788 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
789 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
790 <li class=ed>@@ #vc-MixedChildrenUnique
791 Validity constraint: No Duplicate Types</li>
792 <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
793 <li>If there is more than one <code>AttributeDefinition</code> node
794 with <span class=ed>attribute type <code>ID</code></span> in the
795 <code>NamedNodeMap</code> list contained in the
796 <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
797 <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
798 <li>If there is more than one <code>AttributeDefinition</code> node
799 with <span class=ed>attribute type <code>NOTATION</code></span> in the
800 <code>NamedNodeMap</code> list contained in the
801 <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
802 <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
803 <li class=ed>"For interoperability, the same Nmtoken SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
804 </ol>
805 </dd>
806 <dt>If <var>n</var> is an <code>Entity</code> node whose
807 <code>notationName</code> attribute value is <code>null</code> (i.e. a
808 parsed entity)</dt>
809 <dd>
810 <ol>
811 <li>Raise an
812 <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
813 <li><a href="#algorithm-to-validate-an-ncname">Validate the
814 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
815 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
816 the <code>publicId</code> attribute value as a public identifier</a>.</li>
817 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
818 the <code>systemId</code> attribute value as a system identifier</a>.</li>
819 <li>If the <code>publicId</code> attribute value of <var>n</var> is
820 <em>not</em> <code>null</code> and the <code>systemId</code> attribute
821 value of <var>n</var> is <code>null</code>, then raise an
822 <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
823 <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
824 in the <code>childNodes</code> list of <var>n</var>,
825 <ol>
826 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
827 <code>Element</code>,
828 <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
829 <code>ProcessingInstruction</code>, or <code>EntityReference</code>
830 node, then it is an
831 <a href="#xml-well-formedness-error" id=wfe-entity-child><code>xml-well-formedness-error</code></a>.</li>
832 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is an
833 <code>EntityReference</code> node, then it is an
834 <a href="#entity-error" id=ee-er-er><code>entity-error</code></a>.</li>
835 <li>Otherwise,
836 <a href="#algorithm-to-check-a-node" title="check a node">check
837 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
838 </ol></li>
839 </ol>
840 </dd>
841 <dt>If <var>n</var> is an <code>Entity</code> node whose
842 <code>notationName</code> attribute value is <em>not</em> <code>null</code>
843 (i.e. an unparsed entity)</dt>
844 <dd>
845 <ol>
846 <li><a href="#algorithm-to-validate-an-ncname">Validate the
847 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
848 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
849 the <code>publicId</code> attribute value as a public identifier</a>.</li>
850 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
851 the <code>systemId</code> attribute value as a system identifier</a>.</li>
852 <li>If the <code>systemId</code> attribute value of <var>n</var> is
853 <code>null</code>, then raise an
854 <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
855 <li><a href="#algorithm-to-validate-an-ncname">Validate the
856 <code>notationName</code> attribute value of <var>n</var> as an
857 <code>NCName</code></a>.</li>
858 <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
859 <li>If the <code>childNodes</code> list of <var>n</var> contains
860 any nodes, they are in
861 <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
862 </ol>
863 </dd>
864 <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
865 <dd>
866 <ol>
867 <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
868 <li><a href="#algorithm-to-validate-an-ncname">Validate the
869 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
870 <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
871 in the <code>childNodes</code> list of <var>n</var>,
872 <ol>
873 <li>If <var>n<sub><var>c</var></sub></var> is <em>not</em> an
874 <code>Element</code>,
875 <code>Text</code>, <code>CDATASection</code>, <code>Comment</code>,
876 <code>ProcessingInstruction</code>, or <code>EntityReference</code>
877 node, then it is an
878 <a href="#xml-well-formedness-error" id=wfe-er-child><code>xml-well-formedness-error</code></a>.</li>
879 <li>Otherwise, if <var>n<sub><var>c</var></sub></var> is <em>not</em> an
880 <code>EntityReference</code> node, then it is an
881 <a href="#entity-error" id=ee-entity-er><code>entity-error</code></a>.</li>
882 <li>Otherwise,
883 <a href="#algorithm-to-check-a-node" title="check a node">check
884 <var>n<sub><var>c</var></sub></var></a> recursively.</li>
885 </ol></li>
886 </ol>
887 </dd>
888 <dt>If <var>n</var> is a <code>Notation</code> node</dt>
889 <dd>
890 <ol>
891 <li><a href="#algorithm-to-validate-an-ncname">Validate the
892 <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
893 <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
894 the <code>publicId</code> attribute value as a public identifier</a>.</li>
895 <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
896 the <code>systemId</code> attribute value as a system identifier</a>.</li>
897 <li>If the <code>childNodes</code> list of <var>n</var> contains
898 any nodes, they are in
899 <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
900 </ol>
901 </dd>
902 <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
903 <dd>
904 <ol>
905 <li>If the <code>target</code> attribute value matches to the string
906 <code>xml</code> in any case combination, then raise a
907 <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
908 <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
909 <code>target</code> attribute value as an <code>NCName</code></a>.</li>
910 <li>Then,
911 <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
912 the <code>data</code> attribute value as an <abbr>XML</abbr> character
913 data</a>.</li>
914 <li>If the <code>data</code> attribute value contains a string
915 <code>?></code>, then raise a
916 <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
917 <li>If the <code>data</code> attribute value starts with either
918 <code class=char>U+0009</code> <code class=charname>CHARACTER
919 TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
920 FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
921 RETURN</code>, or <code class=char>U+0020</code>
922 <code class=charname>SPACE</code> character, then raise a
923 <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
924 <li>If the <code>childNodes</code> list of <var>n</var> contains
925 any nodes, then raise an
926 <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
927 </ol>
928 </dd>
929 <dt>If <var>n</var> is a <code>Text</code> node</dt>
930 <dd>
931 <ol>
932 <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
933 <code>data</code> attribute value as an <abbr>XML</abbr> character
934 data</a>.</li>
935 <li>If the <code>childNodes</code> list of <var>n</var> contains
936 any nodes, they are in
937 <a href="#xml-well-formedness-error" id=wfe-text-child><code>xml-well-formedness-error</code></a>.</li>
938 </ol>
939 </dd>
940 <dt>Otherwise</dt>
941 <dd><span class=ed>xml-well-formedness-error? unknown-error?</span></dd>
942 </dl>
943
944 </div>
945
946 </div>
947
948 <div id=classes class=section>
949 <h2>Character Classes</h2>
950
951 <p>This section defines a couple of character classes.
952 These classes are referred to by algorithms specified above.</p>
953
954 <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
955 contains the following characters:</p>
956 <ul>
957 <li><code class=char>U+0009</code> <code class=charname>CHARACTER
958 TABULATION</code></li>
959 <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
960 <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
961 RETURN</code></li>
962 <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
963 .. <code class=char>U+D7FF</code></li>
964 <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
965 <code class=charname>REPLACEMENT CHARACTER</code></li>
966 <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
967 </ul>
968 <div class="note memo">
969 <p>This character class contains all characters allowed in the production rule
970 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
971 of <abbr>XML</abbr> 1.0
972 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
973 </div>
974
975 <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
976 contains the following characters:</p>
977 <ul>
978 <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
979 </ul>
980
981 <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
982 contains the following characters:</p>
983 <ul>
984 <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
985 <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
986 <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
987 AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
988 PROGRAM COMMAND</code></li>
989 <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
990 <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
991 <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
992 <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
993 <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
994 <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
995 <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
996 <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
997 <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
998 <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
999 <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
1000 <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
1001 <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
1002 <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
1003 <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
1004 <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
1005 <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
1006 </ul>
1007 <div class="note memo">
1008 <p>This character class contains the characters listed in the Note in
1009 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
1010 of <abbr>XML</abbr> 1.0
1011 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
1012 as amended by
1013 <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
1014 </div>
1015
1016 <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
1017 contains the following characters:</p>
1018 <ul class=ed>
1019 </ul>
1020 <div class="note memo">
1021 <p>This character class contains all characters allowed as the first character
1022 of a string matching to the production rule
1023 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1024 of <abbr>XML</abbr> 1.0
1025 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1026 </div>
1027
1028 <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1029 contains the following characters:</p>
1030 <ul>
1031 <li>The characters in the character class
1032 <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1033 <li class=ed></li>
1034 </ul>
1035 <div class="note memo">
1036 <p>This character class contains all characters allowed as the second
1037 character of a string matching to the production rule
1038 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1039 of <abbr>XML</abbr> 1.0
1040 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1041 </div>
1042
1043 <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1044 contains the following characters:</p>
1045 <ul>
1046 <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1047 TABULATION</code></li>
1048 <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1049 <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1050 RETURN</code></li>
1051 <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1052 <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1053 (<code class=char>!</code>)</li>
1054 <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1055 (<code class=char>$</code>)</li>
1056 <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1057 (<code class=char>#</code>)</li>
1058 <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1059 (<code class=char>%</code>)</li>
1060 <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1061 (<code class=char>'</code>)</li>
1062 <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1063 (<code class=char>(</code>)</li>
1064 <li><code class=char>U+0029</code> <code class=charname>RIGHT
1065 PARENTHESIS</code> (<code class=char>)</code>)</li>
1066 <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1067 (<code class=char>*</code>)</li>
1068 <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1069 (<code class=char>+</code>)</li>
1070 <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1071 (<code class=char>,</code>)</li>
1072 <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1073 (<code class=char>-</code>)</li>
1074 <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1075 (<code class=char>.</code>)</li>
1076 <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1077 (<code class=char>/</code>)</li>
1078 <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1079 (<code class=char>0</code>) .. <code class=char>U+0039</code>
1080 <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1081 <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1082 (<code class=char>:</code>)</li>
1083 <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1084 (<code class=char>;</code>)</li>
1085 <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1086 (<code class=char>=</code>)</li>
1087 <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1088 (<code class=char>?</code>)</li>
1089 <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1090 (<code class=char>@</code>)</li>
1091 <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1092 A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1093 <code class=charname>LATIN CAPITAL LETTER Z</code>
1094 (<code class=char>Z</code>)</li>
1095 <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1096 (<code class=char>_</code>)</li>
1097 <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1098 A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1099 <code class=charname>LATIN CAPITAL LETTER Z</code>
1100 (<code class=char>Z</code>)</li>
1101 </ul>
1102 <div class="note memo">
1103 <p>This character class contains all characters allowed in the production rule
1104 <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1105 of <abbr>XML</abbr> 1.0
1106 <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1107 </div>
1108
1109 </div>
1110
1111 <div id="references" class="section reference">
1112 <h2>References</h2>
1113
1114 <div id="normative-references" class="section normative reference">
1115 <h3>Normative References</h3>
1116
1117 <dl>
1118 <dt id=ref-DOM3CORE>DOM3CORE</dt>
1119 <dd class=ed>@@ W3C DOM Level 3 Core</dd>
1120 <dt id=ref-DOMDTDEF>DOMDTDEF</dt>
1121 <dd class=ed>@@ manakai's extension to DOM for document type definitions</dd>
1122 <dt id="ref-KEYWORDS">KEYWORDS</dt>
1123 <dd><cite><a href="urn:ietf:rfc:2119">Key words for use in
1124 <abbr>RFCs</abbr> to Indicate Requirement Levels</a></cite>,
1125 <abbr title="Internet Engineering Task Force">IETF</abbr>
1126 <a href="urn:ietf:bcp:14"><abbr title="Best Current Practice">BCP</abbr>
1127 14</a>, <abbr title="Request for Comments">RFC</abbr> 2119,
1128 <time datetime=1997-03>March 1997</time>.
1129 <em>This</em> version of the specification is referenced.</dd>
1130 <dt id=ref-INFOSET>INFOSET</dt>
1131 <dd class=ed>@@</dd>
1132 <dt id=ref-XML10">XML10</dt>
1133 <dd><cite><a href="http://www.w3.org/TR/2006/REC-xml-20060816">Extensible
1134 Markup Language (<abbr>XML</abbr>) 1.0 (Fourth Edition)</a></cite>,
1135 <abbr>W3C</abbr> Recommendation, <time datetime=2006-08-16>16 August
1136 2006</time>, edited in place <time datetime=2006-09-29>29 September
1137 2006</time>. Tje latest version of the specification is available at
1138 <code class=uri>&lt;<a href="http://www.w3.org/TR/xml">http://www.w3.org/TR/xml</a>&gt;</code>.
1139 <em>This</em> version of the specification is referenced.</dd>
1140 <dt id=ref-XML11>XML11</dt>
1141 <dd class=ed>@@</dd>
1142 </dl>
1143 </div>
1144
1145 <div id="informative-references" class="section informative reference">
1146 <h3>Non$B!>(Bnormative References</h3>
1147
1148 <dl class=ed>
1149 <dt id=ref-CSS>CSS</dt>
1150 <dd><cite><a href="http://www.w3.org/TR/2007/CR-CSS21-20070719">Cascading
1151 Style Sheets Level 2 Revision 1 (<abbr>CSS</abbr> 2.1)
1152 Specification</a></cite>,
1153 <abbr title="World Wide Web Consortium">W3C</abbr> Candidate
1154 Recommendation, <time datetime=2007-07-19>19 July 2007</time>.
1155 Work in progress. The latest version of the specification is
1156 available at
1157 <code class=URI>&lt;<a href="http://www.w3.org/TR/CSS21">http://www.w3.org/TR/CSS21</a>></code>.</dd>
1158 <dt id=ref-CSSOM>CSSOM</dt>
1159 <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/csswg/cssom/Overview.html?rev=1.55&amp;content-type=text/html;%20charset=utf-8">Cascading
1160 Style Sheets Object Model (<abbr>CSSOM</abbr>)</a></cite>,
1161 <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1162 <time datetime=2007-06-18>18 June 2007</time>. Work in progress.
1163 The latest Editor's Draft of the specification is available at
1164 <code class=URI>&lt;<a href="http://dev.w3.org/csswg/cssom/Overview.html">http://dev.w3.org/csswg/cssom/Overview.html</a>></code>.</dd>
1165 <dt id=ref-HTML5>HTML5</dt>
1166 <dd><cite><a href="http://whatwg.org/html5"><abbr>HTML</abbr> 5</a></cite>,
1167 <abbr title="Web Hypertext Application Technology Working Group">WHATWG</abbr>
1168 Working Draft. Work in progress.</dd>
1169 <dt id=ref-SAPI>SAPI</dt>
1170 <dd><cite><a href="http://dev.w3.org/cvsweb/~checkout~/2006/webapi/selectors-api/Overview.html?rev=1.28&amp;content-type=text/html;%20charset=utf-8">Selectors <abbr>API</abbr></a></cite>,
1171 <abbr title="World Wide Web Consortium">W3C</abbr> Editor's Draft,
1172 <time datetime=2007-08-29>29 August 2007</time>. Work in progress.
1173 The latest Editor's Draft of the specification is available at
1174 <code class=URI>&lt;<a href="http://dev.w3.org/2006/webapi/selectors-api/Overview.html">http://dev.w3.org/2006/webapi/selectors-api/Overview.html</a>></code>.
1175 The latest published version of the specification is available at
1176 <code class=URI>&lt;<a href="http://www.w3.org/TR/selectors-api/">http://www.w3.org/TR/selectors-api/</a>></code>.</dd>
1177 <dt id=XBL2>XBL2</dt>
1178 <dd><cite><a href="http://www.mozilla.org/projects/xbl/xbl2.html"><abbr>XBL</abbr>
1179 2.0</a></cite>, Mozilla.org, <time datetime=2007-03-15>15 Mar 2007</time>.
1180 Work in progress. The latest
1181 <abbr title="World Wide Web Consortium">W3C</abbr>$B!>(Bpublished
1182 version of the specification is available at
1183 <code class=URI>&lt;<a href="http://www.w3.org/TR/xbl/">http://www.w3.org/TR/xbl/</a>></code>.</dd>
1184 <dt id=ref-XML5>XML5</dt>
1185 <dd class=ed>@@</dd>
1186 </dl>
1187 </div>
1188
1189 </div>
1190
1191 </body>
1192 </html>

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24