/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Diff of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.9 by wakaba, Tue Nov 13 01:11:21 2007 UTC revision 1.24 by wakaba, Fri Mar 28 15:37:01 2008 UTC
# Line 17  Line 17 
17  <body class="has-abstract">  <body class="has-abstract">
18    
19  <div class="header">  <div class="header">
20  <h1>manakai's XML Conformance Checking</h1>  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21  <h2>Working Draft <time datetime=2007-11-13>13 November 2007</time></h2>  <h2>Working Draft <time datetime=2008-03-29>29 March 2008</time></h2>
22    
23  <dl class="versions-uri">  <dl class="versions-uri">
24  <dt>This Version</dt>  <dt>This Version</dt>
# Line 42  Line 42 
42        >w@suika.fam.cx</a>&gt;</code></dd>        >w@suika.fam.cx</a>&gt;</code></dd>
43  </dl>  </dl>
44    
45  <p class="copyright" lang="en">&#xA9; <time>2007</time> <a  <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47  Permission is granted to copy, distribute and/or modify this document  Permission is granted to copy, distribute and/or modify this document
48  under the terms of the <a rel="license"  under the terms of the <a rel="license"
# Line 118  else in this specification is normative. Line 118  else in this specification is normative.
118  <p><span class=ed>Algorithm is normative but non-normative</span>.  <p><span class=ed>Algorithm is normative but non-normative</span>.
119  In addition, the order in which <a href="#errors">errors</a> are  In addition, the order in which <a href="#errors">errors</a> are
120  raised is undefined.</p>  raised is undefined.</p>
 </div>  
121    
122  <div class="section" id=checking-dom>  <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123  <h2>Checking <abbr>DOM</abbr></h2>  by hyperlinks.  When the document being processed is an <abbr>XML</abbr> 1.1
124    document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125    specification should be consulted instead.</p>
126    </div>
127    
 <p>The following algorithms and definitions are applied to  
 <abbr>XML</abbr> documents; especially, they are not applied  
 to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>  
128    
129  <div class=section id=error-categories>  <div class=section id=error-categories>
130  <h3>Error Classification</h3>  <h2>Error Classification</h2>
131    
132  <p class=ed>If a <code>Document</code> node has no  <p class=ed>If a <code>Document</code> node has no
133  xml-well-formedness-error, entity-error, and unknown-error,  xml-well-formedness-error, entity-error, and unknown-error,
# Line 141  into a well$B!>(Bformed XML document. Line 140  into a well$B!>(Bformed XML document.
140  can be easily serialized into a valid XML document.</p>  can be easily serialized into a valid XML document.</p>
141  </div>  </div>
142    
143    <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
144    ...</p>
145    
146  <p>Errors are classified into these  <p>Errors are classified into these
147  <dfn id=error-category title="error category">error categories</dfn>:</p>  <dfn id=error-category title="error category">error categories</dfn>:</p>
148    
# Line 159  can be easily serialized into a valid XM Line 161  can be easily serialized into a valid XM
161    </dd>    </dd>
162  <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>  <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
163    <dd class=ed>@@</dd>    <dd class=ed>@@</dd>
164    <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
165      <dd>
166        <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
167        will be raised when a construct, which might not be restored to the same
168        construct when it is serialized and then re-parsed by a conforming
169        processor, is encountered.</p>
170        <div class="example memo">
171          <p>For a <code>Comment</code> node a
172          <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
173          be raised, since <abbr>XML</abbr> processors are not required to
174          report texts of comments for applications.</p>
175        </div>
176      </dd>
177  <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>  <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
178    <dd class=ed>@@</dd>    <dd class=ed>@@</dd>
179  <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>  <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
# Line 179  can be easily serialized into a valid XM Line 194  can be easily serialized into a valid XM
194    <a href="@@/#dt-fatal">fatal error</a>)    <a href="@@/#dt-fatal">fatal error</a>)
195    that is not classified to any other <a href="#error-category">error    that is not classified to any other <a href="#error-category">error
196    category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>    category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
197    <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
198      <dd>
199        <p>An
200        <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
201        will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
202        in <abbr>XML</abbr> specification is not met.</p>
203      </dd>
204  <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>  <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
205    <dd>A violation of validity constraint in XML document.</dd>    <dd>A violation of validity constraint in XML document.</dd>
206  <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>  <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
# Line 192  can be easily serialized into a valid XM Line 214  can be easily serialized into a valid XM
214    
215  <div class=ed><p>@@ TODO: #dt-atuseroption at user option  <div class=ed><p>@@ TODO: #dt-atuseroption at user option
216  (MAY or MUST), #dt-compat for compatibility,  (MAY or MUST), #dt-compat for compatibility,
217  #dt-interop for interoperability</p></div>  #dt-interop for interoperability</p>
218    
219    <p>TODO: XML 1.1, XML Namespace 1.0/1.1, xml:base, xml:id
220    </div>
221    
222  </div>  </div>
223    
224    <div class=section id=parsing-xml>
225    <h2>Parsing <abbr>XML</abbr> Document</h2>
226    
227    <p>When an <abbr>XML</abbr> document is parsed, the following clauses
228    are applied:</p>
229    <dl>
230    <dt>For each document
231      <dd>If the <abbr>XML</abbr> document does not begin with an
232      <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
233      raise an
234      <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
235    <dt>For each internal general entity declaration processed by the parser
236      <dd>If the
237      <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
238      part of the general entity declaration contains a bare <code>U+003C</code>
239      <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
240      <em class=rfc2119>MUST</em> raise an
241      <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
242    "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
243    -->
244    <dt>For each element type declaration processed by the parser
245      <dd>If there is another element type declaration whose <code>Name</code>
246      is equal to the <code>Name</code> of the element type declaration, then
247      the parser <em class=rfc2119>MUST</em> raise an
248      <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
249    <!--
250      NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
251      will not be warned.
252    -->
253    <dt>For each empty-element tag
254      <dd>If the <code>Name</code> of the tag is not declared by a processed
255      element type declaration as <code>EMPTY</code> content, then the parser
256      <em class=rfc2119>MUST</em> raise an
257      <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
258    <dt>For each start-tag
259      <dd>If the <code>Name</code> of the tag is declared by a processed element
260      type declaration as <code>EMPTY</code> content, then the parser
261      <em class=rfc2119>MUST</em> raise an
262      <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
263    </dl>
264    
265    <p>In addition, the parser <em class=rfc2119>MUST</em> check whether the
266    following constraints are met.
267    
268    <p><strong>Well-formedness constraints</strong>.  For each violation to
269    one of constraints below, an
270    <a href="#xml-well-formedness-error"><code>xml-well-formedness-error</code></a>
271    <em class=rfc2119>MUST</em> be raised.  The list of well-formedness
272    constraints is below:
273    <ul>
274    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wfc-PEinInternalSubset">Well-formedness constraint: PEs in Internal Subset</a>
275    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#ExtSubset">Well-formedness constraint: External Subset</a>
276    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#PE-between-Decls">Well-formedness constraint: PE Between Declarations</a>
277    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#GIMatch">Well-formedness constraint: Element Type Match</a>
278    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#uniqattspec">Well-formedness constraint: Unique Att Spec</a>
279    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NoExternalRefs">Well-formedness constraint: No External Entity References</a>
280    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#CleanAttrVals">Well-formedness constraint: No &lt; in Attribute Values</a>
281    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wf-Legalchar">Well-formedness constraint: Legal Character</a>
282    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#textent">Well-formedness constraint: Parsed Entity</a>
283    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#norecursion">Well-formedness constraint: No Recursion</a>
284    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#indtd">Well-formedness constraint: In DTD</a>
285    </ul>
286    
287    <p><strong>Validity constraints</strong>.  For each violation to
288    one of constraints below, an
289    <a href="#xml-validity-error"><code>xml-validity-error</code></a>.
290    <em class=rfc2119>MUST</em> be raised.  The list of validity
291    constraints is below:
292    <ul>
293    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinMarkupDecl">Validity constraint: Proper Declaration/PE Nesting</a>
294    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinGroup">Validity constraint: Proper Group/PE Nesting</a>
295    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#condsec-nesting">Validity constraint: Proper Conditional Section/PE Nesting</a>
296    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#UniqueNotationName">Validity constraint: Unique Notation Name</a>
297    </ul>
298    
299    <p><strong>Other creteria</strong>.  For each violation to
300    one of constraints below, an
301    <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
302    <em class=rfc2119>MUST</em> be raised.  The list of constraints is below:
303    <ul>
304    <li><q>For interoperability, if a parameter-entity reference appears in a
305    <code>choice</code>, <code>seq</code>, or <code>Mixed</code> construct, its
306    replacement text <em class=rfc2119>SHOULD</em> contain at least one non-blank
307    character, and neither the first nor last non-blank character of the
308    replacement text <em class=rfc2119>SHOULD</em> be a connector (<code>|</code>
309    or <code>,</code>).</q>
310    <li><q>External parsed entities <em class=rfc2119>SHOULD</em> each begin with a
311    text declaration.</q>
312    </ul>
313    
314    <!--
315    <li><q>For interoperability, an XML processor <em class=rfc2119>MAY</em> at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error.</q>
316    
317    <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.
318    <li>It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."
319    -->
320    
321    <!--
322    
323    
324    @@ Need detailed review, but maybe should be in parsing phase
325    
326    #vc-check-rmd Validity constraint: Standalone Document Declaration
327    
328    @@ Need dtailed review
329    
330    #wf-entdeclared Well-formedness constraint: Entity Declared
331    #vc-entdeclared Validity constraint: Entity Declared
332    "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
333    "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
334    
335    @@ flaged and then reported in DOM check phase
336    
337    "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
338    "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
339    "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
340    
341    @@ in parsing phase
342    
343    "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
344    
345    @@ We should phrase out what the parser should do where the XML specification
346    left undefined.  For example: Comment must be converted to a Comment node,
347    illegal xml:space value must be preserved, so on.
348    
349    -->
350    </div>
351    
352    <div class="section" id=checking-dom>
353    <h2>Checking <abbr>DOM</abbr></h2>
354    
355    <p>The following algorithms and definitions are applied to
356    <abbr>XML</abbr> documents; especially, they are not applied
357    to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
358    
359  <div class=section id=definitions>  <div class=section id=definitions>
360  <h3>Definitions</h3>  <h3>Definitions</h3>
361    
# Line 213  is that of the <code>Document</code> nod Line 373  is that of the <code>Document</code> nod
373    
374  </div>  </div>
375    
376    
377    <div class=section id=checking-components>
378    <h3>Conformance Checking Algorithms for Components</h3>
379    
380    <p>To
381    <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
382    validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
383    the following algorithm <em class=rfc2119>MUST</em> be used:</p>
384    <ol>
385    <li>If <var>s</var> contains a character that is <em>not</em> in
386    the character class <a href="#class-Char10"><code>Char10</code></a>,
387    then raise an
388    <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
389    <li>If <var>s</var> contains a character that is in the character
390    class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
391    then raise an
392    <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
393    <li>If <var>s</var> contains a character that is in the character
394    class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
395    then raise an
396    <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
397    <li class=ed>@@ XML 1.1 support</li>
398    <li>If <var>s</var> contains a <code class=char>U+000D</code>
399    <code class=charname>CARRIAGE RETURN</code> character, then
400    raise a
401    <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
402    <span class=ed>@@ We should not raise duplicate errors for U+000D
403    in attribute values.  In addition, we should support a mode where
404    U+000D will be serialized as &#x000D; (so that no round-trip-error
405    will be raised).</span></li>
406    </ol>
407    
408    <p>To
409    <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
410    <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
411    algorithm <em class=rfc2119>MUST</em> be used:</p>
412    <ol>
413    <li>If <var>s</var> is an empty string, then raise an
414    <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
415    Abort these steps.</li>
416    <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
417    <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
418    <li>If the first character in <var>s</var> is a character that is
419    <em>not</em> in the character class
420    <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
421    an
422    <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
423    <li>If a character other than the first character in <var>s</var> is a
424    character that is <em>not</em> in the character class
425    <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
426    <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
427    <li>If <var>s</var> begins with the string <code>xml</code> (in any
428    case combination), then raise an
429    <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
430    <span class=ed>@@ except for attribute names <code>xml:lang</code>,
431    <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,
432    <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
433    pi name <code>xml-stylesheet</code>.</span><!--
434    "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
435    xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
436    --></li>
437    <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
438    [XML11]. -->
439    <li class=ed>@@ XML 1.1 support</li>
440    </ol>
441    
442    <p>To
443    <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
444    an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
445    following algorithm <em class=rfc2119>MUST</em> be used:</p>
446    <ol>
447    <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
448    <var>s</var> as a <code>Name</code></a>.</li>
449    <li class=ed>@@</li>
450    </ol>
451    
452    <p>To
453    <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
454    a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
455    following algorithm <em class=rfc2119>MUST</em> be used:</p>
456    <ol>
457    <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
458    <li>If <var>pid</var> contains a character that is <em>not</em> in the
459    character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
460    raise an
461    <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
462      <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
463      <code class=charname>CHARACTER TABULATION</code>,
464      <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
465      and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
466      characters, if the first character of <var>pid</var> is
467      <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
468      if the last character of <var>pid</var> is <code class=char>U+0020</code>
469      <code class=charname>SPACE</code> character, or if there is a
470      <code class=char>U+0020</code> <code class=charname>SPACE</code> character
471      immediately followed by another <code class=char>U+0020</code>
472      <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
473      <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
474      <span class=ed>Is this really a roundtripness problem?  XML spec
475      does only define the way to match public identifiers in fact, no
476      canonical form.</span></li>
477    </ol>
478    
479    <p>To
480    <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
481    a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
482    following algorithm <em class=rfc2119>MUST</em> be used:</p>
483    <ol>
484      <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
485    <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
486    <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
487      <li>If <var>sid</var> contains both <code class=char>U+0022</code>
488      <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
489      <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
490      (<code class=char>'</code>) characters, raise an
491      <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
492      <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
493      <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
494      character, then raise an
495      <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
496      <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
497      reference, then raise a fact-level error (xml-misc-warning?).<!--
498      XML 1.0 has no conformance language for system identifier being
499      a URI reference. --></li>
500    </ol>
501    
502    </div>
503    
504  <div class=section id=checking-node>  <div class=section id=checking-node>
505  <h3>Checking <code>Node</code></h3>  <h3>Checking <code>Node</code></h3>
506    
# Line 223  following:</p> Line 511  following:</p>
511  <dt>If <var>n</var> is an <code>Attr</code> node</dt>  <dt>If <var>n</var> is an <code>Attr</code> node</dt>
512    <dd>    <dd>
513      <ol>      <ol>
514      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
515      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
516      <li>If the <code>prefix</code> attribute value is different from      <li>If the <code>prefix</code> attribute value is different from
517      <code>null</code>, then <a href="#algorithm-to-validate-a-ncname">validate      <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
518      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
519      <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
520      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
# Line 241  following:</p> Line 529  following:</p>
529        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
530        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
531        </ol></li>        </ol></li>
532        <li>If <code>nodeName</code> attribute of <var>n</var> is
533        <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
534        and <code>value</code> attribute of <var>n</var> is neither
535        <code>default</code> nor <code>preserve</code>, then it is an
536        <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
537        Note that <!ATTLIST e xml:space (default|preserve) "a">...
538        <e xml:space="default"/> is conforming and valid. --></li>
539        <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
540        successor] or an empty string, then xml-misc-warning
541        (a "fact"-level error; not an XML error).</li>
542      <li class=ed>@@ <code>specified</code>,      <li class=ed>@@ <code>specified</code>,
543      <code>manakaiAttributeType</code></li>      <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
544        <!-- xml:space MUST be declared to be valid. -->
545        <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
546        attribute <code>value</code> of <var>n</var>.</li>
547        <li>Validate the <var>n</var> against the <span class=ed>declared
548        type</span> as following:
549          <dl class=switch>
550          <dt class=ed><code>ID_ATTR</code></dt>
551            <dd>
552              <ol>
553              <li><span class=ed>Validate <var>v</var> as an
554              <code>Name</code>.</span>  If it fails, then raise an
555              <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
556              <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
557              then raise an  
558              <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
559              </ol>
560            </dd>
561          <dt class=ed>IDREF_ATTR</dt>
562            <dd>
563              <ol>
564              <li><span class=ed>Validate <var>v</var> as an
565              <code>Name</code>.</span>  If it fails, then raise an
566              <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
567              <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
568              defined,</span> then raise an  
569              <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
570              </ol>
571            </dd>
572          <dt class=ed>IDREFS_ATTR</dt>
573            <dd class=ed>@@</dd>
574          <dt class=ed>ENTITY_ATTR</dt>
575            <dd>
576              <ol>
577              <li><span class=ed>Validate <var>v</var> as an
578              <code>Name</code>.</span>  If it fails, then raise an
579              <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
580              <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
581              defined,</span> then raise an  
582              <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
583              </ol>
584            </dd>
585          <dt class=ed>ENTITIES_ATTR</dt>
586            <dd class=ed>@@</dd>
587          <dt class=ed>NMTOKEN_ATTR</dt>
588            <dd>
589              <ol>
590              <li><span class=ed>Validate <var>v</var> as an
591              <code>Nmtoken</code>.</span>  If it fails, then raise an
592              <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
593              </ol>
594            </dd>
595          <dt class=ed>NMTOKENS_ATTR</dt>
596            <dd class=ed>@@</dd>
597          <dt class=ed>NOTATION_ATTR</dt>
598            <dd class=ed><var>v</var> must be one of enumerated values.
599            If not, then raise an
600            <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
601          <dt class=ed>ENUMERATED_ATTR</dt>
602            <dd class=ed><var>v</var> must be one of enumerated values.
603            If not, then raise an
604            <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
605          <dt class=ed>@@</dt>
606          </dl>
607        </li>
608        <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
609        raise an
610        <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
611        <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
612        <li class=ed>@@ strict serialization error for U+000D, U+000A, and
613        U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
614      </ol>      </ol>
615    </dd>    </dd>
616  <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
617    <dd>    <dd>
618      <ol>      <ol>
619      <li class=ed></li>      <li class=ed></li>
620        <li>If <code>nodeName</code> attribute of <var>n</var> is
621        <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
622        and <span class=ed>its declared type is different from (default|preserve),
623        (preserve|default), (default), or (preserve)</span>, then raise an
624        <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
625      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
626      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
627        <ol>        <ol>
# Line 262  following:</p> Line 635  following:</p>
635        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
636        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
637        </ol></li>        </ol></li>
638        <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
639        be declared.  If not, then raise an
640        <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
641        <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
642        values MUST all be distinct.  If not, then raise an
643        <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
644        <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
645        element, then raise an
646        <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
647        <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
648      </ol>      </ol>
649    </dd>    </dd>
650  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
651    <dd>    <dd>
652      <ol>      <ol>
653      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
654      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
655      data</a>.</li>      data</a>.</li>
656      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains
657      a string <code>]]></code>, then it is an      a string <code>]]></code>, then raise an
658      <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
659      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
660      any nodes, they are in      any nodes, they are in
661      <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
# Line 281  following:</p> Line 664  following:</p>
664  <dt>If <var>n</var> is a <code>Comment</code> node</dt>  <dt>If <var>n</var> is a <code>Comment</code> node</dt>
665    <dd>    <dd>
666      <ol>      <ol>
667      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li>Raise an
668        <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
669        <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
670      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
671      data</a>.</li>      data</a>.</li>
672      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains
673      a string <code>--</code>, or if it ends with a character      a string <code>--</code>, or if it ends with a character
674      <code>-</code>, then it is an      <code>-</code>, then raise an
675      <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
676        "for compatibility" --></li>
677      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
678      any nodes, they are in      any nodes, they are in
679      <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
# Line 347  following:</p> Line 733  following:</p>
733  <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>  <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
734    <dd>    <dd>
735      <ol>      <ol>
736      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
737      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
738        <li>Follow the following substeps:
739          <ol>
740          <li>If <code>ownerDocument</code> attribute of <var>n</var> is
741          <code>null</code>, then abort these substeps.</li>
742          <li>If <code>documentElement</code> attribute of the node
743          set to <code>ownerDocument</code> attribute of <var>n</var> is
744          <code>null</code>, then abort these substeps.</li>
745          <li>If <code>nodeName</code> attribute of the node set to
746          <code>documentElement</code> attribute of the node set to
747          <code>ownerDocument</code> attribute of <var>n</var> is
748          different from <code>nodeName</code> of <var>n</var>,
749          then raise an
750          <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
751          </ol>
752        </li>
753      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
754      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
755      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
756      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
757      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <li>If the <code>publicId</code> attribute value of <var>n</var> is
758      <!-- publicId = non-null and systemId = null -->      <em>not</em> <code>null</code> and the <code>systemId</code> attribute
759        value of <var>n</var> is <code>null</code>, then raise an
760        <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
761        <span class=ed>@@ publicId == null?  Or, publicId == ""</span></li>
762      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
763      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
764        <ol>        <ol>
# Line 366  following:</p> Line 770  following:</p>
770        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
771        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
772        </ol></li>        </ol></li>
773      <li class=ed>@@ <code>entities</code>, <code>notations</code>,      <li>For each node in the <code>entities</code>, <code>notations</code>,
774      <code>elementTypes</code>, externally declared?</li>      and <code>elementTypes</code> lists of <var>n</var>,
775        <a href="#algorithm-to-check-a-node" title="check a node">check the
776        node</a> recursively.</li>
777        <li class=ed>@@ externally declared?</li>
778      </ol>      </ol>
779    </dd>    </dd>
780  <dt>If <var>n</var> is an <code>Element</code> node</dt>  <dt>If <var>n</var> is an <code>Element</code> node</dt>
781    <dd>    <dd>
782      <ol>      <ol>
783      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
784      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
785      <li>If the <code>prefix</code> attribute value is different from      <li>If the <code>prefix</code> attribute value is different from
786      <code>null</code>, then <a href="#algorithm-to-validate-a-ncname">validate      <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
787      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
788      <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
789      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 394  following:</p> Line 801  following:</p>
801        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
802        <var>n<sub><var>c</var></sub></var></a> recursively.</li>        <var>n<sub><var>c</var></sub></var></a> recursively.</li>
803        </ol></li>        </ol></li>
804        <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
805        <li>Let <var>attrs</var> be the value of the <code>attribute</code>
806        attribute of <var>n</var>.  Check conformance of <var>attrs</var>
807        as following:
808          <ol>
809          <li>If <var>attrs</var> contains an <code>Attr</code> node whose
810          <code>nodeName</code> attribute value is equal to that of another
811          <code>Attr</code> node in <var>attrs</var>, then raise an
812          <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
813          #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
814          <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
815          </ol>
816        </li>
817      </ol>      </ol>
818    </dd>    </dd>
819  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
# Line 403  following:</p> Line 823  following:</p>
823      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
824      any nodes, they are in      any nodes, they are in
825      <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
826        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
827        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
828        <li class=ed>@@ #vc-MixedChildrenUnique
829        Validity constraint: No Duplicate Types</li>
830        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
831        <li>If there is more than one <code>AttributeDefinition</code> node
832        with <span class=ed>attribute type <code>ID</code></span> in the
833        <code>NamedNodeMap</code> list contained in the
834        <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
835        <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
836        <li>If there is more than one <code>AttributeDefinition</code> node
837        with <span class=ed>attribute type <code>NOTATION</code></span> in the
838        <code>NamedNodeMap</code> list contained in the
839        <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
840        <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
841        <li class=ed>"For interoperability, the same Nmtoken  SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
842      </ol>      </ol>
843    </dd>    </dd>
844  <dt>If <var>n</var> is an <code>Entity</code> node</dt>  <dt>If <var>n</var> is an <code>Entity</code> node whose
845    <code>notationName</code> attribute value is <code>null</code> (i.e. a
846    parsed entity)</dt>
847    <dd>    <dd>
848      <ol>      <ol>
849      <li>An <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a> <span class=ed>@@ if !notationName</span>.</li>      <li>Raise an
850      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
851        <li><a href="#algorithm-to-validate-an-ncname">Validate the
852      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
853      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
854      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
855      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
856      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
857      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <li>If the <code>publicId</code> attribute value of <var>n</var> is
858      <!-- publicId = non-null and systemId = null -->      <em>not</em> <code>null</code> and the <code>systemId</code> attribute
859      <li class=ed>@@ <code>notationName</code></li>      value of <var>n</var> is <code>null</code>, then raise an
860        <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
861      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
862      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
863        <ol>        <ol>
# Line 436  following:</p> Line 876  following:</p>
876        </ol></li>        </ol></li>
877      </ol>      </ol>
878    </dd>    </dd>
879    <dt>If <var>n</var> is an <code>Entity</code> node whose
880    <code>notationName</code> attribute value is <em>not</em> <code>null</code>
881    (i.e. an unparsed entity)</dt>
882      <dd>
883        <ol>
884        <li><a href="#algorithm-to-validate-an-ncname">Validate the
885        <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
886        <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
887        the <code>publicId</code> attribute value as a public identifier</a>.</li>
888        <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
889        the <code>systemId</code> attribute value as a system identifier</a>.</li>
890        <li>If the <code>systemId</code> attribute value of <var>n</var> is
891        <code>null</code>, then raise an
892        <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
893        <li><a href="#algorithm-to-validate-an-ncname">Validate the
894        <code>notationName</code> attribute value of <var>n</var> as an
895        <code>NCName</code></a>.</li>
896        <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
897        <li>If the <code>childNodes</code> list of <var>n</var> contains
898        any nodes, they are in
899        <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
900        </ol>
901      </dd>
902  <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>  <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
903    <dd>    <dd>
904      <ol>      <ol>
905      <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>      <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
906      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
907      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
908      <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
909      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 463  following:</p> Line 926  following:</p>
926  <dt>If <var>n</var> is a <code>Notation</code> node</dt>  <dt>If <var>n</var> is a <code>Notation</code> node</dt>
927    <dd>    <dd>
928      <ol>      <ol>
929      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
930      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
931      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
932      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
933      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
934      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
     <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>  
935      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
936      any nodes, they are in      any nodes, they are in
937      <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
# Line 478  following:</p> Line 940  following:</p>
940  <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>  <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
941    <dd>    <dd>
942      <ol>      <ol>
943      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li>If the <code>target</code> attribute value matches to the string
944      <code>target</code> attribute value as an <code>NCName</code></a>.</li>      <code>xml</code> in any case combination, then raise a
     <li>If the <code>target</code> attribute value matches to  
     <code>XML</code> case-insensitively, then it is a  
945      <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
946      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
947      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>target</code> attribute value as an <code>NCName</code></a>.</li>
948        <li>Then,
949        <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
950        the <code>data</code> attribute value as an <abbr>XML</abbr> character
951      data</a>.</li>      data</a>.</li>
952      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains a string
953      a string <code>?></code>, or starts with either      <code>?></code>, then raise a
954      <code class=char>U+0009</code>, <code class=char>U+000A</code>,      <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
955      <code class=char>U+000D</code>, or <code class=char>U+0020</code>      <li>If the <code>data</code> attribute value starts with either
956      character, then it is a      <code class=char>U+0009</code> <code class=charname>CHARACTER
957      <a href="#round-trip-error" id=rte-pi-data><code>round-trip-error</code></a>.</li>      TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
958        FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
959        RETURN</code>, or <code class=char>U+0020</code>
960        <code class=charname>SPACE</code> character, then raise a
961        <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
962      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
963      any nodes, they are in      any nodes, then raise an
964      <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
965      </ol>      </ol>
966    </dd>    </dd>
967  <dt>If <var>n</var> is a <code>Text</code> node</dt>  <dt>If <var>n</var> is a <code>Text</code> node</dt>
968    <dd>    <dd>
969      <ol>      <ol>
970      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
971      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
972      data</a>.</li>      data</a>.</li>
973      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
# Line 514  following:</p> Line 981  following:</p>
981    
982  </div>  </div>
983    
984  <div class=section id=checking-components>  </div>
 <h3>Checking Components</h3>  
985    
986  <p>The algorithm  <div id=classes class=section>
987  <dfn id=algorithm-to-validate-an-xml-character-data title-"to validate an XML character data">to  <h2>Character Classes</h2>
 validate an <abbr>XML</abbr> character data  
 (<dfn id=var-s><var>s</var></dfn>)</dfn> is defined as following:</p>  
 <ol>  
 <li class=ed>@@</li>  
 </ol>  
988    
989  <p>The algorithm  <p>This section defines a couple of character classes.
990  <dfn id=algorithm-to-validate-a-name title="to validate a Name">to validate a  These classes are referred to by algorithms specified above.</p>
 <code>Name</code> (<dfn id=var-name><var>name</var></dfn>)</dfn> is defined  
 as following:</p>  
 <ol>  
 <li class=ed>@@</li>  
 </ol>  
991    
992  <p>The algorithm  <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
993  <dfn id=algorithm-to-validate-a-ncname title="to validate an NCName">to  contains the following characters:</p>
994  validate an <code>NCName</code> (<dfn id=var-name><var>name</var></dfn>)</dfn>  <ul>
995  is defined as following:</p>  <li><code class=char>U+0009</code> <code class=charname>CHARACTER
996  <ol>  TABULATION</code></li>
997  <li class=ed>@@</li>  <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
998  </ol>  <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
999    RETURN</code></li>
1000    <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
1001    .. <code class=char>U+D7FF</code></li>
1002    <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
1003    <code class=charname>REPLACEMENT CHARACTER</code></li>
1004    <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
1005    </ul>
1006    <div class="note memo">
1007    <p>This character class contains all characters allowed in the production rule
1008    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
1009    of <abbr>XML</abbr> 1.0
1010    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1011    </div>
1012    
1013  <p>To <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate  <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
1014  a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the  contains the following characters:</p>
1015  algorithm below <em class=rfc2119>MUST</em> be used:</p>  <ul>
1016  <ol>  <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
1017    <li>If <var>pid</var> contains any character  </ul>
1018    that is outside of the range of <code>#x20 | #xD | #xA |  
1019    [a-zA-Z0-9] | [-'()+,./:=?;!*#@$_%]</code><!-- @@ TODO: formal def -->,  <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
1020    then it is an  contains the following characters:</p>
1021    <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>  <ul>
1022    <li>If <var>pid</var> contains one of <code class=char>U+0009</code>  <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
1023    <code class=charname>CHARACTER TABULATION</code>,  <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
1024    <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,  <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
1025    and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>  AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
1026    characters, if the first character of <var>pid</var> is  PROGRAM COMMAND</code></li>
1027    <code class=char>U+0020</code> <code class=charname>SPACE</code> character,  <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
1028    if the last character of <var>pid</var> is <code class=char>U+0020</code>  <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
1029    <code class=charname>SPACE</code> character, or if there is a  <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
1030    <code class=char>U+0020</code> <code class=charname>SPACE</code> character  <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
1031    immediately followed by another <code class=char>U+0020</code>  <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
1032    <code class=charname>SPACE</code> character in <var>pid</var>, then it is a  <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
1033    <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.</li>  <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
1034    <li class=ed>@@ Should we check formal-public-identifierness?</li>  <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
1035  </ol>  <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
1036    <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
1037    <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
1038    <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
1039    <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
1040    <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
1041    <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
1042    <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
1043    <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
1044    </ul>
1045    <div class="note memo">
1046    <p>This character class contains the characters listed in the Note in
1047    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
1048    of <abbr>XML</abbr> 1.0
1049    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
1050    as amended by
1051    <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
1052    </div>
1053    
1054    <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
1055    contains the following characters:</p>
1056    <ul class=ed>
1057    </ul>
1058    <div class="note memo">
1059    <p>This character class contains all characters allowed as the first character
1060    of a string matching to the production rule
1061    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1062    of <abbr>XML</abbr> 1.0
1063    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1064    </div>
1065    
1066    <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1067    contains the following characters:</p>
1068    <ul>
1069    <li>The characters in the character class
1070    <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1071    <li class=ed></li>
1072    </ul>
1073    <div class="note memo">
1074    <p>This character class contains all characters allowed as the second
1075    character of a string matching to the production rule
1076    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1077    of <abbr>XML</abbr> 1.0
1078    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1079    </div>
1080    
1081    <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1082    contains the following characters:</p>
1083    <ul>
1084    <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1085    TABULATION</code></li>
1086    <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1087    <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1088    RETURN</code></li>
1089    <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1090    <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1091    (<code class=char>!</code>)</li>
1092    <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1093    (<code class=char>$</code>)</li>
1094    <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1095    (<code class=char>#</code>)</li>
1096    <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1097    (<code class=char>%</code>)</li>
1098    <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1099    (<code class=char>'</code>)</li>
1100    <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1101    (<code class=char>(</code>)</li>
1102    <li><code class=char>U+0029</code> <code class=charname>RIGHT
1103    PARENTHESIS</code> (<code class=char>)</code>)</li>
1104    <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1105    (<code class=char>*</code>)</li>
1106    <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1107    (<code class=char>+</code>)</li>
1108    <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1109    (<code class=char>,</code>)</li>
1110    <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1111    (<code class=char>-</code>)</li>
1112    <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1113    (<code class=char>.</code>)</li>
1114    <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1115    (<code class=char>/</code>)</li>
1116    <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1117    (<code class=char>0</code>) .. <code class=char>U+0039</code>
1118    <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1119    <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1120    (<code class=char>:</code>)</li>
1121    <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1122    (<code class=char>;</code>)</li>
1123    <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1124    (<code class=char>=</code>)</li>
1125    <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1126    (<code class=char>?</code>)</li>
1127    <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1128    (<code class=char>@</code>)</li>
1129    <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1130    A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1131    <code class=charname>LATIN CAPITAL LETTER Z</code>
1132    (<code class=char>Z</code>)</li>
1133    <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1134    (<code class=char>_</code>)</li>
1135    <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1136    A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1137    <code class=charname>LATIN CAPITAL LETTER Z</code>
1138    (<code class=char>Z</code>)</li>
1139    </ul>
1140    <div class="note memo">
1141    <p>This character class contains all characters allowed in the production rule
1142    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1143    of <abbr>XML</abbr> 1.0
1144    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1145  </div>  </div>
1146    
1147  </div>  </div>

Legend:
Removed from v.1.9  
changed lines
  Added in v.1.24

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24