/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Diff of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.9 by wakaba, Tue Nov 13 01:11:21 2007 UTC revision 1.22 by wakaba, Sun Mar 16 12:33:53 2008 UTC
# Line 17  Line 17 
17  <body class="has-abstract">  <body class="has-abstract">
18    
19  <div class="header">  <div class="header">
20  <h1>manakai's XML Conformance Checking</h1>  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21  <h2>Working Draft <time datetime=2007-11-13>13 November 2007</time></h2>  <h2>Working Draft <time datetime=2008-03-16>16 March 2008</time></h2>
22    
23  <dl class="versions-uri">  <dl class="versions-uri">
24  <dt>This Version</dt>  <dt>This Version</dt>
# Line 42  Line 42 
42        >w@suika.fam.cx</a>&gt;</code></dd>        >w@suika.fam.cx</a>&gt;</code></dd>
43  </dl>  </dl>
44    
45  <p class="copyright" lang="en">&#xA9; <time>2007</time> <a  <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47  Permission is granted to copy, distribute and/or modify this document  Permission is granted to copy, distribute and/or modify this document
48  under the terms of the <a rel="license"  under the terms of the <a rel="license"
# Line 118  else in this specification is normative. Line 118  else in this specification is normative.
118  <p><span class=ed>Algorithm is normative but non-normative</span>.  <p><span class=ed>Algorithm is normative but non-normative</span>.
119  In addition, the order in which <a href="#errors">errors</a> are  In addition, the order in which <a href="#errors">errors</a> are
120  raised is undefined.</p>  raised is undefined.</p>
 </div>  
121    
122  <div class="section" id=checking-dom>  <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123  <h2>Checking <abbr>DOM</abbr></h2>  by hyperlinks.  When the document being processed is an <abbr>XML</abbr> 1.1
124    document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125    specification should be consulted instead.</p>
126    </div>
127    
 <p>The following algorithms and definitions are applied to  
 <abbr>XML</abbr> documents; especially, they are not applied  
 to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>  
128    
129  <div class=section id=error-categories>  <div class=section id=error-categories>
130  <h3>Error Classification</h3>  <h2>Error Classification</h2>
131    
132  <p class=ed>If a <code>Document</code> node has no  <p class=ed>If a <code>Document</code> node has no
133  xml-well-formedness-error, entity-error, and unknown-error,  xml-well-formedness-error, entity-error, and unknown-error,
# Line 141  into a well$B!>(Bformed XML document. Line 140  into a well$B!>(Bformed XML document.
140  can be easily serialized into a valid XML document.</p>  can be easily serialized into a valid XML document.</p>
141  </div>  </div>
142    
143    <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
144    ...</p>
145    
146  <p>Errors are classified into these  <p>Errors are classified into these
147  <dfn id=error-category title="error category">error categories</dfn>:</p>  <dfn id=error-category title="error category">error categories</dfn>:</p>
148    
# Line 159  can be easily serialized into a valid XM Line 161  can be easily serialized into a valid XM
161    </dd>    </dd>
162  <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>  <dt><dfn id=round-trip-error><code>round-trip-error</code></dfn></dt>
163    <dd class=ed>@@</dd>    <dd class=ed>@@</dd>
164    <dt><dfn id=round-trip-warning><code>round-trip-warning</code></dfn></dt>
165      <dd>
166        <p>A <a href="#round-trip-warning"><code>round-trip-warning</code></a>
167        will be raised when a construct, which might not be restored to the same
168        construct when it is serialized and then re-parsed by a conforming
169        processor, is encountered.</p>
170        <div class="example memo">
171          <p>For a <code>Comment</code> node a
172          <a href="#round-trip-warning"><code>round-trip-warning</code></a> will
173          be raised, since <abbr>XML</abbr> processors are not required to
174          report texts of comments for applications.</p>
175        </div>
176      </dd>
177  <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>  <dt class=ed><dfn id=unknown-error><code>unknown-error</code></dfn>?</dt>
178    <dd class=ed>@@</dd>    <dd class=ed>@@</dd>
179  <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>  <dt><dfn id=xml-misc-error><code>xml-misc-error</code></dt>
# Line 179  can be easily serialized into a valid XM Line 194  can be easily serialized into a valid XM
194    <a href="@@/#dt-fatal">fatal error</a>)    <a href="@@/#dt-fatal">fatal error</a>)
195    that is not classified to any other <a href="#error-category">error    that is not classified to any other <a href="#error-category">error
196    category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>    category</a>. <span class=ed>@@ What errors fall into this category?</span></dd>
197    <dt><dfn id=xml-misc-recommendation><code>xml-misc-recommendation</code></dfn></dt>
198      <dd>
199        <p>An
200        <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
201        will be raised if a <em class=rfc2119>SHOULD</em>$B!>(Blevel requirement
202        in <abbr>XML</abbr> specification is not met.</p>
203      </dd>
204  <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>  <dt><dfn id=xml-validity-error><code>xml-validity-error</code></dfn></dt>
205    <dd>A violation of validity constraint in XML document.</dd>    <dd>A violation of validity constraint in XML document.</dd>
206  <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>  <dt><dfn id=xml-well-formedness-error><code>xml-well-formedness-error</code></dfn></dt>
# Line 196  can be easily serialized into a valid XM Line 218  can be easily serialized into a valid XM
218    
219  </div>  </div>
220    
221    <div class=section id=parsing-xml>
222    <h2>Parsing <abbr>XML</abbr> Document</h2>
223    
224    <p>When an <abbr>XML</abbr> document is parsed, the following clauses
225    are applied:</p>
226    <dl>
227    <dt>For each document
228      <dd>If the <abbr>XML</abbr> document does not begin with an
229      <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
230      raise an
231      <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
232    <dt>For each internal general entity declaration processed by the parser
233      <dd>If the
234      <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
235      part of the general entity declaration contains a bare <code>U+003C</code>
236      <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
237      <em class=rfc2119>MUST</em> raise an
238      <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
239    "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
240    -->
241    <dt>For each element type declaration processed by the parser
242      <dd>If there is another element type declaration whose <code>Name</code>
243      is equal to the <code>Name</code> of the element type declaration, then
244      the parser <em class=rfc2119>MUST</em> raise an
245      <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
246    <!--
247      NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
248      will not be warned.
249    -->
250    <dt>For each empty-element tag
251      <dd>If the <code>Name</code> of the tag is not declared by a processed
252      element type declaration as <code>EMPTY</code> content, then the parser
253      <em class=rfc2119>MUST</em> raise an
254      <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
255    <dt>For each start-tag
256      <dd>If the <code>Name</code> of the tag is declared by a processed element
257      type declaration as <code>EMPTY</code> content, then the parser
258      <em class=rfc2119>MUST</em> raise an
259      <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
260    <!--
261    
262    #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting
263    #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset
264    #ExtSubset Well-formedness constraint: External Subset
265    #PE-between-Decls Well-formedness constraint: PE Between Declarations
266    #GIMatch Well-formedness constraint: Element Type Match
267    #uniqattspec Well-formedness constraint: Unique Att Spec
268    #NoExternalRefs Well-formedness constraint: No External Entity References
269    #CleanAttrVals Well-formedness constraint: No < in Attribute Values
270    #vc-PEinGroup Validity constraint: Proper Group/PE Nesting
271    "For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)."
272    "For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error."
273    #condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting
274    #wf-Legalchar Well-formedness constraint: Legal Character
275    #textent Well-formedness constraint: Parsed Entity
276    #norecursion Well-formedness constraint: No Recursion
277    #indtd Well-formedness constraint: In DTD
278    "External parsed entities SHOULD each begin with a text declaration."
279    "It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."
280    #UniqueNotationName Validity constraint: Unique Notation Name
281    
282    @@ Need detailed review, but maybe should be in parsing phase
283    
284    #vc-check-rmd Validity constraint: Standalone Document Declaration
285    
286    @@ Need dtailed review
287    
288    #wf-entdeclared Well-formedness constraint: Entity Declared
289    #vc-entdeclared Validity constraint: Entity Declared
290    "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
291    "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
292    
293    @@ flaged and then reported in DOM check phase
294    
295    "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
296    "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
297    "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
298    
299    @@ in parsing phase
300    
301    "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
302    
303    -->
304    </dl>
305    </div>
306    
307    <div class="section" id=checking-dom>
308    <h2>Checking <abbr>DOM</abbr></h2>
309    
310    <p>The following algorithms and definitions are applied to
311    <abbr>XML</abbr> documents; especially, they are not applied
312    to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
313    
314  <div class=section id=definitions>  <div class=section id=definitions>
315  <h3>Definitions</h3>  <h3>Definitions</h3>
316    
# Line 213  is that of the <code>Document</code> nod Line 328  is that of the <code>Document</code> nod
328    
329  </div>  </div>
330    
331    
332    <div class=section id=checking-components>
333    <h3>Conformance Checking Algorithms for Components</h3>
334    
335    <p>To
336    <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
337    validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
338    the following algorithm <em class=rfc2119>MUST</em> be used:</p>
339    <ol>
340    <li>If <var>s</var> contains a character that is <em>not</em> in
341    the character class <a href="#class-Char10"><code>Char10</code></a>,
342    then raise an
343    <a href="#xml-well-formedness-error" id=wfe-illegal-char><code>xml-well-formedness-error</code></a>.</li>
344    <li>If <var>s</var> contains a character that is in the character
345    class <a href="#class-CompatChar10"><code>CompatChar10</code></a>,
346    then raise an
347    <a href="#xml-misc-warning" id=xmw-compat-char><code>xml-misc-warning</code></a>.</li>
348    <li>If <var>s</var> contains a character that is in the character
349    class <a href="#class-ControlChar10"><code>ControlChar10</code></a>,
350    then raise an
351    <a href="#xml-misc-warning" id=xmw-control-char><code>xml-misc-warning</code></a>.</li>
352    <li class=ed>@@ XML 1.1 support</li>
353    <li>If <var>s</var> contains a <code class=char>U+000D</code>
354    <code class=charname>CARRIAGE RETURN</code> character, then
355    raise a
356    <a href="#round-trip-error" id=rte-cr><code>round-trip-error</code></a>.
357    <span class=ed>@@ We should not raise duplicate errors for U+000D
358    in attribute values.  In addition, we should support a mode where
359    U+000D will be serialized as &#x000D; (so that no round-trip-error
360    will be raised).</span></li>
361    </ol>
362    
363    <p>To
364    <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
365    <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
366    algorithm <em class=rfc2119>MUST</em> be used:</p>
367    <ol>
368    <li>If <var>s</var> is an empty string, then raise an
369    <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
370    Abort these steps.</li>
371    <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
372    <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
373    <li>If the first character in <var>s</var> is a character that is
374    <em>not</em> in the character class
375    <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
376    an
377    <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
378    <li>If a character other than the first character in <var>s</var> is a
379    character that is <em>not</em> in the character class
380    <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
381    <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
382    <li>If <var>s</var> begins with the string <code>xml</code> (in any
383    case combination), then raise an
384    <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
385    <span class=ed>@@ except for attribute names <code>xml:lang</code>,
386    <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,
387    <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
388    pi name <code>xml-stylesheet</code>.</span><!--
389    "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
390    xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
391    --></li>
392    <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
393    [XML11]. -->
394    <li class=ed>@@ XML 1.1 support</li>
395    </ol>
396    
397    <p>To
398    <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
399    an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
400    following algorithm <em class=rfc2119>MUST</em> be used:</p>
401    <ol>
402    <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
403    <var>s</var> as a <code>Name</code></a>.</li>
404    <li class=ed>@@</li>
405    </ol>
406    
407    <p>To
408    <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
409    a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
410    following algorithm <em class=rfc2119>MUST</em> be used:</p>
411    <ol>
412    <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
413    <li>If <var>pid</var> contains a character that is <em>not</em> in the
414    character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
415    raise an
416    <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
417      <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
418      <code class=charname>CHARACTER TABULATION</code>,
419      <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
420      and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>
421      characters, if the first character of <var>pid</var> is
422      <code class=char>U+0020</code> <code class=charname>SPACE</code> character,
423      if the last character of <var>pid</var> is <code class=char>U+0020</code>
424      <code class=charname>SPACE</code> character, or if there is a
425      <code class=char>U+0020</code> <code class=charname>SPACE</code> character
426      immediately followed by another <code class=char>U+0020</code>
427      <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
428      <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
429      <span class=ed>Is this really a roundtripness problem?  XML spec
430      does only define the way to match public identifiers in fact, no
431      canonical form.</span></li>
432    </ol>
433    
434    <p>To
435    <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
436    a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
437    following algorithm <em class=rfc2119>MUST</em> be used:</p>
438    <ol>
439      <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
440    <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
441    <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
442      <li>If <var>sid</var> contains both <code class=char>U+0022</code>
443      <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
444      <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
445      (<code class=char>'</code>) characters, raise an
446      <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
447      <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
448      <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
449      character, then raise an
450      <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
451      <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
452      reference, then raise a fact-level error (xml-misc-warning?).<!--
453      XML 1.0 has no conformance language for system identifier being
454      a URI reference. --></li>
455    </ol>
456    
457    </div>
458    
459  <div class=section id=checking-node>  <div class=section id=checking-node>
460  <h3>Checking <code>Node</code></h3>  <h3>Checking <code>Node</code></h3>
461    
# Line 223  following:</p> Line 466  following:</p>
466  <dt>If <var>n</var> is an <code>Attr</code> node</dt>  <dt>If <var>n</var> is an <code>Attr</code> node</dt>
467    <dd>    <dd>
468      <ol>      <ol>
469      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
470      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
471      <li>If the <code>prefix</code> attribute value is different from      <li>If the <code>prefix</code> attribute value is different from
472      <code>null</code>, then <a href="#algorithm-to-validate-a-ncname">validate      <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
473      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
474      <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
475      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
# Line 241  following:</p> Line 484  following:</p>
484        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
485        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
486        </ol></li>        </ol></li>
487        <li>If <code>nodeName</code> attribute of <var>n</var> is
488        <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
489        and <code>value</code> attribute of <var>n</var> is neither
490        <code>default</code> nor <code>preserve</code>, then it is an
491        <a href="#xml-misc-error" id=xme-attr-xml-space><code>xml-misc-error</code></a>.<!--
492        Note that <!ATTLIST e xml:space (default|preserve) "a">...
493        <e xml:space="default"/> is conforming and valid. --></li>
494        <li class=ed>@@ xml:lang value is not a language tag [RFC 3066 or its
495        successor] or an empty string, then xml-misc-warning
496        (a "fact"-level error; not an XML error).</li>
497      <li class=ed>@@ <code>specified</code>,      <li class=ed>@@ <code>specified</code>,
498      <code>manakaiAttributeType</code></li>      <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
499        <!-- xml:space MUST be declared to be valid. -->
500        <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
501        attribute <code>value</code> of <var>n</var>.</li>
502        <li>Validate the <var>n</var> against the <span class=ed>declared
503        type</span> as following:
504          <dl class=switch>
505          <dt class=ed><code>ID_ATTR</code></dt>
506            <dd>
507              <ol>
508              <li><span class=ed>Validate <var>v</var> as an
509              <code>Name</code>.</span>  If it fails, then raise an
510              <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
511              <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
512              then raise an  
513              <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
514              </ol>
515            </dd>
516          <dt class=ed>IDREF_ATTR</dt>
517            <dd>
518              <ol>
519              <li><span class=ed>Validate <var>v</var> as an
520              <code>Name</code>.</span>  If it fails, then raise an
521              <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
522              <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
523              defined,</span> then raise an  
524              <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
525              </ol>
526            </dd>
527          <dt class=ed>IDREFS_ATTR</dt>
528            <dd class=ed>@@</dd>
529          <dt class=ed>ENTITY_ATTR</dt>
530            <dd>
531              <ol>
532              <li><span class=ed>Validate <var>v</var> as an
533              <code>Name</code>.</span>  If it fails, then raise an
534              <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
535              <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
536              defined,</span> then raise an  
537              <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
538              </ol>
539            </dd>
540          <dt class=ed>ENTITIES_ATTR</dt>
541            <dd class=ed>@@</dd>
542          <dt class=ed>NMTOKEN_ATTR</dt>
543            <dd>
544              <ol>
545              <li><span class=ed>Validate <var>v</var> as an
546              <code>Nmtoken</code>.</span>  If it fails, then raise an
547              <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
548              </ol>
549            </dd>
550          <dt class=ed>NMTOKENS_ATTR</dt>
551            <dd class=ed>@@</dd>
552          <dt class=ed>NOTATION_ATTR</dt>
553            <dd class=ed><var>v</var> must be one of enumerated values.
554            If not, then raise an
555            <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
556          <dt class=ed>ENUMERATED_ATTR</dt>
557            <dd class=ed><var>v</var> must be one of enumerated values.
558            If not, then raise an
559            <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
560          <dt class=ed>@@</dt>
561          </dl>
562        </li>
563        <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
564        raise an
565        <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
566        <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
567        <li class=ed>@@ strict serialization error for U+000D, U+000A, and
568        U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
569      </ol>      </ol>
570    </dd>    </dd>
571  <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
572    <dd>    <dd>
573      <ol>      <ol>
574      <li class=ed></li>      <li class=ed></li>
575        <li>If <code>nodeName</code> attribute of <var>n</var> is
576        <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
577        and <span class=ed>its declared type is different from (default|preserve),
578        (preserve|default), (default), or (preserve)</span>, then raise an
579        <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
580      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
581      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
582        <ol>        <ol>
# Line 262  following:</p> Line 590  following:</p>
590        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
591        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
592        </ol></li>        </ol></li>
593        <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
594        be declared.  If not, then raise an
595        <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
596        <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
597        values MUST all be distinct.  If not, then raise an
598        <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
599        <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
600        element, then raise an
601        <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
602        <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
603      </ol>      </ol>
604    </dd>    </dd>
605  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
606    <dd>    <dd>
607      <ol>      <ol>
608      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
609      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
610      data</a>.</li>      data</a>.</li>
611      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains
612      a string <code>]]></code>, then it is an      a string <code>]]></code>, then raise an
613      <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-cs-mse><code>xml-well-formedness-error</code></a>.<!-- "for compatibility" --></li>
614      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
615      any nodes, they are in      any nodes, they are in
616      <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-cs-child><code>xml-well-formedness-error</code></a>.</li>
# Line 281  following:</p> Line 619  following:</p>
619  <dt>If <var>n</var> is a <code>Comment</code> node</dt>  <dt>If <var>n</var> is a <code>Comment</code> node</dt>
620    <dd>    <dd>
621      <ol>      <ol>
622      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li>Raise an
623        <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
624        <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
625      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
626      data</a>.</li>      data</a>.</li>
627      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains
628      a string <code>--</code>, or if it ends with a character      a string <code>--</code>, or if it ends with a character
629      <code>-</code>, then it is an      <code>-</code>, then raise an
630      <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-comment-com><code>xml-well-formedness-error</code></a>.<!--
631        "for compatibility" --></li>
632      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
633      any nodes, they are in      any nodes, they are in
634      <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-comment-child><code>xml-well-formedness-error</code></a>.</li>
# Line 347  following:</p> Line 688  following:</p>
688  <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>  <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
689    <dd>    <dd>
690      <ol>      <ol>
691      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
692      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
693        <li>Follow the following substeps:
694          <ol>
695          <li>If <code>ownerDocument</code> attribute of <var>n</var> is
696          <code>null</code>, then abort these substeps.</li>
697          <li>If <code>documentElement</code> attribute of the node
698          set to <code>ownerDocument</code> attribute of <var>n</var> is
699          <code>null</code>, then abort these substeps.</li>
700          <li>If <code>nodeName</code> attribute of the node set to
701          <code>documentElement</code> attribute of the node set to
702          <code>ownerDocument</code> attribute of <var>n</var> is
703          different from <code>nodeName</code> of <var>n</var>,
704          then raise an
705          <a href="#xml-validity-error" id=vc-roottype><code>xml-validity-error</code></a>.</li></li>
706          </ol>
707        </li>
708      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
709      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
710      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
711      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
712      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <li>If the <code>publicId</code> attribute value of <var>n</var> is
713      <!-- publicId = non-null and systemId = null -->      <em>not</em> <code>null</code> and the <code>systemId</code> attribute
714        value of <var>n</var> is <code>null</code>, then raise an
715        <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
716        <span class=ed>@@ publicId == null?  Or, publicId == ""</span></li>
717      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
718      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
719        <ol>        <ol>
# Line 366  following:</p> Line 725  following:</p>
725        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
726        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
727        </ol></li>        </ol></li>
728      <li class=ed>@@ <code>entities</code>, <code>notations</code>,      <li>For each node in the <code>entities</code>, <code>notations</code>,
729      <code>elementTypes</code>, externally declared?</li>      and <code>elementTypes</code> lists of <var>n</var>,
730        <a href="#algorithm-to-check-a-node" title="check a node">check the
731        node</a> recursively.</li>
732        <li class=ed>@@ externally declared?</li>
733      </ol>      </ol>
734    </dd>    </dd>
735  <dt>If <var>n</var> is an <code>Element</code> node</dt>  <dt>If <var>n</var> is an <code>Element</code> node</dt>
736    <dd>    <dd>
737      <ol>      <ol>
738      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
739      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
740      <li>If the <code>prefix</code> attribute value is different from      <li>If the <code>prefix</code> attribute value is different from
741      <code>null</code>, then <a href="#algorithm-to-validate-a-ncname">validate      <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
742      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
743      <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
744      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 394  following:</p> Line 756  following:</p>
756        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
757        <var>n<sub><var>c</var></sub></var></a> recursively.</li>        <var>n<sub><var>c</var></sub></var></a> recursively.</li>
758        </ol></li>        </ol></li>
759        <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
760        <li>Let <var>attrs</var> be the value of the <code>attribute</code>
761        attribute of <var>n</var>.  Check conformance of <var>attrs</var>
762        as following:
763          <ol>
764          <li>If <var>attrs</var> contains an <code>Attr</code> node whose
765          <code>nodeName</code> attribute value is equal to that of another
766          <code>Attr</code> node in <var>attrs</var>, then raise an
767          <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
768          #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
769          <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
770          </ol>
771        </li>
772      </ol>      </ol>
773    </dd>    </dd>
774  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
# Line 403  following:</p> Line 778  following:</p>
778      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
779      any nodes, they are in      any nodes, they are in
780      <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
781        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
782        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
783        <li class=ed>@@ #vc-MixedChildrenUnique
784        Validity constraint: No Duplicate Types</li>
785        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
786        <li>If there is more than one <code>AttributeDefinition</code> node
787        with <span class=ed>attribute type <code>ID</code></span> in the
788        <code>NamedNodeMap</code> list contained in the
789        <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
790        <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
791        <li>If there is more than one <code>AttributeDefinition</code> node
792        with <span class=ed>attribute type <code>NOTATION</code></span> in the
793        <code>NamedNodeMap</code> list contained in the
794        <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
795        <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
796        <li class=ed>"For interoperability, the same Nmtoken  SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
797      </ol>      </ol>
798    </dd>    </dd>
799  <dt>If <var>n</var> is an <code>Entity</code> node</dt>  <dt>If <var>n</var> is an <code>Entity</code> node whose
800    <code>notationName</code> attribute value is <code>null</code> (i.e. a
801    parsed entity)</dt>
802    <dd>    <dd>
803      <ol>      <ol>
804      <li>An <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a> <span class=ed>@@ if !notationName</span>.</li>      <li>Raise an
805      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
806        <li><a href="#algorithm-to-validate-an-ncname">Validate the
807      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
808      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
809      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
810      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
811      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
812      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <li>If the <code>publicId</code> attribute value of <var>n</var> is
813      <!-- publicId = non-null and systemId = null -->      <em>not</em> <code>null</code> and the <code>systemId</code> attribute
814      <li class=ed>@@ <code>notationName</code></li>      value of <var>n</var> is <code>null</code>, then raise an
815        <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
816      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
817      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
818        <ol>        <ol>
# Line 436  following:</p> Line 831  following:</p>
831        </ol></li>        </ol></li>
832      </ol>      </ol>
833    </dd>    </dd>
834    <dt>If <var>n</var> is an <code>Entity</code> node whose
835    <code>notationName</code> attribute value is <em>not</em> <code>null</code>
836    (i.e. an unparsed entity)</dt>
837      <dd>
838        <ol>
839        <li><a href="#algorithm-to-validate-an-ncname">Validate the
840        <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
841        <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
842        the <code>publicId</code> attribute value as a public identifier</a>.</li>
843        <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
844        the <code>systemId</code> attribute value as a system identifier</a>.</li>
845        <li>If the <code>systemId</code> attribute value of <var>n</var> is
846        <code>null</code>, then raise an
847        <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
848        <li><a href="#algorithm-to-validate-an-ncname">Validate the
849        <code>notationName</code> attribute value of <var>n</var> as an
850        <code>NCName</code></a>.</li>
851        <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
852        <li>If the <code>childNodes</code> list of <var>n</var> contains
853        any nodes, they are in
854        <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
855        </ol>
856      </dd>
857  <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>  <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
858    <dd>    <dd>
859      <ol>      <ol>
860      <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>      <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
861      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
862      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
863      <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
864      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 463  following:</p> Line 881  following:</p>
881  <dt>If <var>n</var> is a <code>Notation</code> node</dt>  <dt>If <var>n</var> is a <code>Notation</code> node</dt>
882    <dd>    <dd>
883      <ol>      <ol>
884      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
885      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
886      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
887      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
888      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
889      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
     <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>  
890      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
891      any nodes, they are in      any nodes, they are in
892      <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
# Line 478  following:</p> Line 895  following:</p>
895  <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>  <dt>If <var>n</var> is a <code>ProcessingInstruction</code> node</dt>
896    <dd>    <dd>
897      <ol>      <ol>
898      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li>If the <code>target</code> attribute value matches to the string
899      <code>target</code> attribute value as an <code>NCName</code></a>.</li>      <code>xml</code> in any case combination, then raise a
     <li>If the <code>target</code> attribute value matches to  
     <code>XML</code> case-insensitively, then it is a  
900      <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
901      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
902      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>target</code> attribute value as an <code>NCName</code></a>.</li>
903        <li>Then,
904        <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
905        the <code>data</code> attribute value as an <abbr>XML</abbr> character
906      data</a>.</li>      data</a>.</li>
907      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains a string
908      a string <code>?></code>, or starts with either      <code>?></code>, then raise a
909      <code class=char>U+0009</code>, <code class=char>U+000A</code>,      <a href="#xml-well-formedness-error" id=wfe-pi-pic><code>xml-well-formedness-error</code></a>.</li>
910      <code class=char>U+000D</code>, or <code class=char>U+0020</code>      <li>If the <code>data</code> attribute value starts with either
911      character, then it is a      <code class=char>U+0009</code> <code class=charname>CHARACTER
912      <a href="#round-trip-error" id=rte-pi-data><code>round-trip-error</code></a>.</li>      TABULATION</code>, <code class=char>U+000A</code> <code class=charname>LINE
913        FEED</code>, <code class=char>U+000D</code> <code class=charname>CARRIAGE
914        RETURN</code>, or <code class=char>U+0020</code>
915        <code class=charname>SPACE</code> character, then raise a
916        <a href="#round-trip-error" id=rte-pi-s><code>round-trip-error</code></a>.</li>
917      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
918      any nodes, they are in      any nodes, then raise an
919      <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-pi-child><code>xml-well-formedness-error</code></a>.</li>
920      </ol>      </ol>
921    </dd>    </dd>
922  <dt>If <var>n</var> is a <code>Text</code> node</dt>  <dt>If <var>n</var> is a <code>Text</code> node</dt>
923    <dd>    <dd>
924      <ol>      <ol>
925      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
926      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
927      data</a>.</li>      data</a>.</li>
928      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
# Line 514  following:</p> Line 936  following:</p>
936    
937  </div>  </div>
938    
939  <div class=section id=checking-components>  </div>
 <h3>Checking Components</h3>  
940    
941  <p>The algorithm  <div id=classes class=section>
942  <dfn id=algorithm-to-validate-an-xml-character-data title-"to validate an XML character data">to  <h2>Character Classes</h2>
 validate an <abbr>XML</abbr> character data  
 (<dfn id=var-s><var>s</var></dfn>)</dfn> is defined as following:</p>  
 <ol>  
 <li class=ed>@@</li>  
 </ol>  
943    
944  <p>The algorithm  <p>This section defines a couple of character classes.
945  <dfn id=algorithm-to-validate-a-name title="to validate a Name">to validate a  These classes are referred to by algorithms specified above.</p>
 <code>Name</code> (<dfn id=var-name><var>name</var></dfn>)</dfn> is defined  
 as following:</p>  
 <ol>  
 <li class=ed>@@</li>  
 </ol>  
946    
947  <p>The algorithm  <p>Character class <dfn id=class-Char10><code>Char10</code></dfn>
948  <dfn id=algorithm-to-validate-a-ncname title="to validate an NCName">to  contains the following characters:</p>
949  validate an <code>NCName</code> (<dfn id=var-name><var>name</var></dfn>)</dfn>  <ul>
950  is defined as following:</p>  <li><code class=char>U+0009</code> <code class=charname>CHARACTER
951  <ol>  TABULATION</code></li>
952  <li class=ed>@@</li>  <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
953  </ol>  <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
954    RETURN</code></li>
955    <li><code class=char>U+0020</code> <code class=charname>SPACE</code>
956    .. <code class=char>U+D7FF</code></li>
957    <li><code class=char>U+E000</code> .. <code class=char>U+FFFD</code>
958    <code class=charname>REPLACEMENT CHARACTER</code></li>
959    <li><code class=char>U+10000</code> .. <code class=char>U+10FFFF</code></li>
960    </ul>
961    <div class="note memo">
962    <p>This character class contains all characters allowed in the production rule
963    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Char"><code>Char</code></a>
964    of <abbr>XML</abbr> 1.0
965    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
966    </div>
967    
968  <p>To <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate  <p>Character class <dfn id=class-CompatChar10><code>CompatChar10</code></dfn>
969  a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the  contains the following characters:</p>
970  algorithm below <em class=rfc2119>MUST</em> be used:</p>  <ul>
971  <ol>  <li class=ed>@@ <q>Document authors are encouraged to avoid "compatibility characters", as defined in section 6.8 of [Unicode @@ Unicode 2.0 @@] (see also D21 in section 3.6 of [Unicode3]).</q></li>
972    <li>If <var>pid</var> contains any character  </ul>
973    that is outside of the range of <code>#x20 | #xD | #xA |  
974    [a-zA-Z0-9] | [-'()+,./:=?;!*#@$_%]</code><!-- @@ TODO: formal def -->,  <p>Character class <dfn id=class-ControlChar10><code>ControlChar10</code></dfn>
975    then it is an  contains the following characters:</p>
976    <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>  <ul>
977    <li>If <var>pid</var> contains one of <code class=char>U+0009</code>  <li><code class=char>U+007F</code> <code class=charname>DELETE</code> ..
978    <code class=charname>CHARACTER TABULATION</code>,  <code class=char>U+0084</code> <code class=charname>INDEX</code></li>
979    <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,  <li><code class=char>U+0086</code> <code class=charname>START OF SELECTED
980    and <code class=char>U+000D</code> <code class=charname>LINE FEED</code>  AREA</code> .. <code class=char>U+009F</code> <code class=charname>APPLICATION
981    characters, if the first character of <var>pid</var> is  PROGRAM COMMAND</code></li>
982    <code class=char>U+0020</code> <code class=charname>SPACE</code> character,  <li><code class=char>U+FDD0</code> .. <code class=char>U+FDEF</code></li>
983    if the last character of <var>pid</var> is <code class=char>U+0020</code>  <li><code class=char>U+1FFFE</code> .. <code class=char>U+1FFFF</code></li>
984    <code class=charname>SPACE</code> character, or if there is a  <li><code class=char>U+2FFFE</code> .. <code class=char>U+2FFFF</code></li>
985    <code class=char>U+0020</code> <code class=charname>SPACE</code> character  <li><code class=char>U+3FFFE</code> .. <code class=char>U+3FFFF</code></li>
986    immediately followed by another <code class=char>U+0020</code>  <li><code class=char>U+4FFFE</code> .. <code class=char>U+4FFFF</code></li>
987    <code class=charname>SPACE</code> character in <var>pid</var>, then it is a  <li><code class=char>U+5FFFE</code> .. <code class=char>U+5FFFF</code></li>
988    <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.</li>  <li><code class=char>U+6FFFE</code> .. <code class=char>U+6FFFF</code></li>
989    <li class=ed>@@ Should we check formal-public-identifierness?</li>  <li><code class=char>U+7FFFE</code> .. <code class=char>U+7FFFF</code></li>
990  </ol>  <li><code class=char>U+8FFFE</code> .. <code class=char>U+8FFFF</code></li>
991    <li><code class=char>U+9FFFE</code> .. <code class=char>U+9FFFF</code></li>
992    <li><code class=char>U+AFFFE</code> .. <code class=char>U+AFFFF</code></li>
993    <li><code class=char>U+BFFFE</code> .. <code class=char>U+BFFFF</code></li>
994    <li><code class=char>U+CFFFE</code> .. <code class=char>U+CFFFF</code></li>
995    <li><code class=char>U+DFFFE</code> .. <code class=char>U+DFFFF</code></li>
996    <li><code class=char>U+EFFFE</code> .. <code class=char>U+EFFFF</code></li>
997    <li><code class=char>U+FFFFE</code> .. <code class=char>U+FFFFF</code></li>
998    <li><code class=char>U+10FFFE</code> .. <code class=char>U+10FFFF</code></li>
999    </ul>
1000    <div class="note memo">
1001    <p>This character class contains the characters listed in the Note in
1002    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#charsets">Section 2.2</a>
1003    of <abbr>XML</abbr> 1.0
1004    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>,
1005    as amended by
1006    <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
1007    </div>
1008    
1009    <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
1010    contains the following characters:</p>
1011    <ul class=ed>
1012    </ul>
1013    <div class="note memo">
1014    <p>This character class contains all characters allowed as the first character
1015    of a string matching to the production rule
1016    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1017    of <abbr>XML</abbr> 1.0
1018    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1019    </div>
1020    
1021    <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1022    contains the following characters:</p>
1023    <ul>
1024    <li>The characters in the character class
1025    <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1026    <li class=ed></li>
1027    </ul>
1028    <div class="note memo">
1029    <p>This character class contains all characters allowed as the second
1030    character of a string matching to the production rule
1031    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1032    of <abbr>XML</abbr> 1.0
1033    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1034    </div>
1035    
1036    <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1037    contains the following characters:</p>
1038    <ul>
1039    <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1040    TABULATION</code></li>
1041    <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1042    <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1043    RETURN</code></li>
1044    <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1045    <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1046    (<code class=char>!</code>)</li>
1047    <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1048    (<code class=char>$</code>)</li>
1049    <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1050    (<code class=char>#</code>)</li>
1051    <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1052    (<code class=char>%</code>)</li>
1053    <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1054    (<code class=char>'</code>)</li>
1055    <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1056    (<code class=char>(</code>)</li>
1057    <li><code class=char>U+0029</code> <code class=charname>RIGHT
1058    PARENTHESIS</code> (<code class=char>)</code>)</li>
1059    <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1060    (<code class=char>*</code>)</li>
1061    <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1062    (<code class=char>+</code>)</li>
1063    <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1064    (<code class=char>,</code>)</li>
1065    <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1066    (<code class=char>-</code>)</li>
1067    <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1068    (<code class=char>.</code>)</li>
1069    <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1070    (<code class=char>/</code>)</li>
1071    <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1072    (<code class=char>0</code>) .. <code class=char>U+0039</code>
1073    <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1074    <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1075    (<code class=char>:</code>)</li>
1076    <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1077    (<code class=char>;</code>)</li>
1078    <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1079    (<code class=char>=</code>)</li>
1080    <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1081    (<code class=char>?</code>)</li>
1082    <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1083    (<code class=char>@</code>)</li>
1084    <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1085    A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1086    <code class=charname>LATIN CAPITAL LETTER Z</code>
1087    (<code class=char>Z</code>)</li>
1088    <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1089    (<code class=char>_</code>)</li>
1090    <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1091    A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1092    <code class=charname>LATIN CAPITAL LETTER Z</code>
1093    (<code class=char>Z</code>)</li>
1094    </ul>
1095    <div class="note memo">
1096    <p>This character class contains all characters allowed in the production rule
1097    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1098    of <abbr>XML</abbr> 1.0
1099    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1100  </div>  </div>
1101    
1102  </div>  </div>

Legend:
Removed from v.1.9  
changed lines
  Added in v.1.22

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24