/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Diff of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.17 by wakaba, Sat Dec 1 12:19:41 2007 UTC revision 1.26 by wakaba, Sat Mar 29 06:35:16 2008 UTC
# Line 18  Line 18 
18    
19  <div class="header">  <div class="header">
20  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21  <h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2>  <h2>Working Draft <time datetime=2008-03-29>29 March 2008</time></h2>
22    
23  <dl class="versions-uri">  <dl class="versions-uri">
24  <dt>This Version</dt>  <dt>This Version</dt>
# Line 42  Line 42 
42        >w@suika.fam.cx</a>&gt;</code></dd>        >w@suika.fam.cx</a>&gt;</code></dd>
43  </dl>  </dl>
44    
45  <p class="copyright" lang="en">&#xA9; <time>2007</time> <a  <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47  Permission is granted to copy, distribute and/or modify this document  Permission is granted to copy, distribute and/or modify this document
48  under the terms of the <a rel="license"  under the terms of the <a rel="license"
# Line 118  else in this specification is normative. Line 118  else in this specification is normative.
118  <p><span class=ed>Algorithm is normative but non-normative</span>.  <p><span class=ed>Algorithm is normative but non-normative</span>.
119  In addition, the order in which <a href="#errors">errors</a> are  In addition, the order in which <a href="#errors">errors</a> are
120  raised is undefined.</p>  raised is undefined.</p>
121    
122    <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123    by hyperlinks.  When the document being processed is an <abbr>XML</abbr> 1.1
124    document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125    specification should be consulted instead.</p>
126  </div>  </div>
127    
128    
# Line 205  can be easily serialized into a valid XM Line 210  can be easily serialized into a valid XM
210    violate to any well$B!>(Bformedness constraint in XML    violate to any well$B!>(Bformedness constraint in XML
211    specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,    specification <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>,
212    <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>    <a href="#ref-XML11">XML11</a>]</cite>.</p></dd>
213    <dt><dfn id=misc-info><code>misc-info</code></dfn>
214      <dd><p>A <code>misc-info</code> is raised when some status information
215      on parsing or checking process that are considered useful for debugging
216      and so on is available.  It by no means implies the non-conformance of
217      the document.
218  </dl>  </dl>
219    
220  <div class=ed><p>@@ TODO: #dt-atuseroption at user option  <div class=ed><p>@@ TODO: #dt-atuseroption at user option
221  (MAY or MUST), #dt-compat for compatibility,  (MAY or MUST), #dt-compat for compatibility,
222  #dt-interop for interoperability</p></div>  #dt-interop for interoperability</p>
223    
224    <p>TODO: XML 1.1, XML Namespace 1.0/1.1, xml:base, xml:id
225    </div>
226    
227  </div>  </div>
228    
229  <div class=section id=parsing-xml>  <div class=section id=parsing-xml>
230  <h2>Parsing <abbr>XML</abbr> Document</h2>  <h2>Parsing <abbr>XML</abbr> Document</h2>
231    
232  <ul>  <p>When an <abbr>XML</abbr> document is parsed, the following clauses
233  <li>If the <abbr>XML</abbr> document does not begin with an  are applied:</p>
234  <abbr>XML</abbr> declaration, then raise an  <dl class=switch>
235  <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.</li>  <dt>For the document
236  <li>If the replacement text of an entity declaration is    <dd>If the <abbr>XML</abbr> document does not begin with an
237  <code>&lt;</code>, then raise an    <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
238  <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--    raise an
239      <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
240    <dt>For the document type declaration
241      <dd class=ed>@@ read external entity
242      <dd>The <code>entities</code> attribute of the <code>DocumentType</code>
243      node <em class=rfc2119>MUST</em> contain a <code>NamedNodeMap</code> object
244      whose first five items are as follows:
245        <ol start=0>
246        <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
247        is <code>amp</code>.  It contains a <code>Text</code> node whose
248        <code>data</code> attribute is set to <code>&amp;</code>.
249        <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
250        is <code>lt</code>.  It contains a <code>Text</code> node whose
251        <code>data</code> attribute is set to <code>&lt;</code>.
252        <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
253        is <code>gt</code>.  It contains a <code>Text</code> node whose
254        <code>data</code> attribute is set to <code>></code>.
255        <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
256        is <code>quot</code>.  It contains a <code>Text</code> node whose
257        <code>data</code> attribute is set to <code>"</code>.
258        <li>An <code>Entity</code> node whose <code>nodeName</code> attribute
259        is <code>apos</code>.  It contains a <code>Text</code> node whose
260        <code>data</code> attribute is set to <code>'</code>.
261        </ol>
262    <dt>For each internal general entity declaration being processed by the parser
263      <dd>If the
264      <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
265      part of the general entity declaration contains a bare <code>U+003C</code>
266      <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
267      <em class=rfc2119>MUST</em> raise an
268      <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
269  "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].  "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
270  --></li>  -->
271  <li>If there is an element type declaration whose <code>Name</code>  <dt>For each element type declaration being processed by the parser
272  value is already declared, then raise an    <dd>If there is another processed element type declaration whose
273  <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.</li>    <code>Name</code> is equal to the <code>Name</code> of the element type
274  <li>If attribute definition whose <code>Name</code> is    declaration, then the parser <em class=rfc2119>MUST</em> raise an
275  <code>xml:space</code> has <span class=ed>declared type different from    <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
276  (default|preserve), (default), or (preserve)</span>, then raise an  <dt>For each attribute definition list declaration being processed by the
277  <a href="#xml-misc-error" id=xme-ad-xml-space><code>xml-misc-error</code></a>.  parser
278  <span class=ed>@@ duplication with    <dd>If there is another processed attribute defintion list declaration whose
279  <a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!--    <code>Name</code> is equal to the <code>Name</code> of the attribute
280  <!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li>    definition list declaration, then the parser <em class=rfc2119>MUST</em>
281  <li>If an empty-element tag is used for an element which is <em>not</em>    raise an
282  declared <code>EMPTY</code>, then raise an    <a href="#xml-misc-warning" id=xme-attlist-unique><code>xml-misc-warning</code></a>.
283  <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.</li>    <dd>For each attribute definition in the attribute definition list
284  <li>If an empty-element tag is <em>not</em> used for an element which is    declaration, if there is another processed attribute definition whose
285  declared <code>EMPTY</code>, then raise an    <code>Name</code> is equal to the <code>Name</code> of the attribute
286  <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.</li>    definition (whether or not in the same attribute definition list
287      declaration), then the parser <em class=rfc2119>MUST</em> raise an
288      <a href="#xml-misc-warning" id=xme-attrdef-unique><code>xml-misc-warning</code></a>.
289      <!--
290        <q>For interoperability, an XML processor <em class=rfc2119>MAY</em> at
291        user option issue a warning when more than one attribute-list declaration
292        is provided for a given element type, or more than one attribute definition
293        is provided for a given attribute, but this is not an error.</q>
294      -->
295  <!--  <!--
296      NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
297      will not be warned.
298    -->
299    
300    <dt>For each entity declaration being processed by the parser
301      <dd>Handle as follows:
302        <ol>
303        <li><p>If the entity declaration declares a general entity, the following
304        is applied:
305          <dl>
306          <dt>If the <code>Name</code> is <code>lt</code> or <code>amp</code>
307            <dd><p>If the entity declaration does not declare an internal entity,
308            or if the replacement text of the entity is not the escaped form of
309            <code>&lt;</code> (if <code>lt</code>) or <code>&amp;</code> (if
310            <code>amp</code>), then the parser <em class=rfc2119>MUST</em> raise an
311            <a href="#xml-misc-error" id=xme-double-escape><code>xml-misc-error</code></a>.
312    
313              <div class="note memo informative">
314              <p>In other word, the character in the <code>EntityValue</code>
315              has to be double-escaped.
316              </div>
317          <dt>If the <code>Name</code> is <code>gt</code>, <code>quot</code>, or
318          <code>apos</code>
319            <dd><p>If the entity declaration does not declare an internal entity,
320            or if the replacement text of the entity is not equal to or not the
321            escaped form of <code>></code> (if <code>gt</code>), <code>"</code> (if
322            <code>quot</code>), or <code>'</code> (if <code>apos</code>), then the
323            parser <em class=rfc2119>MUST</em> raise an
324            <a href="#xml-misc-error" id=xme-single-escape><code>xml-misc-error</code></a>.
325    
326              <div class="note memo informative">
327              <p>In other word, the character in the <code>EntityValue</code>
328              has to be single- or double-escaped.
329              </div>
330          </dl>
331          <!--
332            <q>If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless).</q>
333          -->
334    
335        <li><p>If the entity declaration has to be ignored since there has already
336        been declared an entity with the same <code>Name</code> as the declaration,
337        then the parser <em class=rfc2119>MUST</em> raise a
338        <a href="#misc-info" id=mi-ent-unique><code>misc-info</code></a>
339        and abort these steps.
340    
341        <div class="informative note memo">
342        <p>Five predefined entities, i.e. <code>amp</code>, <code>lt</code>,
343        <code>gt</code>, <code>quot</code>, and <code>apos</code>, are always
344        declared implicitly and therefore any declaration for such an entity
345        always raises an
346        <a href="#misc-info" id=mi-ent-unique><code>misc-info</code></a>.
347        </div>  
348    
349        <li><p>If the entity declaration declares a parameter entity and the
350        <code>Name</code> of the entity begins with the string <code>xml</code>
351        (in any combination of upper- and lowercase letters), then the parser
352        <em class=rfc2119>MUST</em> raise an
353        <a href="#xml-misc-warning" id=xmw-reserved-pe-name><code>xml-misc-warning</code></a>.
354    
355        <li><p>If the entity declaration contains the <code>EntityValue</code>,
356        then for each occurence of any references to unparsed entities in the
357        <code>EntityValue</code>, the parser <em class=rfc2119>MUST</em> raise an
358        <a href="#xml-misc-error" id=xme-unparsed-in-ev><code>xml-misc-error</code></a>.
359        <!--
360          <q>It is an error for a reference to an unparsed entity to appear in the
361          EntityValue in an entity declaration.</q>
362        -->
363        <li><p>If the entity declaration declares a general entity, then an
364        <code>Entity</code> node <em class=rfc2119>MUST</em> be created and
365        appended to the <code>NamedNodeMap</code> object in the
366        <code>entities</code> attribute of the <code>DocumentType</code> node.
367        
368        <p class=ed>Read the external entity
369    
370        <p>If the replacement text of the entity is read, then parse the
371        replacement text as if it were referenced from the content of an
372        element (with no namespace bindings).  If no <span class=ed>@@ parse error</span>
373        is raised by the parsing process, then the nodes generated by the
374        parsing <em class=rfc2119>MUST</em> be appended to the <code>Entity</code>
375        node.  The parse error <em class=rfc2119>MUST NOT</em> be propagated to
376        the entire parsing process.  Other kinds of errors
377        <em class=rfc2119>MUST</em> be propagated.  The first parse error
378        <em class=rfc2119>MUST</em> abort the internal parsing process.
379        <span class=ed>@@ better wording</span>
380    
381        <p class=ed>@@ prop
382        
383        <p>Then, the <code>Entity</code> node and its descendant
384        <em class=rfc2119>MUST</em> be marked as read-only.
385      </ol>
386    
387    <dt>For each notation declaration being processed by the parser
388      <dd>If there is another processed notation declaration whose
389      <code>Name</code> is equal to the <code>Name</code> of the notation
390      declaration, then the parser <em class=rfc2119>MUST</em> raise an
391      <a href="#xml-validity-error" id=vc-uniquenotationname><code>xml-validity-error</code></a>.
392      <!-- <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#UniqueNotationName">Validity constraint: Unique Notation Name</a> -->
393    
394    <dt>For each empty-element tag
395      <dd>If the <code>Name</code> of the tag is not declared by a processed
396      element type declaration as <code>EMPTY</code> content, then the parser
397      <em class=rfc2119>MUST</em> raise an
398      <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
399    <dt>For each start-tag
400      <dd>If the <code>Name</code> of the tag is declared by a processed element
401      type declaration as <code>EMPTY</code> content, then the parser
402      <em class=rfc2119>MUST</em> raise an
403      <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
404    
405    <dt>For each parameter entity reference
406      <dd>If the declaration for the entity is not read (i.e. no declaration
407      for the entity is processed or the external entity referenced by the
408      declaration cannot be retrieved), then:
409        <ul>
410        <li>If the parameter entity is contained in a declaration, then the
411        declaration <em class=rfc2119>MUST</em> be ignored <em>except</em> that
412        any error before the parameter entity <em class=rfc2119>MUST</em> be
413        raised as usual.
414        <li>If the parameter entity is contained in the status portion of a
415        conditional section, then the conditional section
416        <em class=rfc2119>MUST</em> be processed as if it were an
417        <code>IGNORE</code>d section.
418        <li>The parser <em class=rfc2119>MUST NOT</em> process any entity or
419        attribute-list declaration after the parameter entity reference in the DTD
420        <em>except</em> when the <code>standalone</code> pseudo-attribute of the
421        XML declaration (if any) is set to <code>yes</code>.
422        <!-- This requirement is enforced for internal DTD subset case in
423        XML 1.0/1.1 specification (section 5.1) but not for any other cases. -->
424        <!-- According to this definition, element type declarations, notation
425        declarations, and PIs ARE processed. -->
426        <li>If the parameter entity reference is the first reference to an entity
427        that is not read, then the parser <em class=rfc2119>MUST</em> raise an
428        <a href="#entity-error" id=ee-unread-pe><code>entity-error</code></a>.
429        <li>The <code>allDeclarationsProcessed</code> <span class=ed>@@ ref</span>
430        attribute of the <code>Document</code> node <em class=rfc2119>MUST</em> be
431        set to <code>false</code>.
432        </ul>
433    <dt>For each general entity reference in an attribute value or in the content
434    of an element
435      <dd>If the declaration for the entity is not read (i.e. no declaration for
436      the entity is processed or the external entity referenced by the declaration
437      cannot be retrieved), then:
438        <ul>
439        <li>If the general entity reference is the first reference to an entity
440        that is not read, then the parser <em class=rfc2119>MUST</em> raise an
441        <a href="#entity-error" id=ee-unread-ge><code>entity-error</code></a>.
442        <span class=ed>@@ entity declared WFC?</span>
443        <li class=ed>An unexpended entity reference node <em class=rfc2119>MUST</em> be inserted to the current node.
444        </ul>
445    </dl>
446    
447    <p class=ed>@@ MUST try to read external entity
448    
449    <p>In addition, the parser has to check whether the
450    following constraints are met.
451    
452    <p><strong>Well-formedness constraints</strong>.  For each violation to
453    one of constraints below, an
454    <a href="#xml-well-formedness-error"><code>xml-well-formedness-error</code></a>
455    <em class=rfc2119>MUST</em> be raised.  The list of well-formedness
456    constraints is below:
457    <ul>
458    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wfc-PEinInternalSubset">Well-formedness constraint: PEs in Internal Subset</a>
459    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#GIMatch">Well-formedness constraint: Element Type Match</a>
460    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#uniqattspec">Well-formedness constraint: Unique Att Spec</a>
461    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NoExternalRefs">Well-formedness constraint: No External Entity References</a>
462    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#CleanAttrVals">Well-formedness constraint: No &lt; in Attribute Values</a>
463    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#wf-Legalchar">Well-formedness constraint: Legal Character</a>
464    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#textent">Well-formedness constraint: Parsed Entity</a>
465    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#norecursion">Well-formedness constraint: No Recursion</a>
466    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#indtd">Well-formedness constraint: In DTD</a>
467    </ul>
468    
469    <p><strong>Validity constraints</strong>.  For each violation to
470    one of constraints below, an
471    <a href="#xml-validity-error"><code>xml-validity-error</code></a>.
472    <em class=rfc2119>MUST</em> be raised.  The list of validity
473    constraints is below:
474    <ul>
475    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinMarkupDecl">Validity constraint: Proper Declaration/PE Nesting</a>
476    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#vc-PEinGroup">Validity constraint: Proper Group/PE Nesting</a>
477    <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#condsec-nesting">Validity constraint: Proper Conditional Section/PE Nesting</a>
478    </ul>
479    
480    <p><strong>Other creteria</strong>.  For each violation to
481    one of constraints below, an
482    <a href="#xml-misc-recommendation"><code>xml-misc-recommendation</code></a>
483    <em class=rfc2119>MUST</em> be raised.  The list of constraints is below:
484    <ul>
485    <li><q>For interoperability, if a parameter-entity reference appears in a
486    <code>choice</code>, <code>seq</code>, or <code>Mixed</code> construct, its
487    replacement text <em class=rfc2119>SHOULD</em> contain at least one non-blank
488    character, and neither the first nor last non-blank character of the
489    replacement text <em class=rfc2119>SHOULD</em> be a connector (<code>|</code>
490    or <code>,</code>).</q>
491    <li><q>External parsed entities <em class=rfc2119>SHOULD</em> each begin with a
492    text declaration.</q>
493    </ul>
494    
495    
496  #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting  <!--
 #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset  
 #ExtSubset Well-formedness constraint: External Subset  
 #PE-between-Decls Well-formedness constraint: PE Between Declarations  
 #GIMatch Well-formedness constraint: Element Type Match  
 #uniqattspec Well-formedness constraint: Unique Att Spec  
 #NoExternalRefs Well-formedness constraint: No External Entity References  
 #CleanAttrVals Well-formedness constraint: No < in Attribute Values  
 #vc-PEinGroup Validity constraint: Proper Group/PE Nesting  
 "For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)."  
 "For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error."  
 #condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting  
 #wf-Legalchar Well-formedness constraint: Legal Character  
 #textent Well-formedness constraint: Parsed Entity  
 #norecursion Well-formedness constraint: No Recursion  
 #indtd Well-formedness constraint: In DTD  
 "External parsed entities SHOULD each begin with a text declaration."  
 "It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."  
 #UniqueNotationName Validity constraint: Unique Notation Name  
497    
498  @@ Need detailed review, but maybe should be in parsing phase  @@ Need detailed review, but maybe should be in parsing phase
499    
# Line 272  declared <code>EMPTY</code>, then raise Line 504  declared <code>EMPTY</code>, then raise
504  #wf-entdeclared Well-formedness constraint: Entity Declared  #wf-entdeclared Well-formedness constraint: Entity Declared
505  #vc-entdeclared Validity constraint: Entity Declared  #vc-entdeclared Validity constraint: Entity Declared
506  "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."  "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
 "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."  
507    
508  @@ flaged and then reported in DOM check phase  @@ flaged and then reported in DOM check phase
509    
# Line 284  declared <code>EMPTY</code>, then raise Line 515  declared <code>EMPTY</code>, then raise
515    
516  "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."  "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
517    
518    @@ We should phrase out what the parser should do where the XML specification
519    left undefined.  For example: Comment must be converted to a Comment node,
520    illegal xml:space value must be preserved, so on.
521    
522    Warn <!ENTITY % xml... ...>
523    
524    -->
525    
526    <p>The parser <em class=rfc2119>MUST</em> raise an
527    <a href="#xml-well-formedness-error" id=wfe-syntax><code>xml-well-formedness-error</code></a>
528    for any failure to match to a production rule in the XML specification.
529    <!--
530      <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#ExtSubset">Well-formedness constraint: External Subset</a>
531      <li><a href="http://www.w3.org/TR/2006/REC-xml-20060816/#PE-between-Decls">Well-formedness constraint: PE Between Declarations</a>
532  -->  -->
 </ul>  
533  </div>  </div>
534    
535  <div class="section" id=checking-dom>  <div class="section" id=checking-dom>
# Line 314  is that of the <code>Document</code> nod Line 558  is that of the <code>Document</code> nod
558    
559    
560  <div class=section id=checking-components>  <div class=section id=checking-components>
561  <h3>Checking Components</h3>  <h3>Conformance Checking Algorithms for Components</h3>
562    
563  <p>The algorithm  <p>To
564  <dfn id=algorithm-to-validate-an-xml-character-data title-"to validate an XML character data">to  <dfn id=algorithm-to-validate-an-xml-character-data title="validate an XML string">to
565  validate an <abbr>XML</abbr> character data  validate an <abbr>XML</abbr> string (<dfn id=var-s><var>s</var></dfn>)</dfn>,
566  (<dfn id=var-s><var>s</var></dfn>)</dfn> is defined as following:</p>  the following algorithm <em class=rfc2119>MUST</em> be used:</p>
567  <ol>  <ol>
568  <li>If <var>s</var> contains a character that is <em>not</em> in  <li>If <var>s</var> contains a character that is <em>not</em> in
569  the character class <a href="#class-Char10"><code>Char10</code></a>,  the character class <a href="#class-Char10"><code>Char10</code></a>,
# Line 344  U+000D will be serialized as &#x000D; (s Line 588  U+000D will be serialized as &#x000D; (s
588  will be raised).</span></li>  will be raised).</span></li>
589  </ol>  </ol>
590    
591  <p>The algorithm  <p>To
592  <dfn id=algorithm-to-validate-a-name title="to validate a Name">to validate a  <dfn id=algorithm-to-validate-a-name title="validate a Name">validate a
593  <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn> is defined  <code>Name</code> (<dfn id=var-name-s><var>s</var></dfn>)</dfn>, the following
594  as following:</p>  algorithm <em class=rfc2119>MUST</em> be used:</p>
595  <ol>  <ol>
596    <li>If <var>s</var> is an empty string, then raise an
597    <a href="#xml-well-formedness-error" id=wfe-name-empty><code>xml-well-formedness-error</code></a>.
598    Abort these steps.</li>
599    <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
600    <var>s</var> as an <abbr>XML</abbr> string</a>.</li>
601    <li>If the first character in <var>s</var> is a character that is
602    <em>not</em> in the character class
603    <a href="#class-NameStartChar10"><code>NameStartChar10</code></a>, then raise
604    an
605    <a href="#xml-well-formedness-error" id=wfe-name-startchar><code>xml-well-formedness-error</code></a>.</li>
606    <li>If a character other than the first character in <var>s</var> is a
607    character that is <em>not</em> in the character class
608    <a href="#class-NameChar10"><code>NameChar10</code></a>, then raise an
609    <a href="#xml-well-formedness-error" id=wfe-name-char><code>xml-well-formedness-error</code></a>.</li>
610  <li>If <var>s</var> begins with the string <code>xml</code> (in any  <li>If <var>s</var> begins with the string <code>xml</code> (in any
611  case combination), then raise an  case combination), then raise an
612  <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.  <a href="#xml-misc-warning" id=xmw-reserved-name><code>xml-misc-warning</code></a>.
613  <span class=ed>@@ except for attribute names <code>xml:lang</code>,  <span class=ed>@@ except for attribute names <code>xml:lang</code>,
614  <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,  <code>xml:space</code><!--, <code>xml:base</code>, <code>xml:id</code>,
615  <code>xmlns</code>, <code>xmlns:<var>*</var></code>,  <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
616  pi name <code>xml-stylesheet</code>.</span><!--  pi name <code>xml-stylesheet</code>-->.</span><!--
617  "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":  "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
618  xml:base and xml:id specifications violate to this sentense!  xmlns, xml-stylesheet, xml:base and xml:id specifications violate to this sentense!
619  --></li>  --></li>
620  <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],  <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
621  [XML11]. -->  [XML11]. -->
622  <li class=ed>@@</li>  <li class=ed>@@ XML 1.1 support</li>
623  </ol>  </ol>
624    
625  <p>The algorithm  <p>To
626  <dfn id=algorithm-to-validate-a-ncname title="to validate an NCName">to  <dfn id=algorithm-to-validate-an-ncname title="validate an NCName">validate
627  validate an <code>NCName</code> (<dfn id=var-name><var>name</var></dfn>)</dfn>  an <code>NCName</code> (<dfn id=var-ncname-s><var>s</var></dfn>)</dfn>, the
628  is defined as following:</p>  following algorithm <em class=rfc2119>MUST</em> be used:</p>
629  <ol>  <ol>
630    <li><a href="#algorithm-to-validate-a-name" title="validate a Name">Validate
631    <var>s</var> as a <code>Name</code></a>.</li>
632  <li class=ed>@@</li>  <li class=ed>@@</li>
633  </ol>  </ol>
634    
635  <p>To <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate  <p>To
636    <dfn id=algorithm-to-validate-a-public-identifier title="validate a public identifier">validate
637  a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the  a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
638  algorithm below <em class=rfc2119>MUST</em> be used:</p>  following algorithm <em class=rfc2119>MUST</em> be used:</p>
639  <ol>  <ol>
640    <li>If <var>pid</var> contains any character  <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
641    that is outside of the range of <code>#x20 | #xD | #xA |  <li>If <var>pid</var> contains a character that is <em>not</em> in the
642    [a-zA-Z0-9] | [-'()+,./:=?;!*#@$_%]</code><!-- @@ TODO: formal def -->,  character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
643    then it is an  raise an
644    <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>  <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
645    <li>If <var>pid</var> contains one of <code class=char>U+0009</code>    <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
646    <code class=charname>CHARACTER TABULATION</code>,    <code class=charname>CHARACTER TABULATION</code>,
647    <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,    <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
# Line 392  algorithm below <em class=rfc2119>MUST</ Line 653  algorithm below <em class=rfc2119>MUST</
653    <code class=char>U+0020</code> <code class=charname>SPACE</code> character    <code class=char>U+0020</code> <code class=charname>SPACE</code> character
654    immediately followed by another <code class=char>U+0020</code>    immediately followed by another <code class=char>U+0020</code>
655    <code class=charname>SPACE</code> character in <var>pid</var>, then it is a    <code class=charname>SPACE</code> character in <var>pid</var>, then it is a
656    <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.</li>    <a href="#round-trip-error" id=rte-pubid-space><code>round-trip-error</code></a>.
657    <li class=ed>@@ Should we check formal-public-identifierness?</li>    <span class=ed>Is this really a roundtripness problem?  XML spec
658      does only define the way to match public identifiers in fact, no
659      canonical form.</span></li>
660    </ol>
661    
662    <p>To
663    <dfn id=algorithm-to-validate-a-system-identifier title="validate a system identifier">validate
664    a system identifier (<dfn id=var-sid><var>sid</var></dfn>)</dfn>, the
665    following algorithm <em class=rfc2119>MUST</em> be used:</p>
666    <ol>
667      <li>If <var>sid</var> is <code>null</code>, abort these steps.</li>
668    <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate
669    <var>sid</var> as an <abbr>XML</abbr> string</a>.</li>
670      <li>If <var>sid</var> contains both <code class=char>U+0022</code>
671      <code class=charname>QUOTATION MARK</code> (<code class=char>"</code>) and
672      <code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
673      (<code class=char>'</code>) characters, raise an
674      <a href="#xml-well-formedness-error" id=wfe-sid-lit><code>xml-well-formedness-error</code></a>.</li>
675      <li>If <var>sid</var> contains at least one <code class=char>U+0023</code>
676      <code class=charname>NUMBER SIGN</code> (<code class=char>#</code>)
677      character, then raise an
678      <a href="#xml-misc-error" id=xme-sid-fragment><code>xml-misc-error</code></a>.</li>
679      <li class=ed>@@ If <var>sid</var> cannot be converted to a URI
680      reference, then raise a fact-level error (xml-misc-warning?).<!--
681      XML 1.0 has no conformance language for system identifier being
682      a URI reference. --></li>
683  </ol>  </ol>
684    
685  </div>  </div>
# Line 408  following:</p> Line 694  following:</p>
694  <dt>If <var>n</var> is an <code>Attr</code> node</dt>  <dt>If <var>n</var> is an <code>Attr</code> node</dt>
695    <dd>    <dd>
696      <ol>      <ol>
697      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
698      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
699      <li>If the <code>prefix</code> attribute value is different from      <li>If the <code>prefix</code> attribute value is different from
700      <code>null</code>, then <a href="#algorithm-to-validate-a-ncname">validate      <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
701      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
702      <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-attr-nc><var>n<sub><var>c</var></sub></var></dfn> in the
703      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
# Line 517  following:</p> Line 803  following:</p>
803      <li>If <code>nodeName</code> attribute of <var>n</var> is      <li>If <code>nodeName</code> attribute of <var>n</var> is
804      <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>      <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
805      and <span class=ed>its declared type is different from (default|preserve),      and <span class=ed>its declared type is different from (default|preserve),
806      (default), or (preserve)</span>, then raise an      (preserve|default), (default), or (preserve)</span>, then raise an
807      <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>      <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
808      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
809      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
# Line 547  following:</p> Line 833  following:</p>
833  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
834    <dd>    <dd>
835      <ol>      <ol>
836      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
837      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
838      data</a>.</li>      data</a>.</li>
839      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains
# Line 563  following:</p> Line 849  following:</p>
849      <ol>      <ol>
850      <li>Raise an      <li>Raise an
851      <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>      <a href="#round-trip-warning" id=rtw-comment><code>round-trip-warning</code></a>.</li>
852      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
853      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
854      data</a>.</li>      data</a>.</li>
855      <li>If the <code>data</code> attribute value contains      <li>If the <code>data</code> attribute value contains
# Line 630  following:</p> Line 916  following:</p>
916  <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>  <dt>If <var>n</var> is a <code>DocumentType</code> node</dt>
917    <dd>    <dd>
918      <ol>      <ol>
919      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
920      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
921      <li>Follow the following substeps:      <li>Follow the following substeps:
922        <ol>        <ol>
# Line 649  following:</p> Line 935  following:</p>
935      </li>      </li>
936      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
937      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
938      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
939      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
940      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <li>If the <code>publicId</code> attribute value of <var>n</var> is
941      <!-- @@ "It is an error for a fragment identifier (beginning with a # character) to be part of a system identifier." -->      <em>not</em> <code>null</code> and the <code>systemId</code> attribute
942      <!-- publicId = non-null and systemId = null -->      value of <var>n</var> is <code>null</code>, then raise an
943        <a href="#xml-well-formedness-error" id=wfe-dt-sid><code>xml-well-formedness-error</code></a>.
944        <span class=ed>@@ publicId == null?  Or, publicId == ""</span></li>
945      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
946      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
947        <ol>        <ol>
# Line 665  following:</p> Line 953  following:</p>
953        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
954        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
955        </ol></li>        </ol></li>
956      <li class=ed>@@ <code>entities</code>, <code>notations</code>,      <li>For each node in the <code>entities</code>, <code>notations</code>,
957      <code>elementTypes</code>, externally declared?</li>      and <code>elementTypes</code> lists of <var>n</var>,
958        <a href="#algorithm-to-check-a-node" title="check a node">check the
959        node</a> recursively.</li>
960        <li class=ed>@@ externally declared?</li>
961        <li>If the <code>NamedNodeMap</code> object in the <code>entities</code>
962        attribute of <var>n</var> does not contain <code>Entity</code> nodes
963        whose <code>nodeName</code> attribute are <code>amp</code>,
964        <code>lt</code>, <code>gt</code>, <code>apos</code>, and <code>quot</code>
965        then raise
966        <a href="#xml-misc-recommentation" id=xmr-predefined><code>xml-misc-recommendation</code></a>(s).
967      </ol>      </ol>
968    </dd>    </dd>
969  <dt>If <var>n</var> is an <code>Element</code> node</dt>  <dt>If <var>n</var> is an <code>Element</code> node</dt>
970    <dd>    <dd>
971      <ol>      <ol>
972      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
973      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>      <code>localName</code> attribute value as an <code>NCName</code></a>.</li>
974      <li>If the <code>prefix</code> attribute value is different from      <li>If the <code>prefix</code> attribute value is different from
975      <code>null</code>, then <a href="#algorithm-to-validate-a-ncname">validate      <code>null</code>, then <a href="#algorithm-to-validate-an-ncname">validate
976      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>      the <code>prefix</code> attribute value as an <code>NCName</code></a>.</li>
977      <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-element-nc><var>n<sub><var>c</var></sub></var></dfn>
978      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 694  following:</p> Line 991  following:</p>
991        <var>n<sub><var>c</var></sub></var></a> recursively.</li>        <var>n<sub><var>c</var></sub></var></a> recursively.</li>
992        </ol></li>        </ol></li>
993      <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>      <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
994      <li class=ed>@@ #uniqattspec Well-formedness constraint: Unique Att Spec</li>      <li>Let <var>attrs</var> be the value of the <code>attribute</code>
995      <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>      attribute of <var>n</var>.  Check conformance of <var>attrs</var>
996        as following:
997          <ol>
998          <li>If <var>attrs</var> contains an <code>Attr</code> node whose
999          <code>nodeName</code> attribute value is equal to that of another
1000          <code>Attr</code> node in <var>attrs</var>, then raise an
1001          <a href="#xml-well-formedness-error" id=wfe-uniqattspec><code>xml-well-formedness-error</code></a>.<!--
1002          #uniqattspec Well-formedness constraint: Unique Att Spec --></li>
1003          <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
1004          </ol>
1005        </li>
1006      </ol>      </ol>
1007    </dd>    </dd>
1008  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
# Line 723  following:</p> Line 1030  following:</p>
1030      <li class=ed>"For interoperability, the same Nmtoken  SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>      <li class=ed>"For interoperability, the same Nmtoken  SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
1031      </ol>      </ol>
1032    </dd>    </dd>
1033  <dt>If <var>n</var> is an <code>Entity</code> node</dt>  <dt>If <var>n</var> is an <code>Entity</code> node whose
1034    <code>notationName</code> attribute value is <code>null</code> (i.e. a
1035    parsed entity)</dt>
1036    <dd>    <dd>
1037      <ol>      <ol>
1038      <li>An <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a> <span class=ed>@@ if !notationName</span>.</li>      <li>Raise an
1039      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <a href="#entity-error" id=ee-entity-node><code>entity-error</code></a>.</li>
1040        <li><a href="#algorithm-to-validate-an-ncname">Validate the
1041      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1042      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1043      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
1044      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1045      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
1046      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <li>If the <code>publicId</code> attribute value of <var>n</var> is
1047      <!-- publicId = non-null and systemId = null -->      <em>not</em> <code>null</code> and the <code>systemId</code> attribute
1048      <li class=ed>@@ <code>notationName</code></li>      value of <var>n</var> is <code>null</code>, then raise an
1049      <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>      <a href="#xml-well-formedness-error" id=wfe-entity-sid><code>xml-well-formedness-error</code></a>.</li>
1050      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
1051      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
1052        <ol>        <ol>
# Line 755  following:</p> Line 1065  following:</p>
1065        </ol></li>        </ol></li>
1066      </ol>      </ol>
1067    </dd>    </dd>
1068    <dt>If <var>n</var> is an <code>Entity</code> node whose
1069    <code>notationName</code> attribute value is <em>not</em> <code>null</code>
1070    (i.e. an unparsed entity)</dt>
1071      <dd>
1072        <ol>
1073        <li><a href="#algorithm-to-validate-an-ncname">Validate the
1074        <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1075        <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1076        the <code>publicId</code> attribute value as a public identifier</a>.</li>
1077        <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1078        the <code>systemId</code> attribute value as a system identifier</a>.</li>
1079        <li>If the <code>systemId</code> attribute value of <var>n</var> is
1080        <code>null</code>, then raise an
1081        <a href="#xml-well-formedness-error" id=wfe-uentity-sid><code>xml-well-formedness-error</code></a>.</li>
1082        <li><a href="#algorithm-to-validate-an-ncname">Validate the
1083        <code>notationName</code> attribute value of <var>n</var> as an
1084        <code>NCName</code></a>.</li>
1085        <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
1086        <li>If the <code>childNodes</code> list of <var>n</var> contains
1087        any nodes, they are in
1088        <a href="#xml-well-formedness-error" id=wfe-uentity-child><code>xml-well-formedness-error</code></a>.</li>
1089        </ol>
1090      </dd>
1091  <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>  <dt>If <var>n</var> is an <code>EntityReference</code> node</dt>
1092    <dd>    <dd>
1093      <ol>      <ol>
1094      <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>      <li>An <a href="#entity-error" id=ee-er-node><code>entity-error</code></a>.</li>
1095      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
1096      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1097      <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-er-nc><var>n<sub><var>c</var></sub></var></dfn>
1098      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 782  following:</p> Line 1115  following:</p>
1115  <dt>If <var>n</var> is a <code>Notation</code> node</dt>  <dt>If <var>n</var> is a <code>Notation</code> node</dt>
1116    <dd>    <dd>
1117      <ol>      <ol>
1118      <li><a href="#algorithm-to-validate-a-ncname">Validate the      <li><a href="#algorithm-to-validate-an-ncname">Validate the
1119      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>      <code>nodeName</code> attribute value as an <code>NCName</code></a>.</li>
1120      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate      <li><a href="#algorithm-to-validate-a-public-identifier" title="validate a public identifier">Validate
1121      the <code>publicId</code> attribute value as a public identifier</a>.</li>      the <code>publicId</code> attribute value as a public identifier</a>.</li>
1122      <li>If the <code>systemId</code> attribute value contains      <li><a href="#algorithm-to-validate-a-system-identifier" title="validate a system identifier">Validate
1123      both <code>"</code> and <code>'</code> characters, it is a      the <code>systemId</code> attribute value as a system identifier</a>.</li>
     <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>  
1124      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
1125      any nodes, they are in      any nodes, they are in
1126      <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-notation-child><code>xml-well-formedness-error</code></a>.</li>
# Line 800  following:</p> Line 1132  following:</p>
1132      <li>If the <code>target</code> attribute value matches to the string      <li>If the <code>target</code> attribute value matches to the string
1133      <code>xml</code> in any case combination, then raise a      <code>xml</code> in any case combination, then raise a
1134      <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-pi-xml><code>xml-well-formedness-error</code></a>.</li>
1135      <li>Otherwise, <a href="#algorithm-to-validate-a-ncname">validate the      <li>Otherwise, <a href="#algorithm-to-validate-an-ncname">validate the
1136      <code>target</code> attribute value as an <code>NCName</code></a>.</li>      <code>target</code> attribute value as an <code>NCName</code></a>.</li>
1137      <li>Then,      <li>Then,
1138      <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML character data">validate      <a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">validate
1139      the <code>data</code> attribute value as an <abbr>XML</abbr> character      the <code>data</code> attribute value as an <abbr>XML</abbr> character
1140      data</a>.</li>      data</a>.</li>
1141      <li>If the <code>data</code> attribute value contains a string      <li>If the <code>data</code> attribute value contains a string
# Line 824  following:</p> Line 1156  following:</p>
1156  <dt>If <var>n</var> is a <code>Text</code> node</dt>  <dt>If <var>n</var> is a <code>Text</code> node</dt>
1157    <dd>    <dd>
1158      <ol>      <ol>
1159      <li><a href="#algorithm-to-validate-an-xml-character-data">Validate the      <li><a href="#algorithm-to-validate-an-xml-character-data" title="validate an XML string">Validate the
1160      <code>data</code> attribute value as an <abbr>XML</abbr> character      <code>data</code> attribute value as an <abbr>XML</abbr> character
1161      data</a>.</li>      data</a>.</li>
1162      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
# Line 908  as amended by Line 1240  as amended by
1240  <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>  <a href="http://www.w3.org/XML/xml-V10-4e-errata#E02">errata</a>.</p>
1241  </div>  </div>
1242    
1243    <p>The character class <dfn id=class-NameStartChar10><code>NameStartChar10</code></dfn>
1244    contains the following characters:</p>
1245    <ul class=ed>
1246    </ul>
1247    <div class="note memo">
1248    <p>This character class contains all characters allowed as the first character
1249    of a string matching to the production rule
1250    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1251    of <abbr>XML</abbr> 1.0
1252    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1253    </div>
1254    
1255    <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1256    contains the following characters:</p>
1257    <ul>
1258    <li>The characters in the character class
1259    <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1260    <li class=ed></li>
1261    </ul>
1262    <div class="note memo">
1263    <p>This character class contains all characters allowed as the second
1264    character of a string matching to the production rule
1265    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1266    of <abbr>XML</abbr> 1.0
1267    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1268    </div>
1269    
1270    <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1271    contains the following characters:</p>
1272    <ul>
1273    <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1274    TABULATION</code></li>
1275    <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1276    <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1277    RETURN</code></li>
1278    <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1279    <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1280    (<code class=char>!</code>)</li>
1281    <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1282    (<code class=char>$</code>)</li>
1283    <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1284    (<code class=char>#</code>)</li>
1285    <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1286    (<code class=char>%</code>)</li>
1287    <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1288    (<code class=char>'</code>)</li>
1289    <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1290    (<code class=char>(</code>)</li>
1291    <li><code class=char>U+0029</code> <code class=charname>RIGHT
1292    PARENTHESIS</code> (<code class=char>)</code>)</li>
1293    <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1294    (<code class=char>*</code>)</li>
1295    <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1296    (<code class=char>+</code>)</li>
1297    <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1298    (<code class=char>,</code>)</li>
1299    <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1300    (<code class=char>-</code>)</li>
1301    <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1302    (<code class=char>.</code>)</li>
1303    <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1304    (<code class=char>/</code>)</li>
1305    <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1306    (<code class=char>0</code>) .. <code class=char>U+0039</code>
1307    <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1308    <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1309    (<code class=char>:</code>)</li>
1310    <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1311    (<code class=char>;</code>)</li>
1312    <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1313    (<code class=char>=</code>)</li>
1314    <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1315    (<code class=char>?</code>)</li>
1316    <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1317    (<code class=char>@</code>)</li>
1318    <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1319    A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1320    <code class=charname>LATIN CAPITAL LETTER Z</code>
1321    (<code class=char>Z</code>)</li>
1322    <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1323    (<code class=char>_</code>)</li>
1324    <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1325    A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1326    <code class=charname>LATIN CAPITAL LETTER Z</code>
1327    (<code class=char>Z</code>)</li>
1328    </ul>
1329    <div class="note memo">
1330    <p>This character class contains all characters allowed in the production rule
1331    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1332    of <abbr>XML</abbr> 1.0
1333    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1334    </div>
1335    
1336  </div>  </div>
1337    
1338  <div id="references" class="section reference">  <div id="references" class="section reference">

Legend:
Removed from v.1.17  
changed lines
  Added in v.1.26

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24