/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Diff of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.13 by wakaba, Wed Nov 14 12:28:40 2007 UTC revision 1.17 by wakaba, Sat Dec 1 12:19:41 2007 UTC
# Line 18  Line 18 
18    
19  <div class="header">  <div class="header">
20  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21  <h2>Working Draft <time datetime=2007-11-14>14 November 2007</time></h2>  <h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2>
22    
23  <dl class="versions-uri">  <dl class="versions-uri">
24  <dt>This Version</dt>  <dt>This Version</dt>
# Line 120  In addition, the order in which <a href= Line 120  In addition, the order in which <a href=
120  raised is undefined.</p>  raised is undefined.</p>
121  </div>  </div>
122    
 <div class=section id=parsing-xml>  
 <h2>Parsing <abbr>XML</abbr> Document</h2>  
   
 <ul>  
 <li>If the <abbr>XML</abbr> document does not begin with an  
 <abbr>XML</abbr> declaration, then raise an  
 <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.</li>  
 <li>If the replacement text of an entity declaration is  
 <code>&lt;</code>, then raise an  
 <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--  
 "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].  
 --></li>  
 <li>If attribute definition whose <code>Name</code> is  
 <code>xml:space</code> has <span class=ed>declared type different from  
 (default|preserve), (default), or (preserve)</span>, then raise an  
 <a href="#xml-misc-error" id=xme-ad-xml-space><code>xml-misc-error</code></a>.  
 <span class=ed>@@ duplication with  
 <a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!--  
 <!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li>  
 <!--  
   
 #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting  
 #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset  
 #ExtSubset Well-formedness constraint: External Subset  
 #PE-between-Decls Well-formedness constraint: PE Between Declarations  
   
 @@ Need detailed review, but maybe should be in parsing phase  
   
 #vc-check-rmd Validity constraint: Standalone Document Declaration  
   
 -->  
 </ul>  
 </div>  
   
 <div class="section" id=checking-dom>  
 <h2>Checking <abbr>DOM</abbr></h2>  
   
 <p>The following algorithms and definitions are applied to  
 <abbr>XML</abbr> documents; especially, they are not applied  
 to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>  
123    
124  <div class=section id=error-categories>  <div class=section id=error-categories>
125  <h3>Error Classification</h3>  <h2>Error Classification</h2>
126    
127  <p class=ed>If a <code>Document</code> node has no  <p class=ed>If a <code>Document</code> node has no
128  xml-well-formedness-error, entity-error, and unknown-error,  xml-well-formedness-error, entity-error, and unknown-error,
# Line 175  into a well$B!>(Bformed XML document. Line 135  into a well$B!>(Bformed XML document.
135  can be easily serialized into a valid XML document.</p>  can be easily serialized into a valid XML document.</p>
136  </div>  </div>
137    
138    <p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>,
139    ...</p>
140    
141  <p>Errors are classified into these  <p>Errors are classified into these
142  <dfn id=error-category title="error category">error categories</dfn>:</p>  <dfn id=error-category title="error category">error categories</dfn>:</p>
143    
# Line 250  can be easily serialized into a valid XM Line 213  can be easily serialized into a valid XM
213    
214  </div>  </div>
215    
216    <div class=section id=parsing-xml>
217    <h2>Parsing <abbr>XML</abbr> Document</h2>
218    
219    <ul>
220    <li>If the <abbr>XML</abbr> document does not begin with an
221    <abbr>XML</abbr> declaration, then raise an
222    <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.</li>
223    <li>If the replacement text of an entity declaration is
224    <code>&lt;</code>, then raise an
225    <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
226    "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
227    --></li>
228    <li>If there is an element type declaration whose <code>Name</code>
229    value is already declared, then raise an
230    <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.</li>
231    <li>If attribute definition whose <code>Name</code> is
232    <code>xml:space</code> has <span class=ed>declared type different from
233    (default|preserve), (default), or (preserve)</span>, then raise an
234    <a href="#xml-misc-error" id=xme-ad-xml-space><code>xml-misc-error</code></a>.
235    <span class=ed>@@ duplication with
236    <a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!--
237    <!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li>
238    <li>If an empty-element tag is used for an element which is <em>not</em>
239    declared <code>EMPTY</code>, then raise an
240    <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.</li>
241    <li>If an empty-element tag is <em>not</em> used for an element which is
242    declared <code>EMPTY</code>, then raise an
243    <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.</li>
244    <!--
245    
246    #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting
247    #wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset
248    #ExtSubset Well-formedness constraint: External Subset
249    #PE-between-Decls Well-formedness constraint: PE Between Declarations
250    #GIMatch Well-formedness constraint: Element Type Match
251    #uniqattspec Well-formedness constraint: Unique Att Spec
252    #NoExternalRefs Well-formedness constraint: No External Entity References
253    #CleanAttrVals Well-formedness constraint: No < in Attribute Values
254    #vc-PEinGroup Validity constraint: Proper Group/PE Nesting
255    "For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)."
256    "For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error."
257    #condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting
258    #wf-Legalchar Well-formedness constraint: Legal Character
259    #textent Well-formedness constraint: Parsed Entity
260    #norecursion Well-formedness constraint: No Recursion
261    #indtd Well-formedness constraint: In DTD
262    "External parsed entities SHOULD each begin with a text declaration."
263    "It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration."
264    #UniqueNotationName Validity constraint: Unique Notation Name
265    
266    @@ Need detailed review, but maybe should be in parsing phase
267    
268    #vc-check-rmd Validity constraint: Standalone Document Declaration
269    
270    @@ Need dtailed review
271    
272    #wf-entdeclared Well-formedness constraint: Entity Declared
273    #vc-entdeclared Validity constraint: Entity Declared
274    "For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities."
275    "If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)."
276    
277    @@ flaged and then reported in DOM check phase
278    
279    "Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark"
280    "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration"
281    "In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix."
282    
283    @@ in parsing phase
284    
285    "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
286    
287    -->
288    </ul>
289    </div>
290    
291    <div class="section" id=checking-dom>
292    <h2>Checking <abbr>DOM</abbr></h2>
293    
294    <p>The following algorithms and definitions are applied to
295    <abbr>XML</abbr> documents; especially, they are not applied
296    to <abbr>HTML</abbr> documents.<!-- TODO: ref to HTML5 --></p>
297    
298  <div class=section id=definitions>  <div class=section id=definitions>
299  <h3>Definitions</h3>  <h3>Definitions</h3>
300    
# Line 310  case combination), then raise an Line 355  case combination), then raise an
355  <span class=ed>@@ except for attribute names <code>xml:lang</code>,  <span class=ed>@@ except for attribute names <code>xml:lang</code>,
356  <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,  <code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>,
357  <code>xmlns</code>, <code>xmlns:<var>*</var></code>,  <code>xmlns</code>, <code>xmlns:<var>*</var></code>,
358  pi name <code>xml-stylesheet</code>.</span></li>  pi name <code>xml-stylesheet</code>.</span><!--
359    "names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.":
360    xml:base and xml:id specifications violate to this sentense!
361    --></li>
362  <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],  <!-- COLON in Name is a lowercase "should not" in a Note of [XML10],
363  [XML11]. -->  [XML11]. -->
364  <li class=ed>@@</li>  <li class=ed>@@</li>
# Line 389  following:</p> Line 437  following:</p>
437      successor] or an empty string, then xml-misc-warning      successor] or an empty string, then xml-misc-warning
438      (a "fact"-level error; not an XML error).</li>      (a "fact"-level error; not an XML error).</li>
439      <li class=ed>@@ <code>specified</code>,      <li class=ed>@@ <code>specified</code>,
440      <code>manakaiAttributeType</code></li>      <code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li>
441      <!-- xml:space MUST be declared to be valid. -->      <!-- xml:space MUST be declared to be valid. -->
442        <li>Let <dfn id=var-v><var>v</var></dfn> be the value of the
443        attribute <code>value</code> of <var>n</var>.</li>
444        <li>Validate the <var>n</var> against the <span class=ed>declared
445        type</span> as following:
446          <dl class=switch>
447          <dt class=ed><code>ID_ATTR</code></dt>
448            <dd>
449              <ol>
450              <li><span class=ed>Validate <var>v</var> as an
451              <code>Name</code>.</span>  If it fails, then raise an
452              <a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li>
453              <li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span>
454              then raise an  
455              <a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li>
456              </ol>
457            </dd>
458          <dt class=ed>IDREF_ATTR</dt>
459            <dd>
460              <ol>
461              <li><span class=ed>Validate <var>v</var> as an
462              <code>Name</code>.</span>  If it fails, then raise an
463              <a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li>
464              <li><span class=ed>If <code>ID</code> <var>v</var> is NOT
465              defined,</span> then raise an  
466              <a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li>
467              </ol>
468            </dd>
469          <dt class=ed>IDREFS_ATTR</dt>
470            <dd class=ed>@@</dd>
471          <dt class=ed>ENTITY_ATTR</dt>
472            <dd>
473              <ol>
474              <li><span class=ed>Validate <var>v</var> as an
475              <code>Name</code>.</span>  If it fails, then raise an
476              <a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li>
477              <li><span class=ed>If <code>Entity</code> <var>v</var> is NOT
478              defined,</span> then raise an  
479              <a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li>
480              </ol>
481            </dd>
482          <dt class=ed>ENTITIES_ATTR</dt>
483            <dd class=ed>@@</dd>
484          <dt class=ed>NMTOKEN_ATTR</dt>
485            <dd>
486              <ol>
487              <li><span class=ed>Validate <var>v</var> as an
488              <code>Nmtoken</code>.</span>  If it fails, then raise an
489              <a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li>
490              </ol>
491            </dd>
492          <dt class=ed>NMTOKENS_ATTR</dt>
493            <dd class=ed>@@</dd>
494          <dt class=ed>NOTATION_ATTR</dt>
495            <dd class=ed><var>v</var> must be one of enumerated values.
496            If not, then raise an
497            <a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd>
498          <dt class=ed>ENUMERATED_ATTR</dt>
499            <dd class=ed><var>v</var> must be one of enumerated values.
500            If not, then raise an
501            <a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd>
502          <dt class=ed>@@</dt>
503          </dl>
504        </li>
505        <li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then
506        raise an
507        <a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li>
508        <li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li>
509        <li class=ed>@@ strict serialization error for U+000D, U+000A, and
510        U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li>
511      </ol>      </ol>
512    </dd>    </dd>
513  <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt>
# Line 415  following:</p> Line 532  following:</p>
532        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
533        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>        <var>n<sub><var>c</var></sub></var></a> recusrively.</li>
534        </ol></li>        </ol></li>
535        <li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST
536        be declared.  If not, then raise an
537        <a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li>
538        <li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>,
539        values MUST all be distinct.  If not, then raise an
540        <a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li>
541        <li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code>
542        element, then raise an
543        <a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li>
544        <li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li>
545      </ol>      </ol>
546    </dd>    </dd>
547  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>  <dt>If <var>n</var> is a <code>CDATASection</code> node</dt>
# Line 525  following:</p> Line 652  following:</p>
652      <li>If the <code>systemId</code> attribute value contains      <li>If the <code>systemId</code> attribute value contains
653      both <code>"</code> and <code>'</code> characters, it is a      both <code>"</code> and <code>'</code> characters, it is a
654      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>
655        <!-- @@ "It is an error for a fragment identifier (beginning with a # character) to be part of a system identifier." -->
656      <!-- publicId = non-null and systemId = null -->      <!-- publicId = non-null and systemId = null -->
657      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn>
658      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
# Line 565  following:</p> Line 693  following:</p>
693        <a href="#algorithm-to-check-a-node" title="check a node">check        <a href="#algorithm-to-check-a-node" title="check a node">check
694        <var>n<sub><var>c</var></sub></var></a> recursively.</li>        <var>n<sub><var>c</var></sub></var></a> recursively.</li>
695        </ol></li>        </ol></li>
696        <li class=ed>@@ #elementvalid Validity constraint: Element Valid</li>
697        <li class=ed>@@ #uniqattspec Well-formedness constraint: Unique Att Spec</li>
698        <li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li>
699      </ol>      </ol>
700    </dd>    </dd>
701  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>  <dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt>
# Line 574  following:</p> Line 705  following:</p>
705      <li>If the <code>childNodes</code> list of <var>n</var> contains      <li>If the <code>childNodes</code> list of <var>n</var> contains
706      any nodes, they are in      any nodes, they are in
707      <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li>
708        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li>
709        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li>
710        <li class=ed>@@ #vc-MixedChildrenUnique
711        Validity constraint: No Duplicate Types</li>
712        <li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li>
713        <li>If there is more than one <code>AttributeDefinition</code> node
714        with <span class=ed>attribute type <code>ID</code></span> in the
715        <code>NamedNodeMap</code> list contained in the
716        <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
717        <a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li>
718        <li>If there is more than one <code>AttributeDefinition</code> node
719        with <span class=ed>attribute type <code>NOTATION</code></span> in the
720        <code>NamedNodeMap</code> list contained in the
721        <code>attributeDefinitions</code> attribute of <var>n</var>, then raise an
722        <a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li>
723        <li class=ed>"For interoperability, the same Nmtoken  SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li>
724      </ol>      </ol>
725    </dd>    </dd>
726  <dt>If <var>n</var> is an <code>Entity</code> node</dt>  <dt>If <var>n</var> is an <code>Entity</code> node</dt>
# Line 589  following:</p> Line 736  following:</p>
736      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>      <a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li>
737      <!-- publicId = non-null and systemId = null -->      <!-- publicId = non-null and systemId = null -->
738      <li class=ed>@@ <code>notationName</code></li>      <li class=ed>@@ <code>notationName</code></li>
739        <li class=ed>@@ #not-declared Validity constraint: Notation Declared</li>
740      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>      <li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn>
741      in the <code>childNodes</code> list of <var>n</var>,      in the <code>childNodes</code> list of <var>n</var>,
742        <ol>        <ol>

Legend:
Removed from v.1.13  
changed lines
  Added in v.1.17

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24