18 |
|
|
19 |
<div class="header"> |
<div class="header"> |
20 |
<h1>manakai's <abbr>XML</abbr> Conformance Checking</h1> |
<h1>manakai's <abbr>XML</abbr> Conformance Checking</h1> |
21 |
<h2>Working Draft <time datetime=2007-11-14>14 November 2007</time></h2> |
<h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2> |
22 |
|
|
23 |
<dl class="versions-uri"> |
<dl class="versions-uri"> |
24 |
<dt>This Version</dt> |
<dt>This Version</dt> |
135 |
can be easily serialized into a valid XML document.</p> |
can be easily serialized into a valid XML document.</p> |
136 |
</div> |
</div> |
137 |
|
|
138 |
|
<p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>, |
139 |
|
...</p> |
140 |
|
|
141 |
<p>Errors are classified into these |
<p>Errors are classified into these |
142 |
<dfn id=error-category title="error category">error categories</dfn>:</p> |
<dfn id=error-category title="error category">error categories</dfn>:</p> |
143 |
|
|
225 |
<a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!-- |
<a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!-- |
226 |
"strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11]. |
"strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11]. |
227 |
--></li> |
--></li> |
228 |
|
<li>If there is an element type declaration whose <code>Name</code> |
229 |
|
value is already declared, then raise an |
230 |
|
<a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.</li> |
231 |
<li>If attribute definition whose <code>Name</code> is |
<li>If attribute definition whose <code>Name</code> is |
232 |
<code>xml:space</code> has <span class=ed>declared type different from |
<code>xml:space</code> has <span class=ed>declared type different from |
233 |
(default|preserve), (default), or (preserve)</span>, then raise an |
(default|preserve), (default), or (preserve)</span>, then raise an |
235 |
<span class=ed>@@ duplication with |
<span class=ed>@@ duplication with |
236 |
<a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!-- |
<a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!-- |
237 |
<!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li> |
<!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li> |
238 |
|
<li>If an empty-element tag is used for an element which is <em>not</em> |
239 |
|
declared <code>EMPTY</code>, then raise an |
240 |
|
<a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.</li> |
241 |
|
<li>If an empty-element tag is <em>not</em> used for an element which is |
242 |
|
declared <code>EMPTY</code>, then raise an |
243 |
|
<a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.</li> |
244 |
<!-- |
<!-- |
245 |
|
|
246 |
#vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting |
#vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting |
247 |
#wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset |
#wfc-PEinInternalSubset Well-formedness constraint: PEs in Internal Subset |
248 |
#ExtSubset Well-formedness constraint: External Subset |
#ExtSubset Well-formedness constraint: External Subset |
249 |
#PE-between-Decls Well-formedness constraint: PE Between Declarations |
#PE-between-Decls Well-formedness constraint: PE Between Declarations |
250 |
|
#GIMatch Well-formedness constraint: Element Type Match |
251 |
|
#uniqattspec Well-formedness constraint: Unique Att Spec |
252 |
|
#NoExternalRefs Well-formedness constraint: No External Entity References |
253 |
|
#CleanAttrVals Well-formedness constraint: No < in Attribute Values |
254 |
|
#vc-PEinGroup Validity constraint: Proper Group/PE Nesting |
255 |
|
"For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)." |
256 |
|
"For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error." |
257 |
|
#condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting |
258 |
|
#wf-Legalchar Well-formedness constraint: Legal Character |
259 |
|
#textent Well-formedness constraint: Parsed Entity |
260 |
|
#norecursion Well-formedness constraint: No Recursion |
261 |
|
#indtd Well-formedness constraint: In DTD |
262 |
|
"External parsed entities SHOULD each begin with a text declaration." |
263 |
|
"It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration." |
264 |
|
#UniqueNotationName Validity constraint: Unique Notation Name |
265 |
|
|
266 |
@@ Need detailed review, but maybe should be in parsing phase |
@@ Need detailed review, but maybe should be in parsing phase |
267 |
|
|
268 |
#vc-check-rmd Validity constraint: Standalone Document Declaration |
#vc-check-rmd Validity constraint: Standalone Document Declaration |
269 |
|
|
270 |
|
@@ Need dtailed review |
271 |
|
|
272 |
|
#wf-entdeclared Well-formedness constraint: Entity Declared |
273 |
|
#vc-entdeclared Validity constraint: Entity Declared |
274 |
|
"For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities." |
275 |
|
"If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)." |
276 |
|
|
277 |
|
@@ flaged and then reported in DOM check phase |
278 |
|
|
279 |
|
"Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark" |
280 |
|
"In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration" |
281 |
|
"In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix." |
282 |
|
|
283 |
|
@@ in parsing phase |
284 |
|
|
285 |
|
"It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding." |
286 |
|
|
287 |
--> |
--> |
288 |
</ul> |
</ul> |
289 |
</div> |
</div> |
355 |
<span class=ed>@@ except for attribute names <code>xml:lang</code>, |
<span class=ed>@@ except for attribute names <code>xml:lang</code>, |
356 |
<code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>, |
<code>xml:space</code>, <code>xml:base</code>, <code>xml:id</code>, |
357 |
<code>xmlns</code>, <code>xmlns:<var>*</var></code>, |
<code>xmlns</code>, <code>xmlns:<var>*</var></code>, |
358 |
pi name <code>xml-stylesheet</code>.</span></li> |
pi name <code>xml-stylesheet</code>.</span><!-- |
359 |
|
"names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.": |
360 |
|
xml:base and xml:id specifications violate to this sentense! |
361 |
|
--></li> |
362 |
<!-- COLON in Name is a lowercase "should not" in a Note of [XML10], |
<!-- COLON in Name is a lowercase "should not" in a Note of [XML10], |
363 |
[XML11]. --> |
[XML11]. --> |
364 |
<li class=ed>@@</li> |
<li class=ed>@@</li> |
437 |
successor] or an empty string, then xml-misc-warning |
successor] or an empty string, then xml-misc-warning |
438 |
(a "fact"-level error; not an XML error).</li> |
(a "fact"-level error; not an XML error).</li> |
439 |
<li class=ed>@@ <code>specified</code>, |
<li class=ed>@@ <code>specified</code>, |
440 |
<code>manakaiAttributeType</code></li> |
<code>manakaiAttributeType</code> (#ValueType Validity constraint: Attribute Value Type)</li> |
441 |
<!-- xml:space MUST be declared to be valid. --> |
<!-- xml:space MUST be declared to be valid. --> |
442 |
|
<li>Let <dfn id=var-v><var>v</var></dfn> be the value of the |
443 |
|
attribute <code>value</code> of <var>n</var>.</li> |
444 |
|
<li>Validate the <var>n</var> against the <span class=ed>declared |
445 |
|
type</span> as following: |
446 |
|
<dl class=switch> |
447 |
|
<dt class=ed><code>ID_ATTR</code></dt> |
448 |
|
<dd> |
449 |
|
<ol> |
450 |
|
<li><span class=ed>Validate <var>v</var> as an |
451 |
|
<code>Name</code>.</span> If it fails, then raise an |
452 |
|
<a href="#xml-validity-error" id=vc-id-name><code>xml-validity-error</code></a>.</li> |
453 |
|
<li><span class=ed>If <code>ID</code> <var>v</var> is defined,</span> |
454 |
|
then raise an |
455 |
|
<a href="#xml-validity-error" id=vc-id-duplication><code>xml-validity-error</code></a>.</li> |
456 |
|
</ol> |
457 |
|
</dd> |
458 |
|
<dt class=ed>IDREF_ATTR</dt> |
459 |
|
<dd> |
460 |
|
<ol> |
461 |
|
<li><span class=ed>Validate <var>v</var> as an |
462 |
|
<code>Name</code>.</span> If it fails, then raise an |
463 |
|
<a href="#xml-validity-error" id=vc-idref-name><code>xml-validity-error</code></a>.</li> |
464 |
|
<li><span class=ed>If <code>ID</code> <var>v</var> is NOT |
465 |
|
defined,</span> then raise an |
466 |
|
<a href="#xml-validity-error" id=vc-idref-match><code>xml-validity-error</code></a>.</li> |
467 |
|
</ol> |
468 |
|
</dd> |
469 |
|
<dt class=ed>IDREFS_ATTR</dt> |
470 |
|
<dd class=ed>@@</dd> |
471 |
|
<dt class=ed>ENTITY_ATTR</dt> |
472 |
|
<dd> |
473 |
|
<ol> |
474 |
|
<li><span class=ed>Validate <var>v</var> as an |
475 |
|
<code>Name</code>.</span> If it fails, then raise an |
476 |
|
<a href="#xml-validity-error" id=vc-entname-name><code>xml-validity-error</code></a>.</li> |
477 |
|
<li><span class=ed>If <code>Entity</code> <var>v</var> is NOT |
478 |
|
defined,</span> then raise an |
479 |
|
<a href="#xml-validity-error" id=vc-entname-match><code>xml-validity-error</code></a>.</li> |
480 |
|
</ol> |
481 |
|
</dd> |
482 |
|
<dt class=ed>ENTITIES_ATTR</dt> |
483 |
|
<dd class=ed>@@</dd> |
484 |
|
<dt class=ed>NMTOKEN_ATTR</dt> |
485 |
|
<dd> |
486 |
|
<ol> |
487 |
|
<li><span class=ed>Validate <var>v</var> as an |
488 |
|
<code>Nmtoken</code>.</span> If it fails, then raise an |
489 |
|
<a href="#xml-validity-error" id=vc-nmtok-name><code>xml-validity-error</code></a>.</li> |
490 |
|
</ol> |
491 |
|
</dd> |
492 |
|
<dt class=ed>NMTOKENS_ATTR</dt> |
493 |
|
<dd class=ed>@@</dd> |
494 |
|
<dt class=ed>NOTATION_ATTR</dt> |
495 |
|
<dd class=ed><var>v</var> must be one of enumerated values. |
496 |
|
If not, then raise an |
497 |
|
<a href="#xml-validity-error" id=vc-notatn-match><code>xml-validity-error</code></a>.</dd> |
498 |
|
<dt class=ed>ENUMERATED_ATTR</dt> |
499 |
|
<dd class=ed><var>v</var> must be one of enumerated values. |
500 |
|
If not, then raise an |
501 |
|
<a href="#xml-validity-error" id=vc-enum><code>xml-validity-error</code></a>.</dd> |
502 |
|
<dt class=ed>@@</dt> |
503 |
|
</dl> |
504 |
|
</li> |
505 |
|
<li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then |
506 |
|
raise an |
507 |
|
<a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li> |
508 |
|
<li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li> |
509 |
|
<li class=ed>@@ strict serialization error for U+000D, U+000A, and |
510 |
|
U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li> |
511 |
</ol> |
</ol> |
512 |
</dd> |
</dd> |
513 |
<dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt> |
<dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt> |
532 |
<a href="#algorithm-to-check-a-node" title="check a node">check |
<a href="#algorithm-to-check-a-node" title="check a node">check |
533 |
<var>n<sub><var>c</var></sub></var></a> recusrively.</li> |
<var>n<sub><var>c</var></sub></var></a> recusrively.</li> |
534 |
</ol></li> |
</ol></li> |
535 |
|
<li class=ed>If <code>NOTATION_ATTR</code>, enumerated values MUST |
536 |
|
be declared. If not, then raise an |
537 |
|
<a href="#xml-validity-error" id=vc-notatn-declared><code>xml-validity-error</code></a>.</li> |
538 |
|
<li class=ed>If <code>NOTATION_ATTR</code> or <code>ENUMERATED_ATTR</code>, |
539 |
|
values MUST all be distinct. If not, then raise an |
540 |
|
<a href="#xml-validity-error" id=vc-no-duplicate-tokens><code>xml-validity-error</code></a>.</li> |
541 |
|
<li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code> |
542 |
|
element, then raise an |
543 |
|
<a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li> |
544 |
|
<li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li> |
545 |
</ol> |
</ol> |
546 |
</dd> |
</dd> |
547 |
<dt>If <var>n</var> is a <code>CDATASection</code> node</dt> |
<dt>If <var>n</var> is a <code>CDATASection</code> node</dt> |
652 |
<li>If the <code>systemId</code> attribute value contains |
<li>If the <code>systemId</code> attribute value contains |
653 |
both <code>"</code> and <code>'</code> characters, it is a |
both <code>"</code> and <code>'</code> characters, it is a |
654 |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
655 |
|
<!-- @@ "It is an error for a fragment identifier (beginning with a # character) to be part of a system identifier." --> |
656 |
<!-- publicId = non-null and systemId = null --> |
<!-- publicId = non-null and systemId = null --> |
657 |
<li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn> |
<li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn> |
658 |
in the <code>childNodes</code> list of <var>n</var>, |
in the <code>childNodes</code> list of <var>n</var>, |
693 |
<a href="#algorithm-to-check-a-node" title="check a node">check |
<a href="#algorithm-to-check-a-node" title="check a node">check |
694 |
<var>n<sub><var>c</var></sub></var></a> recursively.</li> |
<var>n<sub><var>c</var></sub></var></a> recursively.</li> |
695 |
</ol></li> |
</ol></li> |
696 |
|
<li class=ed>@@ #elementvalid Validity constraint: Element Valid</li> |
697 |
|
<li class=ed>@@ #uniqattspec Well-formedness constraint: Unique Att Spec</li> |
698 |
|
<li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li> |
699 |
</ol> |
</ol> |
700 |
</dd> |
</dd> |
701 |
<dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt> |
<dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt> |
705 |
<li>If the <code>childNodes</code> list of <var>n</var> contains |
<li>If the <code>childNodes</code> list of <var>n</var> contains |
706 |
any nodes, they are in |
any nodes, they are in |
707 |
<a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li> |
<a href="#xml-well-formedness-error" id=wfe-et-child><code>xml-well-formedness-error</code></a>.</li> |
708 |
|
<li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#elemdecls">At user option, an XML processor MAY issue a warning when a declaration mentions an element type for which no declaration is provided, but this is not an error.</q></li> |
709 |
|
<li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#sec-element-content">For compatibility, it is an error if the content model allows an element to match more than one occurrence of an element type in the content model. </q></li> |
710 |
|
<li class=ed>@@ #vc-MixedChildrenUnique |
711 |
|
Validity constraint: No Duplicate Types</li> |
712 |
|
<li class=ed>@@ <q cite="http://www.w3.org/TR/2006/REC-xml-20060816/#attdecls">At user option, an XML processor MAY issue a warning if attributes are declared for an element type not itself declared, but this is not an error.</q></li> |
713 |
|
<li>If there is more than one <code>AttributeDefinition</code> node |
714 |
|
with <span class=ed>attribute type <code>ID</code></span> in the |
715 |
|
<code>NamedNodeMap</code> list contained in the |
716 |
|
<code>attributeDefinitions</code> attribute of <var>n</var>, then raise an |
717 |
|
<a href="#xml-validity-error" id=vc-one-id-per-el><code>xml-validity-error</code></a>.</li> |
718 |
|
<li>If there is more than one <code>AttributeDefinition</code> node |
719 |
|
with <span class=ed>attribute type <code>NOTATION</code></span> in the |
720 |
|
<code>NamedNodeMap</code> list contained in the |
721 |
|
<code>attributeDefinitions</code> attribute of <var>n</var>, then raise an |
722 |
|
<a href="#xml-validity-error" id=vc-one-notation-per><code>xml-validity-error</code></a>.</li> |
723 |
|
<li class=ed>"For interoperability, the same Nmtoken SHOULD NOT occur more than once in the enumerated attribute types of a single element type."</li> |
724 |
</ol> |
</ol> |
725 |
</dd> |
</dd> |
726 |
<dt>If <var>n</var> is an <code>Entity</code> node</dt> |
<dt>If <var>n</var> is an <code>Entity</code> node</dt> |
736 |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
737 |
<!-- publicId = non-null and systemId = null --> |
<!-- publicId = non-null and systemId = null --> |
738 |
<li class=ed>@@ <code>notationName</code></li> |
<li class=ed>@@ <code>notationName</code></li> |
739 |
|
<li class=ed>@@ #not-declared Validity constraint: Notation Declared</li> |
740 |
<li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn> |
<li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn> |
741 |
in the <code>childNodes</code> list of <var>n</var>, |
in the <code>childNodes</code> list of <var>n</var>, |
742 |
<ol> |
<ol> |