18 |
|
|
19 |
<div class="header"> |
<div class="header"> |
20 |
<h1>manakai's <abbr>XML</abbr> Conformance Checking</h1> |
<h1>manakai's <abbr>XML</abbr> Conformance Checking</h1> |
21 |
<h2>Working Draft <time datetime=2007-11-26>26 November 2007</time></h2> |
<h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2> |
22 |
|
|
23 |
<dl class="versions-uri"> |
<dl class="versions-uri"> |
24 |
<dt>This Version</dt> |
<dt>This Version</dt> |
135 |
can be easily serialized into a valid XML document.</p> |
can be easily serialized into a valid XML document.</p> |
136 |
</div> |
</div> |
137 |
|
|
138 |
|
<p class=ed>To be a conforming <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#dt-validating">validating XML processor</a>, |
139 |
|
...</p> |
140 |
|
|
141 |
<p>Errors are classified into these |
<p>Errors are classified into these |
142 |
<dfn id=error-category title="error category">error categories</dfn>:</p> |
<dfn id=error-category title="error category">error categories</dfn>:</p> |
143 |
|
|
254 |
#vc-PEinGroup Validity constraint: Proper Group/PE Nesting |
#vc-PEinGroup Validity constraint: Proper Group/PE Nesting |
255 |
"For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)." |
"For interoperability, if a parameter-entity reference appears in a choice, seq, or Mixed construct, its replacement text SHOULD contain at least one non-blank character, and neither the first nor last non-blank character of the replacement text SHOULD be a connector (| or ,)." |
256 |
"For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error." |
"For interoperability, an XML processor MAY at user option issue a warning when more than one attribute-list declaration is provided for a given element type, or more than one attribute definition is provided for a given attribute, but this is not an error." |
257 |
|
#condsec-nesting Validity constraint: Proper Conditional Section/PE Nesting |
258 |
|
#wf-Legalchar Well-formedness constraint: Legal Character |
259 |
|
#textent Well-formedness constraint: Parsed Entity |
260 |
|
#norecursion Well-formedness constraint: No Recursion |
261 |
|
#indtd Well-formedness constraint: In DTD |
262 |
|
"External parsed entities SHOULD each begin with a text declaration." |
263 |
|
"It is an error for a reference to an unparsed entity to appear in the EntityValue in an entity declaration." |
264 |
|
#UniqueNotationName Validity constraint: Unique Notation Name |
265 |
|
|
266 |
@@ Need detailed review, but maybe should be in parsing phase |
@@ Need detailed review, but maybe should be in parsing phase |
267 |
|
|
268 |
#vc-check-rmd Validity constraint: Standalone Document Declaration |
#vc-check-rmd Validity constraint: Standalone Document Declaration |
269 |
|
|
270 |
|
@@ Need dtailed review |
271 |
|
|
272 |
|
#wf-entdeclared Well-formedness constraint: Entity Declared |
273 |
|
#vc-entdeclared Validity constraint: Entity Declared |
274 |
|
"For interoperability, valid documents SHOULD declare the entities amp, lt, gt, apos, quot, in the form specified in 4.6 Predefined Entities." |
275 |
|
"If the entities lt or amp are declared, they MUST be declared as internal entities whose replacement text is a character reference to the respective character (less-than sign or ampersand) being escaped; the double escaping is REQUIRED for these entities so that references to them produce a well-formed result. If the entities gt, apos, or quot are declared, they MUST be declared as internal entities whose replacement text is the single character being escaped (or a character reference to that character; the double escaping here is OPTIONAL but harmless)." |
276 |
|
|
277 |
|
@@ flaged and then reported in DOM check phase |
278 |
|
|
279 |
|
"Entities encoded in UTF-16 MUST and entities encoded in UTF-8 MAY begin with the Byte Order Mark" |
280 |
|
"In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 MUST begin with a text declaration" |
281 |
|
"In an encoding declaration, the values "UTF-8", "UTF-16", "ISO-10646-UCS-2", and "ISO-10646-UCS-4" SHOULD be used for the various encodings and transformations of Unicode / ISO/IEC 10646, the values "ISO-8859-1", "ISO-8859-2", ... "ISO-8859-n" (where n is the part number) SHOULD be used for the parts of ISO 8859, and the values "ISO-2022-JP", "Shift_JIS", and "EUC-JP" SHOULD be used for the various encoded forms of JIS X-0208-1997. It is RECOMMENDED that character encodings registered (as charsets) with the Internet Assigned Numbers Authority [IANA-CHARSETS], other than those just listed, be referred to using their registered names; other encodings SHOULD use names starting with an "x-" prefix." |
282 |
|
|
283 |
|
@@ in parsing phase |
284 |
|
|
285 |
|
"It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding." |
286 |
|
|
287 |
--> |
--> |
288 |
</ul> |
</ul> |
289 |
</div> |
</div> |
502 |
<dt class=ed>@@</dt> |
<dt class=ed>@@</dt> |
503 |
</dl> |
</dl> |
504 |
</li> |
</li> |
505 |
<li>If <span class=ed>default is NOT #IMPLIED or #REQUIRED</span>, then |
<li>If <span class=ed>type ID and default is NOT #IMPLIED or #REQUIRED</span>, then |
506 |
raise an |
raise an |
507 |
<a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li> |
<a href="#xml-validity-error" id=vc-id-default><code>xml-validity-error</code></a>.</li> |
508 |
|
<li class=ed>@@ #FixedAttr Validity constraint: Fixed Attribute Default</li> |
509 |
|
<li class=ed>@@ strict serialization error for U+000D, U+000A, and |
510 |
|
U+0009 characters, leading/trailing U+0020, and U+0020{2,} string?</li> |
511 |
</ol> |
</ol> |
512 |
</dd> |
</dd> |
513 |
<dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt> |
<dt>If <var>n</var> is an <code>AttributeDefinition</code> node</dt> |
541 |
<li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code> |
<li class=ed>If <code>NOTATION_ATTR</code> on an <code>EMPTY</code> |
542 |
element, then raise an |
element, then raise an |
543 |
<a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li> |
<a href="#xml-validity-error" id=vc-no-notation-empty><code>xml-validity-error</code></a>.</li> |
544 |
|
<li class=ed>@@ #defattrvalid Validity constraint: Attribute Default Value Syntactically Correct</li> |
545 |
</ol> |
</ol> |
546 |
</dd> |
</dd> |
547 |
<dt>If <var>n</var> is a <code>CDATASection</code> node</dt> |
<dt>If <var>n</var> is a <code>CDATASection</code> node</dt> |
652 |
<li>If the <code>systemId</code> attribute value contains |
<li>If the <code>systemId</code> attribute value contains |
653 |
both <code>"</code> and <code>'</code> characters, it is a |
both <code>"</code> and <code>'</code> characters, it is a |
654 |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
655 |
|
<!-- @@ "It is an error for a fragment identifier (beginning with a # character) to be part of a system identifier." --> |
656 |
<!-- publicId = non-null and systemId = null --> |
<!-- publicId = non-null and systemId = null --> |
657 |
<li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn> |
<li>For each node <dfn id=var-dt-nc><var>n<sub><var>c</var></sub></var></dfn> |
658 |
in the <code>childNodes</code> list of <var>n</var>, |
in the <code>childNodes</code> list of <var>n</var>, |
695 |
</ol></li> |
</ol></li> |
696 |
<li class=ed>@@ #elementvalid Validity constraint: Element Valid</li> |
<li class=ed>@@ #elementvalid Validity constraint: Element Valid</li> |
697 |
<li class=ed>@@ #uniqattspec Well-formedness constraint: Unique Att Spec</li> |
<li class=ed>@@ #uniqattspec Well-formedness constraint: Unique Att Spec</li> |
698 |
|
<li class=ed>@@ #RequiredAttr Validity constraint: Required Attribute</li> |
699 |
</ol> |
</ol> |
700 |
</dd> |
</dd> |
701 |
<dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt> |
<dt>If <var>n</var> is an <code>ElementTypeDefinition</code> node</dt> |
736 |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
<a href="#xml-well-formedness-error" id=wfe-dt-system-id><code>xml-well-formedness-error</code></a>.</li> |
737 |
<!-- publicId = non-null and systemId = null --> |
<!-- publicId = non-null and systemId = null --> |
738 |
<li class=ed>@@ <code>notationName</code></li> |
<li class=ed>@@ <code>notationName</code></li> |
739 |
|
<li class=ed>@@ #not-declared Validity constraint: Notation Declared</li> |
740 |
<li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn> |
<li>For each node <dfn id=var-entity-nc><var>n<sub><var>c</var></sub></var></dfn> |
741 |
in the <code>childNodes</code> list of <var>n</var>, |
in the <code>childNodes</code> list of <var>n</var>, |
742 |
<ol> |
<ol> |