/[suikacvs]/markup/xml/xmlcc/xmlcc-work.en.html
Suika

Diff of /markup/xml/xmlcc/xmlcc-work.en.html

Parent Directory Parent Directory | Revision Log Revision Log | View Patch Patch

revision 1.19 by wakaba, Sat Dec 1 14:51:45 2007 UTC revision 1.22 by wakaba, Sun Mar 16 12:33:53 2008 UTC
# Line 18  Line 18 
18    
19  <div class="header">  <div class="header">
20  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>  <h1>manakai's <abbr>XML</abbr> Conformance Checking</h1>
21  <h2>Working Draft <time datetime=2007-12-01>1 December 2007</time></h2>  <h2>Working Draft <time datetime=2008-03-16>16 March 2008</time></h2>
22    
23  <dl class="versions-uri">  <dl class="versions-uri">
24  <dt>This Version</dt>  <dt>This Version</dt>
# Line 42  Line 42 
42        >w@suika.fam.cx</a>&gt;</code></dd>        >w@suika.fam.cx</a>&gt;</code></dd>
43  </dl>  </dl>
44    
45  <p class="copyright" lang="en">&#xA9; <time>2007</time> <a  <p class="copyright" lang="en">&#xA9; <time>2007</time>$B!>(B<time>2008</time> <a
46      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.      href="http://suika.fam.cx/~wakaba/who?" rel="author">Wakaba</a>.
47  Permission is granted to copy, distribute and/or modify this document  Permission is granted to copy, distribute and/or modify this document
48  under the terms of the <a rel="license"  under the terms of the <a rel="license"
# Line 118  else in this specification is normative. Line 118  else in this specification is normative.
118  <p><span class=ed>Algorithm is normative but non-normative</span>.  <p><span class=ed>Algorithm is normative but non-normative</span>.
119  In addition, the order in which <a href="#errors">errors</a> are  In addition, the order in which <a href="#errors">errors</a> are
120  raised is undefined.</p>  raised is undefined.</p>
121    
122    <p>This document sometimes cites parts of <abbr>XML</abbr> 1.0 specification
123    by hyperlinks.  When the document being processed is an <abbr>XML</abbr> 1.1
124    document, however, corresponding parts of the <abbr>XML</abbr> 1.1
125    specification should be consulted instead.</p>
126  </div>  </div>
127    
128    
# Line 216  can be easily serialized into a valid XM Line 221  can be easily serialized into a valid XM
221  <div class=section id=parsing-xml>  <div class=section id=parsing-xml>
222  <h2>Parsing <abbr>XML</abbr> Document</h2>  <h2>Parsing <abbr>XML</abbr> Document</h2>
223    
224  <ul>  <p>When an <abbr>XML</abbr> document is parsed, the following clauses
225  <li>If the <abbr>XML</abbr> document does not begin with an  are applied:</p>
226  <abbr>XML</abbr> declaration, then raise an  <dl>
227  <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.</li>  <dt>For each document
228  <li>If the replacement text of an entity declaration is    <dd>If the <abbr>XML</abbr> document does not begin with an
229  <code>&lt;</code>, then raise an    <abbr>XML</abbr> declaration, then the parser <em class=rfc2119>MUST</em>
230  <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--    raise an
231      <a href="#xml-misc-recommentation" id=xmr-xml-decl><code>xml-misc-recommendation</code></a>.
232    <dt>For each internal general entity declaration processed by the parser
233      <dd>If the
234      <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-EntityValue"><code>EntityValue</code></a>
235      part of the general entity declaration contains a bare <code>U+003C</code>
236      <code>LESS-THAN SIGN</code> (<code>&lt;</code>) character, then the parser
237      <em class=rfc2119>MUST</em> raise an
238      <a href="#xml-misc-warning" id=xmw-entity-value-lt><code>xml-misc-warning</code></a>.<!--
239  "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].  "strongly advised to avoid" in a Note in Section 2.3 of [XML10], [XML11].
240  --></li>  -->
241  <li>If there is an element type declaration whose <code>Name</code>  <dt>For each element type declaration processed by the parser
242  value is already declared, then raise an    <dd>If there is another element type declaration whose <code>Name</code>
243  <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.</li>    is equal to the <code>Name</code> of the element type declaration, then
244  <li>If attribute definition whose <code>Name</code> is    the parser <em class=rfc2119>MUST</em> raise an
245  <code>xml:space</code> has <span class=ed>declared type different from    <a href="#xml-validity-error" id=vc-edunique><code>xml-validity-error</code></a>.
246  (default|preserve), (default), or (preserve)</span>, then raise an  <!--
247  <a href="#xml-misc-error" id=xme-ad-xml-space><code>xml-misc-error</code></a>.    NOTE: <!ATTLIST a xml:space (default) #IMPLIED xml:space CDATA #IMPLIED>
248  <span class=ed>@@ duplication with    will not be warned.
249  <a href="#xml-at-xml-space">#xml-at-xml-space</a>.<!--  -->
250  <!ATTLIST e xml:space CDATA #IMPLIED xml:space CDATA #IMPLIED> --></span></li>  <dt>For each empty-element tag
251  <li>If an empty-element tag is used for an element which is <em>not</em>    <dd>If the <code>Name</code> of the tag is not declared by a processed
252  declared <code>EMPTY</code>, then raise an    element type declaration as <code>EMPTY</code> content, then the parser
253  <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.</li>    <em class=rfc2119>MUST</em> raise an
254  <li>If an empty-element tag is <em>not</em> used for an element which is    <a href="#xml-misc-recommentation" id=xmr-emptyelemtag-not-empty><code>xml-misc-recommendation</code></a>.
255  declared <code>EMPTY</code>, then raise an  <dt>For each start-tag
256  <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.</li>    <dd>If the <code>Name</code> of the tag is declared by a processed element
257      type declaration as <code>EMPTY</code> content, then the parser
258      <em class=rfc2119>MUST</em> raise an
259      <a href="#xml-misc-recommentation" id=xmr-empty-not-emptyelemtag><code>xml-misc-recommendation</code></a>.
260  <!--  <!--
261    
262  #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting  #vc-PEinMarkupDecl Validity constraint: Proper Declaration/PE Nesting
# Line 285  declared <code>EMPTY</code>, then raise Line 301  declared <code>EMPTY</code>, then raise
301  "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."  "It is a fatal error when an XML processor encounters an entity with an encoding that it is unable to process. It is a fatal error if an XML entity is determined (via default, encoding declaration, or higher-level protocol) to be in a certain encoding but contains byte sequences that are not legal in that encoding."
302    
303  -->  -->
304  </ul>  </dl>
305  </div>  </div>
306    
307  <div class="section" id=checking-dom>  <div class="section" id=checking-dom>
# Line 393  following algorithm <em class=rfc2119>MU Line 409  following algorithm <em class=rfc2119>MU
409  a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the  a public identifier (<dfn id=var-pid><var>pid</var></dfn>)</dfn>, the
410  following algorithm <em class=rfc2119>MUST</em> be used:</p>  following algorithm <em class=rfc2119>MUST</em> be used:</p>
411  <ol>  <ol>
412    <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>  <li>If <var>pid</var> is <code>null</code>, abort these steps.</li>
413    <li>If <var>pid</var> contains any character  <li>If <var>pid</var> contains a character that is <em>not</em> in the
414    that is outside of the range of <code>#x20 | #xD | #xA |  character class <a href="#class-PubidChar"><code>PubidChar</code></a>, then
415    [a-zA-Z0-9] | [-'()+,./:=?;!*#@$_%]</code><!-- @@ TODO: formal def -->,  raise an
416    then it is an  <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>
   <a href="#xml-well-formedness-error" id=wfe-pubid-char><code>xml-well-formedness-error</code></a>.</li>  
417    <li>If <var>pid</var> contains one of <code class=char>U+0009</code>    <li>If <var>pid</var> contains one of <code class=char>U+0009</code>
418    <code class=charname>CHARACTER TABULATION</code>,    <code class=charname>CHARACTER TABULATION</code>,
419    <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,    <code class=char>U+000A</code> <code class=charname>CARRIAGE RETURN</code>,
# Line 414  following algorithm <em class=rfc2119>MU Line 429  following algorithm <em class=rfc2119>MU
429    <span class=ed>Is this really a roundtripness problem?  XML spec    <span class=ed>Is this really a roundtripness problem?  XML spec
430    does only define the way to match public identifiers in fact, no    does only define the way to match public identifiers in fact, no
431    canonical form.</span></li>    canonical form.</span></li>
   <li class=ed>@@ Should we check formal-public-identifierness?</li>  
432  </ol>  </ol>
433    
434  <p>To  <p>To
# Line 561  following:</p> Line 575  following:</p>
575      <li>If <code>nodeName</code> attribute of <var>n</var> is      <li>If <code>nodeName</code> attribute of <var>n</var> is
576      <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>      <code>xml:space</code> <span class=ed>@@ or {xml namespace}:space ?</span>
577      and <span class=ed>its declared type is different from (default|preserve),      and <span class=ed>its declared type is different from (default|preserve),
578      (default), or (preserve)</span>, then raise an      (preserve|default), (default), or (preserve)</span>, then raise an
579      <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>      <a href="#xml-misc-error" id=xme-at-xml-space><code>xml-misc-error</code></a>.</li>
580      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the      <li>For each node <dfn id=var-ad-nc><var>n<sub><var>c</var></sub></var></dfn> in the
581      <code>childNodes</code> list of <var>n</var>,      <code>childNodes</code> list of <var>n</var>,
# Line 996  as amended by Line 1010  as amended by
1010  contains the following characters:</p>  contains the following characters:</p>
1011  <ul class=ed>  <ul class=ed>
1012  </ul>  </ul>
1013    <div class="note memo">
1014    <p>This character class contains all characters allowed as the first character
1015    of a string matching to the production rule
1016    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1017    of <abbr>XML</abbr> 1.0
1018    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1019    </div>
1020    
1021  <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>  <p>The character class <dfn id=class-NameChar10><code>NameChar10</code></dfn>
1022  contains the following characters:</p>  contains the following characters:</p>
# Line 1004  contains the following characters:</p> Line 1025  contains the following characters:</p>
1025  <a href="#class-NameStartChar10">NameStartChar10</a>.</li>  <a href="#class-NameStartChar10">NameStartChar10</a>.</li>
1026  <li class=ed></li>  <li class=ed></li>
1027  </ul>  </ul>
1028    <div class="note memo">
1029    <p>This character class contains all characters allowed as the second
1030    character of a string matching to the production rule
1031    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-Name"><code>Name</code></a>
1032    of <abbr>XML</abbr> 1.0
1033    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1034    </div>
1035    
1036    <p>The character class <dfn id=class-PubidChar><code>PubidChar</code></dfn>
1037    contains the following characters:</p>
1038    <ul>
1039    <li><code class=char>U+0009</code> <code class=charname>CHARACTER
1040    TABULATION</code></li>
1041    <li><code class=char>U+000A</code> <code class=charname>LINE FEED</code></li>
1042    <li><code class=char>U+000D</code> <code class=charname>CARRIAGE
1043    RETURN</code></li>
1044    <li><code class=char>U+0020</code> <code class=charname>SPACE</code></li>
1045    <li><code class=char>U+0021</code> <code class=charname>EXCLAMATION MARK</code>
1046    (<code class=char>!</code>)</li>
1047    <li><code class=char>U+0023</code> <code class=charname>DOLLAR SIGN</code>
1048    (<code class=char>$</code>)</li>
1049    <li><code class=char>U+0024</code> <code class=charname>NUMBER SIGN</code>
1050    (<code class=char>#</code>)</li>
1051    <li><code class=char>U+0025</code> <code class=charname>PERCENT SIGN</code>
1052    (<code class=char>%</code>)</li>
1053    <li><code class=char>U+0027</code> <code class=charname>APOSTROPHE</code>
1054    (<code class=char>'</code>)</li>
1055    <li><code class=char>U+0028</code> <code class=charname>LEFT PARENTHESIS</code>
1056    (<code class=char>(</code>)</li>
1057    <li><code class=char>U+0029</code> <code class=charname>RIGHT
1058    PARENTHESIS</code> (<code class=char>)</code>)</li>
1059    <li><code class=char>U+002A</code> <code class=charname>ASTERISK</code>
1060    (<code class=char>*</code>)</li>
1061    <li><code class=char>U+002B</code> <code class=charname>PLUS SIGN</code>
1062    (<code class=char>+</code>)</li>
1063    <li><code class=char>U+002C</code> <code class=charname>COMMA</code>
1064    (<code class=char>,</code>)</li>
1065    <li><code class=char>U+002D</code> <code class=charname>HYPHEN-MINUS</code>
1066    (<code class=char>-</code>)</li>
1067    <li><code class=char>U+002E</code> <code class=charname>FULL STOP</code>
1068    (<code class=char>.</code>)</li>
1069    <li><code class=char>U+002F</code> <code class=charname>SOLIDUS</code>
1070    (<code class=char>/</code>)</li>
1071    <li><code class=char>U+0030</code> <code class=charname>DIGIT ZERO</code>
1072    (<code class=char>0</code>) .. <code class=char>U+0039</code>
1073    <code class=charname>DIGIT NINE</code> (<code class=char>9</code>)</li>
1074    <li><code class=char>U+003A</code> <code class=charname>COLON</code>
1075    (<code class=char>:</code>)</li>
1076    <li><code class=char>U+003B</code> <code class=charname>SEMICOLON</code>
1077    (<code class=char>;</code>)</li>
1078    <li><code class=char>U+003D</code> <code class=charname>EQUAL SIGN</code>
1079    (<code class=char>=</code>)</li>
1080    <li><code class=char>U+003F</code> <code class=charname>QUESTION MARK</code>
1081    (<code class=char>?</code>)</li>
1082    <li><code class=char>U+0040</code> <code class=charname>COMMERCIAL AT</code>
1083    (<code class=char>@</code>)</li>
1084    <li><code class=char>U+0041</code> <code class=charname>LATIN CAPITAL LETTER
1085    A</code> (<code class=char>A</code>) .. <code class=char>U+005A</code>
1086    <code class=charname>LATIN CAPITAL LETTER Z</code>
1087    (<code class=char>Z</code>)</li>
1088    <li><code class=char>U+005F</code> <code class=charname>LOW LINE</code>
1089    (<code class=char>_</code>)</li>
1090    <li><code class=char>U+0061</code> <code class=charname>LATIN CAPITAL LETTER
1091    A</code> (<code class=char>A</code>) .. <code class=char>U+007A</code>
1092    <code class=charname>LATIN CAPITAL LETTER Z</code>
1093    (<code class=char>Z</code>)</li>
1094    </ul>
1095    <div class="note memo">
1096    <p>This character class contains all characters allowed in the production rule
1097    <a href="http://www.w3.org/TR/2006/REC-xml-20060816/#NT-PubidChar"><code>PubidChar</code></a>
1098    of <abbr>XML</abbr> 1.0
1099    <cite class="bibref normative">[<a href="#ref-XML10">XML10</a>]</cite>.</p>
1100    </div>
1101    
1102  </div>  </div>
1103    

Legend:
Removed from v.1.19  
changed lines
  Added in v.1.22

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24