43 |
</d:desc> |
</d:desc> |
44 |
</d:item> |
</d:item> |
45 |
|
|
46 |
|
<d:item name="character encoding" class="format-charset unsupported" |
47 |
|
level="unsupported"> |
48 |
|
<d:message xml:lang="en">Conformance for character encoding requirements |
49 |
|
cannot be checked.</d:message> |
50 |
|
<d:desc xml:lang="en"> |
51 |
|
<p>The conformance checker cannot detect whether the input document |
52 |
|
met the requirements on character encoding, since the document |
53 |
|
is not inputed as a serialized byte sequence. The document is |
54 |
|
not conforming if it is not encoded in an appropriate character |
55 |
|
encoding with appropriate labeling.</p> |
56 |
|
</d:desc> |
57 |
|
</d:item> |
58 |
|
|
59 |
<d:item name="no character encoding declaration" class="format-charset error" |
<d:item name="no character encoding declaration" class="format-charset error" |
60 |
level="m"> |
level="m"> |
61 |
<d:message xml:lang="en">There is no character encoding |
<d:message xml:lang="en">There is no character encoding |
63 |
<d:desc xml:lang="en"> |
<d:desc xml:lang="en"> |
64 |
<p>The document does not contain a character encoding |
<p>The document does not contain a character encoding |
65 |
declaration. Unless the character encoding is explicitly |
declaration. Unless the character encoding is explicitly |
66 |
specified in upper‐level protocol, e.g. in <abbr>HTTP</abbr>, |
specified in lower‐level protocol, e.g. in <abbr>HTTP</abbr>, |
67 |
or is implied by <abbr>BOM</abbr>, there must be a character |
or is implied by <abbr>BOM</abbr>, there must be a character |
68 |
encoding declaration. The document is non‐conforming.</p> |
encoding declaration. The document is non‐conforming.</p> |
69 |
|
|
80 |
<d:item name="non ascii superset" class="format-charset error" |
<d:item name="non ascii superset" class="format-charset error" |
81 |
level="m"> |
level="m"> |
82 |
<d:message xml:lang="en">No character encoding metadata is found |
<d:message xml:lang="en">No character encoding metadata is found |
83 |
in upper‐level protocol nor is there <abbr>BOM</abbr>, while |
in lower‐level protocol nor is there <abbr>BOM</abbr>, while |
84 |
character encoding <code><var>$0</var></code> |
character encoding <code><var>$0</var></code> |
85 |
is not a superset of <abbr>ASCII</abbr>.</d:message> |
is not a superset of <abbr>ASCII</abbr>.</d:message> |
86 |
<d:desc xml:lang="en"> |
<d:desc xml:lang="en"> |
87 |
<p>The document is not labeled with character encoding name |
<p>The document is not labeled with character encoding name |
88 |
in upper‐level protocol, e.g. in <abbr>HTTP</abbr>, and |
in lower‐level protocol, e.g. in <abbr>HTTP</abbr>, and |
89 |
the document is not begin with <abbr>BOM</abbr>. In addition, |
the document is not begin with <abbr>BOM</abbr>. In addition, |
90 |
the character encoding of the document is not a superset of |
the character encoding of the document is not a superset of |
91 |
<abbr>ASCII</abbr>. The document is non‐conforming.</p> |
<abbr>ASCII</abbr>. The document is non‐conforming.</p> |
1017 |
<section id="attribute-value-errors"> |
<section id="attribute-value-errors"> |
1018 |
<h2>Attribute Value Errors</h2> |
<h2>Attribute Value Errors</h2> |
1019 |
|
|
1020 |
|
<d:item name="charset:not registered" class="attribute-value-warning warning" |
1021 |
|
level="w"> |
1022 |
|
<d:message xml:lang="en">Character encoding name <code><var>$0</var></code> |
1023 |
|
is not registered.</d:message> |
1024 |
|
<d:desc xml:lang="en"> |
1025 |
|
<p>The specified character encoding name is not registered to |
1026 |
|
<abbr>IANA</abbr>. Use of registered character encoding name |
1027 |
|
is a good practice to facilitate interoperability.</p> |
1028 |
|
|
1029 |
|
<dl class="switch"> |
1030 |
|
<dt><code>EUC-TW</code></dt> |
1031 |
|
<dd><code>EUC-TW</code> is not registered. Unfortunately, there |
1032 |
|
is no registered name for that character encoding. Use |
1033 |
|
Big5 encoding with character encoding name <code>Big5</code> |
1034 |
|
if it is enough to represent the document.</dd> |
1035 |
|
<dt><code>ISO-2022-JP-1</code></dt> |
1036 |
|
<dd><code>ISO-2022-JP-1</code> is not registered, nevertheless |
1037 |
|
this character encoding name is documented in |
1038 |
|
<a href="urn:ietf:rfc:2237"><abbr>RFC</abbr> 2237</a>. Use |
1039 |
|
<code>ISO-2022-JP-2</code> instead, since that character encoding |
1040 |
|
is a superset of ISO-2022-JP-1.</dd> |
1041 |
|
<dt><code>ISO-2022-JP-3</code>, <code>ISO-2022-JP-3-plane1</code></dt> |
1042 |
|
<dd>These names are not registered and obsoleted in favor of |
1043 |
|
<code>ISO-2022-JP-2004</code> and |
1044 |
|
<code>ISO-2022-JP-2004-plane1</code>.</dd> |
1045 |
|
<dt><code>ISO-2022-JP-2003</code>, |
1046 |
|
<code>ISO-2022-JP-2003-plane1</code></dt> |
1047 |
|
<dd>These names are not registered and corrected to |
1048 |
|
<code>ISO-2022-JP-2004</code> and |
1049 |
|
<code>ISO-2022-JP-2004-plane1</code>.</dd> |
1050 |
|
<dt><code>ISO-2022-JP-2004</code>, |
1051 |
|
<code>ISO-2022-JP-2004-plane1</code></dt> |
1052 |
|
<dd>These names are not registered. Unfortunately, there is |
1053 |
|
no registered name for these character encodings.</dd> |
1054 |
|
<dt><code>UTF-8N</code></dt> |
1055 |
|
<dd><code>UTF-8N</code> is not registered. Character encoding |
1056 |
|
name <code>UTF-8</code> represents UTF-8 encoding with or |
1057 |
|
without <abbr>BOM</abbr>.</dd> |
1058 |
|
</dl> |
1059 |
|
|
1060 |
|
<p><strong>WARNING</strong>: This error might be raised for |
1061 |
|
a registered character encoding name, since the character encoding |
1062 |
|
name database of the conformance checker is not complete yet.</p> |
1063 |
|
</d:desc> |
1064 |
|
</d:item> |
1065 |
|
|
1066 |
|
<d:item name="charset:private" class="attribute-value-warning warning" |
1067 |
|
level="w"> |
1068 |
|
<d:message xml:lang="en"><code><var>$0</var></code> is a private |
1069 |
|
character encoding name.</d:message> |
1070 |
|
<d:desc xml:lang="en"> |
1071 |
|
<p>The specified character encoding name is a private name and |
1072 |
|
not registered to <abbr>IANA</abbr>. Use of registered character |
1073 |
|
encoding name is a good practice to facilitate interoperability.</p> |
1074 |
|
|
1075 |
|
<dl class="switch"> |
1076 |
|
<dt><code>x-euc-jp</code></dt> |
1077 |
|
<dd>Use <code>EUC-JP</code> for the Japanese <abbr>EUC</abbr> |
1078 |
|
character encoding.</dd> |
1079 |
|
<dt><code>x-sjis</code></dt> |
1080 |
|
<dd>Use <code>Shift_JIS</code> for standard Shift encoding scheme of |
1081 |
|
<abbr>JIS</abbr> coded character set, or <code>Windows-31J</code> |
1082 |
|
for Microsoft standard character set as implemented by |
1083 |
|
Microsoft Windows.</dd> |
1084 |
|
</dl> |
1085 |
|
</d:desc> |
1086 |
|
</d:item> |
1087 |
|
|
1088 |
|
<d:item name="charset:syntax error" class="attribute-value-error" |
1089 |
|
level="m"> |
1090 |
|
<d:message xml:lang="en">The specified value is syntactically not a |
1091 |
|
character encoding name.</d:message> |
1092 |
|
<d:desc xml:lang="en"> |
1093 |
|
<p>The attribute value must be a character encoding name. However, |
1094 |
|
the specified value is not a character encoding name syntactically. |
1095 |
|
The document is non‐conforming.</p> |
1096 |
|
<p>Character encoding name is a string of <abbr>ASCII</abbr> |
1097 |
|
printable characters, up to 40 characters.</p> |
1098 |
|
</d:desc> |
1099 |
|
</d:item> |
1100 |
|
|
1101 |
<d:item name="enumerated:invalid" |
<d:item name="enumerated:invalid" |
1102 |
class="attribute-value-error"> |
class="attribute-value-error"> |
1103 |
<d:message xml:lang="en">This attribute only allow a limited set of |
<d:message xml:lang="en">This attribute only allow a limited set of |
1194 |
</d:desc> |
</d:desc> |
1195 |
</d:item> |
</d:item> |
1196 |
|
|
1197 |
|
<d:item name="mismatched charset name" class="attribute-value-error" |
1198 |
|
level="m"> |
1199 |
|
<d:message xml:lang="en">Character encoding name <code><var>$1</var></code> |
1200 |
|
is different from document character encoding |
1201 |
|
<code><var>$0</var></code>.</d:message> |
1202 |
|
<d:desc xml:lang="en"> |
1203 |
|
<p>The specified character encoding name is different from |
1204 |
|
the character encoding of the document. The document |
1205 |
|
is non‐conforming.</p> |
1206 |
|
</d:desc> |
1207 |
|
</d:item> |
1208 |
|
|
1209 |
<d:item name="reserved browsing context name" |
<d:item name="reserved browsing context name" |
1210 |
class="attribute-value-error"> |
class="attribute-value-error"> |
1211 |
<d:message xml:lang="en">Browsing context name |
<d:message xml:lang="en">Browsing context name |