--- test/html-webhacc/error-description.en.html.u8 2007/11/07 12:20:44 1.14 +++ test/html-webhacc/error-description.en.html.u8 2007/11/18 11:05:12 1.15 @@ -8,6 +8,101 @@
$0
+ is not allowed for HTML document.The character encoding used for the document is not allowed + for HTML document. The document is non‐conforming.
+$0
+ should not be used for HTML document.The character encoding used for the document is not recommended + for HTML document. The document is non‐conforming + unless there is any good reason to use that encoding.
+Use of UTF-8 as the character encoding of the document is encouraged, + though the use of another character encoding is conforming.
+The document does not contain a character encoding + declaration. Unless the character encoding is explicitly + specified in upper‐level protocol, e.g. in HTTP, + or is implied by BOM, there must be a character + encoding declaration. The document is non‐conforming.
+ +The long character encoding declaration syntax
+ <meta http-equiv="Content-Type" content="text/html; charset=charset-name">
+ is obsolete. The new syntax is:
+<meta charset="charset-name">
+
+ Note that the encoding
declaration in XML
+ declaration has no effect for HTML document.
$0
+ is not a superset of ASCII.The document is not labeled with character encoding name + in upper‐level protocol, e.g. in HTTP, and + the document is not begin with BOM. In addition, + the character encoding of the document is not a superset of + ASCII. The document is non‐conforming.
+ +Unless there is a BOM, the character encoding + for the document must be specified in e.g. HTTP‐level, + as:
+
+Content-Type: text/html; charset=charset-name
+
+ Existence of HTML character encoding declaration, i.e.
+ <meta charset="charset-name">
,
+ does not allow to omit charset
parameter
+ for HTML document encoded in non‐ASCII
+ compatible encoding.
Character encodings Shift_JIS
, Windows-31J
,
+ and ISO-2022-JP
are not a superset of
+ ASCII for the purpose of HTML conformance.
$0
, a character encoding declaration specifying
+ character encoding as $1
is found. The document
+ is reparsed.While parsing a document in a character encoding, + a character encoding declaration which declares the character + encoding of the document as another character encoding is found. + The occurence of this warning itself does not make the document + non‐conforming. However, the failure of the first attempt to + to detect the character encoding might be the result of non‐conformance + of the document.
+ +The document will be reparsed from the beginning. Some error + or warning might be reported again.
+ +These are suggestions to avoid this warning:
+charset
parameter in the Content-Type
+ field in the HTTP header, as:
+
+Content-Type: text/html; charset="charset-name"
<meta charset="charset-name">
)
+ just after <head>
start tag.UTF-8
.<meta http-equiv="Content-Type" content="text/html; charset=charset-name">
is obsolete.Old long charset declaration syntax +
Old long character encoding declaration syntax
<meta http-equiv="Content-Type" content="text/html; charset=charset-name">
is in use. The document is non‐conforming.
The new charset declaration syntax is: +
The new character encoding declaration syntax is:
<meta charset="charset-name">
@@ -1114,6 +1209,6 @@
and/or modify it under the same terms as Perl itself.
-
+