/[suikacvs]/webroot/www/2004/id/draft-ietf-hmtl-spec-04.txt
Suika

Contents of /webroot/www/2004/id/draft-ietf-hmtl-spec-04.txt

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.1 - (show annotations) (download)
Tue Jun 15 08:04:04 2004 UTC (19 years, 11 months ago) by wakaba
Branch: MAIN
CVS Tags: HEAD
File MIME type: text/plain
New

1
2 HTML Working Group T. Berners-Lee
3 INTERNET-DRAFT MIT/W3C
4 <draft-ietf-html-spec-04.txt> D. Connolly
5 Expires: In six months June 16, 1995
6
7
8 Hypertext Markup Language - 2.0
9
10
11 Status of this Memo
12
13 This document is an Internet-Draft. Internet-Drafts are working
14 documents of the Internet Engineering Task Force (IETF), its areas, and
15 its working groups. Note that other groups may also distribute working
16 documents as Internet-Drafts.
17
18 Internet-Drafts are draft documents valid for a maximum of six months
19 and may be updated, replaced, or obsoleted by other documents at any
20 time. It is inappropriate to use Internet-Drafts as reference material
21 or to cite them other than as ``work in progress.''
22
23 To learn the current status of any Internet-Draft, please check the
24 1id-abstracts.txt listing contained in the Internet-Drafts Shadow
25 Directories on ftp.is.co.za (Africa), nic.nordu.net (Europe),
26 munnari.oz.au (Pacific Rim), ds.internic.net (US East Coast), or
27 ftp.isi.edu (US West Coast).
28
29 Distribution of this document is unlimited. Please send comments to the
30 HTML working group (HTML-WG) of the Internet Engineering Task Force
31 (IETF) at <html-wg@oclc.org>. Discussions of the group are archived at
32 <URL:http://www.acl.lanl.gov/HTML_WG/archives.html>.
33
34
35 ABSTRACT
36
37 The Hypertext Markup Language (HTML) is a simple markup language
38 used to create hypertext documents that are platform
39 independent. HTML documents are SGML documents with generic
40 semantics that are appropriate for representing information from
41 a wide range of domains. HTML markup can represent hypertext
42 news, mail, documentation, and hypermedia; menus of options;
43 database query results; simple structured documents with
44 in-lined graphics; and hypertext views of existing bodies of
45 information.
46
47 HTML has been in use by the World Wide Web (WWW) global
48 information initiative since 1990. This specification roughly
49 corresponds to the capabilities of HTML in common use prior to
50 June 1994. HTML is an application of ISO Standard 8879:1986
51 Information Processing Text and Office Systems; Standard
52 Generalized Markup Language (SGML).
53
54 The `text/html' Internet Media Type (RFC 1590) and MIME Content
55 Type (RFC 1521) is defined by this specification.
56
57
58
59 Berners-Lee, Connolly [Page 1]
60
61 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
62
63 CONTENTS
64
65
66 1 Introduction .......................................... 3
67 1.1 Scope ................................................. 3
68 1.2 Conformance ........................................... 3
69 2 Terms ................................................. 5
70 3 HTML as an Application of SGML ........................ 9
71 3.1 SGML Documents ........................................ 9
72 3.2 HTML Lexical Syntax .................................. 11
73 3.3 HTML Public Text Identifiers ......................... 15
74 3.4 Example HTML Document ................................ 16
75 4 HTML as an Internet Media Type ....................... 16
76 4.1 text/html media type ................................. 16
77 4.2 HTML Document Representation ......................... 17
78 5 Document Structure ................................... 18
79 5.1 Document Element: HTML ............................... 19
80 5.2 Head: HEAD ........................................... 19
81 5.3 Body: BODY ........................................... 22
82 5.4 Headings: H1 ... H6 .................................. 22
83 5.5 Block Structuring Elements ........................... 23
84 5.6 List Elements ........................................ 25
85 5.7 Phrase Markup ........................................ 28
86 5.8 Line Break: BR ....................................... 31
87 5.9 Horizontal Rule: HR .................................. 31
88 5.10 Image: IMG ........................................... 31
89 6 Characters, Words, and Paragraphs .................... 33
90 6.1 The HTML Document Character Set ...................... 33
91 7 Hyperlinks ........................................... 34
92 7.1 Accessing Resources .................................. 34
93 7.2 Activation of Hyperlinks ............................. 34
94 7.3 Simultaneous Presentation of Image Resources ......... 35
95 7.4 Fragment Identifiers ................................. 35
96 7.5 Queries and Indexes .................................. 35
97 7.6 Image Maps ........................................... 36
98 8 Forms ................................................ 36
99 8.1 Form Elements ........................................ 37
100 8.2 Form Submission ...................................... 42
101 9 HTML Public Text ..................................... 45
102 9.1 HTML DTD ............................................. 45
103 9.2 Strict HTML DTD ...................................... 56
104 9.3 Level 1 HTML DTD ..................................... 57
105 9.4 Strict Level 1 HTML DTD .............................. 58
106 9.5 SGML Declaration for HTML ............................ 58
107 9.6 Sample SGML Open Entity Catalog for HTML ............. 60
108 9.7 Character Entity Sets ................................ 61
109 10 Security Considerations .............................. 63
110 11 References ........................................... 64
111 12 Acknowledgments ...................................... 65
112 12.1 Authors' Addresses ................................... 66
113 13 The HTML Coded Character Set ......................... 66
114 14 Proposed Entities .................................... 69
115
116
117
118 Berners-Lee, Connolly [Page 2]
119
120 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
121
122 1. Introduction
123
124 The HyperText Markup Language (HTML) is a simple data format
125 used to create hypertext documents that are portable from one
126 platform to another. HTML documents are SGML documents with
127 generic semantics that are appropriate for representing
128 information from a wide range of domains.
129
130 As HTML is an application of SGML, this specification assumes a
131 working knowledge of [SGML].
132
133
134 1.1. Scope
135
136 HTML has been in use by the World-Wide Web (WWW) global
137 information initiative since 1990. This specification
138 corresponds to the capabilities of HTML in common use prior to
139 June 1994 and referred to as ``HTML 2.0''.
140
141 HTML is an application of ISO Standard 8879:1986 _Information
142 Processing Text and Office Systems; Standard Generalized Markup
143 Language_ (SGML). The HTML Document Type Definition (DTD) is a
144 formal definition of the HTML syntax in terms of SGML.
145
146 This specification also defines HTML as an Internet Media
147 Type[IMEDIA] and MIME Content Type[MIME] called `text/html'. As
148 such, it defines the semantics of the HTML syntax and how that
149 syntax should be interpreted by user agents.
150
151
152 1.2. Conformance
153
154 This specification governs the syntax of HTML documents and
155 aspects of the behavior of HTML user agents.
156
157
158 1.2.1. Documents
159
160 A document is a conforming HTML document if:
161
162 * It is a conforming SGML document, and it conforms to the
163 HTML DTD (see 9.1, "HTML DTD").
164
165 NOTE - There are a number of syntactic idioms that
166 are not supported or are supported inconsistently in
167 some historical user agent implementations. These
168 idioms are identified in notes like this throughout
169 this specification.
170
171 * It conforms to the application conventions in this
172 specification. For example, the value of the HREF attribute
173 of the <A> element must conform to the URI syntax.
174
175 * Its document character set includes [ISO-8859-1] and
176
177 Berners-Lee, Connolly [Page 3]
178
179 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
180
181 agrees with [ISO-10646]; that is, each code position listed
182 in 13, "The HTML Coded Character Set" is included, and each
183 code position in the document character set is mapped to the
184 same character as [ISO-10646] designates for that code
185 position.
186
187 NOTE - The document character set is somewhat
188 independent of the character encoding scheme used to
189 represent a document. For example, the `ISO-2022-JP'
190 character encoding scheme can be used for HTML
191 documents, since its repertoire is a subset of the
192 [ISO-10646] repertoire. The critical distinction is
193 that numeric character references agree with
194 [ISO-10646] regardless of how the document is
195 encoded.
196
197
198 1.2.2. Feature Test Entities
199
200 The HTML DTD defines a standard HTML document type and several
201 variations, by way of feature test entities. Feature test
202 entities are declarations in the HTML DTD that control the
203 inclusion or exclusion of portions of the DTD.
204
205 HTML.Recommended
206 Certain features of the language are necessary for
207 compatibility with widespread usage, but they may
208 compromise the structural integrity of a document. This
209 feature test entity selects a more prescriptive document
210 type definition that eliminates those features. It is
211 set to `IGNORE' by default.
212
213 For example, in order to preserve the structure of a
214 document, an editing user agent may translate HTML
215 documents to the recommended subset, or it may require
216 that the documents be in the recommended subset for
217 import.
218
219 HTML.Deprecated
220 Certain features of the language are necessary for
221 compatibility with earlier versions of the
222 specification, but they tend to be used and implemented
223 inconsistently, and their use is deprecated. This
224 feature test entity enables a document type definition
225 that allows these features. It is set to `INCLUDE' by
226 default.
227
228 Documents generated by translation software or editing
229 software should not contain deprecated idioms.
230
231
232 1.2.3. User Agents
233
234 An HTML user agent conforms to this specification if:
235
236 Berners-Lee, Connolly [Page 4]
237
238 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
239
240
241 * It parses the characters of an HTML document into data
242 characters and markup according to [SGML].
243
244 NOTE - In the interest of robustness and
245 extensibility, there are a number of widely deployed
246 conventions for handling non-conforming documents.
247 See 4.2.1, "Undeclared Markup Error Handling" for
248 details.
249
250 * It supports the `ISO-8859-1' character encoding scheme and
251 processes each character in the ISO Latin Alphabet No. 1 as
252 specified in 6.1, "The HTML Document Character Set".
253
254 NOTE - To support non-western writing systems, HTML
255 user agents are encouraged to support
256 `ISO-10646-UCS-2' or similar character encoding
257 schemes and as much of the character repertoire of
258 [ISO-10646] as is practical.
259
260 * It behaves identically for documents whose parsed token
261 sequences are identical.
262
263 For example, comments and the whitespace in tags disappear
264 during tokenization, and hence they do not influence the
265 behavior of conforming user agents.
266
267 * It allows the user to traverse (or at least attempt to
268 traverse, resources permitting) all hyperlinks from <A>
269 elements in an HTML document.
270
271 An HTML user agent is a level 2 user agent if, additionally:
272
273 * It allows the user to express all form field values
274 specified in an HTML document and to (attempt to) submit the
275 values as requests to information services.
276
277
278 2. Terms
279
280 absolute URI
281 a URI in absolute form, as per [URL]
282
283 anchor
284 one of two ends of a hyperlink; typically, a phrase
285 marked as an <A> element.
286
287 base URI
288 URI used as the base of an HTML document for the purpose
289 of resolving hyperlink destinations.
290
291 character
292 An atom of information, for example a letter or a digit.
293 Graphic characters have associated glyphs, where as
294
295 Berners-Lee, Connolly [Page 5]
296
297 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
298
299 control characters have associated processing semantics.
300
301 character encoding
302 scheme
303 A function whose domain is the set of sequences of
304 octets, and whose range is the set of sequences of
305 characters from a character repertoire; that is, a
306 sequence of octets and a character encoding scheme
307 determines a sequence of characters.
308
309 character repertoire
310 A finite set of characters; e.g. the range of a coded
311 character set.
312
313 code position
314 An integer. A coded character set and a code position
315 from its domain determine a character.
316
317 coded character set
318 A function whose domain is a subset of the integers and
319 whose range is a character repertoire. That is, for some
320 set of integers (usually of the form {0, 1, 2, ..., N}
321 ), a coded character set and an integer in that set
322 determine a character. Conversely, a character and a
323 coded character set determine the character's code
324 position (or, in rare cases, a few code positions).
325
326 conforming HTML user
327 agent
328 A user agent that conforms to this specification in its
329 processing of the Internet Media Type `text/html'.
330
331 data character
332 Characters other than markup, which make up the content
333 of elements.
334
335 document character set
336 a coded character set whose range includes all
337 characters used in a document. Every SGML document has
338 exactly one document character set. Numeric character
339 references are resolved via the document character set.
340
341 DTD
342 document type definition. Rules that apply SGML to the
343 markup of documents of a particular type, including a
344 set of element and entity declarations. [SGML]
345
346 element
347 A component of the hierarchical structure defined by a
348 document type definition; it is identified in a document
349 instance by descriptive markup, usually a start-tag and
350 end-tag. [SGML]
351
352 end-tag
353
354 Berners-Lee, Connolly [Page 6]
355
356 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
357
358 Descriptive markup that identifies the end of an
359 element. [SGML]
360
361 entity
362 data with an associated notation or interpretation; for
363 example, a sequence of octets associated with an
364 Internet Media Type. [SGML]
365
366 fragment identifier
367 the portion of an HREF attribute value following the `#'
368 character which modifies the presentation of the
369 destination of a hyperlink.
370
371 form data set
372 a sequence of name/value pairs; the names are given by
373 an HTML document and the values are given by a user.
374
375 HTML document
376 An SGML document conforming to this document type
377 definition.
378
379 hyperlink
380 a relationship between two anchors, called the tail and
381 the head.
382
383 markup
384 Syntactically delimited characters added to the data of
385 a document to represent its structure. There are four
386 different kinds of markup: descriptive markup (tags),
387 references, markup declarations, and processing
388 instructions. [SGML]
389
390 may
391 A document or user interface is conforming whether this
392 statement applies or not.
393
394 media type
395 an Internet Media Type, as per [IMEDIA].
396
397 message entity
398 a head and body. The head is a collection of name/value
399 fields, and the body is a sequence of octets. The head
400 defines the content type and content transfer encoding
401 of the body. [MIME]
402
403 minimally conforming
404 HTML user agent
405 A user agent that conforms to this specification except
406 for form processing. It may only process level 1 HTML
407 documents.
408
409 must
410 Documents or user agents in conflict with this statement
411 are not conforming.
412
413 Berners-Lee, Connolly [Page 7]
414
415 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
416
417
418 numeric character
419 reference
420 markup that refers to a character by its code position
421 in the document character set.
422
423 SGML document
424 A sequence of characters organized physically as a set
425 of entities and logically into a hierarchy of elements.
426 An SGML document consists of data characters and markup;
427 the markup describes the structure of the information
428 and an instance of that structure. [SGML]
429
430 shall
431 If a document or user agent conflicts with this
432 statement, it does not conform to this specification.
433
434 should
435 If a document or user agent conflicts with this
436 statement, undesirable results may occur in practice
437 even though it conforms to this specification.
438
439 start-tag
440 Descriptive markup that identifies the start of an
441 element and specifies its generic identifier and
442 attributes. [SGML]
443
444 syntax-reference
445 character set
446 A coded character set whose range includes all
447 characters used for markup; e.g. name characters and
448 delimiter characters.
449
450 tag
451 Markup that delimits an element. A tag includes a name
452 which refers to an element declaration in the DTD, and
453 may include attributes. [SGML]
454
455 text entity
456 A finite sequence of characters. A text entity typically
457 takes the form of a sequence of octets with some
458 associated character encoding scheme, transmitted over
459 the network or stored in a file. [SGML]
460
461 typical
462 Typical processing is described for many elements. This
463 is not a mandatory part of the specification but is
464 given as guidance for designers and to help explain the
465 uses for which the elements were intended.
466
467 URI
468 A Universal Resource Identifier is a formatted string
469 that serves as an identifier for a resource, typically
470 on the Internet. URIs are used in HTML to identify the
471
472 Berners-Lee, Connolly [Page 8]
473
474 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
475
476 destination of hyperlinks. URIs in common practice
477 include Uniform Resource Locators (URLs)[URL] and
478 Relative URLs [RELURL].
479
480 user agent
481 A component of a distributed system that presents an
482 interface and processes requests on behalf of a user;
483 for example, a www browser or a mail user agent.
484
485 WWW
486 The World-Wide Web is a hypertext-based, distributed
487 information system created by researchers at CERN in
488 Switzerland. <URL:http://www.w3.org/>
489
490
491 3. HTML as an Application of SGML
492
493 HTML is an application of ISO 8879:1986 -- Standard Generalized
494 Markup Language (SGML). SGML is a system for defining structured
495 document types and markup languages to represent instances of
496 those document types[SGML]. The public text -- DTD and SGML
497 declaration -- of the HTML document type definition are provided
498 in 9, "HTML Public Text".
499
500 The term _HTML_ refers to both the document type defined here
501 and the markup language for representing instances of this
502 document type.
503
504
505 3.1. SGML Documents
506
507 An HTML document is an SGML document; that is, a sequence of
508 characters organized physically into a set of entities, and
509 logically as a hierarchy of elements.
510
511 In the SGML specification, the first production of the SGML
512 syntax grammar separates an SGML document into three parts: an
513 SGML declaration, a prologue, and an instance. For the purposes
514 of this specification, the prologue is a DTD. This DTD describes
515 another grammar: the start symbol is given in the doctype
516 declaration, the terminals are data characters and tags, and the
517 productions are determined by the element declarations. The
518 instance must conform to the DTD, that is, it must be in the
519 language defined by this grammar.
520
521 The SGML declaration determines the lexicon of the grammar. It
522 specifies the document character set, which determines a
523 character repertoire that contains all characters that occur in
524 all text entities in the document, and the code positions
525 associated with those characters.
526
527 The SGML declaration also specifies the syntax-reference
528 character set of the document, and a few other parameters that
529 bind the abstract syntax of SGML to a concrete syntax. This
530
531 Berners-Lee, Connolly [Page 9]
532
533 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
534
535 concrete syntax determines how the sequence of characters of the
536 document is mapped to a sequence of terminals in the grammar of
537 the prologue.
538
539 For example, consider the following document:
540
541 <!DOCTYPE html PUBLIC "-//IETF//DTD HTML 2.0//EN">
542 <title>Parsing Example</title>
543 <p>Some text. <em>&#42;wow&#42;</em></p>
544
545 An HTML user agent should use the SGML declaration that is given
546 in 9.5, "SGML Declaration for HTML". According to its document
547 character set, `&#42;' refers to an asterisk character, `*'.
548
549 The instance above is regarded as the following sequence of
550 terminals:
551
552 1. start-tag: TITLE
553
554 2. data characters: ``Parsing Example''
555
556 3. end-tag: TITLE
557
558 4. start-tag: P
559
560 5. data characters ``Some text. ''
561
562 6. start-tag: EM
563
564 7. data characters: ``*wow*''
565
566 8. end-tag: EM
567
568 9. end-tag: P
569
570 The start symbol of the DTD grammar is HTML, and the productions
571 are given in the public text identified by `-//IETF//DTD HTML
572 2.0//EN' (9.1, "HTML DTD"). The terminals above parse as:
573
574 HTML
575 |
576 \-HEAD
577 | |
578 | \-TITLE
579 | |
580 | \-<TITLE>
581 | |
582 | \-"Parsing Example"
583 | |
584 | \-</TITLE>
585 |
586 \-BODY
587 |
588 \-P
589
590 Berners-Lee, Connolly [Page 10]
591
592 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
593
594 |
595 \-<P>
596 |
597 \-"Some text. "
598 |
599 \-EM
600 | |
601 | \-<EM>
602 | |
603 | \-"*wow*"
604 | |
605 | \-</EM>
606 |
607 \-</P>
608
609 Some of the elements are delimited explicity by tags, while the
610 boundaries of others are inferred. The <HTML> element contains a
611 <HEAD> element and a <BODY> element. The <HEAD> contains
612 <TITLE>, which is explicitly delimited by start- and end-tags.
613
614
615 3.2. HTML Lexical Syntax
616
617 SGML specifies an abstract syntax and a reference concrete
618 syntax. Aside from certain quantities and capacities (e.g. the
619 limit on the length of a name), all HTML documents use the
620 reference concrete syntax. In particular, all markup characters
621 are in the repertoire of [ISO-646]. Data characters are drawn
622 from the document character set (see 6, "Characters, Words, and
623 Paragraphs").
624
625 A complete discussion of SGML parsing, e.g. the mapping of a
626 sequence of characters to a sequence of tags and data, is left
627 to the SGML standard[SGML]. This section is only a summary.
628
629
630 3.2.1. Data Characters
631
632 Any sequence of characters that do not constitute markup (see
633 9.6 ``Delimiter Recognition'' of [SGML]) are mapped directly to
634 strings of data characters. Some markup also maps to data
635 character strings. Numeric character references map to
636 single-character strings, via the document character set. Each
637 reference to one of the general entities defined in the HTML DTD
638 maps to a single-character string.
639
640 For example,
641
642 abc&lt;def => "abc","<","def"
643 abc&#60;def => "abc","<","def"
644
645 The terminating semicolon on entity or numeric character
646 references is only necessary when the character following the
647 reference would otherwise be recognized as part of the name (see
648
649 Berners-Lee, Connolly [Page 11]
650
651 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
652
653 9.4.5 ``Reference End'' in [SGML]).
654
655 abc &lt def => "abc ","<"," def"
656 abc &#60 def => "abc ","<"," def"
657
658 An ampersand is only recognized as markup when it is followed by
659 a letter or a `#' and a digit:
660
661 abc & lt def => "abc & lt def"
662 abc &# 60 def => "abc &# 60 def"
663
664 A useful technique for translating plain text to HTML is to
665 replace each '<', '&', and '>' by an entity reference or numeric
666 character reference as follows:
667
668 ENTITY NUMERIC
669 CHARACTER REFERENCE CHAR REF CHARACTER DESCRIPTION
670 --------- ---------- ----------- ---------------------
671 & &amp; &#38; Ampersand
672 < &lt; &#60; Less than
673 > &gt; &#62; Greater than
674
675 NOTE - There are SGML mechanisms, CDATA and RCDATA
676 declared content, that allow most `<', `>', and `&'
677 characters to be entered without the use of entity
678 references. Because these mechanisms tend to be used and
679 implemented inconsistently, and because they conflict
680 with techniques for reducing HTML to 7 bit ASCII for
681 transport, they are deprecated in this version of HTML.
682 See 5.5.2.1, "Example and Listing: XMP, LISTING".
683
684
685 3.2.2. Tags
686
687 Tags delimit elements such as headings, paragraphs, lists,
688 character highlighting, and links. Most HTML elements are
689 identified in a document as a start-tag, which gives the element
690 name and attributes, followed by the content, followed by the
691 end tag. Start-tags are delimited by `<' and `>'; end tags are
692 delimited by `</' and `>'. An example is:
693
694 <H1>This is a Heading</H1>
695
696 Some elements only have a start-tag without an end-tag. For
697 example, to create a line break, you use the `<BR>' tag.
698 Additionally, the end tags of some other elements, such as
699 Paragraph (`</P>'), List Item (`</LI>'), Definition Term
700 (`</DT>'), and Definition Description (`<DD>') elements, may be
701 omitted.
702
703 The content of an element is a sequence of data character
704 strings and nested elements. Some elements, such as anchors,
705 cannot be nested. Anchors and character highlighting may be put
706 inside other constructs. See the HTML DTD, 9.1, "HTML DTD" for
707
708 Berners-Lee, Connolly [Page 12]
709
710 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
711
712 full details.
713
714 NOTE - The SGML declaration for HTML specifies SHORTTAG
715 YES, which means that there are other valid syntaxes for
716 tags, such as NET tags, `<EM/.../'; empty start tags,
717 `<>'; and empty end-tags, `</>'. Until support for these
718 idioms is widely deployed, their use is strongly
719 discouraged.
720
721
722 3.2.3. Names
723
724 A name consists of a letter followed by letters, digits,
725 periods, or hyphens. The length of a name is limited to 72
726 characters by the `NAMELEN' parameter in the SGML delcaration
727 for HTML, 9.5, "SGML Declaration for HTML". Element and
728 attribute names are not case sensitive, but entity names are.
729 For example, `<BLOCKQUOTE>', `<BlockQuote>', and `<blockquote>'
730 are equivalent, whereas `&amp;' is different from `&AMP;'.
731
732 In a start-tag, the element name must immediately follow the tag
733 open delimiter `<'.
734
735
736 3.2.4. Attributes
737
738 In a start-tag, white space and attributes are allowed between
739 the element name and the closing delimiter. An attribute
740 specification typically consists of an attribute name, an equal
741 sign, and a value, though some attribute specifications may be
742 just a name token. White space is allowed around the equal sign.
743
744 The value of the attribute may be either:
745
746 * A string literal, delimited by single quotes or double
747 quotes and not containing any occurrences of the delimiting
748 character.
749
750 NOTE - Some historical implementations consider any
751 occurrence of the `>' character to signal the end of
752 a tag. For compatibility with such implementations,
753 when `>' appears in an attribute value, it should be
754 represented with a numeric character reference. For
755 example, `<IMG SRC="eq1.jpg" alt="a>b">' should be
756 written `<IMG SRC="eq1.jpg" alt="a&#62;b">' or `<IMG
757 SRC="eq1.jpg" alt="a&gt;b">'.
758
759 * A name token (a sequence of letters, digits, periods, or
760 hyphens). Name tokens are not case sensitive.
761
762 NOTE - Some historical implementations allow any
763 character except space or `>' in a name token.
764
765 In this example, <img> is the element name, src is the attribute
766
767 Berners-Lee, Connolly [Page 13]
768
769 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
770
771 name, and `http://host/dir/file.gif' is the attribute value:
772
773 <img src='http://host/dir/file.gif'>
774
775 A useful technique for computing an attribute value literal for
776 a given string is to replace each quote and white space
777 character by an entity reference or numeric character reference
778 as follows:
779
780 ENTITY NUMERIC
781 CHARACTER REFERENCE CHAR REF CHARACTER DESCRIPTION
782 --------- ---------- ----------- ---------------------
783 HT &#9; Tab
784 LF &#10; Line Feed
785 CR &#13; Carriage Return
786 SP &#32; Space
787 " &quot; &#34; Quotation mark
788 & &amp; &#38; Ampersand
789
790 For example:
791
792 <IMG SRC="image.jpg" alt="First &quot;real&quot; example">
793
794 The `NAMELEN' parameter in the SGML declaration (9.5, "SGML
795 Declaration for HTML") limits the length of an attribute value
796 to 1024 characters.
797
798 Attributes such as ISMAP and COMPACT may be written using a
799 minimized syntax (see 7.9.1.2 ``Omitted Attribute Name'' in
800 [SGML]). The markup:
801
802 <UL COMPACT="compact">
803
804 can be written using a minimized syntax:
805
806 <UL COMPACT>
807
808 NOTE - Some historical implementations only understand
809 the minimized syntax.
810
811
812 3.2.5. Comments
813
814 To include comments in an HTML document, use a comment
815 declaration. A comment declaration consists of `<!' followed by
816 zero or more comments followed by `>'. Each comment starts with
817 `--' and includes all text up to and including the next
818 occurrence of `--'. In a comment declaration, white space is
819 allowed after each comment, but not before the first comment.
820 The entire comment declaration is ignored.
821
822 NOTE - Some historical HTML implementations incorrectly
823 consider any `>' character to be the termination of a
824 comment.
825
826 Berners-Lee, Connolly [Page 14]
827
828 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
829
830
831 For example:
832
833 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
834 <HEAD>
835 <TITLE>HTML Comment Example</TITLE>
836 <!-- Id: html-sgml.sgm,v 1.5 1995/05/26 21:29:50 connolly Exp -->
837 <!-- another -- -- comment -->
838 <!>
839 </HEAD>
840 <BODY>
841 <p> <!- not a comment, just regular old data characters ->
842
843
844 3.3. HTML Public Text Identifiers
845
846 To identify information as an HTML document conforming to this
847 specification, each document should start with one of the
848 following document type declarations.
849
850 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
851
852 This document type declaration refers to the HTML DTD in 9.1,
853 "HTML DTD".
854
855 NOTE - If the body of a `text/html' message entity does
856 not begin with a document type declaration, an HTML user
857 agent should infer the above document type declaration.
858
859 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0 Level 2//EN">
860
861 This document type declaration also refers to the HTML DTD which
862 appears in 9.1, "HTML DTD".
863
864 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0 Level 1//EN">
865
866 This document type declaration refers to the level 1 HTML DTD in
867 9.3, "Level 1 HTML DTD". Form elements must not occur in level 1
868 documents.
869
870 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0 Strict//EN">
871 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0 Strict Level 1//EN">
872
873 These two document type declarations refer to the HTML DTD in
874 9.2, "Strict HTML DTD" and 9.4, "Strict Level 1 HTML DTD". They
875 refer to the more structurally rigid definition of HTML.
876
877 HTML user agents may support other document types. In
878 particular, they may support other formal public identifiers, or
879 other document types altogether. They may support an internal
880 declaration subset with supplemental entity, element, and other
881 markup declarations.
882
883
884
885 Berners-Lee, Connolly [Page 15]
886
887 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
888
889 3.4. Example HTML Document
890
891 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
892 <HTML>
893 <!-- Here's a good place to put a comment. -->
894 <HEAD>
895 <TITLE>Structural Example</TITLE>
896 </HEAD><BODY>
897 <H1>First Header</H1>
898 <P>This is a paragraph in the example HTML file. Keep in mind
899 that the title does not appear in the document text, but that
900 the header (defined by H1) does.</P>
901 <OL>
902 <LI>First item in an ordered list.
903 <LI>Second item in an ordered list.
904 <UL COMPACT>
905 <LI> Note that lists can be nested;
906 <LI> Whitespace may be used to assist in reading the
907 HTML source.
908 </UL>
909 <LI>Third item in an ordered list.
910 </OL>
911 <P>This is an additional paragraph. Technically, end tags are
912 not required for paragraphs, although they are allowed. You can
913 include character highlighting in a paragraph. <EM>This sentence
914 of the paragraph is emphasized.</EM> Note that the &lt;/P&gt;
915 end tag has been omitted.
916 <P>
917 <IMG SRC ="triangle.xbm" alt="Warning: ">
918 Be sure to read these <b>bold instructions</b>.
919 </BODY></HTML>
920
921
922 4. HTML as an Internet Media Type
923
924 An HTML user agent allows users to interact with resources which
925 have HTML representations. At a minimum, it must allow users to
926 examine and navigate the content of HTML level 1 documents. HTML
927 user agents should be able to preserve all formatting
928 distinctions represented in an HTML document, and be able to
929 simultaneously present resources referred to by IMG elements
930 (they may ignore some formatting distinctions or IMG resources
931 at the request of the user). Level 2 HTML user agents should
932 support form entry and submission.
933
934
935 4.1. text/html media type
936
937 This specification defines the Internet Media Type[IMEDIA]
938 (formerly referred to as the Content Type[MIME]) called
939 `text/html'. The following is to be registered with [IANA].
940
941 Media Type name
942 text
943
944 Berners-Lee, Connolly [Page 16]
945
946 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
947
948
949 Media subtype name
950 html
951
952 Required parameters
953 none
954
955 Optional parameters
956 level, charset
957
958 Encoding considerations
959 any encoding is allowed
960
961 Security considerations
962 see 10, "Security Considerations"
963
964 The optional parameters are defined as follows:
965
966 Level
967 The level parameter specifies the feature set used in
968 the document. The level is an integer number, implying
969 that any features of same or lower level may be present
970 in the document. Level 1 is all features defined in this
971 specification except those that require the <FORM>
972 element. Level 2 includes form processing. Level 2 is
973 the default.
974
975 Charset
976 The charset parameter (as defined in section 7.1.1 of
977 RFC 1521[MIME]) may be given to specify the character
978 encoding scheme used to represent the HTML document as a
979 sequence of octets. The default value is outside the
980 scope of this specification; but for example, the
981 default is `US-ASCII' in the context of MIME mail, and
982 `ISO-8859-1' in the context of HTTP.
983
984
985 4.2. HTML Document Representation
986
987 A message entity with a content type of `text/html' represents
988 an HTML document, consisting of a single text entity. The
989 `charset' parameter (whether implicit or explicit) identifies a
990 character encoding scheme. The text entity consists of the
991 characters determined by this character encoding scheme and the
992 octets of the body of the message entity.
993
994
995 4.2.1. Undeclared Markup Error Handling
996
997 To facilitate experimentation and interoperability between
998 implementations of various versions of HTML, the installed base
999 of HTML user agents supports a superset of the HTML 2.0 language
1000 by reducing it to HTML 2.0: markup in the form of a start-tag or
1001 end-tag, whose generic identifier is not declared is mapped to
1002
1003 Berners-Lee, Connolly [Page 17]
1004
1005 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1006
1007 nothing during tokenization. Undeclared attributes are treated
1008 similarly. The entire attribute specification of an unknown
1009 attribute (i.e., the unknown attribute and its value, if any)
1010 should be ignored. On the other hand, references to undeclared
1011 entities should be treated as data characters.
1012
1013 For example:
1014
1015 <div class=chapter><h1>foo</h1><p>...</div>
1016 => <H1>,"foo",</H1>,<P>,"..."
1017 xxx <P ID=z23> yyy
1018 => "xxx ",<P>," yyy
1019 Let &alpha; &amp; &beta; be finite sets.
1020 => "Let &alpha; & &beta; be finite sets."
1021
1022 Support for notifying the user of such errors is encouraged.
1023
1024 Information providers are warned that this convention is not
1025 binding: unspecified behavior may result, as such markup does
1026 not conform to this specification.
1027
1028
1029 4.2.2. Conventional Representation of Newlines
1030
1031 SGML specifies that a text entity is a sequence of records, each
1032 beginning with a record start character and ending with a record
1033 end character (code positions 10 and 13 respectively) (section
1034 7.6.1, ``Record Boundaries'' in [SGML]).
1035
1036 [MIME] specifies that a body of type `text/*' is a sequence of
1037 lines, each terminated by CRLF, that is, octets 13, 10.
1038
1039 In practice, HTML documents are frequently represented and
1040 transmitted using an end of line convention that depends on the
1041 conventions of the source of the document; frequently, that
1042 representation consists of CR only, LF only, or a CR LF
1043 sequence. Hence the decoding of the octets will often result in
1044 a text entity with some missing record start and record end
1045 characters.
1046
1047 Since there is no ambiguity, HTML user agents are encouraged to
1048 infer the missing record start and end characters.
1049
1050 An HTML user agent should treat end of line in any of its
1051 variations as a word space in all contexts except preformatted
1052 text. Within preformatted text, an HTML user agent should treat
1053 any of the three common representations of end-of-line as
1054 starting a new line.
1055
1056
1057 5. Document Structure
1058
1059 An HTML document is a tree of elements, including a head and
1060 body, headings, paragraphs, lists, etc. Form elements are
1061
1062 Berners-Lee, Connolly [Page 18]
1063
1064 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1065
1066 discussed in 8, "Forms".
1067
1068
1069 5.1. Document Element: HTML
1070
1071 The HTML document element consists of a head and a body, much
1072 like a memo or a mail message. The head contains the title and
1073 optional elements. The body is a text flow consisting of
1074 paragraphs, lists, and other elements.
1075
1076
1077 5.2. Head: HEAD
1078
1079 The head of an HTML document is an unordered collection of
1080 information about the document. For example:
1081
1082 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
1083 <HEAD>
1084 <TITLE>Introduction to HTML</TITLE>
1085 </HEAD>
1086 ...
1087
1088
1089 5.2.1. Title: TITLE
1090
1091 Every HTML document must contain a <TITLE> element.
1092
1093 The title should identify the contents of the document in a
1094 global context. A short title, such as ``Introduction'' may be
1095 meaningless out of context. A title such as ``Introduction to
1096 HTML Elements'' is more appropriate.
1097
1098 NOTE - The length of a title is not limited; however,
1099 long titles may be truncated in some applications. To
1100 minimize this possibility, titles should be fewer than
1101 64 characters.
1102
1103 A user agent may display the title of a document in a history
1104 list or as a label for the window displaying the document. This
1105 differs from headings (5.4, "Headings: H1 ... H6"), which are
1106 typically displayed within the body text flow.
1107
1108
1109 5.2.2. Base Address: BASE
1110
1111 The optional <BASE> element specifies the base address for
1112 resolving relative links from the document, overriding any
1113 context otherwise known to the user agent. The required HREF
1114 attribute specifies the URI for navigating the document (see 7,
1115 "Hyperlinks"). The value of the HREF attribute must be an
1116 absolute URI.
1117
1118
1119
1120
1121 Berners-Lee, Connolly [Page 19]
1122
1123 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1124
1125 5.2.3. Keyword Index: ISINDEX
1126
1127 The <ISINDEX> element indicates that the user agent should allow
1128 the user to search an index by giving keywords. See 7.5,
1129 "Queries and Indexes" for details.
1130
1131
1132 5.2.4. Link: LINK
1133
1134 The <LINK> element represents a hyperlink (see 7, "Hyperlinks").
1135 It has the same attributes as the <A> element (see 5.7.3,
1136 "Anchor: A").
1137
1138 The <LINK> element is typically used to indicate authorship,
1139 related indexes and glossaries, older or more recent versions,
1140 style sheets, document hierarchy etc.
1141
1142
1143 5.2.5. Associated Meta-information: META
1144
1145 The <META> element is an extensible container for use in
1146 identifying specialized document meta-information.
1147 Meta-information has two main functions:
1148
1149 * to provide a means to discover that the data set exists
1150 and how it might be obtained or accessed; and
1151
1152 * to document the content, quality, and features of a data
1153 set, indicating its fitness for use.
1154
1155 Each <META> element specifies a name/value pair. If multiple
1156 META elements are provided with the same name, their combined
1157 contents--concatenated as a comma-separated list--is the value
1158 associated with that name.
1159
1160 NOTE - The <META> element should not be used where a
1161 specific element, such as <TITLE>, would be more
1162 appropriate.
1163
1164 HTTP servers may read the content of the document <HEAD> to
1165 generate header fields corresponding to any elements defining a
1166 value for the attribute HTTP-EQUIV.
1167
1168 NOTE - The method by which the server extracts document
1169 meta-information is unspecified and not mandatory. The
1170 <META> element only provides an extensible mechanism for
1171 identifying and embedding document meta-information --
1172 how it may be used is up to the individual server
1173 implementation and the HTML user agent.
1174
1175 Attributes of the META element:
1176
1177 HTTP-EQUIV
1178 binds the element to an HTTP header field. An HTTP
1179
1180 Berners-Lee, Connolly [Page 20]
1181
1182 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1183
1184 server may use this information to process the document.
1185 In particular, it may include a header field in the
1186 responses to requests for this document: the header name
1187 is taken from the HTTP-EQUIV attribute value, and the
1188 header value is taken from the value of the CONTENT
1189 attribute. HTTP header names are not case sensitive.
1190
1191 NAME
1192 specifies the name of the name/value pair. If not
1193 present, HTTP-EQUIV gives the name.
1194
1195 CONTENT
1196 specifies the value of the name/value pair.
1197
1198 Examples
1199
1200 If the document contains:
1201
1202 <META HTTP-EQUIV="Expires"
1203 CONTENT="Tue, 04 Dec 1993 21:29:02 GMT">
1204 <meta http-equiv="Keywords" CONTENT="Fred">
1205 <META HTTP-EQUIV="Reply-to"
1206 content="fielding@ics.uci.edu (Roy Fielding)">
1207 <Meta Http-equiv="Keywords" CONTENT="Barney">
1208
1209 then the server may include the following header fields:
1210
1211 Expires: Tue, 04 Dec 1993 21:29:02 GMT
1212 Keywords: Fred, Barney
1213 Reply-to: fielding@ics.uci.edu (Roy Fielding)
1214
1215 as part of the HTTP response to a `GET' or `HEAD' request for
1216 that document.
1217
1218 An HTTP server must not use the <META> element to form an HTTP
1219 response header unless the HTTP-EQUIV attribute is present.
1220
1221 An HTTP server may disregard any <META> elements that specify
1222 information controlled by the HTTP server, for example `Server',
1223 `Date', and `Last-modified'.
1224
1225
1226 5.2.6. Next Id: NEXTID
1227
1228 The <NEXTID> element is included for historical reasons only.
1229 HTML document should not contain <NEXTID> elements.
1230
1231 The <NEXTID> element gives a hint for the name to use for a new
1232 <A> element when editing an HTML document. It should be distinct
1233 from all NAME attribute values on <A> elements. For example:
1234
1235 <NEXTID N=Z27>
1236
1237
1238
1239 Berners-Lee, Connolly [Page 21]
1240
1241 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1242
1243 5.3. Body: BODY
1244
1245 The <BODY> element contains the text flow of the document,
1246 including headings, paragraphs, lists, etc.
1247
1248 For example:
1249
1250 <BODY>
1251 <h1>Important Stuff</h1>
1252 <p>Explanation about important stuff...
1253 </BODY>
1254
1255
1256 5.4. Headings: H1 ... H6
1257
1258 The six heading elements, <H1> through <H6>, denote section
1259 headings. Although the order and occurrence of headings is not
1260 constrained by the HTML DTD, documents should not skip levels
1261 (for example, from H1 to H3), as converting such documents to
1262 other representations is often problematic.
1263
1264 Example of use:
1265
1266 <H1>This is a heading</H1>
1267 Here is some text
1268 <H2>Second level heading</H2>
1269 Here is some more text.
1270
1271 Typical renderings are:
1272
1273 H1
1274 Bold, very-large font, centered. One or two blank lines
1275 above and below.
1276
1277 H2
1278 Bold, large font, flush-left. One or two blank lines
1279 above and below.
1280
1281 H3
1282 Italic, large font, slightly indented from the left
1283 margin. One or two blank lines above and below.
1284
1285 H4
1286 Bold, normal font, indented more than H3. One blank line
1287 above and below.
1288
1289 H5
1290 Italic, normal font, indented as H4. One blank line
1291 above.
1292
1293 H6
1294 Bold, indented same as normal text, more than H5. One
1295 blank line above.
1296
1297
1298 Berners-Lee, Connolly [Page 22]
1299
1300 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1301
1302 5.5. Block Structuring Elements
1303
1304 Block structuring elements include paragraphs, lists, and block
1305 quotes. They must not contain heading elements, but they may
1306 contain phrase markup, and in some cases, they may be nested.
1307
1308
1309 5.5.1. Paragraph: P
1310
1311 The <P> element indicates a paragraph. The exact indentation,
1312 leading space, etc. of a paragraph is not specified and may be a
1313 function of other tags, style sheets, etc.
1314
1315 Typically, paragraphs are surrounded by a vertical space of one
1316 line or half a line. The first line in a paragraph is indented
1317 in some cases.
1318
1319 Example of use:
1320
1321 <H1>This Heading Precedes the Paragraph</H1>
1322 <P>This is the text of the first paragraph.
1323 <P>This is the text of the second paragraph. Although you do not
1324 need to start paragraphs on new lines, maintaining this
1325 convention facilitates document maintenance.</P>
1326 <P>This is the text of a third paragraph.</P>
1327
1328
1329 5.5.2. Preformatted Text: PRE
1330
1331 The <PRE> element represents a character cell block of text and
1332 is suitable for text that has been formatted for a monospaced
1333 font.
1334
1335 The <PRE> tag may be used with the optional WIDTH attribute. The
1336 WIDTH attribute specifies the maximum number of characters for a
1337 line and allows the HTML user agent to select a suitable font
1338 and indentation.
1339
1340 Within preformatted text:
1341
1342 * Line breaks within the text are rendered as a move to the
1343 beginning of the next line.
1344
1345 NOTE - References to the ``beginning of a new line''
1346 do not imply that the renderer is forbidden from
1347 using a constant left indent for rendering
1348 preformatted text. The left indent may be
1349 constrained by the width required.
1350
1351 * Anchor elements and phrase markup may be used.
1352
1353 NOTE - Constraints on the processing of <PRE>
1354 content may may limit or prevent the ability of the
1355 HTML user agent to faithfully render phrase markup.
1356
1357 Berners-Lee, Connolly [Page 23]
1358
1359 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1360
1361
1362 * Elements that define paragraph formatting (headings,
1363 address, etc.) must not be used.
1364
1365 NOTE - Some historical documents contain <P> tags in
1366 <PRE> elements. User agents are encouraged to treat
1367 this as a line break. A <P> tag followed by a
1368 newline character should produce only one line
1369 break, not a line break plus a blank line.
1370
1371 * The horizontal tab character (code position 9 in the HTML
1372 document character set) must be interpreted as the smallest
1373 positive nonzero number of spaces which will leave the
1374 number of characters so far on the line as a multiple of 8.
1375 Documents should not contain tab characters, as they are not
1376 supported consistently.
1377
1378 Example of use:
1379
1380 <PRE>
1381 Line 1.
1382 Line 2 is to the right of line 1. <a href="abc">abc</a>
1383 Line 3 aligns with line 2. <a href="def">def</a>
1384 </PRE>
1385
1386
1387 5.5.2.1. Example and Listing: XMP, LISTING
1388
1389 The <XMP> and <LISTING> elements are similar to the <PRE>
1390 element, but they have a different syntax. Their content is
1391 declared as CDATA, which means that no markup except the end-tag
1392 open delimiter-in-context is recognized (see 9.6 ``Delimiter
1393 Recognition'' of [SGML]).
1394
1395 NOTE - In a previous draft of the HTML specification,
1396 the syntax of <XMP> and <LISTING> elements allowed
1397 closing tags to be treated as data characters, as long
1398 as the tag name was not <XMP> or <LISTING>,
1399 respectively.
1400
1401 Since CDATA declared content has a number of unfortunate
1402 interactions with processing techniques and tends to be used and
1403 implemented inconsistently, HTML documents should not contain
1404 <XMP> nor <LISTING> elements -- the <PRE> tag is more expressive
1405 and more consistently supported.
1406
1407 The <LISTING> element should be rendered so that at least 132
1408 characters fit on a line. The <XMP> element should be rendered
1409 so that at least 80 characters fit on a line but is otherwise
1410 identical to the <LISTING> element.
1411
1412 NOTE - In a previous draft, HTML included a <PLAINTEXT>
1413 element that is similar to the <LISTING> element, except
1414 that there is no closing tag: all characters after the
1415
1416 Berners-Lee, Connolly [Page 24]
1417
1418 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1419
1420 <PLAINTEXT> start-tag are data.
1421
1422
1423 5.5.3. Address: ADDRESS
1424
1425 The <ADDRESS> element contains such information as address,
1426 signature and authorship, often at the beginning or end of the
1427 body of a document.
1428
1429 Typically, the <ADDRESS> element is rendered in an italic
1430 typeface and may be indented.
1431
1432 Example of use:
1433
1434 <ADDRESS>
1435 Newsletter editor<BR>
1436 J.R. Brown<BR>
1437 JimquickPost News, Jimquick, CT 01234<BR>
1438 Tel (123) 456 7890
1439 </ADDRESS>
1440
1441
1442 5.5.4. Block Quote: BLOCKQUOTE
1443
1444 The <BLOCKQUOTE> element contains text quoted from another
1445 source.
1446
1447 A typical rendering might be a slight extra left and right
1448 indent, and/or italic font. The <BLOCKQUOTE> typically provides
1449 space above and below the quote.
1450
1451 Single-font rendition may reflect the quotation style of
1452 Internet mail by putting a vertical line of graphic characters,
1453 such as the greater than symbol (>), in the left margin.
1454
1455 Example of use:
1456
1457 I think the poem ends
1458 <BLOCKQUOTE>
1459 <P>Soft you now, the fair Ophelia. Nymph, in thy orisons, be all
1460 my sins remembered.
1461 </BLOCKQUOTE>
1462 but I am not sure.
1463
1464
1465 5.6. List Elements
1466
1467 HTML includes a number of list elements. They may be used in
1468 combination; for example, a <OL> may be nested in an <LI>
1469 element of a <UL>.
1470
1471 The COMPACT attribute suggests that a compact rendering be used.
1472
1473
1474
1475 Berners-Lee, Connolly [Page 25]
1476
1477 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1478
1479 5.6.1. Unordered List: UL, LI
1480
1481 The <UL> represents a list of items -- typically a bulleted
1482 list.
1483
1484 The content of a <UL> element is a sequence of <LI> elements.
1485 For example:
1486
1487 <UL>
1488 <LI>First list item
1489 <LI>Second list item
1490 <p>second paragraph of second item
1491 <LI>Third list item
1492 </UL>
1493
1494
1495 5.6.2. Ordered List: OL
1496
1497 The <OL> element represents an ordered list of items, sorted by
1498 sequence or order of importance. It is typically rendered as a
1499 numbered list.
1500
1501 The content of a <OL> element is a sequence of <LI> elements.
1502 For example:
1503
1504 <OL>
1505 <LI>Click the Web button to open URI window.
1506 <LI>Enter the URI number in the text field of the Open URI
1507 window. The Web document you specified is displayed.
1508 <ol>
1509 <li>substep 1
1510 <li>substep 2
1511 </ol>
1512 <LI>Click highlighted text to move from one link to another.
1513 </OL>
1514
1515
1516 5.6.3. Directory List: DIR
1517
1518 The <DIR> element is similar to the <UL> element. It represents
1519 a list of short items, typically up to 20 characters each. Items
1520 in a directory list may be arranged in columns, typically 24
1521 characters wide.
1522
1523 The content of a <DIR> element is a sequence of <LI> elements.
1524 Nested block elements are not allowed in the content of <DIR>
1525 elements. For example:
1526
1527 <DIR>
1528 <LI>A-H<LI>I-M
1529 <LI>M-R<LI>S-Z
1530 </DIR>
1531
1532
1533
1534 Berners-Lee, Connolly [Page 26]
1535
1536 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1537
1538 5.6.4. Menu List: MENU
1539
1540 The <MENU> element is a list of items with typically one line
1541 per item. The menu list style is typically more compact than the
1542 style of an unordered list.
1543
1544 The content of a <MENU> element is a sequence of <LI> elements.
1545 Nested block elements are not allowed in the content of <MENU>
1546 elements. For example:
1547
1548 <MENU>
1549 <LI>First item in the list.
1550 <LI>Second item in the list.
1551 <LI>Third item in the list.
1552 </MENU>
1553
1554
1555 5.6.5. Definition List: DL, DT, DD
1556
1557 A definition list is a list of terms and corresponding
1558 definitions. Definition lists are typically formatted with the
1559 term flush-left and the definition, formatted paragraph style,
1560 indented after the term.
1561
1562 The content of a <DL> element is a sequence of <DT> elements
1563 and/or <DD> elements, usually in pairs. Multiple <DT> may be
1564 paired with a single <DD> element. Documents should not contain
1565 multiple consecutive <DD> elements.
1566
1567 Example of use:
1568
1569 <DL>
1570 <DT>Term<DD>This is the definition of the first term.
1571 <DT>Term<DD>This is the definition of the second term.
1572 </DL>
1573
1574 If the DT term does not fit in the DT column (typically one
1575 third of the display area), it may be extended across the page
1576 with the DD section moved to the next line, or it may be wrapped
1577 onto successive lines of the left hand column.
1578
1579 The optional COMPACT attribute suggests that a compact rendering
1580 be used, because the list items are small and/or the entire list
1581 is large.
1582
1583 Unless the COMPACT attribute is present, an HTML user agent may
1584 leave white space between successive DT, DD pairs. The COMPACT
1585 attribute may also reduce the width of the left-hand (DT)
1586 column.
1587
1588 <DL COMPACT>
1589 <DT>Term<DD>This is the first definition in compact format.
1590 <DT>Term<DD>This is the second definition in compact format.
1591 </DL>
1592
1593 Berners-Lee, Connolly [Page 27]
1594
1595 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1596
1597
1598
1599 5.7. Phrase Markup
1600
1601 Phrases may be marked up according to idiomatic usage,
1602 typographic appearance, or for use as hyperlink anchors.
1603
1604 User agents must render highlighted phrases distinctly from
1605 plain text. Additionally, <EM> content must be rendered as
1606 distinct from <STRONG> content, and <B> content must rendered as
1607 distinct from <I> content.
1608
1609 Phrase elements may be nested within the content of other phrase
1610 elements; however, HTML user agents may render nested phrase
1611 elements indistinctly from non-nested elements:
1612
1613 plain <B>bold <I>italic</I></B> may be rendered
1614 the same as plain <B>bold </B><I>italic</I>
1615
1616
1617 5.7.1. Idiomatic Elements
1618
1619 Phrases may be marked up to indicate certain idioms.
1620
1621 NOTE - User agents may support the <DFN> element, not
1622 included in this specification, as it has been deployed
1623 to some extent. It is used to indicate the defining
1624 instance of a term, and it is typically rendered in
1625 italic or bold italic.
1626
1627
1628 5.7.1.1. Citation: CITE
1629
1630 The <CITE> element is used to indicate the title of a book or
1631 other citation. It is typically rendered as italics. For
1632 example:
1633
1634 He just couldn't get enough of <cite>The Grapes of Wrath</cite>.
1635
1636
1637 5.7.1.2. Code: CODE
1638
1639 The <CODE> element indicates an example of code, typically
1640 rendered in a mono-spaced font. The <CODE> element is intended
1641 for short words or phrases of code; the <PRE> block structuring
1642 element (5.5.2, "Preformatted Text: PRE") is more apropriate for
1643 multiple-line listings. For example:
1644
1645 The expression <code>x += 1</code>
1646 is short for <code>x = x + 1</code>.
1647
1648
1649
1650
1651
1652 Berners-Lee, Connolly [Page 28]
1653
1654 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1655
1656 5.7.1.3. Emphasis: EM
1657
1658 The <EM> element indicates an emphasized phrase, typically
1659 rendered as italics. For example:
1660
1661 A singular subject <em>always</em> takes a singular verb.
1662
1663
1664 5.7.1.4. Keyboard: KBD
1665
1666 The <KBD> element indicates text typed by a user, typically
1667 rendered in a mono-spaced font. This is commonly used in
1668 instruction manuals. For example:
1669
1670 Enter <kbd>FIND IT</kbd> to search the database.
1671
1672
1673 5.7.1.5. Sample: SAMP
1674
1675 The <SAMP> element indicates a sequence of literal characters,
1676 typically rendered in a mono-spaced font. For example:
1677
1678 The only word containing the letters <samp>mt</samp> is dreamt.
1679
1680
1681 5.7.1.6. Strong Emphasis: STRONG
1682
1683 The <STRONG> element indicates strong emphasis, typically
1684 rendered in bold. For example:
1685
1686 <strong>STOP</strong>, or I'll say "<strong>STOP</strong>" again!.
1687
1688
1689 5.7.1.7. Variable: VAR
1690
1691 The <VAR> element indicates a placeholder variable, typically
1692 rendered as italic. For example:
1693
1694 Type <SAMP>html-check <VAR>file</VAR> | more</SAMP>
1695 to check <VAR>file</VAR> for markup errors.
1696
1697
1698 5.7.2. Typographic Elements
1699
1700 Typographic elements are used to specify the format of marked
1701 text.
1702
1703 Typical renderings for idiomatic elements may vary between user
1704 agents. If a specific rendering is necessary -- for example,
1705 when referring to a specific text attribute as in ``The italic
1706 parts are mandatory'' -- a typographic element can be used to
1707 ensure that the intended typography is used where possible.
1708
1709 NOTE - User agents may support some typographic elements
1710
1711 Berners-Lee, Connolly [Page 29]
1712
1713 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1714
1715 not included in this specification, as they have been
1716 deployed to some extent. The <STRIKE> element indicates
1717 horizontal line through the characters, and the <U>
1718 element indicates an underline.
1719
1720
1721 5.7.2.1. Bold: B
1722
1723 The <B> element indicates bold text. Where bold typography is
1724 unavailable, an alternative representation may be used.
1725
1726
1727 5.7.2.2. Italic: I
1728
1729 The <I> element indicates italic text. Where italic typography
1730 is unavailable, an alternative representation may be used.
1731
1732
1733 5.7.2.3. Teletype: TT
1734
1735 The <TT> element indicates teletype (monospaced )text. Where a
1736 teletype font is unavailable, an alternative representation may
1737 be used.
1738
1739
1740 5.7.3. Anchor: A
1741
1742 The <A> element indicates a hyperlink anchor (see 7,
1743 "Hyperlinks"). At least one of the NAME and HREF attributes
1744 should be present. Attributes of the <A> element:
1745
1746 HREF
1747 gives the URI of the head anchor of a hyperlink.
1748
1749 NAME
1750 gives the name of the anchor, and makes it available as
1751 a head of a hyperlink.
1752
1753 TITLE
1754 suggests a title for the destination resource --
1755 advisory only. The TITLE attribute may be used:
1756
1757 * for display prior to accessing the destination
1758 resource, for example, as a margin note or on a
1759 small box while the mouse is over the anchor, or
1760 while the document is being loaded;
1761
1762 * for resources that do not include a title, such as
1763 graphics, plain text and Gopher menus, for use as a
1764 window title.
1765
1766 REL
1767 The REL attribute gives the relationship(s) described by
1768 the hyperlink. The value is a whitespace separated list
1769
1770 Berners-Lee, Connolly [Page 30]
1771
1772 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1773
1774 of relationship names.
1775
1776 REV
1777 same as the REL attribute, but the semantics of the
1778 relationship are in the reverse direction. A link from A
1779 to B with REL=``X'' expresses the same relationship as a
1780 link from B to A with REV=``X''. An anchor may have both
1781 REL and REV attributes.
1782
1783 URN
1784 specifies a preferred, more persistent identifier for
1785 the head anchor of the hyperlink. The syntax and
1786 semantics of the URN attribute are not yet specified.
1787
1788 METHODS
1789 specifies methods to be used in accessing the
1790 destination, as a whitespace-separated list of names.
1791 The set of applicable names is a function of the scheme
1792 of the URI in the HREF attribute. For similar reasons as
1793 for the TITLE attribute, it may be useful to include the
1794 information in advance in the link. For example, the
1795 HTML user agent may chose a different rendering as a
1796 function of the methods allowed; for example, something
1797 that is searchable may get a different icon.
1798
1799
1800 5.8. Line Break: BR
1801
1802 The <BR> element specifies a line break between words (see 6,
1803 "Characters, Words, and Paragraphs"). For example:
1804
1805 <P> Pease porridge hot<BR>
1806 Pease porridge cold<BR>
1807 Pease porridge in the pot<BR>
1808 Nine days old.
1809
1810
1811 5.9. Horizontal Rule: HR
1812
1813 The <HR> element is a divider between sections of text;
1814 typically a full width horizontal rule or equivalent graphic.
1815 For example:
1816
1817 <HR>
1818 <ADDRESS>February 8, 1995, CERN</ADDRESS>
1819 </BODY>
1820
1821
1822 5.10. Image: IMG
1823
1824 The <IMG> element refers to an image or icon via a hyperlink
1825 (see 7.3, "Simultaneous Presentation of Image Resources").
1826
1827 HTML user agents may process the value of the ALT attribute as
1828
1829 Berners-Lee, Connolly [Page 31]
1830
1831 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1832
1833 an alternative to processing the image resource indicated by the
1834 SRC attribute.
1835
1836 NOTE - Some HTML user agents can process graphics linked
1837 via anchors, but not <IMG> graphics. If a graphic is
1838 essential, it should be referenced from an <A> element
1839 rather than an <IMG> element. If the graphic is not
1840 essential, then the <IMG> element is appropriate.
1841
1842 Attributes of the <IMG> element:
1843
1844 ALIGN
1845 alignment of the image with respect to the text
1846 baseline.
1847
1848 * `TOP' specifies that the top of the image aligns
1849 with the tallest item on the line containing the
1850 image.
1851
1852 * `MIDDLE' specifies that the center of the image
1853 aligns with the baseline of the line containing the
1854 image.
1855
1856 * `BOTTOM' specifies that the bottom of the image
1857 aligns with the baseline of the line containing the
1858 image.
1859
1860 ALT
1861 text to use in place of the referenced image resource,
1862 for example due to processing constraints or user
1863 preference.
1864
1865 ISMAP
1866 indicates an image map (see 7.6, "Image Maps").
1867
1868 SRC
1869 specifies the URI of the image resource.
1870
1871 NOTE - In practice, the media types of image
1872 resources are limited to a few raster graphic
1873 formats: typically `image/gif', `image/jpeg'. In
1874 particular, `text/html' resources are not
1875 intended to be used as image resources.
1876
1877 Examples of use:
1878
1879 <IMG SRC="triangle.xbm" ALT="Warning:"> Be sure
1880 to read these instructions.
1881
1882 <a href="http://machine/htbin/imagemap/sample">
1883 <IMG SRC="sample.xbm" ISMAP>
1884 </a>
1885
1886
1887
1888 Berners-Lee, Connolly [Page 32]
1889
1890 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1891
1892 6. Characters, Words, and Paragraphs
1893
1894 An HTML user agent should present the body of an HTML document
1895 as a collection of typeset paragraphs and preformatted text.
1896 Except for preformatted elements (<PRE>, <XMP>, <LISTING>,
1897 <TEXTAREA>), each block structuring element is regarded as a
1898 paragraph by taking the data characters in its content and the
1899 content of its descendant elements, concatenating them, and
1900 splitting the result into words, separated by space, tab, or
1901 record end characters (and perhaps hyphen characters). The
1902 sequence of words is typeset as a paragraph by breaking it into
1903 lines.
1904
1905
1906 6.1. The HTML Document Character Set
1907
1908 The document character set specified in 9.5, "SGML Declaration
1909 for HTML" must be supported by HTML user agents. It includes the
1910 graphic characters of Latin Alphabet No. 1, or simply Latin-1.
1911 Latin-1 comprises 191 graphic characters, including the
1912 alphabets of most Western European languages.
1913
1914 NOTE - Use the non-breaking space and soft hyphen
1915 indicator characters is discouraged because support for
1916 them is not widely deployed.
1917
1918 NOTE - To support non-western writing systems, a larger
1919 character repertoire will be specified in a future
1920 version of HTML. The document character set will be
1921 [ISO-10646], or some subset that agrees with
1922 [ISO-10646]; in particular, all numeric character
1923 references must use code positions assigned by
1924 [ISO-10646].
1925
1926 In SGML applications, the use of control characters is limited
1927 in order to maximize the chance of successful interchange over
1928 heterogeneous networks and operating systems. In the HTML
1929 document character set only three control characters are
1930 allowed: Horizontal Tab, Carriage Return, and Line Feed (code
1931 positions 9, 13, and 10).
1932
1933 The HTML DTD references the Added Latin 1 entity set, to allow
1934 mnemonic representation of selected Latin 1 characters using
1935 only the widely supported ASCII character repertoire. For
1936 example:
1937
1938 Kurt G&ouml;del was a famous logician and mathematician.
1939
1940 See 9.7.2, "ISO Latin 1 Character Entity Set" for a table of the
1941 ``Added Latin 1'' entities, and 13, "The HTML Coded Character
1942 Set" for a table of the code positions of [ISO 8859-1] and the
1943 control characters in the HTML document character set.
1944
1945
1946
1947 Berners-Lee, Connolly [Page 33]
1948
1949 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
1950
1951 7. Hyperlinks
1952
1953 In addition to general purpose elements such as paragraphs and
1954 lists, HTML documents can express hyperlinks. A hyperlink is a
1955 relationship between two anchors, called the head and the tail
1956 of the hyperlink[DEXTER]. An anchor is a resource such as an
1957 HTML document, or some fragment of, i.e. view on or portion of a
1958 resource. Typically, the user activates a link by indicating the
1959 tail of the link; the head of the link is presented as a result.
1960
1961 Anchors are addressed by Uniform Resource Identifiers (URI).
1962 URIs either refer directly to an anchor in absolute form for
1963 example as in [URL], or they refer to an anchor relative to a
1964 base URI which is absolute, as in [RELURL].
1965
1966 Each of the following markup constructs indicates the tail
1967 anchor of a hyperlink or set of hyperlinks:
1968
1969 * <A> elements with HREF present.
1970
1971 * <LINK> elements.
1972
1973 * <IMG> elements.
1974
1975 * <INPUT> elements with the SRC attribute present.
1976
1977 * <ISINDEX> elements.
1978
1979 * <FORM> elements with `METHOD=GET'.
1980
1981
1982 7.1. Accessing Resources
1983
1984 To access the head anchor of a hyperlink, the user agent
1985 determines its URI from the URI given in the tail anchor, using
1986 the base URI of the document containing the tail anchor if
1987 necessary. Any fragment identifier is discarded, and the result
1988 is used to access a resource, for example as in [URL].
1989
1990 For example, if a document identified as `http://host/x/y.html'
1991 contains:
1992
1993 <img src="../icons/abc.gif">
1994
1995 then the user agent must use the URI `http://host/icons/abc.gif'
1996 to access the resource linked from the <IMG> element.
1997
1998
1999 7.2. Activation of Hyperlinks
2000
2001 An HTML user agent allows the user to navigate the content of
2002 the document and request activation of <A> element hyperlinks. A
2003 request to activate a link is essentially a request to process
2004 the resource indicated by the head anchor of the link, for
2005
2006 Berners-Lee, Connolly [Page 34]
2007
2008 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2009
2010 example to display the indicated HTML document. HTML user agents
2011 should also allow activation of <LINK> element hyperlinks.
2012
2013 The base URI for navigating the head anchor may be different
2014 from the URI used to access it. For example, it may be replaced
2015 by a <BASE> tag in the destination document or by an HTTP
2016 redirection transaction.
2017
2018
2019 7.3. Simultaneous Presentation of Image Resources
2020
2021 An HTML user agent may activate hyperlinks indicated by <IMG>
2022 and <INPUT> elements concurrently with processing the document;
2023 that is, image hyperlinks may be processed without explicit
2024 request by the user. Image resources should be embedded in the
2025 presentation at the point of the tail anchor, that is the <IMG>
2026 or <INPUT> element.
2027
2028 <LINK> hyperlinks may also be processed without explicit user
2029 request; for example, style sheet resources may be processed
2030 before or during the processing of the document.
2031
2032
2033 7.4. Fragment Identifiers
2034
2035 Any characters following a `#' character in a URI constitute a
2036 fragment identifier. As a degenerate case, a URI of the form
2037 `#fragment' refers to an anchor in the same document.
2038
2039 The meaning of fragment identifiers depends on the media type of
2040 the resource containing the head anchor. For `text/html'
2041 resources, it refers to the <A> element with a NAME attribute
2042 whose value is the same as the fragment identifier. The matching
2043 is case sensitive. The document should have exactly one such
2044 element. The user agent should indicate the anchor element, for
2045 example by scrolling to and/or highlighting the phrase.
2046
2047 For example, if a user agent was processing a document
2048 identified as `http://host/x/y.html' and the user indicated the
2049 following anchor:
2050
2051 <p> See: <a href="app1.html#bananas">appendix 1</a>
2052 for more detail on bananas.
2053
2054 then the user agent URI must access the resource
2055 `http://host/x/app1.html'. Assuming the resource is represented
2056 using the `text/html' media type, the user agent must locate the
2057 anchor named `bananas' and begin navigation there.
2058
2059
2060 7.5. Queries and Indexes
2061
2062 The <ISINDEX> element represents a set of hyperlinks. The user
2063 can choose from the set by providing keywords to the user agent.
2064
2065 Berners-Lee, Connolly [Page 35]
2066
2067 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2068
2069 The user agent computes the head URI by appending `?' and the
2070 keywords to the base URI. The keywords are escaped according to
2071 [URL] and joined by `+'. For example, if a document contains:
2072
2073 <BASE HREF="http://host/index">
2074 <ISINDEX>
2075
2076 and the user provides the keywords `apple' and `berry', then the
2077 user agent must access the resource
2078 `http://host/index?apple+berry'.
2079
2080 <FORM> elements with `METHOD=GET' also represent sets of
2081 hyperlinks. See 8.2.2, "Query Forms: METHOD=GET" for details.
2082
2083
2084 7.6. Image Maps
2085
2086 If the ISMAP attribute is present on an <IMG> element, the <IMG>
2087 element must be contained in an <A> element with an HREF
2088 present. This construct represents a set of hyperlinks. The user
2089 can choose from the set by choosing a pixel of the image. The
2090 user agent computes the head URI by appending `?' and the x and
2091 y coordinates of the pixel to the URI given in the <A> element.
2092 For example, if a document contains:
2093
2094 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
2095 <head><title>ImageMap Example</title>
2096 <BASE HREF="http://host/index"></head>
2097 <body>
2098 <p> Choose any of these icons:<br>
2099 <a href="/cgi-bin/imagemap"><img ismap src="icons.gif"></a>
2100
2101 and the user chooses the upper-leftmost pixel, the chosen
2102 hyperlink is the one with the URI
2103 `http://host/cgi-bin/imagemap?0,0'.
2104
2105
2106 8. Forms
2107
2108 A form is a template for a form data set and an associated
2109 method and action URI. A form data set is a sequence of
2110 name/value pair fields. The names are specified on the NAME
2111 attributes of form input elements, and the values are given
2112 initial values by various forms of markup and edited by the
2113 user. The resulting form data set is used to access an
2114 information service as a function of the action and method.
2115
2116 Forms elements can be mixed in with document structuring
2117 elements. For example, a <PRE> element may contain a <FORM>
2118 element, or a <FORM> element may contain lists which contain
2119 <INPUT> elements. This gives considerable flexibility in
2120 designing the layout of forms.
2121
2122 Form processing is a level 2 feature.
2123
2124 Berners-Lee, Connolly [Page 36]
2125
2126 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2127
2128
2129
2130 8.1. Form Elements
2131
2132
2133 8.1.1. Form: FORM
2134
2135 The <FORM> element contains a sequence of input elements, along
2136 with document structuring elements. The attributes are:
2137
2138 ACTION
2139 specifies the action URI for the form. The action URI of
2140 a form defaults to the base URI of the document (see 7,
2141 "Hyperlinks").
2142
2143 METHOD
2144 selects a method of accessing the action URI. The set of
2145 applicable methods is a function of the scheme of the
2146 action URI of the form. See 8.2.2, "Query Forms:
2147 METHOD=GET" and 8.2.3, "Forms with Side-Effects:
2148 METHOD=POST".
2149
2150 ENCTYPE
2151 specifies the media type used to encode the name/value
2152 pairs for transport, in case the protocol does not
2153 itself impose a format. See 8.2.1, "The form-urlencoded
2154 Media Type".
2155
2156
2157 8.1.2. Input Field: INPUT
2158
2159 The <INPUT> element represents a field for user input. The TYPE
2160 attribute discriminates between several variations of fields.
2161
2162 The <INPUT> element has a number of attributes. The set of
2163 applicable attributes depends on the value of the TYPE
2164 attribute.
2165
2166
2167 8.1.2.1. Text Field: INPUT TYPE=TEXT
2168
2169 The default vaule of the TYPE attribute is `TEXT', indicating a
2170 single line text entry fields. (Use the <TEXTAREA> element for
2171 multi-line text fields.)
2172
2173 Required attributes are:
2174
2175 NAME
2176 name for the form field corresponding to this element.
2177
2178 The optional attriubtes are:
2179
2180 MAXLENGTH
2181 constrains the number of characters that can be entered
2182
2183 Berners-Lee, Connolly [Page 37]
2184
2185 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2186
2187 into a text input field. If the value of MAXLENGTH is
2188 greater the the value of the SIZE attribute, the field
2189 should scroll appropriately. The default number of
2190 characters is unlimited.
2191
2192 SIZE
2193 specifies the amount of display space allocated to this
2194 input field according to its type. The default depends
2195 on the user agent.
2196
2197 VALUE
2198 The initial value of the field.
2199
2200 For example:
2201
2202 <p>Street Address: <input name=street><br>
2203 Postal City code: <input name=city size=16 maxlength=16><br>
2204 Zip Code: <input name=zip size=10 maxlength=10 value="99999-9999"><br>
2205
2206
2207 8.1.2.2. Password Field: INPUT TYPE=PASSWORD
2208
2209 An <INPUT> element with `TYPE=PASSWORD' is a text field as
2210 above, except that the value is obscured as it is entered. (see
2211 also: 10, "Security Considerations").
2212
2213 For example:
2214
2215 <p>Name: <input name=login> Password: <input type=password name=passwd>
2216
2217
2218 8.1.2.3. Check Box: INPUT TYPE=CHECKBOX
2219
2220 An <INPUT> element with `TYPE=CHECKBOX' represents a boolean
2221 choice. A set of such elements with the same name represents an
2222 n-of-many choice field. Required attributes are:
2223
2224 NAME
2225 symbolic name for the form field corresponding to this
2226 element or group of elements.
2227
2228 VALUE
2229 The portion of the value of the field contributed by
2230 this element.
2231
2232 Optional attributes are:
2233
2234 CHECKED
2235 indicates that the initial state is on.
2236
2237 For example:
2238
2239
2240
2241
2242 Berners-Lee, Connolly [Page 38]
2243
2244 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2245
2246 <p>What flavors do you like?
2247 <input type=checkbox name=flavor value=vanilla>Vanilla<br>
2248 <input type=checkbox name=flavor value=strawberry>Strawberry<br>
2249 <input type=checkbox name=flavor value=chocolate checked>Chocolate<br>
2250
2251
2252 8.1.2.4. Radio Button: INPUT TYPE=RADIO
2253
2254 An <INPUT> element with `TYPE=RADIO' represents a boolean
2255 choice. A set of such elements with the same name represents a
2256 1-of-many choice field. The NAME and VALUE attributes are
2257 required as for check boxes. Optional attributes are:
2258
2259 CHECKED
2260 indicates that the initial state is on.
2261
2262 At all times, exactly one of the radio buttons in a set is
2263 checked. If none of the <INPUT> elements of a set of radio
2264 buttons specifies `CHECKED', then the user agent must check the
2265 first radio button of the set initially.
2266
2267 For example:
2268
2269 <p>Which is your favorite?
2270 <input type=radio name=flavor value=vanilla>Vanilla<br>
2271 <input type=radio name=flavor value=strawberry>Strawberry<br>
2272 <input type=radio name=flavor value=chocolate>Chocolate<br>
2273
2274
2275 8.1.2.5. Image Pixel: INPUT TYPE=IMAGE
2276
2277 An <INPUT> element with `TYPE=IMAGE' specifies an image resource
2278 to display, and allows input of two form fields: the x and y
2279 coordinate of a pixel chosen from the image. The names of the
2280 fields are the name of the field with `.x' and `.y' appended.
2281 `TYPE=IMAGE' implies `TYPE=SUBMIT' processing; that is, when a
2282 pixel is chosen, the form as a whole is submitted.
2283
2284 The NAME attribute is required as for other input fields. The
2285 SRC attribute is required and the ALIGN is optional as for the
2286 <IMG> element (see 5.10, "Image: IMG").
2287
2288 For example:
2289
2290 <p>Choose a point on the map:
2291 <input type=image name=point src="map.gif">
2292
2293
2294 8.1.2.6. Hidden Field: INPUT TYPE=HIDDEN
2295
2296 An <INPUT> element with `TYPE=HIDDEN' represents a hidden
2297 field.The user does not interact with this field; instead, the
2298 VALUE attribute specifies the value of the field. The NAME and
2299 VALUE attributes are required.
2300
2301 Berners-Lee, Connolly [Page 39]
2302
2303 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2304
2305
2306 For example:
2307
2308 <input type=hidden name=context value="l2k3j4l2k3j4l2k3j4lk23">
2309
2310
2311 8.1.2.7. Submit Button: INPUT TYPE=SUBMIT
2312
2313 An <INPUT> element with `TYPE=SUBMIT' represents an input
2314 option, typically a button, that instructs the user agent to
2315 submit the form. Optional attributes are:
2316
2317 NAME
2318 indicates that this element contributes a form field
2319 whose value is given by the VALUE attribute. If the NAME
2320 attribute is not present, this element does not
2321 contribute a form field.
2322
2323 VALUE
2324 indicates a label for the input (button).
2325
2326 You may submit this request internally:
2327 <input type=submit name=recipient value=internal><br>
2328 or to the external world:
2329 <input type=submit name=recipient value=world>
2330
2331
2332 8.1.2.8. Reset Button: INPUT TYPE=RESET
2333
2334 An <INPUT> element with `TYPE=RESET' represents an input option,
2335 typically a button, that instructs the user agent to reset the
2336 form's fields to their initial states. The VALUE attribute, if
2337 present, indicates a label for the input (button).
2338
2339 When you are finished, you may submit this request:
2340 <input type=submit><br>
2341 You may clear the form and start over at any time: <input type=reset>
2342
2343
2344 8.1.3. Selection: SELECT
2345
2346 The <SELECT> element constrains the form field to an enumerated
2347 list of values. The values are given in <OPTION> elements.
2348 Attributes are:
2349
2350 MULTIPLE
2351 indicates that more than one option may be included in
2352 the value.
2353
2354 NAME
2355 specifies the name of the form field.
2356
2357 SIZE
2358 specifies the number of visible items. Select fields of
2359
2360 Berners-Lee, Connolly [Page 40]
2361
2362 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2363
2364 size one are typically pop-down menus, whereas select
2365 fields with size greater than one are typically lists.
2366
2367 For example:
2368
2369 <SELECT NAME="flavor">
2370 <OPTION>Vanilla
2371 <OPTION>Strawberry
2372 <OPTION value="RumRasin">Rum and Raisin
2373 <OPTION selected>Peach and Orange
2374 </SELECT>
2375
2376 The initial state has the first option selected, unless a
2377 SELECTED attribute is present on any of the <OPTION> elements.
2378
2379
2380 8.1.3.1. Option: OPTION
2381
2382 The Option element can only occur within a Select element. It
2383 represents one choice, and has the following attributes:
2384
2385 SELECTED
2386 Indicates that this option is initially selected.
2387
2388 VALUE
2389 indicates the value to be returned if this option is
2390 chosen. The field value defaults to the content of the
2391 <OPTION> element.
2392
2393 The content of the <OPTION> element is presented to the user to
2394 represent the option. It is used as a returned value if the
2395 VALUE attribute is not present.
2396
2397
2398 8.1.4. Text Area: TEXTAREA
2399
2400 The <TEXTAREA> element represents a multi-line text field.
2401 Attributes are:
2402
2403 COLS
2404 the number of visible columns to display for the text
2405 area, in characters.
2406
2407 NAME
2408 Specifies the name of the form field.
2409
2410 ROWS
2411 The number of visible rows to display for the text area,
2412 in characters.
2413
2414 For example:
2415
2416
2417
2418
2419 Berners-Lee, Connolly [Page 41]
2420
2421 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2422
2423 <TEXTAREA NAME="address" ROWS=6 COLS=64>
2424 HaL Computer Systems
2425 1315 Dell Avenue
2426 Campbell, California 95008
2427 </TEXTAREA>
2428
2429 The content of the <TEXTAREA> element is the field's initial
2430 value.
2431
2432 Typically, the ROWS and COLS attributes determine the visible
2433 dimension of the field in characters. The field is typically
2434 rendered in a fixed-width font. HTML user agents should allow
2435 text to extend beyond these limits by scrolling as needed.
2436
2437
2438 8.2. Form Submission
2439
2440 An HTML user agent begins processing a form by presenting the
2441 document with the fields in their initial state. The user is
2442 allowed to modify the fields, constrained by the field type etc.
2443 When the user indicates that the form should be submitted (using
2444 a submit button or image input), the form data set is processed
2445 according to its method, action URI and enctype.
2446
2447 When there is only one single-line text input field in a form,
2448 the user agent should accept Enter in that field as a request to
2449 submit the form.
2450
2451
2452 8.2.1. The form-urlencoded Media Type
2453
2454 The default encoding for all forms is
2455 `application/x-www-form-urlencoded'. A form data set is
2456 represented in this media type as follows:
2457
2458 1. The form field names and values are escaped: space
2459 characters are replaced by `+', and then reserved characters
2460 are escaped as per [URL]; that is, non-alphanumeric
2461 characters are replaced by `%HH', a percent sign and two
2462 hexadecimal digits representing the ASCII code of the
2463 character. Line breaks, as in multi-line text field values,
2464 are represented as CR LF pairs, i.e. `%0D%0A'.
2465
2466 2. The fields are listed in the order they appear in the
2467 document with the name separated from the value by `=' and
2468 the pairs separated from each other by `&'. Fields with null
2469 values may be omitted. In particular, unselected radio
2470 buttons and checkboxes should not appear in the encoded
2471 data, but hidden fields with VALUE attributes present
2472 should.
2473
2474 NOTE - The URI from a query form submission can be
2475 used in a normal anchor style hyperlink.
2476 Unfortunately, the use of the `&' character to
2477
2478 Berners-Lee, Connolly [Page 42]
2479
2480 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2481
2482 separate form fields interacts with its use in SGML
2483 attribute values as an entity reference delimiter.
2484 For example, the URI `http://host/?x=1&y=2' must be
2485 written `<a href="http://host/?x=1&#38;y=2"' or `<a
2486 href="http://host/?x=1&#amp;y=2">'.
2487
2488 HTTP server implementors, and in particular, CGI
2489 implementors are encouraged to support the use of
2490 `;' in place of `&' to save users the trouble of
2491 escaping `&' characters this way.
2492
2493
2494 8.2.2. Query Forms: METHOD=GET
2495
2496 If the processing of a form is idempotent (i.e. it has no
2497 lasting observable effect on the state of the world), then the
2498 form method should be `GET'. Many database searches have no
2499 visible side-effects and make ideal applications of query forms.
2500
2501 To process a form whose action URL is an HTTP URL and whose
2502 method is `GET', the user agent starts with the action URI and
2503 appends a `?' and the form data set, in
2504 `application/x-www-form-urlencoded' format as above. The user
2505 agent then traverses the link to this URI just as if it were an
2506 anchor (see 7.2, "Activation of Hyperlinks").
2507
2508 NOTE - The URL encoding may result in very long URIs,
2509 which cause some historical HTTP server implementations
2510 to exhibit defective behavior. As a result, some HTML
2511 forms are written using `METHOD=POST' even though the
2512 form submission has no side-effects.
2513
2514
2515 8.2.3. Forms with Side-Effects: METHOD=POST
2516
2517 If the service associated with the processing of a form has side
2518 effects (for example, modification of a database or subscription
2519 to a service), the method should be `POST'.
2520
2521 To process a form whose action URL is an HTTP URL and whose
2522 method is `POST', the user agent conducts an HTTP POST
2523 transaction using the action URI, and a message body of type
2524 `application/x-www-form-urlencoded' format as above. The user
2525 agent should display the response from the HTTP POST interaction
2526 just as it would display the response from an HTTP GET above.
2527
2528
2529 8.2.4. Example Form Submission: Questionnaire Form
2530
2531 Consider the following document:
2532
2533 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
2534 <title>Sample of HTML Form Submission</title>
2535 <H1>Sample Questionnaire</H1>
2536
2537 Berners-Lee, Connolly [Page 43]
2538
2539 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2540
2541 <P>Please fill out this questionnaire:
2542 <FORM METHOD="POST" ACTION="http://www.w3.org/sample">
2543 <P>Your name: <INPUT NAME="name" size="48">
2544 <P>Male <INPUT NAME="gender" TYPE=RADIO VALUE="male">
2545 <P>Female <INPUT NAME="gender" TYPE=RADIO VALUE="female">
2546 <P>Number in family: <INPUT NAME="family" TYPE=text>
2547 <P>Cities in which you maintain a residence:
2548 <UL>
2549 <LI>Kent <INPUT NAME="city" TYPE=checkbox VALUE="kent">
2550 <LI>Miami <INPUT NAME="city" TYPE=checkbox VALUE="miami">
2551 <LI>Other <TEXTAREA NAME="other" cols=48 rows=4></textarea>
2552 </UL>
2553 Nickname: <INPUT NAME="nickname" SIZE="42">
2554 <P>Thank you for responding to this questionnaire.
2555 <P><INPUT TYPE=SUBMIT> <INPUT TYPE=RESET>
2556 </FORM>
2557
2558 The initial state of the form data set is:
2559
2560 name
2561 ``''
2562
2563 gender
2564 ``male''
2565
2566 family
2567 ``''
2568
2569 other
2570 ``''
2571
2572 nickname
2573 ``''
2574
2575 Note that the radio input has an initial value, while the
2576 checkbox has none.
2577
2578 The user might edit the fields and request that the form be
2579 submitted. At that point, suppose the values are:
2580
2581 name
2582 ``John Doe''
2583
2584 gender
2585 ``male''
2586
2587 family
2588 ``5''
2589
2590 city
2591 ``kent,miami''
2592
2593 other
2594 ``abc\ndef''
2595
2596 Berners-Lee, Connolly [Page 44]
2597
2598 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2599
2600
2601 nickname
2602 ``J&D''
2603
2604 The user agent then conducts an HTTP POST transaction using the
2605 URI `http://www.w3.org/sample'. The message body would be
2606 (ignore the line break):
2607
2608 name=John+Doe&gender=male&family=5&city=kent%2Cmiami&
2609 other=abc%0D%0Adef&nickname=J%26D
2610
2611
2612 9. HTML Public Text
2613
2614
2615 9.1. HTML DTD
2616
2617 This is the Document Type Definition for the HyperText Markup
2618 Language, level 2.
2619
2620 <!-- html.dtd
2621
2622 Document Type Definition for the HyperText Markup Language
2623 (HTML DTD)
2624
2625 $Id: html.dtd,v 1.28 1995/06/16 18:54:22 connolly Exp $
2626
2627 Author: Daniel W. Connolly <connolly@w3.org>
2628 See Also: html.decl, html-1.dtd
2629 http://www.w3.org/hypertext/WWW/MarkUp/MarkUp.html
2630 - -->
2631
2632 <!ENTITY % HTML.Version
2633 "-//IETF//DTD HTML 2.0//EN"
2634
2635 -- Typical usage:
2636
2637 <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
2638 <html>
2639 ...
2640 </html>
2641 --
2642 >
2643
2644
2645 <!--============ Feature Test Entities ========================-->
2646
2647 <!ENTITY % HTML.Recommended "IGNORE"
2648 -- Certain features of the language are necessary for
2649 compatibility with widespread usage, but they may
2650 compromise the structural integrity of a document.
2651 This feature test entity enables a more prescriptive
2652 document type definition that eliminates
2653 those features.
2654
2655 Berners-Lee, Connolly [Page 45]
2656
2657 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2658
2659 -->
2660
2661 <![ %HTML.Recommended [
2662 <!ENTITY % HTML.Deprecated "IGNORE">
2663 ]]>
2664
2665 <!ENTITY % HTML.Deprecated "INCLUDE"
2666 -- Certain features of the language are necessary for
2667 compatibility with earlier versions of the specification,
2668 but they tend to be used an implemented inconsistently,
2669 and their use is deprecated. This feature test entity
2670 enables a document type definition that eliminates
2671 these features.
2672 -->
2673
2674 <!ENTITY % HTML.Highlighting "INCLUDE"
2675 -- Use this feature test entity to validate that a
2676 document uses no highlighting tags, which may be
2677 ignored on minimal implementations.
2678 -->
2679
2680 <!ENTITY % HTML.Forms "INCLUDE"
2681 -- Use this feature test entity to validate that a document
2682 contains no forms, which may not be supported in minimal
2683 implementations
2684 -->
2685
2686 <!--============== Imported Names ==============================-->
2687
2688 <!ENTITY % Content-Type "CDATA"
2689 -- meaning an internet media type
2690 (aka MIME content type, as per RFC1521)
2691 -->
2692
2693 <!ENTITY % HTTP-Method "GET | POST"
2694 -- as per HTTP specification, in progress
2695 -->
2696
2697 <!ENTITY % URI "CDATA"
2698 -- The term URI means a CDATA attribute
2699 whose value is a Uniform Resource Identifier,
2700 as defined by
2701 "Universal Resource Identifiers" by Tim Berners-Lee
2702 aka RFC 1630
2703
2704 Note that CDATA attributes are limited by the LITLEN
2705 capacity (1024 in the current version of html.decl),
2706 so that URIs in HTML have a bounded length.
2707
2708 -->
2709
2710
2711 <!--========= DTD "Macros" =====================-->
2712
2713
2714 Berners-Lee, Connolly [Page 46]
2715
2716 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2717
2718 <!ENTITY % heading "H1|H2|H3|H4|H5|H6">
2719
2720 <!ENTITY % list " UL | OL | DIR | MENU " >
2721
2722
2723 <!--======= Character mnemonic entities =================-->
2724
2725 <!ENTITY % ISOlat1 PUBLIC
2726 "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML">
2727 %ISOlat1;
2728
2729 <!ENTITY amp CDATA "&#38;" -- ampersand -->
2730 <!ENTITY gt CDATA "&#62;" -- greater than -->
2731 <!ENTITY lt CDATA "&#60;" -- less than -->
2732 <!ENTITY quot CDATA "&#34;" -- double quote -->
2733
2734
2735 <!--========= SGML Document Access (SDA) Parameter Entities =====-->
2736
2737 <!-- HTML 2.0 contains SGML Document Access (SDA) fixed attributes
2738 in support of easy transformation to the International Committee
2739 for Accessible Document Design (ICADD) DTD
2740 "-//EC-USA-CDA/ICADD//DTD ICADD22//EN".
2741 ICADD applications are designed to support usable access to
2742 structured information by print-impaired individuals through
2743 Braille, large print and voice synthesis. For more information on
2744 SDA & ICADD:
2745 - ISO 12083:1993, Annex A.8, Facilities for Braille,
2746 large print and computer voice
2747 - ICADD ListServ
2748 <ICADD%ASUACAD.BITNET@ARIZVM1.ccit.arizona.edu>
2749 - Usenet news group bit.listserv.easi
2750 - Recording for the Blind, +1 800 221 4792
2751 - -->
2752
2753 <!ENTITY % SDAFORM "SDAFORM CDATA #FIXED"
2754 -- one to one mapping -->
2755 <!ENTITY % SDARULE "SDARULE CDATA #FIXED"
2756 -- context-sensitive mapping -->
2757 <!ENTITY % SDAPREF "SDAPREF CDATA #FIXED"
2758 -- generated text prefix -->
2759 <!ENTITY % SDASUFF "SDASUFF CDATA #FIXED"
2760 -- generated text suffix -->
2761 <!ENTITY % SDASUSP "SDASUSP NAME #FIXED"
2762 -- suspend transform process -->
2763
2764
2765 <!--========== Text Markup =====================-->
2766
2767 <![ %HTML.Highlighting [
2768
2769 <!ENTITY % font " TT | B | I ">
2770
2771 <!ENTITY % phrase "EM | STRONG | CODE | SAMP | KBD | VAR | CITE ">
2772
2773 Berners-Lee, Connolly [Page 47]
2774
2775 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2776
2777
2778 <!ENTITY % text "#PCDATA | A | IMG | BR | %phrase | %font">
2779
2780 <!ELEMENT (%font;|%phrase) - - (%text)*>
2781 <!ATTLIST ( TT | CODE | SAMP | KBD | VAR )
2782 %SDAFORM; "Lit"
2783 >
2784 <!ATTLIST ( B | STRONG )
2785 %SDAFORM; "B"
2786 >
2787 <!ATTLIST ( I | EM | CITE )
2788 %SDAFORM; "It"
2789 >
2790
2791 <!-- <TT> Typewriter text -->
2792 <!-- <B> Bold text -->
2793 <!-- <I> Italic text -->
2794
2795 <!-- <EM> Emphasized phrase -->
2796 <!-- <STRONG> Strong emphais -->
2797 <!-- <CODE> Source code phrase -->
2798 <!-- <SAMP> Sample text or characters -->
2799 <!-- <KBD> Keyboard phrase, e.g. user input -->
2800 <!-- <VAR> Variable phrase or substituable -->
2801 <!-- <CITE> Name or title of cited work -->
2802
2803 <!ENTITY % pre.content "#PCDATA | A | HR | BR | %font | %phrase">
2804
2805 ]]>
2806
2807 <!ENTITY % text "#PCDATA | A | IMG | BR">
2808
2809 <!ELEMENT BR - O EMPTY>
2810 <!ATTLIST BR
2811 %SDAPREF; "&#RE;"
2812 >
2813
2814 <!-- <BR> Line break -->
2815
2816
2817 <!--========= Link Markup ======================-->
2818
2819 <!ENTITY % linkType "NAME">
2820
2821 <!ENTITY % linkExtraAttributes
2822 "REL %linkType #IMPLIED
2823 REV %linkType #IMPLIED
2824 URN CDATA #IMPLIED
2825 TITLE CDATA #IMPLIED
2826 METHODS NAMES #IMPLIED
2827 ">
2828
2829 <![ %HTML.Recommended [
2830 <!ENTITY % A.content "(%text)*"
2831
2832 Berners-Lee, Connolly [Page 48]
2833
2834 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2835
2836 -- <H1><a name="xxx">Heading</a></H1>
2837 is preferred to
2838 <a name="xxx"><H1>Heading</H1></a>
2839 -->
2840 ]]>
2841
2842 <!ENTITY % A.content "(%heading|%text)*">
2843
2844 <!ELEMENT A - - %A.content -(A)>
2845 <!ATTLIST A
2846 HREF %URI #IMPLIED
2847 NAME CDATA #IMPLIED
2848 %linkExtraAttributes;
2849 %SDAPREF; "<Anchor: #AttList>"
2850 >
2851 <!-- <A> Anchor; source/destination of link -->
2852 <!-- <A NAME="..."> Name of this anchor -->
2853 <!-- <A HREF="..."> Address of link destination -->
2854 <!-- <A URN="..."> Permanent address of destination -->
2855 <!-- <A REL=...> Relationship to destination -->
2856 <!-- <A REV=...> Relationship of destination to this -->
2857 <!-- <A TITLE="..."> Title of destination (advisory) -->
2858 <!-- <A METHODS="..."> Operations on destination (advisory) -->
2859
2860
2861 <!--========== Images ==========================-->
2862
2863 <!ELEMENT IMG - O EMPTY>
2864 <!ATTLIST IMG
2865 SRC %URI; #REQUIRED
2866 ALT CDATA #IMPLIED
2867 ALIGN (top|middle|bottom) #IMPLIED
2868 ISMAP (ISMAP) #IMPLIED
2869 %SDAPREF; "<Fig><?SDATrans Img: #AttList>#AttVal(Alt)</Fig>"
2870 >
2871
2872 <!-- <IMG> Image; icon, glyph or illustration -->
2873 <!-- <IMG SRC="..."> Address of image object -->
2874 <!-- <IMG ALT="..."> Textual alternative -->
2875 <!-- <IMG ALIGN=...> Position relative to text -->
2876 <!-- <IMG ISMAP> Each pixel can be a link -->
2877
2878 <!--========== Paragraphs=======================-->
2879
2880 <!ELEMENT P - O (%text)*>
2881 <!ATTLIST P
2882 %SDAFORM; "Para"
2883 >
2884
2885 <!-- <P> Paragraph -->
2886
2887
2888 <!--========== Headings, Titles, Sections ===============-->
2889
2890
2891 Berners-Lee, Connolly [Page 49]
2892
2893 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2894
2895 <!ELEMENT HR - O EMPTY>
2896 <!ATTLIST HR
2897 %SDAPREF; "&#RE;&#RE;"
2898 >
2899
2900 <!-- <HR> Horizontal rule -->
2901
2902 <!ELEMENT ( %heading ) - - (%text;)*>
2903 <!ATTLIST H1
2904 %SDAFORM; "H1"
2905 >
2906 <!ATTLIST H2
2907 %SDAFORM; "H2"
2908 >
2909 <!ATTLIST H3
2910 %SDAFORM; "H3"
2911 >
2912 <!ATTLIST H4
2913 %SDAFORM; "H4"
2914 >
2915 <!ATTLIST H5
2916 %SDAFORM; "H5"
2917 >
2918 <!ATTLIST H6
2919 %SDAFORM; "H6"
2920 >
2921
2922 <!-- <H1> Heading, level 1 -->
2923 <!-- <H2> Heading, level 2 -->
2924 <!-- <H3> Heading, level 3 -->
2925 <!-- <H4> Heading, level 4 -->
2926 <!-- <H5> Heading, level 5 -->
2927 <!-- <H6> Heading, level 6 -->
2928
2929
2930 <!--========== Text Flows ======================-->
2931
2932 <![ %HTML.Forms [
2933 <!ENTITY % block.forms "BLOCKQUOTE | FORM | ISINDEX">
2934 ]]>
2935
2936 <!ENTITY % block.forms "BLOCKQUOTE">
2937
2938 <![ %HTML.Deprecated [
2939 <!ENTITY % preformatted "PRE | XMP | LISTING">
2940 ]]>
2941
2942 <!ENTITY % preformatted "PRE">
2943
2944 <!ENTITY % block "P | %list | DL
2945 | %preformatted
2946 | %block.forms">
2947
2948 <!ENTITY % flow "(%text|%block)*">
2949
2950 Berners-Lee, Connolly [Page 50]
2951
2952 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
2953
2954
2955 <!ENTITY % pre.content "#PCDATA | A | HR | BR">
2956 <!ELEMENT PRE - - (%pre.content)*>
2957 <!ATTLIST PRE
2958 WIDTH NUMBER #implied
2959 %SDAFORM; "Lit"
2960 >
2961
2962 <!-- <PRE> Preformatted text -->
2963 <!-- <PRE WIDTH=...> Maximum characters per line -->
2964
2965 <![ %HTML.Deprecated [
2966
2967 <!ENTITY % literal "CDATA"
2968 -- historical, non-conforming parsing mode where
2969 the only markup signal is the end tag
2970 in full
2971 -->
2972
2973 <!ELEMENT (XMP|LISTING) - - %literal>
2974 <!ATTLIST XMP
2975 %SDAFORM; "Lit"
2976 %SDAPREF; "Example:&#RE;"
2977 >
2978 <!ATTLIST LISTING
2979 %SDAFORM; "Lit"
2980 %SDAPREF; "Listing:&#RE;"
2981 >
2982
2983 <!-- <XMP> Example section -->
2984 <!-- <LISTING> Computer listing -->
2985
2986 <!ELEMENT PLAINTEXT - O %literal>
2987 <!-- <PLAINTEXT> Plain text passage -->
2988
2989 <!ATTLIST PLAINTEXT
2990 %SDAFORM; "Lit"
2991 >
2992 ]]>
2993
2994
2995 <!--========== Lists ==================-->
2996
2997 <!ELEMENT DL - - (DT | DD)+>
2998 <!ATTLIST DL
2999 COMPACT (COMPACT) #IMPLIED
3000 %SDAFORM; "List"
3001 %SDAPREF; "Definition List:"
3002 >
3003
3004 <!ELEMENT DT - O (%text)*>
3005 <!ATTLIST DT
3006 %SDAFORM; "Term"
3007 >
3008
3009 Berners-Lee, Connolly [Page 51]
3010
3011 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3012
3013
3014 <!ELEMENT DD - O %flow>
3015 <!ATTLIST DD
3016 %SDAFORM; "LItem"
3017 >
3018
3019 <!-- <DL> Definition list, or glossary -->
3020 <!-- <DL COMPACT> Compact style list -->
3021 <!-- <DT> Term in definition list -->
3022 <!-- <DD> Definition of term -->
3023
3024 <!ELEMENT (OL|UL) - - (LI)+>
3025 <!ATTLIST OL
3026 COMPACT (COMPACT) #IMPLIED
3027 %SDAFORM; "List"
3028 >
3029 <!ATTLIST UL
3030 COMPACT (COMPACT) #IMPLIED
3031 %SDAFORM; "List"
3032 >
3033 <!-- <UL> Unordered list -->
3034 <!-- <UL COMPACT> Compact list style -->
3035 <!-- <OL> Ordered, or numbered list -->
3036 <!-- <OL COMPACT> Compact list style -->
3037
3038
3039 <!ELEMENT (DIR|MENU) - - (LI)+ -(%block)>
3040 <!ATTLIST DIR
3041 COMPACT (COMPACT) #IMPLIED
3042 %SDAFORM; "List"
3043 %SDAPREF; "<LHead>Directory</LHead>"
3044 >
3045 <!ATTLIST MENU
3046 COMPACT (COMPACT) #IMPLIED
3047 %SDAFORM; "List"
3048 %SDAPREF; "<LHead>Menu</LHead>"
3049 >
3050
3051 <!-- <DIR> Directory list -->
3052 <!-- <DIR COMPACT> Compact list style -->
3053 <!-- <MENU> Menu list -->
3054 <!-- <MENU COMPACT> Compact list style -->
3055
3056 <!ELEMENT LI - O %flow>
3057 <!ATTLIST LI
3058 %SDAFORM; "LItem"
3059 >
3060
3061 <!-- <LI> List item -->
3062
3063 <!--========== Document Body ===================-->
3064
3065 <![ %HTML.Recommended [
3066 <!ENTITY % body.content "(%heading|%block|HR|ADDRESS|IMG)*"
3067
3068 Berners-Lee, Connolly [Page 52]
3069
3070 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3071
3072 -- <h1>Heading</h1>
3073 <p>Text ...
3074 is preferred to
3075 <h1>Heading</h1>
3076 Text ...
3077 -->
3078 ]]>
3079
3080 <!ENTITY % body.content "(%heading | %text | %block |
3081 HR | ADDRESS)*">
3082
3083 <!ELEMENT BODY O O %body.content>
3084
3085 <!-- <BODY> Document body -->
3086
3087 <!ELEMENT BLOCKQUOTE - - %body.content>
3088 <!ATTLIST BLOCKQUOTE
3089 %SDAFORM; "BQ"
3090 >
3091
3092 <!-- <BLOCKQUOTE> Quoted passage -->
3093
3094 <!ELEMENT ADDRESS - - (%text|P)*>
3095 <!ATTLIST ADDRESS
3096 %SDAFORM; "Lit"
3097 %SDAPREF; "Address:&#RE;"
3098 >
3099
3100 <!-- <ADDRESS> Address, signature, or byline -->
3101
3102
3103 <!--======= Forms ====================-->
3104
3105 <![ %HTML.Forms [
3106
3107 <!ELEMENT FORM - - %body.content -(FORM) +(INPUT|SELECT|TEXTAREA)>
3108 <!ATTLIST FORM
3109 ACTION %URI #IMPLIED
3110 METHOD (%HTTP-Method) GET
3111 ENCTYPE %Content-Type; "application/x-www-form-urlencoded"
3112 %SDAPREF; "<Para>Form:</Para>"
3113 %SDASUFF; "<Para>Form End.</Para>"
3114 >
3115
3116 <!-- <FORM> Fill-out or data-entry form -->
3117 <!-- <FORM ACTION="..."> Address for completed form -->
3118 <!-- <FORM METHOD=...> Method of submitting form -->
3119 <!-- <FORM ENCTYPE="..."> Representation of form data -->
3120
3121 <!ENTITY % InputType "(TEXT | PASSWORD | CHECKBOX |
3122 RADIO | SUBMIT | RESET |
3123 IMAGE | HIDDEN )">
3124 <!ELEMENT INPUT - O EMPTY>
3125 <!ATTLIST INPUT
3126
3127 Berners-Lee, Connolly [Page 53]
3128
3129 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3130
3131 TYPE %InputType TEXT
3132 NAME CDATA #IMPLIED
3133 VALUE CDATA #IMPLIED
3134 SRC %URI #IMPLIED
3135 CHECKED (CHECKED) #IMPLIED
3136 SIZE CDATA #IMPLIED
3137 MAXLENGTH NUMBER #IMPLIED
3138 ALIGN (top|middle|bottom) #IMPLIED
3139 %SDAPREF; "Input: "
3140 >
3141
3142 <!-- <INPUT> Form input datum -->
3143 <!-- <INPUT TYPE=...> Type of input interaction -->
3144 <!-- <INPUT NAME=...> Name of form datum -->
3145 <!-- <INPUT VALUE="..."> Default/initial/selected value -->
3146 <!-- <INPUT SRC="..."> Address of image -->
3147 <!-- <INPUT CHECKED> Initial state is "on" -->
3148 <!-- <INPUT SIZE=...> Field size hint -->
3149 <!-- <INPUT MAXLENGTH=...> Data length maximum -->
3150 <!-- <INPUT ALIGN=...> Image alignment -->
3151
3152 <!ELEMENT SELECT - - (OPTION+) -(INPUT|SELECT|TEXTAREA)>
3153 <!ATTLIST SELECT
3154 NAME CDATA #REQUIRED
3155 SIZE NUMBER #IMPLIED
3156 MULTIPLE (MULTIPLE) #IMPLIED
3157 %SDAFORM; "List"
3158 %SDAPREF;
3159 "<LHead>Select #AttVal(Multiple)</LHead>"
3160 >
3161
3162 <!-- <SELECT> Selection of option(s) -->
3163 <!-- <SELECT NAME=...> Name of form datum -->
3164 <!-- <SELECT SIZE=...> Options displayed at a time -->
3165 <!-- <SELECT MULTIPLE> Multiple selections allowed -->
3166
3167 <!ELEMENT OPTION - O (#PCDATA)*>
3168 <!ATTLIST OPTION
3169 SELECTED (SELECTED) #IMPLIED
3170 VALUE CDATA #IMPLIED
3171 %SDAFORM; "LItem"
3172 %SDAPREF;
3173 "Option: #AttVal(Value) #AttVal(Selected)"
3174 >
3175
3176 <!-- <OPTION> A selection option -->
3177 <!-- <OPTION SELECTED> Initial state -->
3178 <!-- <OPTION VALUE="..."> Form datum value for this option-->
3179
3180 <!ELEMENT TEXTAREA - - (#PCDATA)* -(INPUT|SELECT|TEXTAREA)>
3181 <!ATTLIST TEXTAREA
3182 NAME CDATA #REQUIRED
3183 ROWS NUMBER #REQUIRED
3184 COLS NUMBER #REQUIRED
3185
3186 Berners-Lee, Connolly [Page 54]
3187
3188 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3189
3190 %SDAFORM; "Para"
3191 %SDAPREF; "Input Text -- #AttVal(Name): "
3192 >
3193
3194 <!-- <TEXTAREA> An area for text input -->
3195 <!-- <TEXTAREA NAME=...> Name of form datum -->
3196 <!-- <TEXTAREA ROWS=...> Height of area -->
3197 <!-- <TEXTAREA COLS=...> Width of area -->
3198
3199 ]]>
3200
3201
3202 <!--======= Document Head ======================-->
3203
3204 <![ %HTML.Recommended [
3205 <!ENTITY % head.extra "">
3206 ]]>
3207 <!ENTITY % head.extra "& NEXTID?">
3208
3209 <!ENTITY % head.content "TITLE & ISINDEX? & BASE? %head.extra">
3210
3211 <!ELEMENT HEAD O O (%head.content) +(META|LINK)>
3212
3213 <!-- <HEAD> Document head -->
3214
3215 <!ELEMENT TITLE - - (#PCDATA)*>
3216 <!ATTLIST TITLE
3217 %SDAFORM; "Ti" >
3218
3219 <!-- <TITLE> Title of document -->
3220
3221 <!ELEMENT LINK - O EMPTY>
3222 <!ATTLIST LINK
3223 HREF %URI #REQUIRED
3224 %linkExtraAttributes;
3225 %SDAPREF; "Linked to : #AttVal (TITLE) (URN) (HREF)>" >
3226
3227 <!-- <LINK> Link from this document -->
3228 <!-- <LINK HREF="..."> Address of link destination -->
3229 <!-- <LINK URN="..."> Lasting name of destination -->
3230 <!-- <LINK REL=...> Relationship to destination -->
3231 <!-- <LINK REV=...> Relationship of destination to this -->
3232 <!-- <LINK TITLE="..."> Title of destination (advisory) -->
3233 <!-- <LINK METHODS="..."> Operations allowed (advisory) -->
3234
3235 <!ELEMENT ISINDEX - O EMPTY>
3236 <!ATTLIST ISINDEX
3237 %SDAPREF;
3238 "<Para>[Document is indexed/searchable.]</Para>">
3239
3240 <!-- <ISINDEX> Document is a searchable index -->
3241
3242 <!ELEMENT BASE - O EMPTY>
3243 <!ATTLIST BASE
3244
3245 Berners-Lee, Connolly [Page 55]
3246
3247 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3248
3249 HREF %URI; #REQUIRED >
3250
3251 <!-- <BASE> Base context document -->
3252 <!-- <BASE HREF="..."> Address for this document -->
3253
3254 <!ELEMENT NEXTID - O EMPTY>
3255 <!ATTLIST NEXTID
3256 N CDATA #REQUIRED >
3257
3258 <!-- <NEXTID> Next ID to use for link name -->
3259 <!-- <NEXTID N=...> Next ID to use for link name -->
3260
3261 <!ELEMENT META - O EMPTY>
3262 <!ATTLIST META
3263 HTTP-EQUIV NAME #IMPLIED
3264 NAME NAME #IMPLIED
3265 CONTENT CDATA #REQUIRED >
3266
3267 <!-- <META> Generic Metainformation -->
3268 <!-- <META HTTP-EQUIV=...> HTTP response header name -->
3269 <!-- <META NAME=...> Metainformation name -->
3270 <!-- <META CONTENT="..."> Associated information -->
3271
3272 <!--======= Document Structure =================-->
3273
3274 <![ %HTML.Deprecated [
3275 <!ENTITY % html.content "HEAD, BODY, PLAINTEXT?">
3276 ]]>
3277 <!ENTITY % html.content "HEAD, BODY">
3278
3279 <!ELEMENT HTML O O (%html.content)>
3280 <!ENTITY % version.attr "VERSION CDATA #FIXED '%HTML.Version;'">
3281
3282 <!ATTLIST HTML
3283 %version.attr;
3284 %SDAFORM; "Book"
3285 >
3286
3287 <!-- <HTML> HTML Document -->
3288
3289
3290 9.2. Strict HTML DTD
3291
3292 This document type declaration refers to the HTML DTD with the
3293 `HTML.Recommended' entity defined as `INCLUDE' rather than
3294 IGNORE; that is, it refers to the more structurally rigid
3295 definition of HTML.
3296
3297 <!-- html-s.dtd
3298
3299 Document Type Definition for the HyperText Markup Language
3300 with strict validation (HTML Strict DTD).
3301
3302 $Id: html-s.dtd,v 1.3 1995/06/02 18:55:46 connolly Exp $
3303
3304 Berners-Lee, Connolly [Page 56]
3305
3306 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3307
3308
3309 Author: Daniel W. Connolly <connolly@w3.org>
3310 See Also: http://www.w3.org/hypertext/WWW/MarkUp/MarkUp.html
3311 - -->
3312
3313 <!ENTITY % HTML.Version
3314 "-//IETF//DTD HTML 2.0 Strict//EN"
3315
3316 -- Typical usage:
3317
3318 <!DOCTYPE HTML PUBLIC
3319 "-//IETF//DTD HTML Strict//EN">
3320 <html>
3321 ...
3322 </html>
3323 --
3324 >
3325
3326 <!-- Feature Test Entities -->
3327 <!ENTITY % HTML.Recommended "INCLUDE">
3328
3329 <!ENTITY % html PUBLIC "-//IETF//DTD HTML 2.0//EN">
3330 %html;
3331
3332
3333 9.3. Level 1 HTML DTD
3334
3335 This document type declaration refers to the HTML DTD with the
3336 `HTML.Forms' entity defined as `IGNORE' rather than `INCLUDE'.
3337 Documents which contain <FORM> elements do not conform to this
3338 DTD, and must use the level 2 DTD.
3339
3340 <!-- html-1.dtd
3341
3342 Document Type Definition for the HyperText Markup Language
3343 with Level 1 Extensions (HTML Level 1 DTD).
3344
3345 $Id: html-1.dtd,v 1.2 1995/03/29 18:53:10 connolly Exp $
3346
3347 Author: Daniel W. Connolly <connolly@w3.org>
3348 See Also: http://info.cern.ch/hypertext/WWW/MarkUp/MarkUp.html
3349 - -->
3350
3351 <!ENTITY % HTML.Version
3352 "-//IETF//DTD HTML 2.0 Level 1//EN"
3353
3354 -- Typical usage:
3355
3356 <!DOCTYPE HTML PUBLIC
3357 "-//IETF//DTD HTML Level 1//EN">
3358 <html>
3359 ...
3360 </html>
3361 --
3362
3363 Berners-Lee, Connolly [Page 57]
3364
3365 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3366
3367 >
3368
3369 <!-- Feature Test Entities -->
3370 <!ENTITY % HTML.Forms "IGNORE">
3371
3372 <!ENTITY % html PUBLIC "-//IETF//DTD HTML 2.0//EN">
3373 %html;
3374
3375
3376 9.4. Strict Level 1 HTML DTD
3377
3378 This document type declaration refers to the level 1 HTML DTD
3379 with the `HTML.Recommended' entity defined as `INCLUDE' rather
3380 than IGNORE; that is, it refers to the more structurally rigid
3381 definition of HTML.
3382
3383 <!-- html-1s.dtd
3384
3385 Document Type Definition for the HyperText Markup Language
3386 Struct Level 1
3387
3388 $Id: html-1s.dtd,v 1.3 1995/06/02 18:55:43 connolly Exp $
3389
3390 Author: Daniel W. Connolly <connolly@w3.org>
3391 See Also: http://www.w3.org/hypertext/WWW/MarkUp/MarkUp.html
3392 - -->
3393
3394 <!ENTITY % HTML.Version
3395 "-//IETF//DTD HTML 2.0 Strict Level 1//EN"
3396
3397 -- Typical usage:
3398
3399 <!DOCTYPE HTML PUBLIC
3400 "-//IETF//DTD HTML Strict Level 1//EN">
3401 <html>
3402 ...
3403 </html>
3404 --
3405 >
3406
3407 <!-- Feature Test Entities -->
3408 <!ENTITY % HTML.Recommended "INCLUDE">
3409
3410 <!ENTITY % html-1 PUBLIC "-//IETF//DTD HTML 2.0 Level 1//EN">
3411 %html-1;
3412
3413
3414 9.5. SGML Declaration for HTML
3415
3416 This is the SGML Declaration for HyperText Markup Language.
3417
3418 <!SGML "ISO 8879:1986"
3419 - --
3420 SGML Declaration for HyperText Markup Language (HTML).
3421
3422 Berners-Lee, Connolly [Page 58]
3423
3424 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3425
3426
3427 - --
3428
3429 CHARSET
3430 BASESET "ISO 646:1983//CHARSET
3431 International Reference Version
3432 (IRV)//ESC 2/5 4/0"
3433 DESCSET 0 9 UNUSED
3434 9 2 9
3435 11 2 UNUSED
3436 13 1 13
3437 14 18 UNUSED
3438 32 95 32
3439 127 1 UNUSED
3440 BASESET "ISO Registration Number 100//CHARSET
3441 ECMA-94 Right Part of
3442 Latin Alphabet Nr. 1//ESC 2/13 4/1"
3443
3444 DESCSET 128 32 UNUSED
3445 160 96 32
3446
3447 CAPACITY SGMLREF
3448 TOTALCAP 150000
3449 GRPCAP 150000
3450 ENTCAP 150000
3451
3452 SCOPE DOCUMENT
3453 SYNTAX
3454 SHUNCHAR CONTROLS 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
3455 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 127
3456 BASESET "ISO 646:1983//CHARSET
3457 International Reference Version
3458 (IRV)//ESC 2/5 4/0"
3459 DESCSET 0 128 0
3460 FUNCTION
3461 RE 13
3462 RS 10
3463 SPACE 32
3464 TAB SEPCHAR 9
3465
3466
3467 NAMING LCNMSTRT ""
3468 UCNMSTRT ""
3469 LCNMCHAR ".-"
3470 UCNMCHAR ".-"
3471 NAMECASE GENERAL YES
3472 ENTITY NO
3473 DELIM GENERAL SGMLREF
3474 SHORTREF SGMLREF
3475 NAMES SGMLREF
3476 QUANTITY SGMLREF
3477 ATTSPLEN 2100
3478 LITLEN 1024
3479 NAMELEN 72 -- somewhat arbitrary; taken from
3480
3481 Berners-Lee, Connolly [Page 59]
3482
3483 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3484
3485 internet line length conventions --
3486 PILEN 1024
3487 TAGLVL 100
3488 TAGLEN 2100
3489 GRPGTCNT 150
3490 GRPCNT 64
3491
3492 FEATURES
3493 MINIMIZE
3494 DATATAG NO
3495 OMITTAG YES
3496 RANK NO
3497 SHORTTAG YES
3498 LINK
3499 SIMPLE NO
3500 IMPLICIT NO
3501 EXPLICIT NO
3502 OTHER
3503 CONCUR NO
3504 SUBDOC NO
3505 FORMAL YES
3506 APPINFO "SDA" -- conforming SGML Document Access application
3507 --
3508 >
3509 <!--
3510 $Id: html.decl,v 1.17 1995/06/08 14:59:32 connolly Exp $
3511
3512 Author: Daniel W. Connolly <connolly@w3.org>
3513
3514 See also: http://www.w3.org/hypertext/WWW/MarkUp/MarkUp.html
3515 -->
3516
3517
3518 9.6. Sample SGML Open Entity Catalog for HTML
3519
3520 The SGML standard describes an ``entity manager'' as the portion
3521 or component of an SGML system that maps SGML entities into the
3522 actual storage model (e.g., the file system). The standard
3523 itself does not define a particular mapping methodology or
3524 notation.
3525
3526 To assist the interoperability among various SGML tools and
3527 systems, the SGML Open consortium has passed a technical
3528 resolution that defines a format for an application- independent
3529 entity catalog that maps external identifiers and/or entity
3530 names to file names.
3531
3532 Each entry in the catalog associates a storage object identifier
3533 (such as a file name) with information about the external entity
3534 that appears in the SGML document. In addition to entries that
3535 associate public identifiers, a catalog entry can associate an
3536 entity name with a storage object identifier. For example, the
3537 following are possible catalog entries:
3538
3539
3540 Berners-Lee, Connolly [Page 60]
3541
3542 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3543
3544 -- catalog: SGML Open style entity catalog for HTML --
3545 -- $Id: catalog,v 1.2 1994/11/30 23:45:18 connolly Exp $ --
3546
3547 -- Ways to refer to Level 2: most general to most specific --
3548 PUBLIC "-//IETF//DTD HTML//EN" html.dtd
3549 PUBLIC "-//IETF//DTD HTML 2.0//EN" html.dtd
3550 PUBLIC "-//IETF//DTD HTML Level 2//EN" html.dtd
3551 PUBLIC "-//IETF//DTD HTML 2.0 Level 2//EN" html.dtd
3552
3553 -- Ways to refer to Level 1: most general to most specific --
3554 PUBLIC "-//IETF//DTD HTML Level 1//EN" html-1.dtd
3555 PUBLIC "-//IETF//DTD HTML 2.0 Level 1//EN" html-1.dtd
3556
3557 -- Ways to refer to Level 0: most general to most specific --
3558 PUBLIC "-//IETF//DTD HTML Level 0//EN" html-0.dtd
3559 PUBLIC "-//IETF//DTD HTML 2.0 Level 0//EN" html-0.dtd
3560
3561
3562 -- Ways to refer to Strict Level 2: most general to most specif\
3563 c --
3564 PUBLIC "-//IETF//DTD HTML Strict//EN" html-s.dtd
3565 PUBLIC "-//IETF//DTD HTML 2.0 Strict//EN" html-s.dtd
3566 PUBLIC "-//IETF//DTD HTML Strict Level 2//EN" html-s.dtd
3567 PUBLIC "-//IETF//DTD HTML 2.0 Strict Level 2//EN" html-s.dtd
3568
3569 -- Ways to refer to Strict Level 1: most general to most specif\
3570 c --
3571 PUBLIC "-//IETF//DTD HTML Strict Level 1//EN" html-1s.dtd
3572 PUBLIC "-//IETF//DTD HTML 2.0 Strict Level 1//EN" html-1s.dtd
3573
3574 -- Ways to refer to Strict Level 0: most general to most specif\
3575 c --
3576 PUBLIC "-//IETF//DTD HTML Strict Level 0//EN" html-0s.dtd
3577 PUBLIC "-//IETF//DTD HTML 2.0 Strict Level 0//EN" html-0s.dtd
3578
3579 -- ISO latin 1 entity set for HTML --
3580 PUBLIC "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML" ISOlat1\
3581 sgml
3582
3583
3584 9.7. Character Entity Sets
3585
3586 The HTML DTD defines the following entities. They represent
3587 particular graphic characters which have special meanings in
3588 places in the markup, or may not be part of the character set
3589 available to the writer.
3590
3591
3592 9.7.1. Numeric and Special Graphic Entity Set
3593
3594 The following table lists each of the characters included from
3595 the Numeric and Special Graphic entity set, along with its name,
3596 syntax for use, and description. This list is derived from `ISO
3597 Standard 8879:1986//ENTITIES Numeric and Special Graphic//EN'.
3598
3599 Berners-Lee, Connolly [Page 61]
3600
3601 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3602
3603 However, HTML does not include for the entire entity set -- only
3604 the entities listed below are included.
3605
3606 GLYPH NAME SYNTAX DESCRIPTION
3607 < lt &lt; Less than sign
3608 > gt &gt; Greater than sign
3609 & amp &amp; Ampersand
3610 " quot &quot; Double quote sign
3611
3612
3613 9.7.2. ISO Latin 1 Character Entity Set
3614
3615 The following public text lists each of the characters specified
3616 in the Added Latin 1 entity set, along with its name, syntax for
3617 use, and description. This list is derived from ISO Standard
3618 8879:1986//ENTITIES Added Latin 1//EN. HTML includes the entire
3619 entity set.
3620
3621 <!-- (C) International Organization for Standardization 1986
3622 Permission to copy in any form is granted for use with
3623 conforming SGML systems and applications as defined in
3624 ISO 8879, provided this notice is included in all copies.
3625 - -->
3626 <!-- Character entity set. Typical invocation:
3627 <!ENTITY % ISOlat1 PUBLIC
3628 "ISO 8879-1986//ENTITIES Added Latin 1//EN//HTML">
3629 %ISOlat1;
3630 - -->
3631 <!-- Modified for use in HTML
3632 $Id: ISOlat1.sgml,v 1.2 1994/11/30 23:45:12 connolly Exp $ -->
3633 <!ENTITY AElig CDATA "&#198;" -- capital AE diphthong (ligature) -->
3634 <!ENTITY Aacute CDATA "&#193;" -- capital A, acute accent -->
3635 <!ENTITY Acirc CDATA "&#194;" -- capital A, circumflex accent -->
3636 <!ENTITY Agrave CDATA "&#192;" -- capital A, grave accent -->
3637 <!ENTITY Aring CDATA "&#197;" -- capital A, ring -->
3638 <!ENTITY Atilde CDATA "&#195;" -- capital A, tilde -->
3639 <!ENTITY Auml CDATA "&#196;" -- capital A, dieresis or umlaut mark -->
3640 <!ENTITY Ccedil CDATA "&#199;" -- capital C, cedilla -->
3641 <!ENTITY ETH CDATA "&#208;" -- capital Eth, Icelandic -->
3642 <!ENTITY Eacute CDATA "&#201;" -- capital E, acute accent -->
3643 <!ENTITY Ecirc CDATA "&#202;" -- capital E, circumflex accent -->
3644 <!ENTITY Egrave CDATA "&#200;" -- capital E, grave accent -->
3645 <!ENTITY Euml CDATA "&#203;" -- capital E, dieresis or umlaut mark -->
3646 <!ENTITY Iacute CDATA "&#205;" -- capital I, acute accent -->
3647 <!ENTITY Icirc CDATA "&#206;" -- capital I, circumflex accent -->
3648 <!ENTITY Igrave CDATA "&#204;" -- capital I, grave accent -->
3649 <!ENTITY Iuml CDATA "&#207;" -- capital I, dieresis or umlaut mark -->
3650 <!ENTITY Ntilde CDATA "&#209;" -- capital N, tilde -->
3651 <!ENTITY Oacute CDATA "&#211;" -- capital O, acute accent -->
3652 <!ENTITY Ocirc CDATA "&#212;" -- capital O, circumflex accent -->
3653 <!ENTITY Ograve CDATA "&#210;" -- capital O, grave accent -->
3654 <!ENTITY Oslash CDATA "&#216;" -- capital O, slash -->
3655 <!ENTITY Otilde CDATA "&#213;" -- capital O, tilde -->
3656 <!ENTITY Ouml CDATA "&#214;" -- capital O, dieresis or umlaut mark -->
3657
3658 Berners-Lee, Connolly [Page 62]
3659
3660 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3661
3662 <!ENTITY THORN CDATA "&#222;" -- capital THORN, Icelandic -->
3663 <!ENTITY Uacute CDATA "&#218;" -- capital U, acute accent -->
3664 <!ENTITY Ucirc CDATA "&#219;" -- capital U, circumflex accent -->
3665 <!ENTITY Ugrave CDATA "&#217;" -- capital U, grave accent -->
3666 <!ENTITY Uuml CDATA "&#220;" -- capital U, dieresis or umlaut mark -->
3667 <!ENTITY Yacute CDATA "&#221;" -- capital Y, acute accent -->
3668 <!ENTITY aacute CDATA "&#225;" -- small a, acute accent -->
3669 <!ENTITY acirc CDATA "&#226;" -- small a, circumflex accent -->
3670 <!ENTITY aelig CDATA "&#230;" -- small ae diphthong (ligature) -->
3671 <!ENTITY agrave CDATA "&#224;" -- small a, grave accent -->
3672 <!ENTITY aring CDATA "&#229;" -- small a, ring -->
3673 <!ENTITY atilde CDATA "&#227;" -- small a, tilde -->
3674 <!ENTITY auml CDATA "&#228;" -- small a, dieresis or umlaut mark -->
3675 <!ENTITY ccedil CDATA "&#231;" -- small c, cedilla -->
3676 <!ENTITY eacute CDATA "&#233;" -- small e, acute accent -->
3677 <!ENTITY ecirc CDATA "&#234;" -- small e, circumflex accent -->
3678 <!ENTITY egrave CDATA "&#232;" -- small e, grave accent -->
3679 <!ENTITY eth CDATA "&#240;" -- small eth, Icelandic -->
3680 <!ENTITY euml CDATA "&#235;" -- small e, dieresis or umlaut mark -->
3681 <!ENTITY iacute CDATA "&#237;" -- small i, acute accent -->
3682 <!ENTITY icirc CDATA "&#238;" -- small i, circumflex accent -->
3683 <!ENTITY igrave CDATA "&#236;" -- small i, grave accent -->
3684 <!ENTITY iuml CDATA "&#239;" -- small i, dieresis or umlaut mark -->
3685 <!ENTITY ntilde CDATA "&#241;" -- small n, tilde -->
3686 <!ENTITY oacute CDATA "&#243;" -- small o, acute accent -->
3687 <!ENTITY ocirc CDATA "&#244;" -- small o, circumflex accent -->
3688 <!ENTITY ograve CDATA "&#242;" -- small o, grave accent -->
3689 <!ENTITY oslash CDATA "&#248;" -- small o, slash -->
3690 <!ENTITY otilde CDATA "&#245;" -- small o, tilde -->
3691 <!ENTITY ouml CDATA "&#246;" -- small o, dieresis or umlaut mark -->
3692 <!ENTITY szlig CDATA "&#223;" -- small sharp s, German (sz ligature) -\
3693 >
3694 <!ENTITY thorn CDATA "&#254;" -- small thorn, Icelandic -->
3695 <!ENTITY uacute CDATA "&#250;" -- small u, acute accent -->
3696 <!ENTITY ucirc CDATA "&#251;" -- small u, circumflex accent -->
3697 <!ENTITY ugrave CDATA "&#249;" -- small u, grave accent -->
3698 <!ENTITY uuml CDATA "&#252;" -- small u, dieresis or umlaut mark -->
3699 <!ENTITY yacute CDATA "&#253;" -- small y, acute accent -->
3700 <!ENTITY yuml CDATA "&#255;" -- small y, dieresis or umlaut mark -->
3701
3702
3703 10. Security Considerations
3704
3705 Anchors, embedded images, and all other elements which contain
3706 URIs as parameters may cause the URI to be dereferenced in
3707 response to user input. In this case, the security
3708 considerations of [URL] apply.
3709
3710 The widely deployed methods for submitting forms requests --
3711 HTTP and SMTP -- provide little assurance of confidentiality.
3712 Information providers who request sensitive information via
3713 forms -- especially by way of the `PASSWORD' type input field
3714 (see 8.1.2, "Input Field: INPUT") -- should be aware and make
3715 their users aware of the lack of confidentiality.
3716
3717 Berners-Lee, Connolly [Page 63]
3718
3719 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3720
3721
3722
3723 11. References
3724
3725 [URI]
3726 T. Berners-Lee. ``Universal Resource Identifiers in WWW:
3727 A Unifying Syntax for the Expression of Names and
3728 Addresses of Objects on the Network as used in the
3729 World- Wide Web.'' RFC 1630, CERN, June 1994.
3730 <URL:ftp://ds.internic.net/rfc/rfc1630.txt>
3731
3732 [URL]
3733 T. Berners-Lee, L. Masinter, and M. McCahill. ``Uniform
3734 Resource Locators (URL).'' RFC 1738, CERN, Xerox PARC,
3735 University of Minnesota, October 1994.
3736 <URL:ftp://ds.internic.net/rfc/rfc1738.txt>
3737
3738 [HTTP]
3739 T. Berners-Lee, R. T. Fielding, and H. Frystyk Nielsen.
3740 ``Hypertext Transfer Protocol - HTTP/1.0.'' Work in
3741 Progress, MIT, UC Irvine, CERN, March 1995.
3742 <URL:ftp://ds.internic.net/internet-drafts/draft-ietf-http-v10-spec-00.ps>
3743
3744 [MIME]
3745 N. Borenstein and N. Freed. ``MIME (Multipurpose
3746 Internet Mail Extensions) Part One: Mechanisms for
3747 Specifying and Describing the Format of Internet Message
3748 Bodies.'' RFC 1521, Bellcore, Innosoft, September 1993.
3749 <URL:ftp://ds.internic.net/rfc/rfc1521.txt>
3750
3751 [RELURL]
3752 R. T. Fielding. ``Relative Uniform Resource Locators.''
3753 Work in Progress, UC Irvine, March 1995.
3754 <URL:ftp://ds.internic.net/internet-drafts/draft-ietf-uri-relative-url-06.txt>
3755
3756 [GOLD90]
3757 C. F. Goldfarb. ``The SGML Handbook.'' Y. Rubinsky, Ed.,
3758 Oxford University Press, 1990. <URL:>
3759
3760 [DEXTER]
3761 Frank Halasz and Mayer Schwartz, ``The Dexter Hypertext
3762 Reference Model'', ``Communications of the ACM'', pp.
3763 30-39, vol. 37 no. 2, Feb 1994, <URL:>
3764
3765 [IMEDIA]
3766 J. Postel. ``Media Type Registration Procedure.'',
3767 USC/ISI, March 1994.
3768 <URL:ftp://ds.internic.net/rfc/rfc1590.txt>
3769
3770 [IANA]
3771 J. Reynolds and J. Postel. ``Assigned Numbers.'' STD 2,
3772 RFC 1700, USC/ISI, October 1994.
3773 <URL:ftp://ds.internic.net/rfc/rfc1700.txt>
3774
3775
3776 Berners-Lee, Connolly [Page 64]
3777
3778 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3779
3780 [SQ91]
3781 SoftQuad. ``The SGML Primer.'' 3rd ed., SoftQuad Inc.,
3782 1991. <URL:http://www.sq.com/>
3783
3784 [ISO-646]
3785 ISO/IEC 646:1991 Information technology -- ISO 7-bit
3786 coded character set for information interchange
3787 <URL:http://www.iso.ch/cate/d4777.html>
3788
3789 [ISO-10646]
3790 ISO/IEC 10646-1:1993 Information technology -- Universal
3791 Multiple-Octet Coded Character Set (UCS) -- Part 1:
3792 Architecture and Basic Multilingual Plane
3793 <URL:http://www.iso.ch/cate/d18741.html>
3794
3795 [ISO-8859-1]
3796 ISO 8859. International Standard -- Information
3797 Processing -- 8-bit Single-Byte Coded Graphic Character
3798 Sets -- Part 1: Latin Alphabet No. 1, ISO 8859-1:1987.
3799 <URL:http://www.iso.ch/cate/d16338.html>
3800
3801 [SGML]
3802 ISO 8879. Information Processing -- Text and Office
3803 Systems - Standard Generalized Markup Language (SGML),
3804 1986. <URL:http://www.iso.ch/cate/d16387.html>
3805
3806
3807 12. Acknowledgments
3808
3809 The HTML document type was designed by Tim Berners-Lee at CERN
3810 as part of the 1990 World Wide Web project. In 1992, Dan
3811 Connolly wrote the HTML Document Type Definition (DTD) and a
3812 brief HTML specification.
3813
3814 Since 1993, a wide variety of Internet participants have
3815 contributed to the evolution of HTML, which has included the
3816 addition of in-line images introduced by the NCSA Mosaic
3817 software for WWW. Dave Raggett played an important role in
3818 deriving the FORMS material from the HTML+ specification.
3819
3820 Dan Connolly and Karen Olson Muldrow rewrote the HTML
3821 Specification in 1994. The document was then edited by the HTML
3822 working group as a whole, with updates being made by Eric
3823 Schieler, Mike Knezovich, and Eric W. Sink at Spyglass, Inc.
3824 Finally, Roy Fielding restructured the entire draft into its
3825 current form.
3826
3827 Special thanks to the many active participants in the HTML
3828 working group, too numerous to list individually, without whom
3829 there would be no standards process and no standard. That this
3830 document approaches its objective of carefully converging a
3831 description of current practice and formalization of HTML's
3832 relationship to SGML is a tribute to their effort.
3833
3834
3835 Berners-Lee, Connolly [Page 65]
3836
3837 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3838
3839 12.1. Authors' Addresses
3840
3841 Tim Berners-Lee
3842
3843 Director, W3 Consortium
3844 MIT Laboratory for Computer Science
3845 545 Technology Square
3846 Cambridge, MA 02139, U.S.A.
3847 Tel: +1 (617) 253 9670
3848 Fax: +1 (617) 258 8682
3849 Email: timbl@w3.org
3850
3851 Daniel W. Connolly
3852
3853 Research Technical Staff, W3 Consortium
3854 MIT Laboratory for Computer Science
3855 545 Technology Square
3856 Cambridge, MA 02139, U.S.A.
3857 Fax: +1 (617) 258 8682
3858 Email: connolly@w3.org
3859 URI: http://www.w3.org/hypertext/WWW/People/Connolly/
3860
3861
3862 13. The HTML Coded Character Set
3863
3864 This list details the code positions and characters of the HTML
3865 document character set, specified in 9.5, "SGML Declaration for
3866 HTML". This coded character set is based on [ISO-8859-1].
3867
3868 REFERENCE DESCRIPTION
3869 -------------- -----------
3870 &#00; - &#08; Unused
3871 &#09; Horizontal tab
3872 &#10; Line feed
3873 &#11; - &#12; Unused
3874 &#13; Carriage Return
3875 &#14; - &#31; Unused
3876 &#32; Space
3877 &#33; Exclamation mark
3878 &#34; Quotation mark
3879 &#35; Number sign
3880 &#36; Dollar sign
3881 &#37; Percent sign
3882 &#38; Ampersand
3883 &#39; Apostrophe
3884 &#40; Left parenthesis
3885 &#41; Right parenthesis
3886 &#42; Asterisk
3887 &#43; Plus sign
3888 &#44; Comma
3889 &#45; Hyphen
3890 &#46; Period (fullstop)
3891 &#47; Solidus (slash)
3892 &#48; - &#57; Digits 0-9
3893
3894 Berners-Lee, Connolly [Page 66]
3895
3896 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3897
3898 &#58; Colon
3899 &#59; Semi-colon
3900 &#60; Less than
3901 &#61; Equals sign
3902 &#62; Greater than
3903 &#63; Question mark
3904 &#64; Commercial at
3905 &#65; - &#90; Letters A-Z
3906 &#91; Left square bracket
3907 &#92; Reverse solidus (backslash)
3908 &#93; Right square bracket
3909 &#94; Caret
3910 &#95; Horizontal bar (underscore)
3911 &#96; Acute accent
3912 &#97; - &#122; Letters a-z
3913 &#123; Left curly brace
3914 &#124; Vertical bar
3915 &#125; Right curly brace
3916 &#126; Tilde
3917 &#127; - &#159; Unused
3918 &#160; Non-breaking Space
3919 &#161; Inverted exclamation
3920 &#162; Cent sign
3921 &#163; Pound sterling
3922 &#164; General currency sign
3923 &#165; Yen sign
3924 &#166; Broken vertical bar
3925 &#167; Section sign
3926 &#168; Umlaut (dieresis)
3927 &#169; Copyright
3928 &#170; Feminine ordinal
3929 &#171; Left angle quote, guillemotleft
3930 &#172; Not sign
3931 &#173; Soft hyphen
3932 &#174; Registered trademark
3933 &#175; Macron accent
3934 &#176; Degree sign
3935 &#177; Plus or minus
3936 &#178; Superscript two
3937 &#179; Superscript three
3938 &#180; Acute accent
3939 &#181; Micro sign
3940 &#182; Paragraph sign
3941 &#183; Middle dot
3942 &#184; Cedilla
3943 &#185; Superscript one
3944 &#186; Masculine ordinal
3945 &#187; Right angle quote, guillemotright
3946 &#188; Fraction one-fourth
3947 &#189; Fraction one-half
3948 &#190; Fraction three-fourths
3949 &#191; Inverted question mark
3950 &#192; Capital A, grave accent
3951 &#193; Capital A, acute accent
3952
3953 Berners-Lee, Connolly [Page 67]
3954
3955 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
3956
3957 &#194; Capital A, circumflex accent
3958 &#195; Capital A, tilde
3959 &#196; Capital A, dieresis or umlaut mark
3960 &#197; Capital A, ring
3961 &#198; Capital AE dipthong (ligature)
3962 &#199; Capital C, cedilla
3963 &#200; Capital E, grave accent
3964 &#201; Capital E, acute accent
3965 &#202; Capital E, circumflex accent
3966 &#203; Capital E, dieresis or umlaut mark
3967 &#204; Capital I, grave accent
3968 &#205; Capital I, acute accent
3969 &#206; Capital I, circumflex accent
3970 &#207; Capital I, dieresis or umlaut mark
3971 &#208; Capital Eth, Icelandic
3972 &#209; Capital N, tilde
3973 &#210; Capital O, grave accent
3974 &#211; Capital O, acute accent
3975 &#212; Capital O, circumflex accent
3976 &#213; Capital O, tilde
3977 &#214; Capital O, dieresis or umlaut mark
3978 &#215; Multiply sign
3979 &#216; Capital O, slash
3980 &#217; Capital U, grave accent
3981 &#218; Capital U, acute accent
3982 &#219; Capital U, circumflex accent
3983 &#220; Capital U, dieresis or umlaut mark
3984 &#221; Capital Y, acute accent
3985 &#222; Capital THORN, Icelandic
3986 &#223; Small sharp s, German (sz ligature)
3987 &#224; Small a, grave accent
3988 &#225; Small a, acute accent
3989 &#226; Small a, circumflex accent
3990 &#227; Small a, tilde
3991 &#228; Small a, dieresis or umlaut mark
3992 &#229; Small a, ring
3993 &#230; Small ae dipthong (ligature)
3994 &#231; Small c, cedilla
3995 &#232; Small e, grave accent
3996 &#233; Small e, acute accent
3997 &#234; Small e, circumflex accent
3998 &#235; Small e, dieresis or umlaut mark
3999 &#236; Small i, grave accent
4000 &#237; Small i, acute accent
4001 &#238; Small i, circumflex accent
4002 &#239; Small i, dieresis or umlaut mark
4003 &#240; Small eth, Icelandic
4004 &#241; Small n, tilde
4005 &#242; Small o, grave accent
4006 &#243; Small o, acute accent
4007 &#244; Small o, circumflex accent
4008 &#245; Small o, tilde
4009 &#246; Small o, dieresis or umlaut mark
4010 &#247; Division sign
4011
4012 Berners-Lee, Connolly [Page 68]
4013
4014 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
4015
4016 &#248; Small o, slash
4017 &#249; Small u, grave accent
4018 &#250; Small u, acute accent
4019 &#251; Small u, circumflex accent
4020 &#252; Small u, dieresis or umlaut mark
4021 &#253; Small y, acute accent
4022 &#254; Small thorn, Icelandic
4023 &#255; Small y, dieresis or umlaut mark
4024
4025
4026 14. Proposed Entities
4027
4028 The HTML DTD references the ``Added Latin 1'' entity set, which
4029 only supplies named entities for a subset of the non-ASCII
4030 characters in [ISO-8859-1], namely the accented characters. The
4031 following entities should be supported so that all ISO 8859-1
4032 characters may only be referenced symbolically. The names for
4033 these entities are taken from the appendixes of [SGML].
4034
4035 <!ENTITY nbsp CDATA "&#160;" -- no-break space -->
4036 <!ENTITY iexcl CDATA "&#161;" -- inverted exclamation mark -->
4037 <!ENTITY cent CDATA "&#162;" -- cent sign -->
4038 <!ENTITY pound CDATA "&#163;" -- pound sterling sign -->
4039 <!ENTITY curren CDATA "&#164;" -- general currency sign -->
4040 <!ENTITY yen CDATA "&#165;" -- yen sign -->
4041 <!ENTITY brvbar CDATA "&#166;" -- broken (vertical) bar -->
4042 <!ENTITY sect CDATA "&#167;" -- section sign -->
4043 <!ENTITY uml CDATA "&#168;" -- umlaut (dieresis) -->
4044 <!ENTITY copy CDATA "&#169;" -- copyright sign -->
4045 <!ENTITY ordf CDATA "&#170;" -- ordinal indicator, feminine -->
4046 <!ENTITY laquo CDATA "&#171;" -- angle quotation mark, left -->
4047 <!ENTITY not CDATA "&#172;" -- not sign -->
4048 <!ENTITY shy CDATA "&#173;" -- soft hyphen -->
4049 <!ENTITY reg CDATA "&#174;" -- registered sign -->
4050 <!ENTITY macr CDATA "&#175;" -- macron -->
4051 <!ENTITY deg CDATA "&#176;" -- degree sign -->
4052 <!ENTITY plusmn CDATA "&#177;" -- plus-or-minus sign -->
4053 <!ENTITY sup2 CDATA "&#178;" -- superscript two -->
4054 <!ENTITY sup3 CDATA "&#179;" -- superscript three -->
4055 <!ENTITY acute CDATA "&#180;" -- acute accent -->
4056 <!ENTITY micro CDATA "&#181;" -- micro sign -->
4057 <!ENTITY para CDATA "&#182;" -- pilcrow (paragraph sign) -->
4058 <!ENTITY middot CDATA "&#183;" -- middle dot -->
4059 <!ENTITY cedil CDATA "&#184;" -- cedilla -->
4060 <!ENTITY sup1 CDATA "&#185;" -- superscript one -->
4061 <!ENTITY ordm CDATA "&#186;" -- ordinal indicator, masculine -->
4062 <!ENTITY raquo CDATA "&#187;" -- angle quotation mark, right -->
4063 <!ENTITY frac14 CDATA "&#188;" -- fraction one-quarter -->
4064 <!ENTITY frac12 CDATA "&#189;" -- fraction one-half -->
4065 <!ENTITY frac34 CDATA "&#190;" -- fraction three-quarters -->
4066 <!ENTITY iquest CDATA "&#191;" -- inverted question mark -->
4067 <!ENTITY Agrave CDATA "&#192;" -- capital A, grave accent -->
4068 <!ENTITY Aacute CDATA "&#193;" -- capital A, acute accent -->
4069 <!ENTITY Acirc CDATA "&#194;" -- capital A, circumflex accent -->
4070
4071 Berners-Lee, Connolly [Page 69]
4072
4073 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
4074
4075 <!ENTITY Atilde CDATA "&#195;" -- capital A, tilde -->
4076 <!ENTITY Auml CDATA "&#196;" -- capital A, dieresis or umlaut mark -->
4077 <!ENTITY Aring CDATA "&#197;" -- capital A, ring -->
4078 <!ENTITY AElig CDATA "&#198;" -- capital AE diphthong (ligature) -->
4079 <!ENTITY Ccedil CDATA "&#199;" -- capital C, cedilla -->
4080 <!ENTITY Egrave CDATA "&#200;" -- capital E, grave accent -->
4081 <!ENTITY Eacute CDATA "&#201;" -- capital E, acute accent -->
4082 <!ENTITY Ecirc CDATA "&#202;" -- capital E, circumflex accent -->
4083 <!ENTITY Euml CDATA "&#203;" -- capital E, dieresis or umlaut mark -->
4084 <!ENTITY Igrave CDATA "&#204;" -- capital I, grave accent -->
4085 <!ENTITY Iacute CDATA "&#205;" -- capital I, acute accent -->
4086 <!ENTITY Icirc CDATA "&#206;" -- capital I, circumflex accent -->
4087 <!ENTITY Iuml CDATA "&#207;" -- capital I, dieresis or umlaut mark -->
4088 <!ENTITY ETH CDATA "&#208;" -- capital Eth, Icelandic -->
4089 <!ENTITY Ntilde CDATA "&#209;" -- capital N, tilde -->
4090 <!ENTITY Ograve CDATA "&#210;" -- capital O, grave accent -->
4091 <!ENTITY Oacute CDATA "&#211;" -- capital O, acute accent -->
4092 <!ENTITY Ocirc CDATA "&#212;" -- capital O, circumflex accent -->
4093 <!ENTITY Otilde CDATA "&#213;" -- capital O, tilde -->
4094 <!ENTITY Ouml CDATA "&#214;" -- capital O, dieresis or umlaut mark -->
4095 <!ENTITY times CDATA "&#215;" -- multiply sign -->
4096 <!ENTITY Oslash CDATA "&#216;" -- capital O, slash -->
4097 <!ENTITY Ugrave CDATA "&#217;" -- capital U, grave accent -->
4098 <!ENTITY Uacute CDATA "&#218;" -- capital U, acute accent -->
4099 <!ENTITY Ucirc CDATA "&#219;" -- capital U, circumflex accent -->
4100 <!ENTITY Uuml CDATA "&#220;" -- capital U, dieresis or umlaut mark -->
4101 <!ENTITY Yacute CDATA "&#221;" -- capital Y, acute accent -->
4102 <!ENTITY THORN CDATA "&#222;" -- capital THORN, Icelandic -->
4103 <!ENTITY szlig CDATA "&#223;" -- small sharp s, German (sz ligature) -->
4104 <!ENTITY agrave CDATA "&#224;" -- small a, grave accent -->
4105 <!ENTITY aacute CDATA "&#225;" -- small a, acute accent -->
4106 <!ENTITY acirc CDATA "&#226;" -- small a, circumflex accent -->
4107 <!ENTITY atilde CDATA "&#227;" -- small a, tilde -->
4108 <!ENTITY auml CDATA "&#228;" -- small a, dieresis or umlaut mark -->
4109 <!ENTITY aring CDATA "&#229;" -- small a, ring -->
4110 <!ENTITY aelig CDATA "&#230;" -- small ae diphthong (ligature) -->
4111 <!ENTITY ccedil CDATA "&#231;" -- small c, cedilla -->
4112 <!ENTITY egrave CDATA "&#232;" -- small e, grave accent -->
4113 <!ENTITY eacute CDATA "&#233;" -- small e, acute accent -->
4114 <!ENTITY ecirc CDATA "&#234;" -- small e, circumflex accent -->
4115 <!ENTITY euml CDATA "&#235;" -- small e, dieresis or umlaut mark -->
4116 <!ENTITY igrave CDATA "&#236;" -- small i, grave accent -->
4117 <!ENTITY iacute CDATA "&#237;" -- small i, acute accent -->
4118 <!ENTITY icirc CDATA "&#238;" -- small i, circumflex accent -->
4119 <!ENTITY iuml CDATA "&#239;" -- small i, dieresis or umlaut mark -->
4120 <!ENTITY eth CDATA "&#240;" -- small eth, Icelandic -->
4121 <!ENTITY ntilde CDATA "&#241;" -- small n, tilde -->
4122 <!ENTITY ograve CDATA "&#242;" -- small o, grave accent -->
4123 <!ENTITY oacute CDATA "&#243;" -- small o, acute accent -->
4124 <!ENTITY ocirc CDATA "&#244;" -- small o, circumflex accent -->
4125 <!ENTITY otilde CDATA "&#245;" -- small o, tilde -->
4126 <!ENTITY ouml CDATA "&#246;" -- small o, dieresis or umlaut mark -->
4127 <!ENTITY divide CDATA "&#247;" -- divide sign -->
4128 <!ENTITY oslash CDATA "&#248;" -- small o, slash -->
4129
4130 Berners-Lee, Connolly [Page 70]
4131
4132 INTERNET-DRAFT Hypertext Markup Language - 2.0 June 16, 1995
4133
4134 <!ENTITY ugrave CDATA "&#249;" -- small u, grave accent -->
4135 <!ENTITY uacute CDATA "&#250;" -- small u, acute accent -->
4136 <!ENTITY ucirc CDATA "&#251;" -- small u, circumflex accent -->
4137 <!ENTITY uuml CDATA "&#252;" -- small u, dieresis or umlaut mark -->
4138 <!ENTITY yacute CDATA "&#253;" -- small y, acute accent -->
4139 <!ENTITY thorn CDATA "&#254;" -- small thorn, Icelandic -->
4140 <!ENTITY yuml CDATA "&#255;" -- small y, dieresis or umlaut mark -->
4141
4142
4143
4144
4145
4146
4147
4148
4149
4150
4151
4152
4153
4154
4155
4156
4157
4158
4159
4160
4161
4162
4163
4164
4165
4166
4167
4168
4169
4170
4171
4172
4173
4174
4175
4176
4177
4178
4179
4180
4181
4182
4183
4184
4185
4186
4187
4188
4189 Berners-Lee, Connolly [Page 71]
4190
4191
4192

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24