/[suikacvs]/webroot/www/2004/id/draft-ietf-urn-biblio-01.txt
Suika

Contents of /webroot/www/2004/id/draft-ietf-urn-biblio-01.txt

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.1 - (show annotations) (download)
Tue Jun 15 08:04:06 2004 UTC (19 years, 11 months ago) by wakaba
Branch: MAIN
CVS Tags: HEAD
File MIME type: text/plain
New

1
2 Internet Draft Clifford Lynch
3 September 8. 1997 Coalition for Networked Information
4 draft-ietf-urn-biblio-01.txt Cecilia Preston
5 Preston & Lynch
6 Ron Daniel Jr.
7 Los Alamos National Laboratory
8
9
10 Using Existing Bibliographic Identifiers
11 as
12 Uniform Resource Names
13
14
15 Status of this Document
16
17 This document is an Internet-Draft. Internet-Drafts are
18 working documents of the Internet Engineering Task Force
19 (IETF), its areas, and its working groups. Note that other
20 groups may also distribute working documents as Internet-
21 Drafts.
22
23 Internet-Drafts are draft documents valid for a maximum of
24 six months and may be updated, replaced or made obsolete by
25 other documents at any time. It is inappropriate to use
26 Internet-Drafts as reference material or to cite them other
27 than as works in progress.
28
29 Distribution of this document is unlimited.
30
31 This document does not specify a standard; it is purely
32 informational.
33
34
35 0. Abstract
36
37 A system for Uniform Resource Names (URNs) must be capable
38 of supporting identifiers from existing widely-used naming
39 systems. This document discusses how three major
40 bibliographic identifiers (the ISBN, ISSN and SICI) can be
41 supported within the URN framework and the currently
42 proposed syntax for URNs.
43
44 [Page 1]
45
46 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
47
48
49 1. Introduction
50
51 The ongoing work of several IETF working groups, most
52 recently in the Uniform Resource Names working group, has
53 culminated the development of a syntax for Uniform Resource
54 Names (URNs). The functional requirements and overall
55 framework for Uniform Resource Names are specified in RFC
56 1737 [Sollins & Masinter] and the specification for the
57 URN syntax is RFC 2141 [Moats].
58
59 As part of the validation process for the development of
60 URNs the IETF working group has agreed that it is important
61 to demonstrate that the current URN syntax proposal can
62 accommodate existing identifiers from well established
63 namespaces. One such infrastructure for assigning and
64 managing names comes from the bibliographic community.
65 Bibliographic identifiers function as names for objects that
66 exist both in print and, increasingly, in electronic formats.
67 This Internet draft demonstrates the feasibility of
68 supporting three representative bibliographic identifiers
69 within the currently proposed URN framework and syntax.
70
71 Note that this document does not purport to define the
72 "official" standard way ofmoving these bibliographic
73 identifiers into URNs; it merely demonstrates feasibility.
74 It has not been developed in consultation with these
75 standards bodies and maintenance agencies that oversee the
76 existing bibliographic identifiers. Any actual Internet
77 standard for encoding these bibliographic identifiers as
78 URNs will need to be developed in consultation with the
79 responsible standards bodies and maintenance agencies.
80
81 In addition, there are several open questions with regard to
82 the management and registry of Namespace Identifiers (NIDs)
83 for URNs. For purposes of illustration, we have used the
84 three NIDs "ISBN", "ISSN" and "SICI" for the three
85 corresponding bibliographic identifiers discussed in this
86 document. While we believe this to be the most appropriate
87 choice, it is not the only one. The NIDs could be based on
88
89 [Page 2]
90
91 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
92
93
94 the standards body and standard number (e.g. "US-ANSI-NISO-
95 Z39.56-1997" rather than "SICI"). Alternatively, one could
96 lump all bibliographic identifiers into a single
97 "BIBLIOGRAPHIC" name space, and structure the namespace-
98 specific string to specify which identifier is being used.
99 Any final resolution of this must wait for the outcome of
100 namespace management discussions in the working group and
101 the broader IETF community.
102
103 For the purposes of this document, we have selected three
104 major bibliographic identifiers (national and international)
105 to fit within the URN framework. These are the International
106 Standard Book Number (ISBN) [ISO1], the International
107 Standard Serials Number (ISSN) [NISO1,ISO2, ISO3], and the
108 Serial Item and Contribution Identifier (SICI) [NISO2]. An
109 ISBN is used to identify a monograph (book). An ISSN is used
110 to identify serial publications (journals, newspapers) as a
111 whole. A SICI augments the ISSN in order to identify
112 individual issues of serial publications, or components
113 within those issues (such as an individual article, or the
114 table of contents of a given issue). The ISBN and ISSN are
115 defined in the United States by standards issued by the
116 National Information Standards Organization (NISO) and also
117 by parallel international standards issued under the auspices
118 of the International Organization for Standardization (ISO).
119 NISO is the ANSI-accredited standards body serving libraries,
120 publishers and information services. The SICI code is
121 defined by a NISO document in the United States and does not
122 have a parallel international standards document at present.
123
124 Many other bibliographic identifiers are in common use (for
125 example, CODEN, numbers assigned by major bibliographic
126 utilities such as OCLC and RLG, national library numbers such
127 as the Library of Congress Control Number) or are under
128 development. While we do not discuss them in this document,
129 many of these will also need to be supported within the URN
130 framework as it moves to large scale implementation. The
131 issues involved in supporting those additional identifiers
132 are anticipated to be broadly similar to those involved in
133 supporting ISBNs, ISSNs, and SICIs.
134
135 [Page 3]
136
137 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
138
139
140 2. Identification vs. Resolution
141
142 It is important to distinguish between the resource
143 identified by a URN and the resources a URN resolver that can
144 reasonably return when attempting to resolve an identifier.
145 For example, the ISSN 0040-781X identifies the popular
146 magazine "Time" -- all of it, every issue for from the start
147 of publication to present. Resolving such an identifier
148 should not result in the equivalent of hundreds of thousands
149 of pages of text and photos being dumped to the user's
150 machine. It is more reasonable for ISSNs to resolve to a
151 navigational system, such as an HTML-based search form, so
152 the user may select issues or articles of interest. ISBNs
153 and SICIs, on the other hand, do identify finite, manageably-
154 sized objects, but these objects may still be large enough
155 that resolution to a hierarchical system is appropriate.
156
157 In addition, the materials identified by an ISSN, ISBN or
158 SICI may exist only in printed or other physical form, not
159 electronically. The best that a resolver may be able to
160 offer is information about where to get the physical
161 resource, such as library holdings or a bookstore or
162 publisher order form. The URN Framework provides resolution
163 services that may be used to describe any differences
164 between the resource identified by a URN and the resource
165 that would be returned as a result of resolving that URN.
166
167
168 3. International Standard Book Numbers
169
170 3.1 Overview
171
172 An International Standard Book Number (ISBN) identifies an
173 edition of a monographic work. The ISBN is defined by the
174 standard NISO/ANSI/ISO 2108:1992 [ISO1]
175
176 Basically, an ISBN is a ten-digit number (actually, the last
177 digit can be the letter "X" as well, as described below)
178
179 [Page 4]
180
181 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
182
183
184 which is divided into four variable length parts usually
185 separated by hyphens when printed. The parts are as follows
186 (in this order):
187
188 * a group identifier which specifies a group of publishers,
189 based on national, geographic or some other criteria,
190
191 * the publisher identifier,
192
193 * the title identifier,
194
195 * and a modulus 11 check digit, using X instead of 10.
196
197 The group and publisher number assignments are managed in
198 such a way that the hyphens are not needed to parse the ISBN
199 unambiguously into its constituent parts. However, the ISBN
200 is normally transmitted and displayed with hyphens to make
201 it easy for human beings to recognize these parts without
202 having to make reference to or have knowledge of the number
203 assignments for group and publisher identifiers.
204
205 3.2 Encoding Considerations and Lexical Equivalance
206
207 Embedding ISBNs within the URN framework presents no
208 particular encoding problems, since all of the characters
209 that can appear in an ISBN are valid in the identifier
210 segment of the URN. %-encoding, as described in [MOATS] is
211 never needed.
212
213 Example: URN:ISBN:0-395-36341-1
214
215 For the ISBN namespace, some additional equivalence rules
216 are appropriate. Prior to comparing two ISBN URNs for
217 equivalence, it is appropriate to remove all hyphens, and to
218 convert any occurrences of the letter X to upper case.
219
220 3.3 Additional considerations
221
222 The ISBN standard and related community implementation
223 guidelines define when different versions of a work should
224
225 [Page 5]
226
227 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
228
229
230 be assigned the same or differing ISBNs. In actuality,
231 however, practice varies somewhat depending on publisher as
232 to whether different ISBNs are assigned for paperbound vs.
233 hardbound versions of the same work, electronic vs. printed
234
235 versions of the same work, or versions of the same work
236 distinguished in some other way (e.g.published for example in
237 the US and in Europe). The choice of whether to assign a new
238 ISBN or to reuse an existing one when publishing a revised
239 printing of an existing edition of a work or even a revised
240 edition of a work is somewhat subjective. Practice varies
241 from publisher to publisher (indeed, the distinction between
242 a revised printing and a new edition is itself somewhat
243 subjective). The use of ISBNs within the URN framework
244 simply reflects these existing practices. Note that it is
245 likely that an ISBN URN will often resolve to many instances
246 of the work (many URLs).
247
248 4. International Standard Serials Numbers
249
250 4.1 Overview
251
252 International Standard Serials Numbers (ISSN) identify a
253 work that is published on a continued basis in issues; they
254 identify the entire (often open-ended, in the case of an
255 actively published) work. ISSNs are defined by the
256 international standards ISO 3297:1986 [ISO2] and ISO/DIS
257 3297 [ISO3] and within the United States by NISO Z39.9-1992
258 [NISO1]. The ISSN International Centre is located in Paris and coordinates
259 a network of regional centers. The National
260 Serials Data Program within the Library of Congress is the US
261 Center of this network.
262
263 ISSNs have the form NNNN-NNNN where N is a digit, the last
264 digit may be an upper case X as the result of the check
265 character calculation. Unlike the ISBN the ISSN components
266 do not have much structure; blocks of numbers are passed out
267 to the regional centers and publishers.
268
269 [Page 6]
270
271
272 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
273
274
275 4.2 Encoding Considerations and Lexical Equivalance
276
277 Again, there is no problem representing ISSNs in the
278 namespace-specific string of URNs since all characters valid
279 in the ISSN are valid in the namespace-specific URN string,
280 and %-encoding is never required.
281
282 Example: URN:ISSN:1046-8188
283
284 Supplementary comparison rules are also appropriate for the
285 ISSN namespace. Just as for ISBNs, hyphens should be
286 dropped prior to comparison and occurrences of 'x'
287 normalized to uppercase.
288
289 4.3 Additional Considerations
290
291 The ISSN standard and related community implementation
292 guidelines specify when new ISSNs should be assigned vs.
293 continuing to use an existing one. There are some
294 publications where practice within the bibliographic
295 community varies from institution to insitution, such as
296 annuals or annual conference proceedings. In some cases
297 these are treated as serials and ISSNs are used, and in some
298 cases they are treated as monographs and ISBNs are used. For
299 example SIGMOD Record volume 24 number 2 June 1995 contains
300 the Proceedings of the 1995 ACM SIGMOD International
301 Conference on Management of Data. If you subscribe to the
302 journal (ISSN 0163-5808) this is simply the June issue. On
303 the other hand you may have acquired this volume as the
304 conference proceedings (a monograph) and as such would use
305 the ISBN 0-89791-731-6 to identify the work. There are also
306 varying practices within the publishing community as to when
307 new ISSNs are assigned due to the change in the name of a
308 periodical (e.g. Atlantic becomes Atlantic Monthly); or when
309 a periodical is published both in printed and electronic
310 versions (e.g. The New York Times). The use of ISSNs in URNs
311 will reflect these judgments and practices.
312
313
314 [Page 7]
315
316
317 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
318
319
320 5. Serial Item and Contribution Identifiers
321
322 5.1 Overview
323
324 The standard for Serial Item and Contribution Identifiers
325 (SICI) codes, which has recently been extensively revised,
326 is defined by NISO/ANSI Z39.56-1997 [NISO2]. The maintenance
327 agency for the SICI code is the UnCover Corporation.
328
329 SICI codes can be used to identify an issue of a serial, or
330 a specific contribution (e.g., an article, or the table of
331 contents) within an issue of a serial. SICI codes are not
332 assigned, they are constructed based on information about
333 the issue or issue component in question.
334
335 The complete syntax for the SICI code will not be discussed
336 here; see NISO/ANSI Z39.56-1997 [NISO2] for details.
337 However, an example and brief review of the major components
338 is needed to understand the relationship with the ISSN and
339 how this identifier differs from an ISSN. An example of a
340 SICI code is: 0015-6914(19960101)157:1<62:KTSW>2.0.TX;2-F
341
342 The first nine characters are the ISSN identifying the
343 serial title. The second component, in parentheses, is the
344 chronology information giving the date the particular serial
345 issue was published. In this example that date was January
346 1, 1996. The third component, 157:1, is enumeration
347 information (volume, number) for the particular issue of the
348 serial. These three components comprise the "item segment"
349 of a SICI code. By augmenting the ISSN with the chronology
350 and/or enumeration information, specific issues of the
351 serial can be identified. The next segment, <62:KTSW>,
352 identifies a particular contribution within the issue. In
353 this example we provide the starting page number and a title
354 code constructed from the initial characters of the title.
355 Identifiers assigned to a contribution can be used in the
356 contribution segment if page numbers are inappropriate. The
357 rest of the identifier is the control segment, which
358 includes a check character. Interested readers are
359
360 [Page 8]
361
362 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
363
364
365 encouraged to consult the standard for an explanation of the
366 fields in that segment.
367
368 5.2 Encoding Considerations and Lexical Equivalance
369
370 The character set for SICIs is intended to be email-
371 transport-transparent, so it does not present major problems.
372 However, all printable excluded and reserved characters from
373 the URN syntax are valid in the SICI character set and must
374 be %-encoded.
375
376 Example of a SICI for an issue of a journal:
377
378 URN:SICI:1046-8188(199501)13:1%3C%3E1.0.TX;2-F
379
380 For an article contained within that issue:
381
382 URN:SICI:1046-8188(199501)13:1%3C69:FTTHBI%3E2.0.TX;2-4
383
384 Equivalence rules for SICIs are not appropriate for
385 definition as part of the namespace and incorporation in
386 areas such as cache management algorithms. It is best left
387 to resolver systems which try to determine if two SICIs refer
388 to the same content. Consequently, we do not propose any
389 specific rules for equivalence testing through lexical manipulation.
390
391 5.3 Additional Considerations
392
393 Since the serial is identified by an ISSN, some of the
394 ambiguity currently found in the assignment of ISSNs carries
395 over into SICI codes. In cases where an ISSN may refer to a
396 serial that exists in multiple formats, the SICI contains a
397 qualifier that specifies the format type (for example,
398 print, microform, or electronic). SICI codes may be
399 constructed from a variety of sources (the actual issue of
400 the serial, a citation or a record from an abstracting
401 service) and, as such are based on the principle of using
402 all available information, so there may be multiple SICI
403
404 [Page 9]
405 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
406
407 codes representing the same article [NISO2, Appenidx D].
408 For example, one code might be constructed with access to
409 both chronology and enumeration (that is, date of issue and
410 volume, issue and page number), another code might be
411 constructed based only on enumeration information and
412 without benefit of chronology. Systems that use SICI codes
413 employ complex matching algorithms to try to match SICI
414 codes constructed from incomplete information to SICI codes
415 constructed with the benefit of all relevant information.
416
417 6. Security Considerations
418
419 This document proposes means of encoding several existing
420 bibliographic identifiers within the URN framework. This
421 documentent does not discuss resolution; thus questions of
422 secure or authenticated resolution mechanisms are out of
423 scope. It does not address means of validating the integrity
424 or authenticating the source or provenance of URNs that
425 contain bibliographic identifiers. Issues regarding
426 intellectual property rights associated with objects
427 identified by the various bibliographic identifiers are also
428 beyond the scope of this document, as are questions about
429 rights to the databases that might be used to construct
430 resolvers.
431
432 7. References
433
434 [ISO1] NISO/ANSI/ISO 2108:1992 Information and documentation
435 -- International standard book number (ISBN)
436
437 [ISO2] ISO 3297:1986 Documentation -- International standard
438 serial numbering (ISSN)
439
440 [ISO3] ISO/DIS 3297 Information and documentation --
441 International standard serial numbering (ISSN)
442 (Revision of ISO 3297:1986)
443
444 [Moats] R. Moats, URN Syntax RFC 2141 May 1997.
445
446 [NISO 1] NISO/ANSI Z39.9-1992 International standard serial
447 numbering (ISSN)
448
449 [Page 10]
450
451 INTERNET DRAFT: Bibliographic Identifiers as URNs 9/1997
452
453
454 [NISO 2] NISO/ANSI Z39.56-1997 Serial Item and Contribution
455 Identifier
456 [Sollins & Masinter] K. Sollins and L. Masinter, "Functional
457 Requirements for Uniform Resource Names", RFC 1737
458 December 1994.
459
460
461
462 8. Author's Addresses
463
464 Clifford Lynch
465 Executive Director
466 Coalition for Networked Information
467 21 Dupont Circle
468 Washington, DC 20036
469 cliff@cni.org
470
471 Cecilia Preston
472 Preston & Lynch
473 PO Box 8310
474 Emeryville, CA 94662
475 cecilia@well.com
476
477 Ron Daniel Jr.
478 Advanced Computing Lab, MS B287
479 Los Alamos National Laboratory
480 Los Alamos, NM, 87545
481 rdaniel@acl.lanl.gov
482
483
484
485
486
487
488
489
490
491
492
493 [Page 11]
494
495
496
497

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24