/[suikacvs]/webroot/www/2004/id/draft-ietf-urn-biblio-00.txt
Suika

Contents of /webroot/www/2004/id/draft-ietf-urn-biblio-00.txt

Parent Directory Parent Directory | Revision Log Revision Log


Revision 1.1 - (hide annotations) (download)
Tue Jun 15 08:04:06 2004 UTC (20 years, 10 months ago) by wakaba
Branch: MAIN
CVS Tags: HEAD
File MIME type: text/plain
New

1 wakaba 1.1
2     Internet Draft Clifford Lynch
3     draft-ietf-urn-biblio-00.txt University of California
4     22 March 1997 Cecilia Preston
5     Expires in six months Preston & Lynch
6     Ron Daniel Jr.
7     Los Alamos National Laboratory
8    
9    
10     Using Existing Bibliographic Identifiers
11     as
12     Uniform Resource Names
13    
14    
15     Status of this Document
16    
17     This document is an Internet-Draft. Internet-Drafts are
18     working documents of the Internet Engineering Task Force
19     (IETF), its areas, and its working groups. Note that other
20     groups may also distribute working documents as Internet-
21     Drafts.
22    
23     Internet-Drafts are draft documents valid for a maximum of
24     six months and may be updated, replaced or made obsolete by
25     other documents at any time. It is inappropriate to use
26     Internet-Drafts as reference material or to cite them other
27     than as works in progress.
28    
29     Distribution of this document is unlimited. Please send
30     comments to clifford.lynch@ucop.edu and cecilia@well.com.
31    
32     This document does not specify a standard; it is purely
33     informational.
34    
35    
36     0. Abstract
37    
38     A system for Uniform Resource Names (URNs) must be capable
39     of supporting identifiers from existing widely-used naming
40     systems. This document discusses how three major
41     bibliographic identifiers (the ISBN, ISSN and SICI) can be
42     supported within the URN framework and the currently
43     proposed syntax for URNs.
44    
45    
46    
47     [Page 1]
48    
49     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
50    
51    
52     1. Introduction
53    
54     The ongoing work of several IETF working groups, most
55     recently in the Uniform Resource Names working group, has
56     culminated the development of a syntax for Uniform Resource
57     Names (URNs). The functional requirements and overall
58     framework for Uniform Resource Names are specified in RFC
59     1737 [Sollins & Masinter] and the current proposal for the
60     URN syntax is draft-ietf-urn-syntax-04.txt [Moats].
61    
62     As part of the validation process for the development of
63     URNs the IETF working group has agreed that it is important
64     to demonstrate that the current URN syntax proposal can
65     accommodate existing identifiers from well managed
66     namespaces. One such well-established infrastructure for
67     assigning and managing names comes from the bibliographic
68     community. Bibliographic identifiers function as names for
69     objects that exist both in print and, increasingly, in
70     electronic formats. This Internet draft demonstrates the
71     feasibility of supporting three representative bibliographic
72     identifiers within the currently proposed URN framework and
73     syntax.
74    
75     Note that this document does not purport to define the
76     "official" standard way of doing so; it merely demonstrates
77     feasibility. It has not been developed in consultation with
78     the standards bodies and maintenance agencies that oversee
79     the existing bibliographic identifiers. Any actual Internet
80     standard for encoding these bibliographic identifiers as
81     URNs will need to be developed in consultation with the
82     responsible standards bodies and maintenance agencies.
83    
84     In addition, there are several open questions with regard to
85     the management and registry of Namespace Identifiers (NIDs)
86     for URNs. For purposes of illustration, we have used the
87     three NIDs "ISBN", "ISSN" and "SICI" for the three
88     corresponding bibliographic identifiers discussed in this
89     document. While we believe this to be the most appropriate
90     choice, it is not the only one. The NIDs could be based on
91     the standards body and standard number (e.g. "US-ANSI-NISO-
92     Z39.56-1997" rather than "SICI"). Alternatively, one could
93     lump all bibliographic identifiers into a single
94     "BIBLIOGRAPHIC" name space, and structure the namespace-
95    
96     [Page 2]
97    
98     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
99    
100     specific string to specify which identifier is being used.
101     We do not believe that these are advantageous approaches,
102     but must wait for the outcome of namespace management
103     discussions in the working group.
104    
105     For the purposes of this document, we have selected three
106     major bibliographic identifiers (national and international)
107     to fit within the URN framework. These are the
108     International Standard Book Number (ISBN) [ISO1], the
109     International Standard Serials Number (ISSN) [NISO1,ISO2,
110     ISO3], and the Serial Item and Contribution Identifier
111     (SICI) [NISO2]. ISBNs are used to identify monographs
112     (books). ISSNs are used to identify serial publications
113     (journals, newspapers) as a whole. SICIs augment the ISSN
114     in order to identify individual issues of serial
115     publications, or components within those issues (such as an
116     individual article, or the table of contents of a given
117     issue). The ISBN and ISSN are defined in the United States
118     by standards issued by the National Information Standards
119     Organization (NISO) and also by parallel international
120     standards issued under the auspices of the International
121     Organization for Standardization (ISO). NISO is the ANSI-
122     accredited standards body serving libraries, publishers and
123     information services. The SICI code is defined by a NISO
124     document in the United States and does not have a parallel
125     international standards document at present.
126    
127     Many other bibliographic identifiers are in common use (for
128     example, the CODEN, numbers assigned by major bibliographic
129     utilities such as OCLC and RLG, national library numbers
130     such as the Library of Congress Control Number) or are under
131     development. While we do not discuss them in this document,
132     many of these will also need to be supported within the URN
133     framework as it moves to large scale implementation. The
134     issues involved in supporting those additional identifiers
135     are anticipated to be broadly similar to those involved in
136     supporting ISBNs, ISSNs, and SICIs.
137    
138    
139     2. Identification vs. Resolution
140    
141     It is important to distinguish between the resource
142     identified by a URN and the resources that can reasonably be
143     provided when attempting to resolve an identifier. For
144    
145     [Page 3]
146    
147     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
148    
149    
150     example, the ISSN 0040-781X identifies the popular
151     "Time". All of it, every issue for from the start of
152     publication to present. Resolving such an identifier should
153     not result in the equivalent of hundreds of thousands of
154     pages of text and photos being dumped to the user's machine.
155     It is more reasonable for ISSNs to resolve to a navigational
156     system, such as an HTML-based search form, so the user may
157     select issues or articles of interest. ISBNs and SICIs, on
158     the other hand, do identify finite, manageably-sized
159     objects, but they may still be large enough that resolution
160     to a hierarchical system is appropriate.
161    
162     In addition, the materials identified by an ISSN, ISBN or
163     SICI may exist only in printed or other physical form, not
164     electronically. The best that a resolver may be able to
165     offer is information about where to get the physical
166     resource, such as library holdings or a bookstore or
167     publisher order form. The URN Framework provides resolution
168     services that may be used to describe any differences
169     between the resource identified by a URN and the resource
170     that would be returned as a result of resolving that URN.
171    
172    
173     3. International Standard Book Numbers
174    
175     3.1 Overview
176    
177     An International Standard Book Number (ISBN) identifies an
178     edition of a monographic work. The ISBN is defined by the
179     standard NISO/ANSI/ISO 2108:1992 [ISO 1]
180    
181     Basically, an ISBN is a ten-digit number (actually, the last
182     digit can be the letter "X" as well, as described below)
183     which is divided into four variable length parts usually
184     separated by hyphens when printed. The parts are as follows
185     (in this order):
186    
187     * a group identifier which specifies a group of publishers,
188     based on national, geographic or some other criteria,
189    
190     * the publisher identifier,
191    
192     [Page 4]
193    
194     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
195    
196    
197     * the title identifier,
198    
199     * and a modulus 11 check digit, using X in lieu of 10.
200    
201     The group and publisher number assignments are managed in
202     such a way that the hyphens are not needed to parse the ISBN
203     unambiguously into its constituent parts. However, the ISBN
204     is normally transmitted and displayed with hyphens to make
205     it easy for human beings to recognize these parts without
206     having to make reference to or have knowledge of the number
207     assignments for group and publisher identifiers.
208    
209     3.2 Encoding Considerations
210    
211     Embedding ISBNs within the URN framework presents no
212     particular coding problems, since all of the characters that
213     can appear in an ISBN are valid in the identifier segment of
214     the URN. %-encoding is never needed.
215    
216     Example: URN:ISBN:0-395-36341-1
217    
218     For the ISBN namespace, some additional equivalence rules
219     are appropriate. Prior to comparing two ISBN URNs for
220     equivalence, it is appropriate to remove all hyphens, and to
221     convert any occurrences of the letter X to upper case.
222    
223     3.3 Additional considerations
224    
225     The ISBN standard and related community implementation
226     guidelines define when different versions of a work should
227     be assigned the same or differing ISBNs. In actuality,
228     however, practice varies somewhat depending on publisher as
229     to whether different ISBNs are assigned for paperbound vs.
230     hardbound versions of the same work, electronic vs. printed
231     versions of the same work, or versions of the same work
232     published for example in the US and in Europe. The choice
233     of whether to assign a new ISBN or to reuse an existing one
234     when publishing a revised printing of an existing edition of
235     a work or even a revised edition of a work is somewhat
236     subjective. Practice varies from publisher to publisher
237     (indeed, the distinction between a revised printing and a
238     new edition is itself somewhat subjective). The use of
239    
240     [Page 5]
241    
242     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
243    
244    
245     ISBNs within the URN framework simply reflects these
246     existing practices. Note that it is likely that an ISBN URN
247     will often resolve to many instances of the work (many
248     URLs).
249    
250    
251     4. International Standard Serials Numbers
252    
253     4.1 Overview
254    
255     International Standard Serials Numbers (ISSN) identify a
256     work that is being published on a continued basis in issues;
257     they identify the entire (often open-ended, in the case of
258     an actively published) work. ISSNs are defined by the
259     standards ISO 3297:1986 [ISO 2] and ISO/DIS 3297 [ISO 3] and
260     within the United States by NISO Z39.9-1992 [NISO 1]. The
261     ISSN International Centre is located in Paris and
262     coordinates a network of regional centers. The National
263     Serials Data Program within the Library of Congress is the
264     US Center of this network.
265    
266     ISSNs have the form NNNN-NNNN where N is a digit, the last
267     digit may be an upper case X as the result of the check
268     character calculation. Unlike the ISBN the ISSN components
269     do not have much structure; blocks of numbers are passed out
270     to the regional centers and publishers.
271    
272     4.2 Encoding Considerations
273    
274     Again, there is no problem representing ISSNs in the
275     namespace-specific string of URNs since all characters valid
276     in the ISSN are valid in the namespace-specific URN string,
277     and %-encoding is never required.
278    
279     Example: URN:ISSN:1046-8188
280    
281     Supplementary comparison rules are also appropriate for the
282     ISSN namespace. Just as for ISBNs, hyphens should be
283     dropped prior to comparison and occurrences of 'x'
284     normalized to uppercase.
285    
286    
287     [Page 6]
288    
289    
290     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
291    
292    
293     4.3 Additional Considerations
294    
295     The ISSN standard and related community implementation
296     guidelines specify when new ISSNs should be assigned vs.
297     continuing to use an existing one. There are some
298     publications where practice within the bibliographic
299     community varies from site to site, such as annuals or
300     annual conference proceedings. In some cases these are
301     treated as serials and ISSNs are used, and in some cases
302     they are treated as monographs and ISBNs are used. For
303     example SIGMOD Record volume 24 number 2 June 1995 contains
304     the Proceedings of the 1995 ACM SIGMOD International
305     Conference on Management of Data. If you subscribe to the
306     journal (ISSN 0163-5808) this is simply the June issue. On
307     the other hand you may have acquired this volume as the
308     conference proceedings (a monograph) and as such would use
309     the ISBN 0-89791-731-6 to identify the work. There are also
310     varying practices within the publishing community as to when
311     new ISSNs are assigned due to the change in the name of a
312     periodical (Atlantic becomes Atlantic Monthly); or when a
313     periodical is published both in printed and electronic
314     versions (The New York Times). The use of ISSNs as URNs
315     will reflect these judgments and practices.
316    
317    
318     5. Serial Item and Contribution Identifiers
319    
320     5.1 Overview
321    
322     The standard for Serial Item and Contribution Identifiers
323     (SICI) has recently been extensively revised and is defined
324     by NISO/ANSI Z39.56-1997 [NISO 2]. The maintenance agency
325     for the SICI code is the UnCover Corporation.
326    
327     SICI codes can be used to identify an issue of a serial, or
328     a specific contribution (i.e., an article, or the table of
329     contents) within an issue of a serial. SICI codes are not
330     assigned, they are constructed based on information about
331     the issue or issue component in question.
332    
333     The complete syntax for the SICI code will not be discussed
334     here; see NISO/ANSI Z39.56-1997 for details. However an
335     example and brief review of the major components is needed
336    
337     [Page 7]
338    
339     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
340    
341    
342     to understand the relationship with the ISSN and how this
343     identifier differs. An example of a SICI code is:
344    
345     0015-6914(19960101)157:1<62:KTSW>2.0.TX;2-F
346    
347     The first nine characters are the ISSN identifying the
348     serial title. The second component, in parentheses, is the
349     chronology information giving the date the particular serial
350     issue was published. In this example that date was January
351     1, 1996. The third component, 157:1, is enumeration
352     information (volume, number) on the particular issue of the
353     serial. These three components comprise the "item segment"
354     of a SICI code. By augmenting the ISSN with the chronology
355     and/or enumeration information, specific issues of the
356     serial can be identified. The next segment, <62:KTSW>,
357     identifies a particular contribution within the issue. In
358     this example we provide the starting page number and a title
359     code constructed from the initial characters of the title.
360     Identifiers assigned to a contribution can be used in the
361     contribution segment if page numbers are inappropriate. The
362     rest of the identifier is the control segment, which
363     includes a check character. Interested readers are
364     encouraged to consult the standard for an explanation of the
365     fields in that segment.
366    
367     5.2 Encoding Considerations
368    
369     The character set for SICIs is intended to be email-
370     transport-transparent, so it does not present major
371     problems. However, all printable excluded and reserved
372     characters from the URN syntax draft are valid in the SICI
373     character set and must be %-encoded.
374    
375     Example of a SICI for an issue of a journal
376    
377     URN:SICI:1046-8188(199501)13:1%3C%3E1.0.TX;2-F
378    
379     For an article contained within that issue
380    
381     URN:SICI:1046-8188(199501)13:1%3C69:FTTHBI%3E2.0.TX;2-4
382    
383    
384     [Page 8]
385    
386     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
387    
388    
389     Special equivalence rules for SICIs are not appropriate for
390     definition as part of the namespace and incorporation in
391     areas such as cache management algorithms. These are best
392     left to resolver systems which try to determine if two SICIs
393     refer to the same content. Consequently, we do not propose
394     any specific rules for equivalence testing through lexical
395     manipulation.
396    
397     5.3 Additional Considerations
398    
399     Since the serial is identified by an ISSN, some of the
400     ambiguity currently found in the assignment of ISSNs carries
401     over into SICI codes. In cases where an ISSN may refer to a
402     serial that exists in multiple formats, the SICI contains a
403     qualifier that specifies the format type (for example,
404     print, microform, or electronic). SICI codes may be
405     constructed from a variety of sources (the actual issue of
406     the serial, a citation or a record from an abstracting
407     service) and, as such are based on the principle of using
408     all available information, so there may be multiple SICI
409     codes representing the same article [NISO2, Appenidx D].
410     For example, one code might be constructed with access to
411     both chronology and enumeration (that is, date of issue and
412     volume, issue and page number), another code might be
413     constructed based only on enumeration information and
414     without benefit of chronology. Systems that use SICI codes
415     employ complex matching algorithms to try to match SICI
416     codes constructed from incomplete information to SICI codes
417     constructed with the benefit of all relevant information.
418    
419     6. Security Considerations
420    
421     This document proposes means of encoding several existing
422     bibliographic identifiers within the URN framework. It does
423     not discuss resolution; thus questions of secure or
424     authenticated resolution mechanisms are out of scope. It
425     does not address means of validating the integrity or
426     authenticating the source or provenance of URNs that contain
427     bibliographic identifiers. Issues regarding intellectual
428     property rights associated with objects identified by the
429     various bibliographic identifiers are also beyond the scope
430     of this document, as are questions about rights to the
431     databases that might be used to construct resolvers.
432    
433     [Page 9]
434    
435     INTERNET DRAFT:Bibliographic Identifiers as URNs 3/1997
436    
437     7. References
438    
439     [ISO1] NISO/ANSI/ISO 2108:1992 Information and documentation
440     -- International standard book number (ISBN)
441     [ISO2] ISO 3297:1986 Documentation -- International standard
442     serial numbering (ISSN)
443     [ISO3] ISO/DIS 3297 Information and documentation --
444     International standard serial numbering (ISSN)
445     (Revision of ISO 3297:1986)
446     [Moats] R. Moats, "URN Syntax" draft-ietf-urn-syntax-
447     04.text. March 1997
448     [NISO 1] NISO/ANSI Z39.9-1992 International standard serial
449     numbering (ISSN)
450     [NISO 2] NISO/ANSI Z39.56-1997 Serial Item and Contribution
451     Identifier
452     [Sollins & Masinter] K. Sollins and L. Masinter, "Functional
453     Requirements for Uniform Resource Names", RFC 1737
454     December 1994.
455    
456     8. Author's Addresses
457    
458     Clifford Lynch
459     University of California Office of the President
460     300 Lakeside Drive, 8th floor
461     Oakland CA 94612-3550
462     clifford.lynch@ucop.edu
463    
464     Cecilia Preston
465     Preston & Lynch
466     PO Box 8310
467     Emeryville, CA 94662
468     cecilia@well.com
469    
470     Ron Daniel Jr.
471     Advanced Computing Lab, MS B287
472     Los Alamos National Laboratory
473     Los Alamos, NM, 87545
474     voice: +1 505 665 0597
475     fax: +1 505 665 4939
476     http://www.acl.lanl.gov/~rdaniel
477    
478    
479    
480    
481     [Page 10]
482    
483    
484    

admin@suikawiki.org
ViewVC Help
Powered by ViewVC 1.1.24