[ I will be trying to provide some structure to
discussions in this mailing list by raising specific topics and looking for
comments. Please keep the Topic number in responses ]
Topic 1: What do we mean by GUID?
The most fundamental thing that we need to establish
as we consider a GUID implementation is a definition for “GUID” in
this context. We have been using a number of terms to describe the
identifiers we need (unique, resolvable, persistent, etc.).
I’ve been spending some time following up on
Rod Page’s recommendation that we consider the use of Archival Resource
Keys (ARK) from the California Digital Library (see http://wiki.gbif.org/guidwiki/wikka.php?wakka=ARK).
The CDL web site includes an excellent overview of this GUID model, which
also serves as an excellent introduction to the issues involved. I would
urge you all to read this document – it’s only nine pages long!):
http://www.cdlib.org/inside/diglib/ark/arkcdl.pdf
This document arrives at the following problem
definition for persistent, actionable identifiers:
This document clearly identifies issues around provider
service commitments as the key problem that needs solving. The
construction of ARKs seeks to address this in a couple of ways. It
separates the role of Name Assigning Authority (i.e. who initially assigns the
identifier) from that of the Name Mapping Authority (i.e. who is able to map
the identifier to the data object at any particular time). It also
defines a simple standard relationship between three things: the data object,
the metadata for the object, and a commitment statement from the provider as to
what aspects of persistence are guaranteed.
Please provide your thoughts.
Donald
---------------------------------------------------------------
Donald Hobern (dhobern@gbif.org)
Programme Officer for Data Access and Database Interoperability
Global Biodiversity Information Facility Secretariat
Universitetsparken 15, DK-2100
Tel: +45-35321483
---------------------------------------------------------------