What do we mean by GUID?

Donald Hobern dhobern at GBIF.ORG
Thu Oct 13 14:23:52 CEST 2005


James Ytow wrote:

> Here we see a tipical trouble with identifier and identity.  Do you
> mean identity of an object (a unique thing, so we don't need
> identifier because it is the thing) or equivalence of data (there can
> be multiple data objects having the same value)?  Where we need GUID
> we can't rely on identity, in my understanding.

The DOI Handbook (http://dx.doi.org/10.1000/186) addresses this issue.  The
DOI approach centres on the use of Application Profiles.  Each DOI can be
associated with a profile which defines the metadata elements with which it
is to be associated (and therefore by implication which pieces of
information must remain stable for two records to be considered copies of
the same object).

On page 17, the Handbook states:

  A common question is: if I identify entity A with a DOI, and then I
  adapt it in some way to create entity B, should I assign a new DOI to
  entity B?

  The answer is: there can be no general rule which applies to all cases
  and each must be treated in context. If a registrant finds it useful to
  do so, they may. The rules of Application profiles, and business rules
  of Registration Agencies, will help in deciding for DOIs registered in
  Application Profiles. The key point is that one should precisely specify
  what A is and what B is; two digital entities are never the same in any
  absolute sense and can be considered copies of each other only in the
  context of some defined purpose.

  For a more detailed explanation of this fundamental topic, see the
  article "On Making and Identifying a Copy"
  http://dx.doi.org/10.1045/january2003-paskin

This is a very important point, and one that we will need to bear in mind as
we consider the application of GUIDs to digital representations of real
world objects such as specimen records.  Two completely differently
formatted records may be considered as copies of the same object if they
serve the same purpose for some user.  This is one area which the DOI
infrastructure particularly tries to address.

Donald

---------------------------------------------------------------
Donald Hobern (dhobern at gbif.org)
Programme Officer for Data Access and Database Interoperability
Global Biodiversity Information Facility Secretariat
Universitetsparken 15, DK-2100 Copenhagen, Denmark
Tel: +45-35321483   Mobile: +45-28751483   Fax: +45-35321480
---------------------------------------------------------------


-----Original Message-----
From: Taxonomic Databases Working Group GUID Project
[mailto:TDWG-GUID at LISTSERV.NHM.KU.EDU] On Behalf Of Nozomi Ytow
Sent: 12 October 2005 18:40
To: TDWG-GUID at LISTSERV.NHM.KU.EDU
Subject: Re: What do we mean by GUID?

Matt Jones wrote:

> The term GUID is one we started using in SEEK when looking for a
> solution to the identity and resolution problems that we saw looming for
> the Taxonomic COncept Standard.  Dave Thau's presentation on this
> (linked on the GUID wiki) defines this pretty well and explores the
issues.

Here we see a tipical trouble with identifier and identity.  Do you
mean identity of an object (a unique thing, so we don't need
identifier because it is the thing) or equivalence of data (there can
be multiple data objects having the same value)?  Where we need GUID
we can't rely on identity, in my understanding.


> "globally unique" means simply that an identifier that is issued can
> only have one valid interpretation across all possible systems.

What do you mean by valid?  Suppose a data object in data provider's
database.  A GBIF portal has its copy when last a user accessed to the
data object.  The data provider changes its contents for some reason
afther the last access through the GBIF portal.  What is the
valid interpretation of these data objects?  Tha provider's one?

> Regardless of the mechanism used to resolve the identifier, the object
> that the id 'identifies' will be bit-for-bit identical.

So you mean equivalence, not identity.  If it is bit-for-bit
equivalence, why do you need GUID?  The contents IS the GUID
you defined.


> There are some tricky issues
> dealing with granularity of the identifier for digital data (does the
> identifier point at a tuple in an entity, or at a whole entity, or at
> multiple entities).

Do you mean your bit-for-bit GUID requires scope disamibugater also?
Isn't it assigned to a data object, i.e. unit to be handled as a
chunk?


It may be better to use other words such as globally disambiguateor
or distinguisher, because we do not mean identity by identifier.

Cheers,
James




More information about the tdwg-tag mailing list