<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

<html>

<head>

  <meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">

  <title></title>

</head>

<body bgcolor="#ffffff" text="#000000">

Comments regarding several emails inline:<br>

<br>

Richard Pyle wrote:

<blockquote cite="mid:00c001cc2222$9d664720$d832d560$@bishopmuseum.org"

 type="cite">

  <pre wrap="">By contrast, the core object in GNUB is a taxon name usage instance -- which

is a purely abstract notion of the usage of a taxon name within some

documentation source (like a publication).  In this case, the text-string

name is merely a property of the GUID-identified object, and would be an

extremely BAD choice to use as a unique identifier.  </pre>

</blockquote>

It is possible that I'm not understanding what you are saying here, but

if you are saying that the only name-related property of your GNUB

taxon instances will be one which has a name string literal as its

object, then I think that is a big mistake.&nbsp; That will require any

client using your taxon instance metadata to re-process the literal

name string to cross reference it with lexical variants, parse it into

its pieces, etc.&nbsp; That should only need to be done once and then

referenced via a GUID for the name (i.e. in the sense of

tn:TaxonName).&nbsp; <br>

<blockquote cite="mid:00c001cc2222$9d664720$d832d560$@bishopmuseum.org"

 type="cite">

  <pre wrap="">This is why GNUB needs

to generate a unique identifier to represent this core data object.  The

form that identifier takes (UUID, LSID, integer, DOI, whatever) from the

perspective of the end user should be completely irrelevant, because the

user should rarely (if ever) see it, and should certainly *never* be in a

position to type it on a keyboard (we can discuss the appearance of ZooBank

LSIDs on printed pages separately). </pre>

</blockquote>

OK, again maybe I'm not understanding what you are saying here, but if

you are saying that you don't intend to expose your unique GNUB

identifiers to the public, then as far as I'm concerned you are setting

up GNUB to be irrelevant from the start.&nbsp; You mention a number of cool

taxonomist-geek type things that you hope to accomplish with GNUB.&nbsp; But

from my perspective as a non-taxonomist-geek, the main purpose I have

for GNUB is as a place to anchor dwc:Identification instances so that I

can indicate whether my identified resource is a representative of the

same taxon that is being referred to by somebody else (or at least to

make it possible for somebody to figure that out via computery

cleverness, Semantic Web or otherwise).&nbsp; How am I going to do that if

you don't provide me with a good (i.e. meeting the 8 criteria of my

last email) GUID to use as the object of my dwc:Identification

properties?&nbsp; For over a year, I've heard you lament that the whole

problem is that people make identifications and don't indicate the

sensu/sec. reference for the names they use.&nbsp; You are now creating a

system that would allow people to unambiguously make it clear what

taxon they mean but you aren't giving them any way to say what it is?&nbsp;

Again, I may just be misunderstanding what you wrote here.<br>

<br>

Kevin Richards wrote:

<blockquote

 cite="mid:03894928E052DD47A5ED29BC0822619B15F7EFBD2E@donkey.landcare.ad.landcareresearch.co.nz"

 type="cite">

  <pre wrap="">Oh, now that I have read Rich's email here, it seems we are in agreement, of sorts.  I think there is obviously a need for both of these "identifier" approaches - ie a record based ID that no one should really ever need to interact with directly, and a human friendly "ID" that allows people to discuss the same semantic "thing".

  </pre>

</blockquote>

Yes.&nbsp; This "record based ID" can be anything you want.&nbsp; I don't really

don't and shouldn't have to care about that.&nbsp; The "human friendly ID

that allows people to discuss the same semantic thing" is precisely

what the TDWG GUID Applicability Statement (a ratified TDWG standard,

thanks to Kevin) is talking about.&nbsp; As I read that standard, I don't

see any requirement that a GUID be "human friendly", but I would

consider "human friendliness" to be a desirable "best practice"

(influenced somewhat by <a class="moz-txt-link-freetext" href="http://www.w3.org/Provider/Style/URI">http://www.w3.org/Provider/Style/URI</a> and

<a class="moz-txt-link-freetext" href="http://www.w3.org/TR/cooluris/">http://www.w3.org/TR/cooluris/</a>) - if we have a choice of creating

externally exposed GUIDs that are either human-friendly or not

human-friendly, and if either works equally well, why not choose ones

that are human-friendly?<br>

<blockquote

 cite="mid:03894928E052DD47A5ED29BC0822619B15F7EFBD2E@donkey.landcare.ad.landcareresearch.co.nz"

 type="cite">

  <pre wrap="">

It is interesting all this discussion of identifiers when in the end it doesn&#8217;t matter too much what the identifier is, just that you have an identifier at all.  The important thing is the semantics, the "are we talking about the same thing" question - so this is where I believe RDF/semantic web comes in - I might see if I can come up with some RDF/sem web example for TDWG that could demonstrate this, hmmm...

  </pre>

</blockquote>

Already done in the context of tc:Taxon and tn:TaxonName and posted on

this list in January:

<a class="moz-txt-link-freetext" href="http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002204.html">http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002204.html</a> .<br>

<a class="moz-txt-link-freetext" href="http://biodiversity.org.au/apni.taxon/118883">http://biodiversity.org.au/apni.taxon/118883</a><br>

an identifier that is both friendly to humans and computers.&nbsp; Through

content negotiation a computer gets <br>

<a class="moz-txt-link-freetext" href="http://biodiversity.org.au/apni.taxon/118883.rdf">http://biodiversity.org.au/apni.taxon/118883.rdf</a><br>

and the human gets<br>

<a class="moz-txt-link-freetext" href="http://biodiversity.org.au/apni.taxon/118883.html">http://biodiversity.org.au/apni.taxon/118883.html</a><br>

The resource itself has rdf:type tc:TaxonConcept (defined in the

ontology to be equivalent to tc:Taxon), well-known because it is part

of the TDWG ontology.&nbsp; In these examples, the approach for referring to

name strings through tc:hasName, the subsequent reference to a name

record (<a class="moz-txt-link-freetext" href="http://biodiversity.org.au/apni.name/36530">http://biodiversity.org.au/apni.name/36530</a>), and the structure

of that name record in RDF

(<a class="moz-txt-link-freetext" href="http://biodiversity.org.au/apni.name/36530.rdf">http://biodiversity.org.au/apni.name/36530.rdf</a>) follow the approach of

the TSC standard (as incarnated in the TDWG ontology) very precisely.&nbsp;

I can't see anything in these examples that doesn't follow TDWG

standards and what I know of as "best practices".&nbsp; Thank you, Paul...&nbsp;

Also we have many examples of appropriate HTTP URI GUID use from Pete,

although not involving tc:Taxon and tn:TaxonName specifically.&nbsp; <br>

<br>

Richard Pyle wrote:

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

I like your list of what we want "GUIDs" (see below) to do, and I think it's

an excellent starting point for a bar we should all strive for.  I'm

particularly grateful to learn that the existing ZooBank service fails so

many of them.  I've forwarded your post to Rob Whitton, who will be working

on Gen-2 of ZooBank in the coming weeks, and asked him if we can use your 8

tests as a metric to adhere to.  </pre>

</blockquote>

Better yet, read the TDWG GUID Applicability Statement

<a class="moz-txt-link-freetext" href="http://www.tdwg.org/standards/150/">http://www.tdwg.org/standards/150/</a> and <a class="moz-txt-link-freetext" href="http://www.w3.org/TR/cooluris/">http://www.w3.org/TR/cooluris/</a>

.&nbsp; My 8 points are just a paraphrase out of my head.&nbsp; Striving is not

good enough.&nbsp; Follow the standard.<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

  </pre>

  <blockquote type="cite">

    <pre wrap="">"But really, from the perspective of the end-user, does it matter

if it's an identifier or a service?  Ultimately, they ask the questions,

and the answers appear on their computer screens."

I would answer this question by saying "yes, it does matter!" -

it is important that a well-designed GUID do more than just throw

something up onto a human user's web browser.

    </pre>

  </blockquote>

  <pre wrap=""><!---->

I absolutely agree with you, but that's not the distinction I was making in

my quoted text.  I was only talking about whether we call something an

"identifier" (not GUID, which has more specific implications), or a

"service", in the context of human-machine conversations.  I think your

enumeration of things we want GUIDs to do is a very good framework for

discussion.  I would only caution that "GUID" means different things to

different people (some people use it synonymously with UUID, for example),

and also that GUID does not imply "actionable".  </pre>

</blockquote>

Again I would say read <a class="moz-txt-link-freetext" href="http://www.tdwg.org/standards/150/">http://www.tdwg.org/standards/150/</a> .&nbsp; When I say

"GUID" I am not throwing around a colloquial term.&nbsp; I intend for it to

have the exact technical meaning that it is given in the TDWG

standard.&nbsp; At this point in time (i.e. after we finally have a ratified

standard on GUIDs), nobody in our community has any business designing

and exposing GUIDs without having read this document and completely

understanding its requirements and recommendations.&nbsp; I should not have

to be "explaining" any of this to anybody on the list.&nbsp; It is explained

clearly and concisely in the standard.&nbsp; I really am somewhat

flabbergasted about how participants in TDWG, which I think is supposed

to be a biodiversity standards organization, generally don't seem to

read and follow the ratified standards.&nbsp; I think the process could be

helped somewhat if the TDWG website were cleaned up a bit to make the

obsolete stuff less easy to find and the important, current stuff

easier to find.&nbsp; Also, I don't understand why all important documents

aren't linked to the permanent URI page (e.g.

<a class="moz-txt-link-freetext" href="http://www.tdwg.org/standards/150/">http://www.tdwg.org/standards/150/</a>) in pdf format.&nbsp; That would allow

users to view the page directly in a web browser rather than having to

open a zip file and then open a Word document.&nbsp; <br>

<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">There has been a bit of a

debate over the importance of embedding "actionability" into identifiers

inherently (the Tim Berners-Lee perspective)</pre>

</blockquote>

Wrong.&nbsp; "GUIDs should be resolvable" (direct quote of recommendation 7

from the GUID applicability statement).<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">, vs thinking about

"identification" separately from how we perform some action on it.  For

example, UUIDs and Social Security numbers are extremely useful identifiers,

even though they are not inherently actionable.  It's amazingly easy to

perform action on a non-actionable identifier by simply appending it to a

actionable prefix.  For example, going back to the list of "identifiers":

A. A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523

B. urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523

C.

<a class="moz-txt-link-freetext" href="http://zoobank.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF4">http://zoobank.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF4</a>

1523

D. <a class="moz-txt-link-freetext" href="http://zoobank.org/A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://zoobank.org/A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</a>

E.

<a class="moz-txt-link-freetext" href="http://lsid.tdwg.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5B">http://lsid.tdwg.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5B</a>

F41523

F. <a class="moz-txt-link-freetext" href="http://www.google.com/search?q=Danaus+plexippus+(Linnaeus+1758)">http://www.google.com/search?q=Danaus+plexippus+(Linnaeus+1758)</a>

G.

<a class="moz-txt-link-freetext" href="http://lsid.tdwg.org/summary/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB">http://lsid.tdwg.org/summary/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB</a>

4-EA8E5BF41523

H.

<a class="moz-txt-link-freetext" href="http://darwin.zoology.gla.ac.uk/~rpage/lsid/tester/?q=urn:lsid:zoobank.org:a">http://darwin.zoology.gla.ac.uk/~rpage/lsid/tester/?q=urn:lsid:zoobank.org:a</a>

ct:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523&amp;submit=Go

There are two different ways of looking at this:

1) There are 8 different identifiers

2) There is one identifier (A)</pre>

</blockquote>

A is an identifier but A does not meet the requirement of the GUID

Applicability statement.&nbsp; Quote recommendation 2: "HTTP GET resolution <b>must</b>

be provided for non-self resolving GUIDs".&nbsp; Pick one of your proxied

HTTP URIs, call it your GUID and stop there.&nbsp; (Note: the emphasis on

"must" is present in the standards document, not added by me.)<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">, and 6 ways to perform action on it (B-E,

G-H).

If you treat them all as distinct identifiers, then let me add a few more to

the list:

I.

<a class="moz-txt-link-freetext" href="http://zoobank.org/?lsid=urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA">http://zoobank.org/?lsid=urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA</a>

8E5BF41523

J.

<a class="moz-txt-link-freetext" href="http://zoobank.org/?id=urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E">http://zoobank.org/?id=urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E</a>

5BF41523

K. <a class="moz-txt-link-freetext" href="http://zoobank.org/?id=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://zoobank.org/?id=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</a>

L. <a class="moz-txt-link-freetext" href="http://zoobank.org/?uuid=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://zoobank.org/?uuid=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</a>

  </pre>

</blockquote>

Don't add more of them to the list.&nbsp; Recommendation 3: "Providers <b>must</b>

assign at most one GUID to any particular object."&nbsp; Recommendation 4:

"Only one globally unique identifier should be assigned to each

object".&nbsp; <br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

Note that all four of the above, plus B-D in the original list, are all

resolved through zoobank.org.  Why are there so many different ways to

perform action on the "same" identifier? Because I wanted the ZooBank

resolution service to be flexible. And, because in my mind, there is only

one identifier (A); and lots of different ways to retrieve the metadata of

the object it represents.

  </pre>

</blockquote>

I would assert that what you "want" and what you have in your mind is

at odds with the TDWG standard for GUIDs.<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

Now consider this from the TB-L perspective. Eleven different identifiers

for the same object (excluding F).  Does that mean we need to generate

owl:sameAs statements for all pair-wise relationships?  That's a lot of

owl:sameAs statements! Even if I'm the bad guy in foolishly allowing so many

different ways to resolve ZooBank identifiers, and needlessly fabricated so

many "different" identifiers for the same thing unnecessarily.  Fair enough.

But I still think we're a lot better off by disentangling identifiers from

the services we use to perform action on them.

  </pre>

</blockquote>

This may be your opinion, but it is at odds with the ratified standard

which says (recommendation 2) that "HTTP GET resolution <b>must </b>be

provided for non-self-resolving GUIDs".&nbsp; <br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

One of the arguments on the TB-L side is that a non-actionable identifier by

itself is useless if you cannot inherently perform action on it.  For

example, if you were walking through the park and stumbled upon a slip of

paper with "A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523" written on it, you

probably wouldn't be able to do much with it.  But in reality, that's not

what happens.  We never expose identifiers as a simple context-free

identifiers in their non-resolvable form.  These identifiers are *always*

exposed in some context.  The problem is that if you treat the "resolution

metadata" (as I call it -- e.g., "urn:lsid:zoobank.org:act:" or

<a class="moz-txt-link-rfc2396E" href="http://zoobank.org/">"http://zoobank.org/"</a>) as *part* of the identifier (as you have to do if you

make things like "urn:lsid:ubio.org:namebank:11815"), then it becomes

difficult for an application to distinguish between

<a class="moz-txt-link-rfc2396E" href="http://zoobank.org/?id=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">"http://zoobank.org/?id=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523"</a>, and

<a class="moz-txt-link-rfc2396E" href="http://zoobank.org/?uuid=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">"http://zoobank.org/?uuid=A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523"</a>; which, to a

human, obviously refers to the same thing.  In other words, absent all those

owl:sameAs statements, an application could break if it harvests content

from different sources that use different resolution metadata for the "same"

(sensu Pyle) identifier.

  </pre>

</blockquote>

The problem here is caused by you when you create and expose so many

different HTTP URI forms of your UUID.&nbsp; Stop doing that (recommendation

4).<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

Maybe what we need to think about is a registry of "persistent resolution

services", which our community relies on.  That way, we can apply the

owl:sameAs statements to the resolution services, rather than to every

single individual identifier.

  </pre>

</blockquote>

There is no need for this.&nbsp; Make a single HTTP URI version of your UUID

and stick with it.&nbsp; Preferably one without the query string and use Mod

rewrite (or whatever it's called) to transform the simple, clear, and

permanent version of the URI into whatever flavor of temporary URL you

are liking at the moment.&nbsp; Every application today understands HTTP

GET.&nbsp; No need for a registry.<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">

  </pre>

  <blockquote type="cite">

    <pre wrap="">An important question that I think has been underlying much of this

    </pre>

  </blockquote>

  <pre wrap=""><!---->discussion

  </pre>

  <blockquote type="cite">

    <pre wrap="">is whether GUIDs are actually needed for names.

    </pre>

  </blockquote>

  <pre wrap=""><!---->

I think the answer is clearly  "yes". The problem is defining what is meant

by the word "name".

  </pre>

</blockquote>

Go with the TCS standard and the TDWG ontology as it exists currently.<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap=""></pre>

  <blockquote type="cite">

    <pre wrap="">and parts thereof, then it does make sense to apply GUIDs to that

kind of entity.  I am thinking about a tn:TaxonName as defined in the

TDWG ontology (see

    </pre>

  </blockquote>

  <pre wrap=""><!----><a class="moz-txt-link-freetext" href="http://code.google.com/p/tdwg-ontology/source/browse/trunk/ontology/voc/Taxo">http://code.google.com/p/tdwg-ontology/source/browse/trunk/ontology/voc/Taxo</a>

nName.rdf),

  </pre>

  <blockquote type="cite">

    <pre wrap="">which comes out of the TCS schema (see

<a class="moz-txt-link-freetext" href="http://code.google.com/p/darwin-sw/wiki/ClassTaxon">http://code.google.com/p/darwin-sw/wiki/ClassTaxon</a> for info and links

    </pre>

  </blockquote>

<pre wrap="">regarding TCS).

  </pre>

  <blockquote type="cite">

    <pre wrap="">A tn:TaxonName is "An object that represents a single scientific

    </pre>

  </blockquote>

  <pre wrap=""><!---->biological name..." i.e. an "object"

  </pre>

  <blockquote type="cite">

    <pre wrap="">NOT defined as a string.

    </pre>

  </blockquote>

  <pre wrap=""><!---->

While it's nice to see the explicit representation of a "name" as an object,

rather than a string; unfortunately that doesn't address the elephant in the

room; that is, that different people have different notions of what "a

single scientific biological name" is.  I'm not talking subtly different

shades of fundamentally the same thing; I'm talking about fundamentally

different things with different implied sets of properties. This is one of

the issues I continued to hammer on during the development of TCS, and the

one that gave me the biggest qualms about TCS 1.0.  My hope was that it

would be resolved in TCS 2.0. </pre>

</blockquote>

There ain't no TCS 2.0 .&nbsp; There is only TCS 1.2 .&nbsp; I'm sorry about it,

but that's the ratified standard.<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap="">I wanted to reduce both names and concepts to

the same core entity: usage instances.  That's exactly what we're doing with

GNUB.

  </pre>

</blockquote>

There have been any number of things that I would "like" to be the way

I want.&nbsp; However, the point of standards is that they get hammered out

in a form that satisfies the community in a general way.&nbsp; Individual

people often are left without everything that they wanted.&nbsp; From within

our own personal projects, we can do anything we darn well please.&nbsp; But

when it comes to communicating with others, we should discipline

ourselves to follow the standards.&nbsp; I understand that for existing

systems, there is considerable time and money required to retrofit old

systems to a new standard.&nbsp; But GNUB is not an "old system".&nbsp; It is

being build from scratch and I would assert that where it comes to

interfacing it with the outside world, it should follow standards such

as they exist at the moment.&nbsp; At the moment, people are allowed to

think about and describe names without reducing them solely to usage

instances as you would like.&nbsp; I spend about an hour yesterday composing

a rant about how counterproductive it is for taxonomy and computer

geeks to create tools and systems that won't ever actually be used by

the people who need them.&nbsp; I decided that it wasn't helpful to actually

post it, but now I'm thinking that maybe I should have...<br>

<blockquote cite="mid:021201cc23b2$4f313ec0$ed93bc40$@bishopmuseum.org"

 type="cite">

  <pre wrap=""><!---->

That's only true to the extent that tn:TaxonName may be too broadly

(imprecisely) defined (just like dwc:Taxon).

  </pre>

</blockquote>

dwc:Taxon doesn't really have much of any useful definition, so I'm

with you there.&nbsp; tn:TaxonName is actually rather precisely defined, at

least if you look at the RDF

(<a class="moz-txt-link-freetext" href="http://code.google.com/p/tdwg-ontology/source/browse/trunk/ontology/voc/TaxonName.rdf">http://code.google.com/p/tdwg-ontology/source/browse/trunk/ontology/voc/TaxonName.rdf</a>)

and relate it to the TCS documents on which it is based

(<a class="moz-txt-link-freetext" href="http://www.tdwg.org/standards/117/">http://www.tdwg.org/standards/117/</a> , again it would be extremely

useful to have a pdf version of the User Guide directly linked to that

page so that people could look at it in their browsers rather than

having to download a zip archive.&nbsp; Note also Kennedy et al. 2005

<a class="moz-txt-link-freetext" href="http://www.springerlink.com/content/7bv5pa3falxwrrvx/">http://www.springerlink.com/content/7bv5pa3falxwrrvx/</a> which I found

helpful for understanding the rationale for TCS).&nbsp; In my opinion, TCS

(and by extension, the TDWG ontology) puts a rather restrictive collar

and leash on taxon names.&nbsp; I quote from the user guide page 9:

"&lt;TaxonName&gt; elements do not represent taxa.&nbsp; They serve only as

abstract nomenclatural data structures that encapsulate the core rules

of the different nomenclatural codes.&nbsp; Their purpose is to prevent

nomenclatural statements becoming confused with statements about the

circumscription of, and relationships between, different taxon

concepts.&nbsp; No taxonomic opinion can be expressed using

&lt;TaxonName&gt; elements in TCS.&nbsp; As a rule of thumb if you are

dealing with anything beyond a type specimen and references to it, you

are talking about a TaxonConcept of some form."&nbsp; This does not seem

like a broad and imprecise definition to me.&nbsp; One is allowed to

describe the pieces of the name and that's about it.<br>

<br>

When I look carefully at how the TDWG ontology deals with taxon names

and taxon concepts, it seems very simple and "usable" to me.&nbsp; If one

defines a Taxon to be composed of a name component and a sensu/sec.

component as several people (including you, I think) on this list have

done and as TSC has done (I think), then representing it in RDF becomes

tractable.&nbsp; One anchors the name part to a tn:TaxonName instance

(properly collared and chained and wearing a GUID as a dog tag).&nbsp; How

one anchors the sensu/sec. part is still a subject for discussion.&nbsp; I

have been thinking about the following approach.&nbsp; It is based on a Venn

diagram that I have in my head which I created from your descriptions

of TNUs on this list.&nbsp; The Venn diagram has a big rectangle labeled

"nominal taxon".&nbsp; Inside that is a smaller rectangle named "taxon name

usage (TNU)".&nbsp; Inside that is an even smaller rectangle named "taxon

concept".&nbsp; In this view, Taxon concepts are

well-described/circumscribed by a publication.&nbsp; TNUs (which include

taxon concepts) are associated with a particular person's idea of what

the taxon is, but which may or may not be described in a publication.&nbsp;

Nominal taxa are all instances of a scientific name use including those

where we have no idea who applied the name or what set of organisms

they intended to be included in the taxon.&nbsp; In terms of RDF metadata:<br>

1. Go ahead and let the rdf:type of the thing be tc:Taxon<br>

2. Make the object of tc:hasName be a GUID (i.e. as described by the

TDWG GUID Applicability Statement, not some other kind of

GUID)-identified resource, preferably from a well-known source like

uBio.<br>

3. If the sensu/sec. is described in a publication (in my mind a true

taxon concept), then the object of tc:accordingTo is an HTTP proxied

DOI, HTTP URI of a BHL-scanned publication, or if both of those fail,

something non-resolvable but globally-unique like an ISBN or URL of a

stable web page.<br>

4. If the sensu/sec. is not described in a publication, but is

associated with a particular person (in my mind a TNU that isn't a true

taxon concept), then the object of tc:accordingTo could be the URI of a

foaf:Person or foaf:Group.<br>

5. If the sensu/sec. is completely unknown, then the taxon is a nominal

taxon that is not a TNU.&nbsp; I don't know whether it is better for the

taxon to simply lack a tc:accordingTo property or to have a

tc:accordingTo property that somehow says "we don't know anything about

the sensu/sec.".&nbsp; <br>

<br>

I realize that you probably aren't going to like this because it isn't

as sophisticated and nuanced as you would like for your GNUB TNUs to

be.&nbsp; However, there would be nothing that would prohibit you from

creating and adding a myriad of clever properties to the tc:Taxon

instance RDF to make it do all of the things you want.&nbsp; The practice I

have described would break down the act of defining a taxon into&nbsp;

well-known, standardized pieces and it is a practice that could be

fairly easily be followed by people without sophisticated IT

resources.&nbsp; It would allow for the transfer and comparison of taxa

information and make the possibility of reconciling at some central

location (like GNUB) the taxa that are described in a distributed

network of users.&nbsp; Doing something like this is, I believe, the entire

reason for the existence of TCS, the TDWG ontology, old TDWG TAG

roadmaps, etc.&nbsp; Please apply some self-discipline to follow the

ratified standards or risk blowing us all back to 2005 where we would

have to discuss all of the settled things again.&nbsp; If that is going to

happen, I will give up on TDWG because I'll be retired before it is

done over again.<br>

<br>

In some ways what I'm talking about here is really (as I understand it)

the principle that underlies REST.&nbsp; Within your big GNUB kingdom and my

little Bioimages kingdom, we are free to do whatever clever things we

want, structure databases as we wish, do clever programming stuff or

whatever.&nbsp; But when you and I talk, we follow commonly established

rules, namely we talk using the HTTP protocol and identify the things

that we want to talk about using HTTP URIs.&nbsp; Since we are talking

specifically about biodiversity informatics, we should choose to follow

more restrictive rules about the identifiers themselves (following the

TDWG GUID applicability statement) and the nature of the RDF (following

the GUID applicability statement, well-known vocabularies such as the

TDWG ontology, FOAF, DCMI, Darwin Core, geo, etc.).&nbsp; If we fail to do

that, then every interaction that I have with another entity requires

me to establish in advance the rules of that interaction.&nbsp; The Web

works well because people follow a defined set of rules about URLs and

HTML.&nbsp; I would assert that we now (at last) have a similar model

available to us in the biodiversity informatics community if

organizations would just have the self-discipline to use it.&nbsp; <br>

<br>

Roderic Page wrote:

<blockquote

 cite="mid:33745C5B-B724-42A0-ADF1-CDFF2CF56FBD@bio.gla.ac.uk"

 type="cite">

  <pre wrap="">Reading this thread makes me despair. It's as if we are determined not to make progress, forever debating identifiers and what they identify, with seemingly little hope of resolution, and no clear vision of what the goals are. We wallow in acronym soup, and enjoy the technical challenges, but don't actually get anywhere</pre>

</blockquote>

I have to say that I'm not as pessimistic as Rod is.&nbsp; Maybe that's just

because I haven't been involved in the process as long as he has and

haven't had sufficient time to develop appropriate cynicism.&nbsp; But I

think there has been real progress, even in the couple years I've been

tracking TDWG.&nbsp; We DO have a GUID Applicability Statement Standard

now.&nbsp; We DO have a Darwin Core standard that defines terms which could

be used to describe properties of biodiversity resources.&nbsp; We DO have

doi's that are HTTP proxied and which return real metadata.&nbsp; We DO have

people in our community who know how to write RDF and set up content

negotiation for GUIDs as described in standards and best practices.&nbsp; I

would also say that we do have a relatively clear vision of what the

goals are.&nbsp; When I look at the old TAG roadmaps from 2006-2008<br>

<a class="moz-txt-link-freetext" href="http://www.tdwg.org/uploads/media/TAG_Roadmap_01.doc">http://www.tdwg.org/uploads/media/TAG_Roadmap_01.doc</a> (2006)<br>

<a class="moz-txt-link-freetext" href="http://www.tdwg.org/fileadmin/subgroups/tag/TAG_Roadmap_2007_final.pdf">http://www.tdwg.org/fileadmin/subgroups/tag/TAG_Roadmap_2007_final.pdf</a>

(2007)<br>

<a class="moz-txt-link-freetext" href="http://www.tdwg.org/fileadmin/subgroups/tag/TAG_Roadmap_2008.pdf">http://www.tdwg.org/fileadmin/subgroups/tag/TAG_Roadmap_2008.pdf</a> (2008)

<br>

the goals laid out there are the same ones I hear people talking about

now.&nbsp; The difference is that we now have the tools and standards to do

what was desired in 2006-8.&nbsp; We also have a funded project (BiSciCol)

that is making progress toward developing a system that will track when

changes occur in metadata for resources that are described by GUIDs.&nbsp;

So I'm actually pretty optimistic about the whole venture assuming that

we can get people and organizations to actually read and try to follow

the standards that we have already agreed upon.&nbsp; <br>

<br>

Steve<br>

<br>

<pre class="moz-signature" cols="72">-- 

Steven J. Baskauf, Ph.D., Senior Lecturer

Vanderbilt University Dept. of Biological Sciences

postal mail address:

VU Station B 351634

Nashville, TN  37235-1634,  U.S.A.

delivery address:

2125 Stevenson Center

1161 21st Ave., S.

Nashville, TN 37235

office: 2128 Stevenson Center

phone: (615) 343-4582,  fax: (615) 343-6707

<a class="moz-txt-link-freetext" href="http://bioimages.vanderbilt.edu">http://bioimages.vanderbilt.edu</a>

</pre>

</body>

</html>