<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">

<HTML>

<HEAD>

<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">

<META NAME="Generator" CONTENT="MS Exchange Server version 6.5.7654.12">

<TITLE>Re: [tdwg-content] Producing a global taxon register (was: ITIS TSNID to uBio NamebankIDs mapping)</TITLE>

</HEAD>

<BODY>

<!-- Converted from text/plain format -->


<P><FONT SIZE=2>I agree that it is important to have clarity of what the goal<BR>

of a project is<BR>

<BR>

* a HCAL - a hierarchical catalogue of all life - is a very popular<BR>

type of project; Catalogue of Life, ITIS, NCBI, Wikispecies, etc<BR>

all pursue this.<BR>

<BR>

* a GTR - global taxon register - is something else entirely, at least<BR>

if the term is taken literally. It would be indispensable if the purpose<BR>

&quot;to index all usages of all names in all sources&quot; is to be realized.<BR>

I don't know of any project that pursues this in a systematic way<BR>

(I suppose the French Wikipedia rates a mention, at least making some<BR>

attempt).<BR>

<BR>

and of course there are projects that focus on names, but at the moment<BR>

we still don't have something like a complete nomenclatural index<BR>

(inventorying all nomenclatural acts), and are just moving towards<BR>

lists of currently accepted names (closely connected to the HCAL).<BR>

For information on biodiversity the latter is only marginally relevant,<BR>

and the GNI is much less so.<BR>

<BR>

Names and taxa are quite different things and they are interconnected<BR>

in a complex way.<BR>

<BR>

Paul<BR>

<BR>

-----Oorspronkelijk bericht-----<BR>

Van: tdwg-content-bounces@lists.tdwg.org namens Tony.Rees@csiro.au<BR>

Verzonden: za 4-6-2011 1:04<BR>

Aan: deepreef@bishopmuseum.org; tdwg-content@lists.tdwg.org<BR>

Onderwerp: [tdwg-content] Producing a global taxon register (was: ITIS TSNID to uBio NamebankIDs mapping)<BR>

<BR>

Hi all (jumping in with some trepidation...)<BR>

<BR>

It's good to hear some ramp-up may be coming of activity in the GNUB space (congratulations, Rich et al.). My main concern, however is that it does not solve my particular problem - which is in a nutshell, given &quot;any&quot; cited taxonomic name, what can we tell about it - with regard to its classification, nomenclatural and taxonomic/synonym status, and certain attributes (initially for my use case, simple geologic time - is it extant or not - and simple habitat classification - is it marine or not - though of course infinitely expandable from there).<BR>

<BR>

To me the vision of GNUB is too grand - to index all usages of all names in all sources - and the vision of GNI is too limited - to index the names but not actually record/harmonise/verify/manage (in a structured way) any associated information. I'm after something in between - what I have tentatively previously called HCAL - a hierarchical catalogue of all life (presuming that at least one &quot;management&quot; hierarchy is incorporated) - or maybe just a GTR - global taxon register. Sort of, waiting for the Catalogue of Life and/or ITIS to be complete, for both extant and fossil taxa, and also incorporate selected &quot;taxon attributes&quot; as above. (This is the space into which my IRMNG database is cast as a preliminary/&quot;working for now&quot; solution, but obviously without the significant resourcing / community cooperation required to build and sustain the thing for the long term).<BR>

<BR>

So my question is, how can such a product emerge from ongoing developments in GN* space, or other...<BR>

<BR>

Over to the experts,<BR>

<BR>

Best - Tony<BR>

<BR>

________________________________________<BR>

From: tdwg-content-bounces@lists.tdwg.org [tdwg-content-bounces@lists.tdwg.org] On Behalf Of Richard Pyle [deepreef@bishopmuseum.org]<BR>

Sent: Saturday, 4 June 2011 8:48 AM<BR>

To: tdwg-content@lists.tdwg.org<BR>

Subject: Re: [tdwg-content] ITIS TSNID to uBio NamebankIDs mapping<BR>

<BR>

Working backwards through this thread...<BR>

<BR>

I hadn't read Dima's post until just now, and I see that at least a couple of his points (i.e., #2, #5, #6) apply to exposing the UUIDs externally. However, I think that a simple protocol (such as replacing spaces with &quot;_&quot;, and avoiding characters that look the same but are different -- such as the Cyrillic 'a') could go a long way to mitigating those problems.<BR>

<BR>

On the other hand, it really depends on what the identifier is for.&nbsp; The string &quot;Danaus_plexippus_(Linnaeus_1758)&quot; may be more friendly to our eyes, but &quot;A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523&quot; is definitely more friendly to a computer (Dima's points 1, 3 &amp; 4, among others).&nbsp; My feeling is that the push for GUIDs is more about enabling computer-computer conversations, than it is about enabling human-human or human-computer interactions; and therefore we should not get bogged down in the &quot;ugliness&quot; of the identifiers.&nbsp; In the context of electronic data services, the &quot;ugliness&quot; potential of the &quot;Danaus_plexippus_(Linnaeus_1758)&quot; approach to identifiers is far greater than the ugliness potential of &quot;A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523&quot;, when it comes to interlinking electronic biodiversity data.&nbsp; It is nothing for a computer to render relevant metadata of the object identified by &quot;A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523&quot; into &quot;Danaus plexippus (Linnaeus_1758)&quot; on a computer screen or piece of paper for human-eyeball consumption.&nbsp; But there are many pitfalls (some noted by Dima) for a computer to unambiguously resolve &quot;Danaus_plexippus_(Linnaeus_1758)&quot; back to a meaningful data object.<BR>

<BR>

I guess my revised point is:&nbsp; GNI (and uBio/NameBank) are essentially the only taxonomic databases out there where a human-friendly persistent/actionable identifier of the sort being discussed is even plausible as an option.&nbsp; It may not even be wise in this context (as per Dima's points), but it *might* be, depending on the need for a human-friendly identifier.<BR>

<BR>

Maybe the simplest thing to do would be to not regard &quot;<A HREF="http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_1758">http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_1758</A>)&quot; as an identifier per se, but rather as a protocol for a web service.&nbsp; In other words, if you append a text string to the root URL &quot;<A HREF="http://gni.globalnames.org/name_strings/">http://gni.globalnames.org/name_strings/</A>&quot;, GNI would run that text string against its index and return whatever metadata based on a text-string match.&nbsp; This is not mutually exclusive with an &quot;identifier&quot; in the form of &quot;<A HREF="http://gni.globalnames.org/name_strings/A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://gni.globalnames.org/name_strings/A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</A>&quot;, that would less ambiguously resolve a known record in GNI.&nbsp; At this point, the line between &quot;identifier&quot; and &quot;service&quot; gets fuzzy, of course.&nbsp; But the analogy is true in ZooBank:<BR>

<BR>

The persistent &quot;Identifer&quot; looks like this:<BR>

A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523<BR>

<BR>

One way that this identifier can be represented as an *actionable* identifier is this:<BR>

urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523<BR>

<BR>

Another &quot;actionable&quot; form of the identifier might be this:<BR>

<A HREF="http://zoobank.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://zoobank.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</A><BR>

<BR>

or this:<BR>

<A HREF="http://zoobank.org/A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://zoobank.org/A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</A><BR>

<BR>

or even this(?):<BR>

<A HREF="http://lsid.tdwg.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://lsid.tdwg.org/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</A><BR>

<BR>

(all of which work, by the way)<BR>

<BR>

However, the following are examples of what I would think of as *services*:<BR>

<A HREF="http://www.google.com/search?q=Danaus+plexippus+(Linnaeus+1758">http://www.google.com/search?q=Danaus+plexippus+(Linnaeus+1758</A>)<BR>

<A HREF="http://lsid.tdwg.org/summary/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523">http://lsid.tdwg.org/summary/urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523</A><BR>

<A HREF="http://darwin.zoology.gla.ac.uk/~rpage/lsid/tester/?q=urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523&submit=Go">http://darwin.zoology.gla.ac.uk/~rpage/lsid/tester/?q=urn:lsid:zoobank.org:act:A9F435E0-8ED7-46DD-BAB4-EA8E5BF41523&submit=Go</A><BR>

<BR>

But really, from the perspective of the end-user, does it matter if it's an identifier or a service?&nbsp; Ultimately, they ask the questions, and the answers appear on their computer screens.<BR>

<BR>

Aloha,<BR>

Rich<BR>

<BR>

<BR>

<BR>

<BR>

<BR>

&gt; -----Original Message-----<BR>

&gt; From: tdwg-content-bounces@lists.tdwg.org [<A HREF="mailto:tdwg-content-">mailto:tdwg-content-</A><BR>

&gt; bounces@lists.tdwg.org] On Behalf Of Dmitry Mozzherin<BR>

&gt; Sent: Friday, June 03, 2011 4:34 AM<BR>

&gt; To: David Remsen (GBIF)<BR>

&gt; Cc: tdwg-content@lists.tdwg.org; Dmitry Mozzherin; Orrell, Thomas; Alan J<BR>

&gt; Hampson; Nicolson, David; Gerald Guala<BR>

&gt; Subject: Re: [tdwg-content] ITIS TSNID to uBio NamebankIDs mapping<BR>

&gt;<BR>

&gt; In my opinion UUIDs have a few advantages over strings --<BR>

&gt;<BR>

&gt; 1. It is uuid, so it will work with uuid tools (current and future ones)<BR>

&gt; 2. It is less&nbsp; ambiguous -- For example -- what is the difference between Betul? and<BR>

&gt; Betula for your eyes? (one of them has a Cyrillic 'a')<BR>

&gt; 3. Database wise it is faster to search because it is just a 128bit number, while<BR>

&gt; a name is at least 245 byte varchar -- it makes searching much faster because<BR>

&gt; in relational databases the size of keys directly proportional to the search<BR>

&gt; speed<BR>

&gt; 4. UUID v. 5<BR>

&gt; (<A HREF="http://en.wikipedia.org/wiki/Universally_unique_identifier">http://en.wikipedia.org/wiki/Universally_unique_identifier</A>)<BR>

&gt; allows to generate UUID algorithmically without looking up a database (no<BR>

&gt; need for network connection)<BR>

&gt;&nbsp; 5. Links like <A HREF="http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_1758">http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_1758</A>) might be ambigous -- I can think of several ways I can represent name string<BR>

&gt; part in the url and they will all resolve to the same thing in GNI.<BR>

&gt; 6. Unescaped unicode characters in url containing literal name strings (people<BR>

&gt; will forget to escape them) will depend on an implementation of a url<BR>

&gt; resolver<BR>

&gt;<BR>

&gt; Saying this links like<BR>

&gt; <A HREF="http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_175">http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_175</A><BR>

&gt; 8)<BR>

&gt; are definitely attractive and is it good to have them as another way to access<BR>

&gt; a name!<BR>

&gt; My personal preference would be not use them as main identifier because<BR>

&gt; of the reasons 1, 2, 3 and 5.<BR>

&gt;<BR>

&gt; Dima<BR>

&gt;<BR>

&gt;<BR>

&gt;<BR>

&gt;<BR>

&gt; On Fri, Jun 3, 2011 at 7:59 AM, David Remsen (GBIF) &lt;dremsen@gbif.org&gt;<BR>

&gt; wrote:<BR>

&gt; &gt; Why not use the name as the basis for the resolvable identifier<BR>

&gt; &gt; instead of a uuid. Isnt there a 1:1 cardinality between the name and<BR>

&gt; &gt; the uuid in the GNI?&nbsp; Doesnt that mean that<BR>

&gt; &gt;<BR>

&gt; &gt; <A HREF="http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-">http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-</A><BR>

&gt; 755c34<BR>

&gt; &gt; c601ec<BR>

&gt; &gt; and<BR>

&gt; &gt;<BR>

&gt; <A HREF="http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_175">http://gni.globalnames.org/name_strings/Danaus_plexippus_(Linnaeus_175</A><BR>

&gt; &gt; 8)<BR>

&gt; &gt;<BR>

&gt; &gt; are equally unique?&nbsp; The latter is certainly more readable.&nbsp; In those<BR>

&gt; &gt; cases where the namestring is a homonym like<BR>

&gt; &gt;<BR>

&gt; &gt; <A HREF="http://gni.globalnames.org/name_strings/Oenanthe">http://gni.globalnames.org/name_strings/Oenanthe</A><BR>

&gt; &gt;<BR>

&gt; &gt; couldn't you just return the addresses of the two globally unique<BR>

&gt; &gt; forms of the name when you resolve it?<BR>

&gt; &gt;<BR>

&gt; &gt; <A HREF="http://gni.globalnames.org/name_strings/Oenanthe_Smith_1899">http://gni.globalnames.org/name_strings/Oenanthe_Smith_1899</A><BR>

&gt; &gt;<BR>

&gt; &gt; <A HREF="http://gni.globalnames.org/name_strings/Oenanthe_Jones_1900">http://gni.globalnames.org/name_strings/Oenanthe_Jones_1900</A><BR>

&gt; &gt;<BR>

&gt; &gt; Wouldn't those be as globally unique and easier to read and adjust to?<BR>

&gt; &gt; Or am I missing something.&nbsp; I always wanted to do that with ubio IDs<BR>

&gt; &gt; after a back and forth with Gregor Hagedorn and wished we hadn't<BR>

&gt; &gt; exposed those integers.<BR>

&gt; &gt;<BR>

&gt; &gt; DR<BR>

&gt; &gt;<BR>

&gt; &gt;&gt; Hi Steve,<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; I don't have time to go through this in detail, and I can't speak for<BR>

&gt; &gt;&gt; the GNI, but I can tell you about how the GNI URI's work at least for now.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; A while back Dima Mozzherin and I were looking into how triples etc.<BR>

&gt; &gt;&gt; might be of use to the GNI.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; We needed a way to generate unique URI's for each name.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; We wanted to avoid having to keep these in sync and not require<BR>

&gt; &gt;&gt; everyone to look each ID up through some service.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; Dima came up with the following plan. We use the namestring as seed<BR>

&gt; &gt;&gt; to generate a unique UUID.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; Basically this is a shared algorithm which the GNI and TaxonConcept<BR>

&gt; &gt;&gt; both use. But it could be used by anyone.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; You feed the name string to the algorithm and it spits out a UUID. We<BR>

&gt; &gt;&gt; append then append that to a URI and web service so it is resolvable.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; So the name Danaus plexippus (Linnaeus 1758) =&gt;<BR>

&gt; &gt;&gt; 4ef223c4-0c3e-5e84-ace9-755c34c601ec<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; So if the GNI and and another group have the same namestring they<BR>

&gt; &gt;&gt; have the same UUID.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; People can then can link their data set to the GNI with the following<BR>

&gt; &gt;&gt; URI<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; <A HREF="http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-">http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-</A><BR>

&gt; 755c3<BR>

&gt; &gt;&gt; 4c601ec<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; RDF<BR>

&gt; &gt;&gt; <A HREF="http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-">http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-</A><BR>

&gt; 755c3<BR>

&gt; &gt;&gt; 4c601ec.rdf<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; &lt;http://gni.globalnames.org/name_strings/4ef223c4-0c3e-5e84-ace9-<BR>

&gt; 755c<BR>

&gt; &gt;&gt; 34c601ec.rdf&gt;If you think of your data set as one table and the GNI<BR>

&gt; &gt;&gt; as another, this URI serves as the foreign key that connects them<BR>

&gt; &gt;&gt; together.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; Some on the list don't like how these look, but there is a tremendous<BR>

&gt; &gt;&gt; advantage in not having to worry about syncing two large data sets<BR>

&gt; &gt;&gt; and determining if a given integer is already in use.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; Also Rod Page has written a recently about UUID's.<BR>

&gt; &gt;&gt; <A HREF="http://iphylo.blogspot.com/2011/05/zoobank-on-couchdb-uuids-replicati">http://iphylo.blogspot.com/2011/05/zoobank-on-couchdb-uuids-replicati</A><BR>

&gt; &gt;&gt; on.html<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; &lt;http://iphylo.blogspot.com/2011/05/zoobank-on-couchdb-uuids-<BR>

&gt; replicat<BR>

&gt; &gt;&gt; ion.html&gt;There may be a way to do something similar with bit.ly like<BR>

&gt; &gt;&gt; identifiers that are shorter (mCcSp), but I think it the general idea<BR>

&gt; &gt;&gt; is a good one.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; If you recall from my talk at TDWG, I was able to use these to make<BR>

&gt; &gt;&gt; statements that one namestring was a synonym etc. of another etc.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; The algorithm we use is written in Ruby but I could be ported to many<BR>

&gt; &gt;&gt; different languages since UUIDs are widely supported.<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; Respectfully,<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; - Pete<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; On Thu, Jun 2, 2011 at 11:41 PM, Steven J. Baskauf &lt;<BR>

&gt; &gt;&gt; steve.baskauf@vanderbilt.edu&gt; wrote:<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt;&gt;&nbsp; My email access has been sporadic since this thread developed, so<BR>

&gt; &gt;&gt;&gt; at this point I'll respond to points made in several of the<BR>

&gt; &gt;&gt;&gt; messages.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; First, I should note that there has been previous discussion on this<BR>

&gt; &gt;&gt;&gt; list on a similar topic from<BR>

&gt; &gt;&gt;&gt; <A HREF="http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002231.htm">http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002231.htm</A><BR>

&gt; &gt;&gt;&gt; lthrough<BR>

&gt; &gt;&gt;&gt; <A HREF="http://lists.tdwg.org/pipermail/tdwg-content/2011-">http://lists.tdwg.org/pipermail/tdwg-content/2011-</A><BR>

&gt; January/002231.html.<BR>

&gt; &gt;&gt;&gt; One can review what was said at that time rather quickly by starting<BR>

&gt; &gt;&gt;&gt; on the first linked message and clicking on the &quot;Next Message&quot; link<BR>

&gt; &gt;&gt;&gt; until you get to the end of the range I gave above.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; My reason for the request for information that started this thread<BR>

&gt; &gt;&gt;&gt; was that I wanted to link to a URI that would anchor the name<BR>

&gt; &gt;&gt;&gt; portion of a name/sensu pair (TNU or Taxon Concept a la TCS if you<BR>

&gt; &gt;&gt;&gt; prefer) as in this RDF<BR>

&gt; &gt;&gt;&gt; snippet:<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;&nbsp;&nbsp;&nbsp; &lt;tc:nameString&gt;Quercus rubra L.&lt;/tc:nameString&gt;<BR>

&gt; &gt;&gt;&gt;&nbsp;&nbsp;&nbsp; &lt;tc:hasName<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; rdf:about=&quot;<A HREF="http://www.ubio.org/authority/metadata.php?lsid=urn:lsid:ubio">http://www.ubio.org/authority/metadata.php?lsid=urn:lsid:ubio</A><BR>

&gt; .org:namebank:448439&quot;<BR>

&gt; &gt;&gt;&gt; &lt;http://www.ubio.org/authority/metadata.php?lsid=urn:lsid:ubio.org:n<BR>

&gt; &gt;&gt;&gt; amebank:448439&gt;/&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; At this point in the discussion, I'm not actually talking about<BR>

&gt; &gt;&gt;&gt; creating a link to a taxon concept but rather to a taxon name, so<BR>

&gt; &gt;&gt;&gt; some of the issues Pete raised don't apply here (e.g. what's the<BR>

&gt; &gt;&gt;&gt; &quot;right&quot; name for a concept<BR>

&gt; &gt;&gt;&gt; -<BR>

&gt; &gt;&gt;&gt; the question here is simply what's a stable identifier for the name) .<BR>

&gt; &gt;&gt;&gt; In<BR>

&gt; &gt;&gt;&gt; principle, I could probably just provide the name string and be done<BR>

&gt; &gt;&gt;&gt; with it.&nbsp; However, having some degree of faith that Smart, Computer<BR>

&gt; &gt;&gt;&gt; Savvy People might some day be able to use the metadata returned by<BR>

&gt; &gt;&gt;&gt; the URI (or perhaps metadata which they already have in a triple<BR>

&gt; &gt;&gt;&gt; store onsite) to do cool things like knowing that my name is the<BR>

&gt; &gt;&gt;&gt; same as an orthographic variant or that &quot;Quercus rubra&nbsp; L.&quot; is<BR>

&gt; &gt;&gt;&gt; basically the same thing as &quot;Quercus rubra&quot;, I would like to also<BR>

&gt; &gt;&gt;&gt; provide a functional URI.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; As an end -user who isn't very interested in the technical issues<BR>

&gt; &gt;&gt;&gt; involving names, I don't really care what URI I use.&nbsp; I would prefer<BR>

&gt; &gt;&gt;&gt; for it to be widely recognized and for it to &quot;work&quot; (i.e. be<BR>

&gt; &gt;&gt;&gt; resolvable).&nbsp; In the earlier<BR>

&gt; &gt;&gt;&gt; (January) thread, there was discussion about existing identifiers.<BR>

&gt; &gt;&gt;&gt; There<BR>

&gt; &gt;&gt;&gt; were a number of posts, but in particular<BR>

&gt; &gt;&gt;&gt; <A HREF="http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002258.htm">http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002258.htm</A><BR>

&gt; &gt;&gt;&gt; l<BR>

&gt; &gt;&gt;&gt; <A HREF="http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002259.htm">http://lists.tdwg.org/pipermail/tdwg-content/2011-January/002259.htm</A><BR>

&gt; &gt;&gt;&gt; ldiscussed the relative merits of ITIS and uBio ID numbers.&nbsp; My<BR>

&gt; &gt;&gt;&gt; take-home message from this was that uBio represented the largest<BR>

&gt; &gt;&gt;&gt; single set of names with assigned identifiers (see<BR>

&gt; &gt;&gt;&gt; <A HREF="http://gni.globalnames.org/data_sourcescited">http://gni.globalnames.org/data_sourcescited</A> in Pete's email) and<BR>

&gt; &gt;&gt;&gt; that uBio metadata provides useful references.<BR>

&gt; &gt;&gt;&gt; Hence my interest in referencing uBio ids as a URI.&nbsp; However, as a<BR>

&gt; &gt;&gt;&gt; practical matter, the organizations that I share images with either<BR>

&gt; &gt;&gt;&gt; want ITIS TSNs (EOL and Morphbank) or just names (Discover Life).<BR>

&gt; &gt;&gt;&gt; Nobody is asking for uBio identifiers or any other identifier.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I found Kevin's comment at<BR>

&gt; &gt;&gt;&gt; <A HREF="http://lists.tdwg.org/pipermail/tdwg-content/2011-May/002486.html">http://lists.tdwg.org/pipermail/tdwg-content/2011-May/002486.html</A><BR>

&gt; &gt;&gt;&gt; very<BR>

&gt; &gt;&gt;&gt; thought-provoking: &quot;My thoughts are that the most likely way this<BR>

&gt; &gt;&gt;&gt; will be solved is by standard market type pressures - ie the best<BR>

&gt; &gt;&gt;&gt; solution/IDs will be used the most and 'float' to the top.&quot;&nbsp; I'm not<BR>

&gt; &gt;&gt;&gt; going to make a judgment about what is the &quot;best&quot; solution or ID.<BR>

&gt; &gt;&gt;&gt; But I would say that in &quot;computer&quot;<BR>

&gt; &gt;&gt;&gt; history, being the &quot;best&quot; doesn't necessarily mean that something<BR>

&gt; &gt;&gt;&gt; will be used.&nbsp; Take for example, the FOAF vocabulary.&nbsp; What the heck<BR>

&gt; &gt;&gt;&gt; is Friend of a Friend?&nbsp; I would venture to say that most of the<BR>

&gt; &gt;&gt;&gt; people using the FOAF vocabulary don't know or care.&nbsp; The FOAF<BR>

&gt; &gt;&gt;&gt; vocabulary was the one that people started to use and once that<BR>

&gt; &gt;&gt;&gt; happened, people didn't switch even if there was something better.<BR>

&gt; &gt;&gt;&gt; I'm not familiar with the history of other stuff like YouTube and<BR>

&gt; &gt;&gt;&gt; Craig's List, but I would guess that they weren't necessarily &quot;the<BR>

&gt; &gt;&gt;&gt; best&quot; systems - they were just the one that the most people started<BR>

&gt; &gt;&gt;&gt; using first and once that happened, people didn't switch.&nbsp; I'm using<BR>

&gt; &gt;&gt;&gt; ITIS IDs because they are easy to get and the people I communicate<BR>

&gt; &gt;&gt;&gt; with want them.&nbsp; Whether they are the &quot;best&quot; or &quot;done correctly&quot;<BR>

&gt; &gt;&gt;&gt; doesn't matter to me as much as the fact that that they are widely<BR>

&gt; &gt;&gt;&gt; recognized and stable (and that thus far every name that I've looked<BR>

&gt; &gt;&gt;&gt; for has been in their database).<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I think that one reason why this question has been on my mind is<BR>

&gt; &gt;&gt;&gt; that I've been waiting for GNUB (Global Name Use Bank) to come out.<BR>

&gt; &gt;&gt;&gt; I'm not really up on how it is going to work, but my impression is<BR>

&gt; &gt;&gt;&gt; that it was going to be based on the Global Name Index (GNI) which<BR>

&gt; &gt;&gt;&gt; was mentioned in that earlier January thread.&nbsp; At that point, the<BR>

&gt; &gt;&gt;&gt; GNI names didn't have any identifiers that were exposed to the<BR>

&gt; &gt;&gt;&gt; public as permanent GUIDs.&nbsp; I'm assuming that if GNUB refers to GNI<BR>

&gt; &gt;&gt;&gt; names, they will have some kind of identifiers.&nbsp; So if that happens<BR>

&gt; &gt;&gt;&gt; how is the GUID recommendation 8 going to be followed?&nbsp; As Kevin<BR>

&gt; &gt;&gt;&gt; said in<BR>

&gt; &gt;&gt;&gt; <A HREF="http://lists.tdwg.org/pipermail/tdwg-content/2011-June/002499.html">http://lists.tdwg.org/pipermail/tdwg-content/2011-June/002499.html</A><BR>

&gt; &gt;&gt;&gt; &quot;What I take from recommendation 8 of the GUID applicability guide<BR>

&gt; &gt;&gt;&gt; ... is that if you DON'T already have a record in your own database<BR>

&gt; &gt;&gt;&gt; for a taxon name/concept, then reuse an existing one.&nbsp; &quot;&nbsp; What we<BR>

&gt; &gt;&gt;&gt; have here with GNI is a situation where none of the records have<BR>

&gt; &gt;&gt;&gt; identifiers.&nbsp; In my mind, the &quot;best practice&quot; according to<BR>

&gt; &gt;&gt;&gt; recommendation 8 would be for the GNI to reuse existing identifiers<BR>

&gt; &gt;&gt;&gt; where they exist and NOT make up new ones.&nbsp; This is a bit more<BR>

&gt; &gt;&gt;&gt; complicated because the ITIS identifiers (which are in common<BR>

&gt; &gt;&gt;&gt; use)<BR>

&gt; &gt;&gt;&gt; don't have an http URI version that is resolvable, and while the<BR>

&gt; &gt;&gt;&gt; uBio identifiers have a resolvable http URI, it's in the form of a<BR>

&gt; &gt;&gt;&gt; proxied LSID, which I've already complained is very ugly.&nbsp; So I'd<BR>

&gt; &gt;&gt;&gt; like to hear some ideas about how to have &quot;reused&quot; identifiers in<BR>

&gt; &gt;&gt;&gt; the GNI.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; One thing that comes to my mind would be to have a &quot;domain name&quot;<BR>

&gt; &gt;&gt;&gt; like &quot;<A HREF="http://purl.org/gni/">http://purl.org/gni/</A>&quot; &lt;<A HREF="http://purl.org/gni/">http://purl.org/gni/</A>&gt; or<BR>

&gt; &gt;&gt;&gt; &quot;<A HREF="http://purl.org/tn/">http://purl.org/tn/</A>&quot;&lt;<A HREF="http://purl.org/tn/">http://purl.org/tn/</A>&gt;(&quot;tn&quot; for &quot;taxon name&quot;)<BR>

&gt; &gt;&gt;&gt; and to follow it with a namespace/id combination similar to what is<BR>

&gt; &gt;&gt;&gt; done with lsids.&nbsp; So for example &quot;itis/19408&quot; and &quot;ubio/448439&quot;<BR>

&gt; &gt;&gt;&gt; could be appended, creating <A HREF="http://purl.org/gni/itis/19408and">http://purl.org/gni/itis/19408and</A><BR>

&gt; &gt;&gt;&gt; <A HREF="http://purl.org/gni/ubio/448439">http://purl.org/gni/ubio/448439</A> for &quot;Quercus rubra&nbsp; L.&quot;&nbsp; Both URIs<BR>

&gt; &gt;&gt;&gt; could point to the same RDF and that RDF could indicate that the two<BR>

&gt; &gt;&gt;&gt; identifiers are owl:sameAs .&nbsp; I realize from what Bob Morris has<BR>

&gt; &gt;&gt;&gt; cautioned in the past that there are problems with owl:sameAs when<BR>

&gt; &gt;&gt;&gt; the two things aren't actually the same thing (e.g. if the uBio ID<BR>

&gt; &gt;&gt;&gt; refers to a name string only but the ITIS TSN refers to the name<BR>

&gt; &gt;&gt;&gt; plus an &quot;accepted&quot; status and a relationship to parent taxa).<BR>

&gt; &gt;&gt;&gt; However, if there were an understanding that the GNI only refers to<BR>

&gt; &gt;&gt;&gt; name strings, then one could still refer to<BR>

&gt; &gt;&gt;&gt; <A HREF="http://purl.org/gni/itis/19408">http://purl.org/gni/itis/19408</A> as an identifier for the name string<BR>

&gt; &gt;&gt;&gt; of the thing (whatever it is) that is referred to by an ITIS TSN of<BR>

&gt; &gt;&gt;&gt; 19408.&nbsp; I don't think there would be a problem saying that and the<BR>

&gt; &gt;&gt;&gt; ubio ID were &quot;owl:sameAs&quot;.&nbsp; Some kind of solution like this would<BR>

&gt; &gt;&gt;&gt; allow people to easily generate a resolvable URI for a name if they<BR>

&gt; &gt;&gt;&gt; were using ITIS TSNs or uBio IDs.&nbsp; If the name that one wanted to<BR>

&gt; &gt;&gt;&gt; use was so obscure that it was one of the 9.5 million names that<BR>

&gt; &gt;&gt;&gt; uBio has that ITIS doesn't have, then that name would only have the<BR>

&gt; &gt;&gt;&gt; ubio version.&nbsp; I have no idea whether this would be a good idea or<BR>

&gt; &gt;&gt;&gt; not, but I was really cringing to think about 19 million newly<BR>

&gt; &gt;&gt;&gt; minted UUIDs appended to<BR>

&gt; &gt;&gt;&gt; &quot;<A HREF="http://gni.globalnames.org/">http://gni.globalnames.org/</A>&quot;&lt;<A HREF="http://gni.globalnames.org/">http://gni.globalnames.org/</A>&gt;and<BR>

&gt; &gt;&gt;&gt; figuring out how to connect those horrid things to the names and<BR>

&gt; &gt;&gt;&gt; ITIS TSNs that I'm already using.&nbsp; I think that I said this before,<BR>

&gt; &gt;&gt;&gt; but using the purl.org domain rather than one like<BR>

&gt; &gt;&gt;&gt; <A HREF="http://gni.globalnames.org/">http://gni.globalnames.org/</A> would in the future allow somebody else<BR>

&gt; &gt;&gt;&gt; to take over management of providing the metadata when the GUIDs<BR>

&gt; are<BR>

&gt; &gt;&gt;&gt; resolved without having to deal with issues of who &quot;owns&quot; the domain<BR>

&gt; &gt;&gt;&gt; name.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Steve<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Kevin Richards wrote:<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;&nbsp; Pete,<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I'm not trying to say what you are doing is a waste of time/impossible.<BR>

&gt; &gt;&gt;&gt; I<BR>

&gt; &gt;&gt;&gt; actually think RDF + semantics are a good way forward, but this<BR>

&gt; &gt;&gt;&gt; really implies that we need to rely on the semantics and linkages<BR>

&gt; &gt;&gt;&gt; rather than having a SINGLE ID for a taxon name.&nbsp; (which is what I<BR>

&gt; &gt;&gt;&gt; thought Steve was getting at).&nbsp; Each instance of a taxon name can<BR>

&gt; &gt;&gt;&gt; have its own ID and then all these instances are connected via<BR>

&gt; &gt;&gt;&gt; ontology defined semantic links.&nbsp; This seems more appropriate to me<BR>

&gt; &gt;&gt;&gt; than insisting everyone uses the &quot;Global Taxon Name ID X&quot;.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; In your example of *Aedes triseriatus* and *Ochlerotatus<BR>

&gt; &gt;&gt;&gt; triseriatus* - these are two different names so they need two<BR>

&gt; &gt;&gt;&gt; different IDs, they may be linked by a single taxon concept, but<BR>

&gt; &gt;&gt;&gt; they are separate names.&nbsp; So which of these now 3 IDs do you expect<BR>

&gt; &gt;&gt;&gt; people to use, and according to what source??<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; For example if we have a name, eg the Robin, Erithacus rubecula,<BR>

&gt; &gt;&gt;&gt; mentioned<BR>

&gt; &gt;&gt;&gt; in IT IS (TSN : 559964) and also in EOL (www.eol.org/pages/1051567),<BR>

&gt; &gt;&gt;&gt; also<BR>

&gt; &gt;&gt;&gt; in GBIF (<A HREF="http://data.gbif.org/species/21266780">http://data.gbif.org/species/21266780</A>), also in avibase (<BR>

&gt; &gt;&gt;&gt; <A HREF="http://avibase.bsc-eoc.org/species.jsp?avibaseid=C809B2B90399A43D">http://avibase.bsc-eoc.org/species.jsp?avibaseid=C809B2B90399A43D</A>),<BR>

&gt; &gt;&gt;&gt; which<BR>

&gt; &gt;&gt;&gt; ID are you hoping people will use??&nbsp; Would you put the IT IS ID in your<BR>

&gt; &gt;&gt;&gt; own<BR>

&gt; &gt;&gt;&gt; dataset as the ID for that name - unlikely.&nbsp; Or would it be better to<BR>

&gt; &gt;&gt;&gt; link<BR>

&gt; &gt;&gt;&gt; them up with semantic linkages.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; What I take from recommendation 8 of the GUID applicability guide (as<BR>

&gt; &gt;&gt;&gt; Steve<BR>

&gt; &gt;&gt;&gt; puts is &quot;stop making up new identifiers when somebody else already has<BR>

&gt; &gt;&gt;&gt; one<BR>

&gt; &gt;&gt;&gt; for the thing you are talking about&quot;) is that if you DON'T already have<BR>

&gt; &gt;&gt;&gt; a<BR>

&gt; &gt;&gt;&gt; record in your own database for a taxon name/concept, then reuse an<BR>

&gt; &gt;&gt;&gt; existing<BR>

&gt; &gt;&gt;&gt; one.&nbsp; NOT ditch all your current IDs and adopt someone else's<BR>

&gt; &gt;&gt;&gt; (especially<BR>

&gt; &gt;&gt;&gt; hard considering it is so hard to work out which if the multitude of<BR>

&gt; &gt;&gt;&gt; names<BR>

&gt; &gt;&gt;&gt; ad concept IDs that directly relates to your taxon name).<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I am all for limiting the number of IDs for the &quot;same&quot; thing, but in<BR>

&gt; &gt;&gt;&gt; some<BR>

&gt; &gt;&gt;&gt; cases it is more useful to build linkages than force this tight<BR>

&gt; &gt;&gt;&gt; integration<BR>

&gt; &gt;&gt;&gt; of data and IDs.&nbsp; Especially for taxon names and concepts, where it is<BR>

&gt; &gt;&gt;&gt; complex to define if you are even talking about the &quot;same&quot; thing or not.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Kevin<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; *From:* Peter DeVries<BR>

&gt; &gt;&gt;&gt; [<A HREF="mailto:pete.devries@gmail.com">mailto:pete.devries@gmail.com</A>&lt;pete.devries@gmail.com&gt;]<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; *Sent:* Wednesday, 1 June 2011 12:38 p.m.<BR>

&gt; &gt;&gt;&gt; *To:* Kevin Richards<BR>

&gt; &gt;&gt;&gt; *Cc:* Steve Baskauf; tdwg-content@lists.tdwg.org; Gerald Guala;<BR>

&gt; &gt;&gt;&gt; Nicolson,<BR>

&gt; &gt;&gt;&gt; David; Alan J Hampson; Orrell, Thomas<BR>

&gt; &gt;&gt;&gt; *Subject:* Re: [tdwg-content] ITIS TSNID to uBio NamebankIDs mapping<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Hi Kevin,<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I forgot one mention some other things that are different about my<BR>

&gt; &gt;&gt;&gt; project.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; You can write a simple SPARQL query to get a list of all the<BR>

&gt; &gt;&gt;&gt; TaxonConcept's<BR>

&gt; &gt;&gt;&gt; that have ITIS ids, or all those that have ITIS and NCBI ID's etc.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; You can do this on any SPARQL endpoint that hosts the data.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; You can download the entire data set and run the queries on your own<BR>

&gt; &gt;&gt;&gt; endpoint.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; You can write a script that runs the query and downloads the ITIS<BR>

&gt; &gt;&gt;&gt; numbers<BR>

&gt; &gt;&gt;&gt; and exports them to CSV etc.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; - Pete<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; On Tue, May 31, 2011 at 5:16 PM, Peter DeVries<BR>

&gt; &lt;pete.devries@gmail.com&gt;<BR>

&gt; &gt;&gt;&gt; wrote:<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Hi Kevin,<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; On Tue, May 31, 2011 at 3:27 PM, Kevin Richards &lt;<BR>

&gt; &gt;&gt;&gt; RichardsK@landcareresearch.co.nz&gt; wrote:<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; This is exactly why this problem still exists and will be very complex<BR>

&gt; &gt;&gt;&gt; to<BR>

&gt; &gt;&gt;&gt; solve - everyone says &quot;we should have a single ID for a specific taxon<BR>

&gt; &gt;&gt;&gt; name,<BR>

&gt; &gt;&gt;&gt; there seems to be several IDs 'out there' that refer to the same taxon<BR>

&gt; &gt;&gt;&gt; name,<BR>

&gt; &gt;&gt;&gt; so Im going to create another ID to link them all up&quot; - yet another ID<BR>

&gt; &gt;&gt;&gt; that<BR>

&gt; &gt;&gt;&gt; no one will particularly want to follow - you would have to get everyone<BR>

&gt; &gt;&gt;&gt; to<BR>

&gt; &gt;&gt;&gt; agree that your combinations/integration of taxon names is the best one<BR>

&gt; &gt;&gt;&gt; and<BR>

&gt; &gt;&gt;&gt; hope everyone follows it - unlikely in this domain.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Isn't this kind of what the The Plant List, and eBird already do?<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; A difference being that they tie these to a specific name and specific<BR>

&gt; &gt;&gt;&gt; classification.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; The Plant list is not really even open so it is difficult to people to<BR>

&gt; &gt;&gt;&gt; adopt it in mass.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; For instance, if I manage a herbarium, how do I easily reconcile my<BR>

&gt; &gt;&gt;&gt; species<BR>

&gt; &gt;&gt;&gt; list with the entities represented in the Plant List?<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; eBird has millions of records which implies that they have been able to<BR>

&gt; &gt;&gt;&gt; convince the observers in the field to adopt their system. You are<BR>

&gt; &gt;&gt;&gt; correct<BR>

&gt; &gt;&gt;&gt; in that there are probably a lot of taxonomists that don't like their<BR>

&gt; &gt;&gt;&gt; list.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; It differs from many of the other classifications, but remember the<BR>

&gt; &gt;&gt;&gt; system<BR>

&gt; &gt;&gt;&gt; rewards them for not agreeing. Note the difference between the<BR>

&gt; microbial<BR>

&gt; &gt;&gt;&gt; taxonomists and other taxonomists. In the case of the microbial<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; workers, the system rewards them for solving problems not debating<BR>

&gt; &gt;&gt;&gt; alternatives. Also, if a good idea comes out that will make it easier<BR>

&gt; &gt;&gt;&gt; for<BR>

&gt; &gt;&gt;&gt; the microbiologists to solve the problems they are rewarded for solving,<BR>

&gt; &gt;&gt;&gt; they are less likely to care whose idea it is.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Like the microbiologists, there are lots of biologists that work with<BR>

&gt; &gt;&gt;&gt; species with the goal of addressing some non-taxonomic problem.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; They don't really care if the name is *Aedes triseriatus* or<BR>

&gt; &gt;&gt;&gt; *Ochlerotatus<BR>

&gt; &gt;&gt;&gt; triseriatus, *but they do care that the identifier that they connect<BR>

&gt; &gt;&gt;&gt; their<BR>

&gt; &gt;&gt;&gt; data to is stable.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; In regards to the issue of market forces,I suspect (but have no<BR>

&gt; &gt;&gt;&gt; knowledge<BR>

&gt; &gt;&gt;&gt; of) that there were probably decisions made in devising these lists that<BR>

&gt; &gt;&gt;&gt; have more to do with appeasing certain personalities that creating best<BR>

&gt; &gt;&gt;&gt; list. With the way this system rewards people it is likely that the<BR>

&gt; &gt;&gt;&gt; &quot;correct&quot; version will float to the top only after that person has<BR>

&gt; &gt;&gt;&gt; passed<BR>

&gt; &gt;&gt;&gt; away. I don't have much faith that the best system will always float to<BR>

&gt; &gt;&gt;&gt; the<BR>

&gt; &gt;&gt;&gt; top, That has a lot to do with the personalities and how the system<BR>

&gt; &gt;&gt;&gt; rewards<BR>

&gt; &gt;&gt;&gt; are setup. Theoretically, it is possible for one strong personality or<BR>

&gt; &gt;&gt;&gt; group<BR>

&gt; &gt;&gt;&gt; to force others to adopt their less than optimal solution - at least<BR>

&gt; &gt;&gt;&gt; this<BR>

&gt; &gt;&gt;&gt; seems to happen in other environments.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Also, there are all sorts of ways that people can use the publication<BR>

&gt; &gt;&gt;&gt; record to rewrite history. Simply cite the review paper that cites the<BR>

&gt; &gt;&gt;&gt; original paper. Or don't cite it at all.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I would have used only the ITIS TSN but if the name changes the ID<BR>

&gt; &gt;&gt;&gt; changes.<BR>

&gt; &gt;&gt;&gt; This isn't &quot;wrong&quot;, it just does not solve my problem.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; * ITIS also should add the spiders from the World Spider Catalog.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Another issue that I think has inhibited adoption of a common list is<BR>

&gt; &gt;&gt;&gt; that<BR>

&gt; &gt;&gt;&gt; people can't agree on a particular name or a particular classification.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Since you can model a species concept as having many names and many<BR>

&gt; &gt;&gt;&gt; classifications why not do so?<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; If this idea was originally accepted, I would not have needed to create<BR>

&gt; &gt;&gt;&gt; TaxonConcept.org.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; My plan has aways been to get something that works to solve some<BR>

&gt; &gt;&gt;&gt; problems<BR>

&gt; &gt;&gt;&gt; and then let some larger group take it over.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; In a sense, I am more like the microbiologists in that I am not being<BR>

&gt; &gt;&gt;&gt; paid<BR>

&gt; &gt;&gt;&gt; to solve this or debate this problem.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; I am doing it because I think something like this is needed, and it is<BR>

&gt; &gt;&gt;&gt; an<BR>

&gt; &gt;&gt;&gt; interesting and personally rewarding puzzle.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; - Pete<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; My thoughts are that the most likely way this will be solve is by<BR>

&gt; &gt;&gt;&gt; stnadard<BR>

&gt; &gt;&gt;&gt; market type pressures - ie the best solution/IDs will be used the most<BR>

&gt; &gt;&gt;&gt; and<BR>

&gt; &gt;&gt;&gt; &quot;float&quot; to the top.&nbsp; It is easy to say that the global taxon name data<BR>

&gt; &gt;&gt;&gt; is a<BR>

&gt; &gt;&gt;&gt; mess, but if you think about it 30 years ago taxon name data were very<BR>

&gt; &gt;&gt;&gt; disparate, duplicated, unconnected, many with NO IDs at all.&nbsp; So I<BR>

&gt; &gt;&gt;&gt; beleive<BR>

&gt; &gt;&gt;&gt; we are making progress and that we will continue to do so albeit at a<BR>

&gt; &gt;&gt;&gt; fairly<BR>

&gt; &gt;&gt;&gt; slow rate.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; Kevin<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; &quot;I agree. This was one of the reasons that I setup TaxonConcept the way<BR>

&gt; &gt;&gt;&gt; I<BR>

&gt; &gt;&gt;&gt; did. It attempts to connect both the LOD entities and the foreign key<BR>

&gt; &gt;&gt;&gt; based<BR>

&gt; &gt;&gt;&gt; entities.&quot;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;&nbsp; Please consider the environment before printing this email<BR>

&gt; &gt;&gt;&gt; Warning:&nbsp; This electronic message together with any attachments is<BR>

&gt; &gt;&gt;&gt; confidential. If you receive it in error: (i) you must not read, use,<BR>

&gt; &gt;&gt;&gt; disclose, copy or retain it; (ii) please contact the sender immediately<BR>

&gt; &gt;&gt;&gt; by<BR>

&gt; &gt;&gt;&gt; reply email and then delete the emails.<BR>

&gt; &gt;&gt;&gt; The views expressed in this email may not be those of Landcare<BR>

&gt; Research<BR>

&gt; &gt;&gt;&gt; New<BR>

&gt; &gt;&gt;&gt; Zealand Limited. <A HREF="http://www.landcareresearch.co.nz">http://www.landcareresearch.co.nz</A><BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; --<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; ------------------------------------------------------------------------------------<BR>

&gt; &gt;&gt;&gt; Pete DeVries<BR>

&gt; &gt;&gt;&gt; Department of Entomology<BR>

&gt; &gt;&gt;&gt; University of Wisconsin - Madison<BR>

&gt; &gt;&gt;&gt; 445 Russell Laboratories<BR>

&gt; &gt;&gt;&gt; 1630 Linden Drive<BR>

&gt; &gt;&gt;&gt; Madison, WI 53706<BR>

&gt; &gt;&gt;&gt; Email: pdevries@wisc.edu<BR>

&gt; &gt;&gt;&gt; TaxonConcept &lt;<A HREF="http://www.taxonconcept.org/">http://www.taxonconcept.org/</A>&gt;&nbsp; &amp;<BR>

&gt; &gt;&gt;&gt; GeoSpecies&lt;<A HREF="http://about.geospecies.org/">http://about.geospecies.org/</A>&gt; Knowledge<BR>

&gt; &gt;&gt;&gt; Bases<BR>

&gt; &gt;&gt;&gt; A Semantic Web, Linked Open Data &lt;<A HREF="http://linkeddata.org/">http://linkeddata.org/</A>&gt;&nbsp; Project<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; --------------------------------------------------------------------------------------<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; --<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; ------------------------------------------------------------------------------------<BR>

&gt; &gt;&gt;&gt; Pete DeVries<BR>

&gt; &gt;&gt;&gt; Department of Entomology<BR>

&gt; &gt;&gt;&gt; University of Wisconsin - Madison<BR>

&gt; &gt;&gt;&gt; 445 Russell Laboratories<BR>

&gt; &gt;&gt;&gt; 1630 Linden Drive<BR>

&gt; &gt;&gt;&gt; Madison, WI 53706<BR>

&gt; &gt;&gt;&gt; Email: pdevries@wisc.edu<BR>

&gt; &gt;&gt;&gt; TaxonConcept &lt;<A HREF="http://www.taxonconcept.org/">http://www.taxonconcept.org/</A>&gt;&nbsp; &amp;<BR>

&gt; &gt;&gt;&gt; GeoSpecies&lt;<A HREF="http://about.geospecies.org/">http://about.geospecies.org/</A>&gt; Knowledge<BR>

&gt; &gt;&gt;&gt; Bases<BR>

&gt; &gt;&gt;&gt; A Semantic Web, Linked Open Data &lt;<A HREF="http://linkeddata.org/">http://linkeddata.org/</A>&gt;&nbsp; Project<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; --------------------------------------------------------------------------------------<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; ------------------------------<BR>

&gt; &gt;&gt;&gt; Please consider the environment before printing this email<BR>

&gt; &gt;&gt;&gt; Warning: This electronic message together with any attachments is<BR>

&gt; &gt;&gt;&gt; confidential. If you receive it in error: (i) you must not read, use,<BR>

&gt; &gt;&gt;&gt; disclose, copy or retain it; (ii) please contact the sender immediately<BR>

&gt; &gt;&gt;&gt; by<BR>

&gt; &gt;&gt;&gt; reply email and then delete the emails.<BR>

&gt; &gt;&gt;&gt; The views expressed in this email may not be those of Landcare<BR>

&gt; Research<BR>

&gt; &gt;&gt;&gt; New<BR>

&gt; &gt;&gt;&gt; Zealand Limited. <A HREF="http://www.landcareresearch.co.nz">http://www.landcareresearch.co.nz</A><BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; --<BR>

&gt; &gt;&gt;&gt; Steven J. Baskauf, Ph.D., Senior Lecturer<BR>

&gt; &gt;&gt;&gt; Vanderbilt University Dept. of Biological Sciences<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; postal mail address:<BR>

&gt; &gt;&gt;&gt; VU Station B 351634<BR>

&gt; &gt;&gt;&gt; Nashville, TN&nbsp; 37235-1634,&nbsp; U.S.A.<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; delivery address:<BR>

&gt; &gt;&gt;&gt; 2125 Stevenson Center<BR>

&gt; &gt;&gt;&gt; 1161 21st Ave., S.<BR>

&gt; &gt;&gt;&gt; Nashville, TN 37235<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt; office: 2128 Stevenson Center<BR>

&gt; &gt;&gt;&gt; phone: (615) 343-4582,&nbsp; fax: (615)<BR>

&gt; &gt;&gt;&gt; 343-6707<A HREF="http://bioimages.vanderbilt.edu">http://bioimages.vanderbilt.edu</A><BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;&gt;<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt;<BR>

&gt; &gt;&gt; --<BR>

&gt; &gt;&gt; ------------------------------------------------------------------------------------<BR>

&gt; &gt;&gt; Pete DeVries<BR>

&gt; &gt;&gt; Department of Entomology<BR>

&gt; &gt;&gt; University of Wisconsin - Madison<BR>

&gt; &gt;&gt; 445 Russell Laboratories<BR>

&gt; &gt;&gt; 1630 Linden Drive<BR>

&gt; &gt;&gt; Madison, WI 53706<BR>

&gt; &gt;&gt; Email: pdevries@wisc.edu<BR>

&gt; &gt;&gt; TaxonConcept &lt;<A HREF="http://www.taxonconcept.org/">http://www.taxonconcept.org/</A>&gt;&nbsp; &amp;<BR>

&gt; &gt;&gt; GeoSpecies&lt;<A HREF="http://about.geospecies.org/">http://about.geospecies.org/</A>&gt; Knowledge<BR>

&gt; &gt;&gt; Bases<BR>

&gt; &gt;&gt; A Semantic Web, Linked Open Data &lt;<A HREF="http://linkeddata.org/">http://linkeddata.org/</A>&gt;&nbsp; Project<BR>

&gt; &gt;&gt; --------------------------------------------------------------------------------------<BR>

&gt; &gt;&gt; _______________________________________________<BR>

&gt; &gt;&gt; tdwg-content mailing list<BR>

&gt; &gt;&gt; tdwg-content@lists.tdwg.org<BR>

&gt; &gt;&gt; <A HREF="http://lists.tdwg.org/mailman/listinfo/tdwg-content">http://lists.tdwg.org/mailman/listinfo/tdwg-content</A><BR>

&gt; &gt;&gt;<BR>

&gt; &gt;<BR>

&gt; &gt;<BR>

&gt; &gt;<BR>

&gt; &gt; ----------------------------------------------------------------------------<BR>

&gt; &gt; David Remsen, Senior Programme Officer<BR>

&gt; &gt; Electronic Catalog of Names of Known Organisms<BR>

&gt; &gt; Global Biodiversity Information Facility Secretariat<BR>

&gt; &gt; Universitetsparken 15, DK-2100 Copenhagen, Denmark<BR>

&gt; &gt; Tel: +45-35321472&nbsp;&nbsp; Fax: +45-35321480<BR>

&gt; &gt; Skype: dremsen<BR>

&gt; &gt; ----------------------------------------------------------------------------<BR>

&gt; &gt;<BR>

&gt; &gt;<BR>

&gt; &gt;<BR>

&gt; &gt; _______________________________________________<BR>

&gt; &gt; tdwg-content mailing list<BR>

&gt; &gt; tdwg-content@lists.tdwg.org<BR>

&gt; &gt; <A HREF="http://lists.tdwg.org/mailman/listinfo/tdwg-content">http://lists.tdwg.org/mailman/listinfo/tdwg-content</A><BR>

&gt; &gt;<BR>

&gt; _______________________________________________<BR>

&gt; tdwg-content mailing list<BR>

&gt; tdwg-content@lists.tdwg.org<BR>

&gt; <A HREF="http://lists.tdwg.org/mailman/listinfo/tdwg-content">http://lists.tdwg.org/mailman/listinfo/tdwg-content</A><BR>

<BR>

<BR>

_______________________________________________<BR>

tdwg-content mailing list<BR>

tdwg-content@lists.tdwg.org<BR>

<A HREF="http://lists.tdwg.org/mailman/listinfo/tdwg-content">http://lists.tdwg.org/mailman/listinfo/tdwg-content</A><BR>

_______________________________________________<BR>

tdwg-content mailing list<BR>

tdwg-content@lists.tdwg.org<BR>

<A HREF="http://lists.tdwg.org/mailman/listinfo/tdwg-content">http://lists.tdwg.org/mailman/listinfo/tdwg-content</A><BR>

<BR>

</FONT>

</P>


</BODY>

</HTML>