On Aug 28, 2010, at 6:59 PM, Blum, Stan wrote:
Regarding #1, and assuming that this concerns molecular data where individuals often function as OTUs, I think it would be even more important for the long-term usefulness of the data to support the ability to reference the specimen with a resolvable GUID
...
On the other hand, a full name backed up by a URL or source/GUID would be a big improvement on codes and abbreviations
My main aim is to support data integration (rather than validation), and the two most important integrating variables for the foreseeable future (at least in my limited vision) are species name (or other taxonomic identifier) and geographic coordinates. These are important partly because the great mass of users outside of TDWG are committed to using the same kinds of species names and the same kinds of coordinates.
Most phylogeny information artefacts (e.g., files) out there don't have either one of these , so integrating phylogenetic information into the global web of data isn't going to get very far until we make it easy for users to put this information into their trees.
To the extent that the scientific community is committed in the same way to specimen identifiers, then this makes the problem simpler because the specimen source would become the integrating variable, and would mediate the integration of data by species or location (because the specimen would have a species and a location). But I don't think we are there yet.
Arlin