- tdwg-content - lists.tdwg.org

Key to the Pseudosaurs of the Natural History Museums of the World
by Bob Morris 27 Apr '05

27 Apr '05

A key to the Pseudosaurs of the Natural History Museums of the World has been released, with initial emphasis on specimens held by the Smithsonian Institution National Museum of Natural History. Please see http://aardvark.cs.umb.edu:8080/keys We welcome commentary and learned discussion at http://wiki.cs.umb.edu/twiki/bin/view/BDEI/Pseudosaurs Bob Morris -- Robert A. Morris Professor of Computer Science UMASS-Boston ram(a)cs.umb.edu http://www.cs.umb.edu/efg http://www.cs.umb.edu/~ram phone (+1)617 287 6466

1 0

SDD 1.0 beta 4
by Gregor Hagedorn 19 Mar '05

19 Mar '05

I have uploaded a new version of the SDD schema that is accessible from: http://wiki.cs.umb.edu/twiki/bin/view/SDD/CurrentSchemaVersion This upload is in accordance with the regulations of TDWG which require a final draft 180 days prior to the voting session (at TDWG 2005, St. Petersburg). I very urgently look forward to criticism. However at the moment only schema- wise people with a schema viewer are able to do this. Those of you looking for example files and generated html schema documentation - please bear with me a little longer, these items are not yet finished. All the best Gregor---------------------------------------------------------- Gregor Hagedorn (G.Hagedorn(a)bba.de) Institute for Plant Virology, Microbiology, and Biosafety Federal Research Center for Agriculture and Forestry (BBA) Königin-Luise-Str. 19 Tel: +49-30-8304-2220 14195 Berlin, Germany Fax: +49-30-8304-2203

1 0

LSID protocol handler for Mozilla/Firefox - oops
by Roderic D. M. Page 13 Dec '04

13 Dec '04

Of course, it would have helped if I'd included the address of the web site where the LSID protocol handler is available (duh!). The address is http://darwin.zoology.gla.ac.uk/~rpage/lsid/ . Regards Rod -- -------------------------------------------------------- Professor Roderic D. M. Page Editor Elect, Systematic Biology DEEB, IBLS Graham Kerr Building University of Glasgow Glasgow G12 8QP United Kingdom Phone: +44 141 330 4778 Fax: +44 141 330 2792 email: r.page(a)bio.gla.ac.uk web: http://taxonomy.zoology.gla.ac.uk/rod/rod.html reprints: http://taxonomy.zoology.gla.ac.uk/rod/pubs.html Subscribe to Systematic Biology through the Society of Systematic Biologists Website: http://systematicbiology.org

1 0

LSID protocol handler for Mozilla/Firefox
by Roderic D. M. Page 13 Dec '04

13 Dec '04

I've put together a very simple extension that enables Mozilla and Firefox to handle the "lsidres" protocol used in IBM's Launchpad to resolve LSIDs. The extension redirects the browser to the http://lsid.biopathways.org LSID resolver. Once the extension is installed, links such as lsidres://urn:lsid:ncbi.nlm.nih.gov.lsid.biopathways.org:pubmed:12441807 become clickable. This is a bit primitive compared to Launchpad, but means that people who don't use Internet Explorer 6 on Windows can have clickable LSIDs. The extension has been tested (I use that term loosely) on Windows 2000, Mac OS X, and Red Hat 8, with Firefox 0.9.2 - 1.0. Regards Rod -- -------------------------------------------------------- Professor Roderic D. M. Page Editor Elect, Systematic Biology DEEB, IBLS Graham Kerr Building University of Glasgow Glasgow G12 8QP United Kingdom Phone: +44 141 330 4778 Fax: +44 141 330 2792 email: r.page(a)bio.gla.ac.uk web: http://taxonomy.zoology.gla.ac.uk/rod/rod.html reprints: http://taxonomy.zoology.gla.ac.uk/rod/pubs.html Subscribe to Systematic Biology through the Society of Systematic Biologists Website: http://systematicbiology.org

1 0

LSIDs and taxonomic schema - an alternative view
by Roderic D. M. Page 15 Nov '04

15 Nov '04

Based on playing with LSIDs as part of the Taxonomic Search Engine (http://darwin.zoology.gla.ac.uk/~rpage/portal ), and following some of the recent efforts on developing taxonomic name schema (such as the LinneanCore), I've become concerned that I don't think the implications of LSIDs have been fully thought through. Specifically, I worry that by focusing on schema for names in their present format, the community is making more work for itself, and missing the point (and potential) of LSIDs. Metadata -------- LSIDs are not simply identifiers, they come with associated metadata. For example, the LSID urn:lsid:ipni.org.lsid.zoology.gla.ac.uk:Id:20012729-1 has associated metadata which can be viewed directly at http://ipni.org.lsid.zoology.gla.ac.uk/authority/metadata/?lsid=urn:lsid:ip… , or by using a LSID resolver (http://biopathways.ibm.nebiogrid.org/resolver/urn:lsid:ipni.org.lsid.zoolog… ). This metadata is in RDF (Resource Description Format), and provides information about the name, and links to other resources (via LSIDs). For example, the above record for *Poissonia heterantha*" has a link to its basionym, *Tephrosia heterantha*. We've been here before ----------------------- It seems to me that there is an assumption that all we do with LSIDs is stick them in an XML document as an identifier ("GUID"), and our work is done. I feel this rather misses the point. The key point is that, if we serve LSIDs we need to serve metadata about the names. So, we need a standard for the metadata. But, hang on, we've just spent energy on a standard for our data...? So, once a schema is agreed, someone then someone has to create a new schema for the metadata for LSIDs..? And, how do these schema relate...? Hmmmm. One response to these ideas might be to simply serve a document based on one of the current schema (such as the LinneanCore) as metadata. But I think is a poor solution that doesn't exploit the potential of RDF metadata. RDF --- RDF is very cool, in that there are tools that can take RDF and reason about them. For instance, given metadata for the LSIDs urn:lsid:ipni.org.lsid.zoology.gla.ac.uk:Id:20012728-1 (Poissonia heterantha), and urn:lsid:ipni.org.lsid.zoology.gla.ac.uk:Id:944651-1 (Coursetia heterantha), we can infer that these two names are synonyms, because they share the same basionym. If we use RDF, we get this kind of ability for "free." There is a lot of work in the semantic web, onotology, and bioinformatics communities about making inferences like this. Isn't this the kind of thing we want to do, rather than simply pass XML documents around? Wouldn't it be nice to take two or more LSIDs and workout their relationship, automatically (where possible). Client databases that stored LSIDs could work out whether names were synonyms in a standard way, without actually having to be told that the names are synonyms. A radical view -------------- Instead of developing schema for exchanging information that are specific to taxonomy, a radical approach would be to adopt LSIDs as identifiers and standardise the associated metadata (which would convey all the information about the name). By adopting RDF we can tap into a lot of existing work, as well as existing external standards (e.g., Dublin core, and the emerging use of LSIDs in bioinformatics). It also offers an opportunity to serve up information on taxonomic names in a much more useful form than a simple XML document. I wonder if an opportunity is being missed here. Regards Rod -- -------------------------------------------------------- Professor Roderic D. M. Page Editor Elect, Systematic Biology DEEB, IBLS Graham Kerr Building University of Glasgow Glasgow G12 8QP United Kingdom Phone: +44 141 330 4778 Fax: +44 141 330 2792 email: r.page(a)bio.gla.ac.uk web: http://taxonomy.zoology.gla.ac.uk/rod/rod.html reprints: http://taxonomy.zoology.gla.ac.uk/rod/pubs.html Subscribe to Systematic Biology through the Society of Systematic Biologists Website: http://systematicbiology.org Search for taxonomic names at http://darwin.zoology.gla.ac.uk/~rpage/portal

1 0

Taxonomic Search Engine - Now with "Did you mean"
by Roderic D. M. Page 11 Nov '04

11 Nov '04

Because I can't spell to save myself, I've added a "did you mean" feature to the Taxonomic Search Engine (http://darwin.zoology.gla.ac.uk/~rpage/portal/ ). You now have the option of asking for suggested spellings, based on a list of names I have stored on the server, as well as any suggestions offered by Google. For example, if you search on "Physeter catadon", you will be asked if you really meant "Physeter catodon" (note the "o" instead of the "a"). Both the spelling suggestion, and the name search itself are now available as web services. Documentation and example clients are available at the site (I hope to flesh these out as time permits). Regards Rod -- -------------------------------------------------------- Professor Roderic D. M. Page Editor Elect, Systematic Biology DEEB, IBLS Graham Kerr Building University of Glasgow Glasgow G12 8QP United Kingdom Phone: +44 141 330 4778 Fax: +44 141 330 2792 email: r.page(a)bio.gla.ac.uk web: http://taxonomy.zoology.gla.ac.uk/rod/rod.html reprints: http://taxonomy.zoology.gla.ac.uk/rod/pubs.html Subscribe to Systematic Biology through the Society of Systematic Biologists Website: http://systematicbiology.org

1 0

Web Service on ITIS
by Bob Morris 07 Nov '04

07 Nov '04

http://blade63.cs.umb.edu:13200/axis/ITIS has a limited web interface to an experimental SOAP Web Service against a copy of ITIS data running on an Oracle installation here. The WSDL is pointed to on that page, and we welcom experiments with clients other than our own. We don't keep the data particularly current, and the service is in use by my course, so don't expect production quality. But please do try to break it and send me mail if you succeed. So, where is the XML-Schema for ITIS query results? Hah, hah just serious. Bob Morris

1 0

Re: Globally Unique Identifier
by Richard Pyle 31 Oct '04

31 Oct '04

> Richard Pyle writes: > > > "a central question, which Donald included in his > > PowerPoint file, is whether the GUID is assigned to the physical > > object, or to the electronic representation (data record). Most of my > > comments have been from the standpoint that the GUID applies to the > > physical specimen. If it is the electronic records that we wish to > > uniquely identify, then it seems to me that the <objectID> component > > of an LSID should apply to the physical specimen, and multiple > > database records should be uniquely identified using the <version> > > component." > > I think this is not practical. Do you mean those GLOPP organism- > interaction-data that have specimen voucher information can not be > published/referenced in GBIF until I figure out whether a collection > has digitized them (most have never digitized elsewhere!)? Not necessarily. I don't think the issue is whether or not the collection has been digitized, but rather whether GUIDs have already been assigned to the vouchers you want to document in the GLOPP dataset. So, if your question is more along the lines of "do I need to check to see if GUIDs have already been issued to voucher specimens that I cite, before I issue new GUIDs", then my answer -- in the long run, at least -- would be, "well....yes!" That's sort of the fundamental point of the GUIDs, isn't it? But I don't see this as being necessarily burdensome. For example, if your GLOPP dataset included unambiguous pointers to specific voucher specimens (e.g., via InstitutionCode+CollectionCode+CatalogNumber), then it *should* be a relatively quick and straightforward process to find out if GUIDs have already been assigned (if it's not quick & easy, then the GUID service would be horribly inadequate!) If, on the other hand, the GLOPP dataset does not provide unambiguous pointers to specific voucher specimens, then the "vouchered" aspect of those specimen citations seems unsupported, in which case your GUIDs would need to be assigned to virtual/unvouchered "specimens" (analogous to observation records), and hence non-duplicate. > Or if I > find they have not been, when the collection starts to digitize them, > they would have to create for those that have already been published > in GLOPP use a new version of the GLOPP LSID? I would hope that if you assigned GUIDs to GLOPP-relevant voucher specimens that belong to a collection that is not-yet digitized, you would do the courtesy of providing the manager of that collection with a listing of the GUIDs you created for the specific relevant specimens. I would further hope that, when that collection is eventually digitized, the manager would have the wherewithal to assign new GUIDs only to those specimens that did not yet have them. But, as someone who has worked in a natural history collection for nearly two decades (and who bore witness to the transition of the collection from non-digitized to digitized), I certainly do understand the "realities" of this, and fully recognize that my optimistic perspective is likely to be overly idealistic. This is why I feel that duplicate assignment of GUIDs is inevitable (that is, two different numbers for one object; not duplicate GUIDs), and MUST be accommodated in any GUID system that is developed. My main point is that such "redundant" GUID issuance should be minimized (i.e., never done intentionally), and quickly/easily identified as such whenever it is discovered. So....if/when the situation does come up that (for example) GLOPP assigns GUIDs to vouchers on behalf of a non-digitized collection, and that collection later (inadvertently) re-assigns redundant GUIDs to the same set of specimens; that eventual discovery of this duplication should be accommodated by a mechanism for "retiring" one of the IDs into "objective synonomy" of the other ID, and automated systems should be implemented in the resolver service that "auto-forward" the retired ID to the active ID. If your question is more about whether the collection, when it later becomes digitized, should use the same <objectID> ID as was assigned for the GLOPP dataset, but qualify that same Object ID with a unique version number -- then my answer is, "I don't know". That is sort of the question I was trying to ask ('though I didn't ask it very effectively). Basically, I was suggesting that/asking whether it would make sense to pin the <ObjectID> portion of a GUID to the physical object, and using the <version> feature as a unique identifier to electronic representations thereof? > The same applies to taxonomic data - most revisions contain voucher > data. Same solution, I think. For the most part, though -- I see these as "growing pains" of a GUID system during its first years of existence. I would predict that two decades from now, if one were to do an analysis of redundant GUIDs, one would find the bulk of those having been issued relatively early on. Aloha, Rich

1 0

MultilingualDesignPattern
by Gregor Hagedorn 26 Oct '04

26 Oct '04

I try to finish the changes on SDD and UBIF. A major issue in my eyes is to get a final vote on the question how to go along with the MultilingualDesignPattern i.e. use Label/Representation or skip the outer collection container in this case? I apologize for reopening a question that was already debated a long time ago, but in the context of finding common ground with other biodiversity schemata (in UBIF) it seems to be one of the issues people tend to take up as a case of being to complicated. Can you comment and vote on: http://efgblade.cs.umb.edu/twiki/bin/view/UBIF/MultilingualDesignPatte rn Those not wanting to use the WIKI can send email back to me and I will collate their opinion on the WIKI. Many thanks! Gregor ---------------------------------------------------------- Gregor Hagedorn (G.Hagedorn(a)bba.de) Institute for Plant Virology, Microbiology, and Biosafety Federal Research Center for Agriculture and Forestry (BBA) Königin-Luise-Str. 19 Tel: +49-30-8304-2220 14195 Berlin, Germany Fax: +49-30-8304-2203

1 0

Re: Taxonomic Search Engine - Now with GenBank
by Roderic D. M. Page 26 Oct '04

26 Oct '04

This is next on the to do list. The site already makes extensive use of SOAP to talk to two of the source databases, I just need to tidy up the code a bit in preparation for making it a SOAP server. There are also performance issues to address, for which caching might help. Regards Rod >Nice. > >It would be great if you exposed this as a Web Service. There seem to be >several frameworks for doing this with PHP applications. > >Bob Morris > > >Roderic D. M. Page wrote: > >>The Taxonomic Search Engine I recently developed >>(http://darwin.zoology.gla.ac.uk/~rpage/portal/ ) now queries the >>GenBank taxonomy, in addition to ITIS, Index Fungorum, uBio, and >>IPNI. Although GenBank is not an authoritative source of taxonomic >>names, they do have many names not in these other databases. >> >>Regards >> >>Rod >> >>-- >>-------------------------------------------------------- >>Professor Roderic D. M. Page >>Editor Elect, Systematic Biology >>DEEB, IBLS >>Graham Kerr Building >>University of Glasgow >>Glasgow G12 8QP >>United Kingdom >> >> >>Phone: +44 141 330 4778 >>Fax: +44 141 330 2792 >>email: r.page(a)bio.gla.ac.uk >>web: http://taxonomy.zoology.gla.ac.uk/rod/rod.html >>reprints: http://taxonomy.zoology.gla.ac.uk/rod/pubs.html >> >>Subscribe to Systematic Biology through the Society of Systematic >>Biologists Website: http://systematicbiology.org > > >-- >Robert A. Morris >Professor of Computer Science >UMASS-Boston >ram(a)cs.umb.edu >http://www.cs.umb.edu/efg >http://www.cs.umb.edu/~ram >phone (+1)617 287 6466 -- -------------------------------------------------------- Professor Roderic D. M. Page Editor Elect, Systematic Biology DEEB, IBLS Graham Kerr Building University of Glasgow Glasgow G12 8QP United Kingdom Phone: +44 141 330 4778 Fax: +44 141 330 2792 email: r.page(a)bio.gla.ac.uk web: http://taxonomy.zoology.gla.ac.uk/rod/rod.html reprints: http://taxonomy.zoology.gla.ac.uk/rod/pubs.html Subscribe to Systematic Biology through the Society of Systematic Biologists Website: http://systematicbiology.org

1 0