I thought I would explain how I have been marking up occurrence records in RDF in this blog post.

http://www.taxonconcept.org/taxonconcept-blog/2011/2/21/species-occurrence-records-represented-in-rdf.html

The quad store view of the different RDF entities mentioned in the blog can be seen here:

Via this bit.ly bundle http://bit.ly/fnlTZd

* Note that some of these links get change in email so you might want to use the bit.ly bundle above

The RDF page: http://lsd.taxonconcept.org/describe/?url=http://ocs.taxonconcept.org/ocs/f522444a-2dd9-400e-be59-47213ef38cb9.rdf

The Occurrence: http://lsd.taxonconcept.org/describe/?url=http://ocs.taxonconcept.org/ocs/f522444a-2dd9-400e-be59-47213ef38cb9%23Occurrence

The Area: http://lsd.taxonconcept.org/describe/?url=geo:44.86528100,-87.23147800;u=10

The Species Concept: http://lsd.taxonconcept.org/describe/?url=http://lod.taxonconcept.org/ses/ICmLC%23Species

The Individual: http://lsd.taxonconcept.org/describe/?url=http://ocs.taxonconcept.org/ocs/f522444a-2dd9-400e-be59-47213ef38cb9%23Individual

The Identification: http://lsd.taxonconcept.org/describe/?url=http://ocs.taxonconcept.org/ocs/f522444a-2dd9-400e-be59-47213ef38cb9%23Identification_0001

The Continent: http://lsd.taxonconcept.org/describe/?url=http://sws.geonames.org/6255149/

The County: http://lsd.taxonconcept.org/describe/?url=http://sws.geonames.org/5250768/


Although the exact format of these could be changed there are several existing advantages and some possible future advantages.

1) Each data provider could expose their own data to the web without a central authority.
     This could be in the form of RDF, RDFa, or as gzipped RDF dump files. (records.rdf.gz)
     They would be automatically discoverable on the LOD cloud.
     A third party service could add value by pointing to the various repositories, perhaps with an annotation system like or as part of CKAN. (http://ckan.net/)

2) Data validation in terms of being "well-formed" could be checked with the existing RDF and RDFa validators.

   http://www.w3.org/RDF/Validator/

   http://validator.w3.org/

3)  The next level of validation e.g:
      Is this a valid geographic coordinate?
      Does the coordinate match the geographical names listed *(You could even have a service that takes the geo area and determines the correct geographic names etc.)
      Does the scientific name match a known species?
      Are the other required fields present and well formed.
      Is this a new record for that species in that specific geographic area

      Would have to be done by a validator created for the community, but it would benefit from the existing tools.

4) All the third party LOD tools and services would be able to use these records.

Respectfully,

- Pete

--
---------------------------------------------------------------
Pete DeVries
Department of Entomology
University of Wisconsin - Madison
445 Russell Laboratories
1630 Linden Drive
Madison, WI 53706
TaxonConcept Knowledge Base / GeoSpecies Knowledge Base
About the GeoSpecies Knowledge Base
------------------------------------------------------------