[tdwg-content] Of Evidence and Individuals (Was Plea for competency questions)
Steve Baskauf
steve.baskauf at vanderbilt.edu
Wed Jul 27 16:50:24 CEST 2011
I would like to clarify that although RDF files were mentioned in my
previous post and although I've used some terms from the OWL world in my
post, the discussion we are having is not about OWL or RDF. It's about
addition of two classes to Darwin Core which is general-purpose and at
the moment doesn't even have any sort of guidelines for its use in RDF.
There has been a general criticism of the "Individual class" proposal
that it doesn't really "do anything" that is beneficial to the community
or that it cannot be defined in a way that makes it usable. The
examples that I gave involved darwin-sw and my website because they
aren't theoretical. They actually exist and (I believe) demonstrate
that the classes CollectionObject and BiologicalEntity (or Token and
IndividualOrganism) CAN actually serve a useful purpose. So at this
point I would rather not let the discussion get distracted by the
technical OWL and RDFS issues that Bob raises below - the discussion is
about John's BiologicalEntity and CollectionObject term proposals, not
technical problems with darwin-sw.
So keeping this on a general level, it is my understanding that classes
in DwC do not have any formal relationship to the terms that are listed
under them because DwC terms do not have ranges and domains. So what
then are DwC classes for? To quote the DwC Quick Reference guide
(http://rs.tdwg.org/dwc/terms/index.htm): "The terms are organized by
categories (in bold) in the index. The categories correspond to Darwin
Core terms that are classes (terms that have other terms to describe
them). The terms that describe a given class (the class properties)
appear in the list immediately below the name of the category in the
index." It seems to me that our goal should be to have a DwC class for
each type of basic "thing" that needs to be tracked in our community and
that the terms listed under each class should truly "describe a given
class" (i.e. actually be a property of instances of that class). Right
now, I think there is a general consensus that there is a problem with
the Occurrence class because not all of the properties under that class
actually describe Occurrences the way we've been talking about them.
John's proposal is one way to fix this problem. Is there a more
sensible way to organize terms within classes that fit the resources we
are tracking in our community? If so, then let's put it on the table.
It is my belief that if we fix some of the problems that have been
discussed over the last couple years, the DwC classes could be
rdfs:Class's and the terms listed under them in DwC could serve as
rdf:Property's of instances of those classes. But that is not a
necessity - one could just as easily say that the DwC classes are
"things" that deserve their own table in a database and that the terms
listed under that class are appropriate column headings in that table.
Rich has made some suggestions for a different way to define the classes
than what I suggested - that's great! If we had "Organism" and
"Documentation" classes, what would be the column headings in database
tables holding the metadata for resources in those classes? How would
the page http://rs.tdwg.org/dwc/terms/index.htm change if we followed
Rich's suggestions? How would we shift existing terms under the new
classes?
After thrashing for two years about what the classes should be part of
DwC and what those classes represent, it looks like we may be close to
some kind of a consensus that could actually be incorporated into the
standard. Let's not get distracted - there are ongoing efforts like
BiSciCol and work to clarify how RDF can be used in our community that
would only be hampered by that.
Steve
Bob Morris wrote:
> On Wed, Jul 27, 2011 at 12:52 AM, Steve Baskauf
> <steve.baskauf at vanderbilt.edu> wrote:
>
>> [...]
>> Well you could say this about any non-disjoint classes. Why not combine the
>> classes Teacher and ElectedOfficial if a person can be an instance of both?
>> We don't do that because there are instances where persons are Teachers and
>> not ElectedOfficials and other instance where people are ElectedOfficials
>> and not Teachers. Instances of the class Teacher share properties like
>> numberOfStudents and schoolOfEmployment, while instances of ElectedOfficial
>> share properties like votesReceived and yearElected.
>>
>
> OWL supports intersection and union. In OWL the set of things that
> are both a Teacher and an Elected Official would form a class. It
> would perhaps have fewer individuals in it than either class, but in
> OWL it is nevertheless a class. Similarly the set of things that are
> either a Teacher or an ElectedOfficial, though not necessarily both,
> is an OWL class. Note that
> http://code.google.com/p/darwin-sw/source/browse/trunk/dsw.owl is at
> this writing not an OWL ontology, as you will see if you try to put it
> into the Manchester OWL validator. Much of that failure is minor
> syntactic botches, but when you fix them, some issues remain about
> dsw's secondary goal to "(hopefully) making design choices that do not
> constrain full SW reasoning downstream".
>
> RDFS does not guarantee that the intersection of two classes is a
> class, but allows you to avow that in particular cases if you wish,
> whereas OWL always requires it. More to the point, RDFS allows you to
> conceptually think of classes and properties as having the same kind
> of structure, so that you can effectively talk about intersecting
> properties, e.g. define a property that is true for those individuals
> for whom two particular properties are true. This is discussed on p.
> 135 of [1], though you won't like the word "infer" used there. Chapter
> 8 of [1] introduces a subset RDFS-Plus of OWL that is largely
> motivated by being content with the expressiveness of SPARQL, which I
> guess fits in your comfort zone, especially as to "AND". My
> understanding of LOD is that its community is indeed content with the
> expressiveness of SPARQL for now, and dsw probably corresponds to the
> approach of Chapter 8 in a way that meets your primary goals for dsw.
>
> In summary, I suspect that you are struggling about modeling
> individuals because:
> a. You believe in your heart that there is a fundamental difference
> between properties and classes but
> b. RDFS does not require that of one's models and
> c. dsw is expressed in RDFS and not in a modeling language that
> requires you to abandon a.
>
>
> [1] Dean Alemang and Jim Hendler "Semantic Web for the Working
> Ontologist", 2nd Edition 2011
>
>
> Bob
>
>
>
>
--
Steven J. Baskauf, Ph.D., Senior Lecturer
Vanderbilt University Dept. of Biological Sciences
postal mail address:
VU Station B 351634
Nashville, TN 37235-1634, U.S.A.
delivery address:
2125 Stevenson Center
1161 21st Ave., S.
Nashville, TN 37235
office: 2128 Stevenson Center
phone: (615) 343-4582, fax: (615) 343-6707
http://bioimages.vanderbilt.edu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.tdwg.org/pipermail/tdwg-content/attachments/20110727/b5a2fa6f/attachment.html
More information about the tdwg-content
mailing list