<div dir="ltr"><span id="docs-internal-guid-3724c683-f092-4b6d-9cdc-ec765aac9a3c"><p style="line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">Since the original proposal was from a group of folks, we decided to put our heads together to construct a general response to the various issues and ideas expressed on this thread. </span></p>
<p style="line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br></span></p><p style="line-height:1.15;margin-top:0pt;margin-bottom:0pt">
<span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">John Deck for Rob Guralnick, Ramona Walls, and John Wieczorek</span></p><p style="line-height:1.15;margin-top:0pt;margin-bottom:0pt">
<span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br></span></p><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt">
<span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">In the text of the issue submitted for MaterialSample (</span><a href="https://code.google.com/p/darwincore/issues/detail?id=167" style="text-decoration:none"><span style="font-size:15px;font-family:Arial;background-color:transparent;text-decoration:underline;vertical-align:baseline;white-space:pre-wrap">https://code.google.com/p/darwincore/issues/detail?id=167</span></a><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">) we noted cases where the current basisOfRecord terms pertaining to the Occurrence class (Occurrence, PreservedSpecimen, LivingSpecimen, FossilSpecimen, HumanObservation, MachineObservation) do not adequately cover certain cases, including: environmental sample (for metagenomic analysis), transcriptomes (measuring genes but not taxa), and destructive samples (e.g. tissues destructively sampled in order to generate genomic DNA). The term we borrowed from OBI (<a href="http://purl.obolibrary.org/obo/OBI_0000747">http://purl.obolibrary.org/obo/OBI_0000747</a>) is broad enough to be utilized across various cases that fulfill our criteria while still maintaining a consistent, clear and human understandable meaning. For our purposes, we can think of “Material Sample” as any type of matter that we can use in order derive further evidence needed for identifications, and taxa, whether it is taxonomically homogenous, heterogenous,</span><span style="background-color:rgb(255,255,255)"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap"> </span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap">a single individual</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap">,</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);vertical-align:baseline;white-space:pre-wrap"> </span></span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">sets of individuals, or populations. </span></p>
<br><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"></span><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">How is MaterialSample different from Individual? The intent of individualID is fairly clear: since an Occurrence represents an organism at a place and time (per Markus’ email), the individualID term allows us to assign an instance identifier for a particular organism that can be present in multiple events. MaterialSampleID, on the other hand, is intended to allow users to say that the basis of an occurence is a material entity (i.e. matter) that has been sampled according to some particular method. Whether or not this material entity is an individual (sensu individualID in DwC) is an independent axis of classification. As was already pointed out, there is no restriction on specifying that an occurence is associated with more than one type, so any occurrence can have both an individualID and a materialSampleID.</span></p>
<br><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"></span><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">We maintain our position on the proposal for MaterialSample as a value for the basisOfRecord, with an associated materialSampleID to identify instances of them. Per Steve’s initial comments, we have already withdrawn the proposal for a MaterialSample class distinct from that in the Darwin Core type vocabulary, which should make it easier to evaluate the implications of what we’re discussing. </span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:rgb(0,255,255);vertical-align:baseline;white-space:pre-wrap"></span></p>
<br><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"></span><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">********************</span></p>
<br><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"></span><p dir="ltr" style="line-height:1.15;margin-top:0pt;margin-bottom:0pt"><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">NOTES, MaterialSample from OBI:</span></p>
<br><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"></span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">OBI has fairly broad definitions of samples & specimens that are meant to be utilized across many different scientific activities. Material Sample is defined as a “</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-style:italic;vertical-align:baseline;white-space:pre-wrap">material entity that has the material sample role</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">”, while a material sample role is defined as “ </span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-style:italic;vertical-align:baseline;white-space:pre-wrap">a specimen role borne by a material entity that is the output of a material sampling process</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">”, and a material sampling process is “</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-style:italic;vertical-align:baseline;white-space:pre-wrap">a specimen gathering process with the objective to obtain a specimen that is representative of the input material entity</span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap">”. </span></span><br>
<div><span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br></span></span></div><div><span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br>
</span></span></div><div><span><span style="font-size:15px;font-family:Arial;color:rgb(0,0,0);background-color:transparent;vertical-align:baseline;white-space:pre-wrap"><br></span></span></div></div><div class="gmail_extra">
<br><br><div class="gmail_quote">On Mon, May 27, 2013 at 11:59 PM, Richard Pyle <span dir="ltr"><<a href="mailto:deepreef@bishopmuseum.org" target="_blank">deepreef@bishopmuseum.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi Markus,<br>
<br>
Great question! Particularly because this is exactly the sort of use case<br>
we designed our model around.<br>
<div class="im"><br>
> if you take a tissue sample of the same tree every year, would the<br>
identifier<br>
> in individualID be the same for all of them or be different? WIth the<br>
current<br>
> dwc:individualID definition it would be the same for all samples. If I<br>
> understand you correct each sample would have its own "individual"<br>
> identifier in your proposal? It can't see how you can collapse these two<br>
things<br>
> into one definition.<br>
<br>
</div>No, that is not how we would handle it.<br>
<br>
In our model, there would be one IndividualID to represent the tree,<br>
spanning the time period beginning (more or less) when the seed was<br>
germinated, until the time at which the entire physical structure of the<br>
tree was disintegrated. It is an individual tree.<br>
<br>
There would be multiple Occurrence instances, for each time that someone<br>
observed or sampled or otherwise wished to document some condition of that<br>
tree. All of these Occurrence instances would refer to the same individualID<br>
value (i.e., the "tree"). In the example above, this means there would be a<br>
different Occurrence instance for each year that a sample is taken --<br>
because in each case, an assertion that the full tree existed at a certain<br>
time and place can be made (I understand that trees tend not to move around<br>
very much, so the Location for each event associated with each Occurrence<br>
would, in this case, remain the same; but the other Event properties -- such<br>
as eventID, samplingProtocol, samplingEffort, eventDate, eventTime,<br>
startDayOfYear, endDayOfYear, year, month, day, verbatimEventDate, habitat,<br>
fieldNumber, fieldNotes, eventRemarks -- would be documented accordingly for<br>
each sampling Occurrence instance).<br>
<br>
Suppose that the tree is visited every month, but only sampled once per<br>
year. In that case, there would be an Occurrence record for every monthly<br>
visit. In other words, an Occurrence instance exists regardless of whether<br>
a physical sample was made or not. Any in-situ images made of the tree<br>
would likewise be associated with the specific Occurrence instance, and each<br>
image would represent a separate instance of "Evidence".<br>
<br>
Now, let's focus on the annual samplings. Every time a new sample is taken<br>
from the tree, at least one new instance of Individual (with a unique<br>
individualID value) is created to represent the sample. This sample<br>
(individual instance) may be a "gathering" (set of multiple individual<br>
specimens gathered at the same time), or it may be a single specimen, or it<br>
may be simply a tissue sample intended for destructive analysis. In any<br>
case, it's a new individual instance derived from the "parent" individual<br>
instance representing the whole tree. In our implementation, "Individual"<br>
can be hierarchical, such that a whole-organism tree can be sub-sampled with<br>
many "child" instances of "gatherings" (say, one gathering each year), and<br>
each gathering may have multiple child "specimen" individuals (e.g.<br>
individual botanical sheets created from the multiple items of a single<br>
gathering), and each specimen may have further "child" subsamples extracted<br>
for DNA analysis (or whatever), and the hierarchy can continue on down to<br>
whatever derivatives that people feel a need to keep track of (e.g.,<br>
aliquot).<br>
<br>
The point is, all Individual instances are well-defined physical objects (or<br>
well-defined sets of physical objects), and they can be arranged in a<br>
n-tiered hierarchy.<br>
<br>
Moreover, each Individual that can be characterized as a "sample" (what we<br>
refer to as a "CollectionObject") may also have a property value for<br>
"CollectionOccurrenceID" -- which refers to the specific Occurrence instance<br>
at which the sample was obtained.<br>
<br>
So, for example, if the tree is visited on May 27, 2013 and a specimen<br>
(sample) is taken, then:<br>
1) An Event instance is generated to represent the event where the tree was<br>
visited;<br>
2) An Occurrence instance is generated, which refers to the new EventID, and<br>
the existing IndividualID for the whole tree, and includes whatever other<br>
Occurrence properties are relevant for the tree at the time of this<br>
Occurrence<br>
3) An Individual instance is generated for the specimen, which has a<br>
property value for parentIndividualID that refers to the individualID for<br>
the whole tree, and a property value for collectionOccurrenceID that refers<br>
the Occurrence instance where the specimen was collected.<br>
<br>
So, to summarize the answer to your question:<br>
- There are multiple Occurrence instances that refer to the same Individual<br>
instance representing the whole tree (and, hence can be collapsed to the<br>
same IndividualID value).<br>
- Any Individual can have derivatives that are themselves unique Individual<br>
instances.<br>
- Individuals are arranged hierarchically, and certain properties can be<br>
inherited up or down the hierarchy, depending on the properties and their<br>
associated logical constraints.<br>
<br>
At some point, I will assemble a set of other specific use cases, and how we<br>
manage them through our use of the "Individual" instance (although I will<br>
probably not use the word "Individual", as this seems to cause too much<br>
confusion in these discussions).<br>
<br>
Aloha,<br>
Rich<br>
<br>
</blockquote></div><br><br clear="all"><div><br></div>-- <br>John Deck<br>(541) 321-0689<br>
</div>