Ebird and NCBI have just genus and specific epithet, most publications have just the genus and specific epithet.<div><br></div><div>That is what you have to match against first and then try to determine what is the most appropriate or intended authority.</div>
<div><br></div><div>Essentially you have a has many relationship with </div><div><br></div><div>scientificName hasMany authorities (authorship strings)</div><div><br></div><div>Also what is not made clear in your earlier example is that</div>
<div><br></div><div>Every <meta charset="utf-8"><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">scientificName: Lobelia spicata var. spicata</span></div>
<div><font class="Apple-style-span" color="#500050" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse;"><br></span></font></div><div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; ">is an instance of </span></font></div>
<div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse;"><br></span></font></div><div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; "><meta charset="utf-8"><span class="Apple-style-span" style="font-size: 13px; ">scientificName: Lobelia spicata</span></span></font></div>
<div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; "><br></span></font></div><div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; ">In relation to occurrence records you will have specimens of <meta charset="utf-8"><span class="Apple-style-span" style="font-size: 13px; color: rgb(80, 0, 80); ">Lobelia spicata var. spicata</span><span class="Apple-style-span" style="font-size: 13px; "> that were</span></span></font></div>
<div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; ">identified as <meta charset="utf-8"><span class="Apple-style-span" style="font-size: 13px; ">Lobelia spicata.</span></span></font></div>
<div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; "><span class="Apple-style-span" style="font-size: 13px; "><br></span></span></font></div><div>
<font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; "><span class="Apple-style-span" style="font-size: 13px; ">This should be done in away where those searching for specimens etc of </span></span></font><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; ">Lobelia spicata also get</span></div>
<div><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; ">those entries labeled </span><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">Lobelia spicata var. spicata and </span><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; color: rgb(80, 0, 80); ">Lobelia spicata ssp. spicata</span><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; "> etc.</span></div>
<div><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; "><br></span></div><div><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; ">Respectfully,</span></div>
<div><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; "><br></span></div><div><span class="Apple-style-span" style="font-family: arial, sans-serif; font-size: 13px; border-collapse: collapse; ">- Pete</span></div>
<meta charset="utf-8"><meta charset="utf-8"><meta charset="utf-8"><div><font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; "><br></span></font></div><div>
<font class="Apple-style-span" face="arial, sans-serif"><span class="Apple-style-span" style="border-collapse: collapse; "><br></span></font><br><div class="gmail_quote">On Thu, Dec 9, 2010 at 10:54 AM, Richard Pyle <span dir="ltr"><<a href="mailto:deepreef@bishopmuseum.org">deepreef@bishopmuseum.org</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div lang="EN-US" link="blue" vlink="purple"><div><p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D">They cannot provide a verbatimScientificName???? That would imply they have no text field whatsoever.</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D"> </span></p><div style="border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt"><div><div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="font-size:10.0pt">From:</span></b><span style="font-size:10.0pt"> Peter DeVries [mailto:<a href="mailto:pete.devries@gmail.com" target="_blank">pete.devries@gmail.com</a>] <br><b>Sent:</b> Thursday, December 09, 2010 6:47 AM<br>
<b>To:</b> Richard Pyle<br><b>Subject:</b> Re: [tdwg-content] proposed term: dwc:verbatimScientificName</span></p></div></div><div><div></div><div class="h5"><p class="MsoNormal"> </p><p class="MsoNormal">So basically what you are saying is that the entire NCBI taxonomy database as well as the ebird database cannot output the required format.</p>
<div><p class="MsoNormal"> </p></div><div><p class="MsoNormal" style="margin-bottom:12.0pt">- Pete</p><div><p class="MsoNormal">On Thu, Dec 9, 2010 at 9:44 AM, Richard Pyle <<a href="mailto:deepreef@bishopmuseum.org" target="_blank">deepreef@bishopmuseum.org</a>> wrote:</p>
<div><div><p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D">I think this is *<b>exactly</b>* the right solution. I would go further to make it clear that:</span></p><p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D"> </span></p>
<p><span style="font-size:11.0pt;color:#1F497D">-</span><span style="font-size:7.0pt;color:#1F497D"> </span><span style="font-size:11.0pt;color:#1F497D">verbatimScientificName is the required field (with scientificName and scientificNameAuthorship as optional)</span></p>
<p><span style="font-size:11.0pt;color:#1F497D">-</span><span style="font-size:7.0pt;color:#1F497D"> </span><span style="font-size:11.0pt;color:#1F497D">When a source database maintains separate fields corresponding to scientificName and scientificNameAuthorship, they should be concatenated (with a single space between them) to form the required verbatimScientificName</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D"> </span></p><p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D">Aloha,</span></p><p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D">Rich</span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:#1F497D"> </span></p><div style="border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt"><div><div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="font-size:10.0pt">From:</span></b><span style="font-size:10.0pt"> <a href="mailto:tdwg-content-bounces@lists.tdwg.org" target="_blank">tdwg-content-bounces@lists.tdwg.org</a> [mailto:<a href="mailto:tdwg-content-bounces@lists.tdwg.org" target="_blank">tdwg-content-bounces@lists.tdwg.org</a>] <b>On Behalf Of </b>David Remsen (GBIF)<br>
<b>Sent:</b> Wednesday, December 08, 2010 6:10 AM<br><b>To:</b> <a href="mailto:tdwg-content@lists.tdwg.org" target="_blank">tdwg-content@lists.tdwg.org</a> List<br><b>Subject:</b> [tdwg-content] proposed term: dwc:verbatimScientificName</span></p>
</div></div><div><div><p class="MsoNormal"> </p><div><p class="MsoNormal">Markus and I wanted to try to consolidate the issues related to the current use and definition of scientificName that have been the focus of last weeks discussion in as simple a way as we can and leave it with a simple proposal which we will add to the issue tracking on the project site.</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">1. We propose that a new term, dwc:verbatimScientificName carry the existing definition for dwc:scientificName and </p></div><div><p class="MsoNormal">2. dwc:scientificName follow the more accepted convention that is better represented by the earlier proposed definition for Canonical Name</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">The intention is to enable data publishers to distinguish unparsed, complex scientific names from more cleanly separated scientific name data. This will relieve consumers of these data from testing each instance of a name for one of these two conditions.</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Here are the definitions for the two existing terms that have been part of the discussion:</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">
<b>dwc:scientificName </b> - The full scientific name, with authorship and date information if known. When forming part of an Identification, this should be the name in lowest level taxonomic rank that can be determined. This term should not contain identification qualifications, which should instead be supplied in the IdentificationQualifier term.</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal"><b>dwc:scientificNameAuthorship</b> - The authorship information for the scientificName formatted according to the conventions of the applicable nomenclaturalCode.</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Here are terms and definitions used in the following 5 source data configurations we came up with. They don't have to be exact for this purpose.</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal"><b>canonical name</b> - The nomenclatural components of a scentific name without authorship information.</p></div><div><p class="MsoNormal"><b>authorship</b> - the authorship information that follows a scientific name</p>
</div><div><p class="MsoNormal"><b>verbatim name</b> - the verbatim text stored in a source database when it differs from or combines the two definitions above. This is a bit more broad than the def for scientificName.</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">We identified the following configurations in a source database and how they would be mapped to the existing terms. In cases 4 and 5 we also propose how we would map these were there a 3rd available term (called 'mapping b:')</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">When a source database contains:</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">1. canonical names only</p></div><div><p class="MsoNormal">
</p></div><div><p class="MsoNormal">Mapping: canonical name -> dwc:scientificName </p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">2. canonical name and authorship in two fields</p></div><div>
<p class="MsoNormal"> </p></div><div><p class="MsoNormal">Mapping: canonical name -> dwc:scientificName / authorship->dwc:scientificNameAuthorship</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">
3. verbatim name only</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Mapping: verbatim name -> dwc:scientificName</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">4. all three: canonical name, authorship, and verbatim name in 3 diff. columns </p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Mapping a: verbatim name -> dwc:scientificName / authorship->dwc:scientificNameAuthorship</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">
Mapping b: canonical name -> dwc:scientificName / authorship->dwc:scientificNameAuthorship / verbatim name -> dwc:verbatimScientificName</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">5. a mix of canonical and verbatim names in a single column</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Mapping a: verbatim name + canonical names -> dwc:scientificName </p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Mapping b: verbatim name + canonical names -> dwc:verbatimScientificName </p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Summary - with the current two terms are left with no choice but to support both canonical and verbatim names in a single term, which makes consuming these data difficult. </p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">We propose that a new term, dwc:verbatimScientificName carry the existing definition for dwc:scientificName and that dwc:scientificName follow the more accepted convention that is better represented by the definition for Canonical Name</p>
</div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal">Best,</p></div><div><p class="MsoNormal">David Remsen / Markus Döring</p></div><div><p class="MsoNormal"> </p></div><div><p class="MsoNormal"> </p></div>
<div><p class="MsoNormal"> </p></div></div></div></div></div></div><p class="MsoNormal" style="margin-bottom:12.0pt"><br>_______________________________________________<br>tdwg-content mailing list<br><a href="mailto:tdwg-content@lists.tdwg.org" target="_blank">tdwg-content@lists.tdwg.org</a><br>
<a href="http://lists.tdwg.org/mailman/listinfo/tdwg-content" target="_blank">http://lists.tdwg.org/mailman/listinfo/tdwg-content</a></p></div><p class="MsoNormal"><br><br clear="all"><br>-- <br>---------------------------------------------------------------<br>
Pete DeVries<br>Department of Entomology<br>University of Wisconsin - Madison<br>445 Russell Laboratories<br>1630 Linden Drive<br>Madison, WI 53706<br><a href="http://www.taxonconcept.org/" target="_blank">TaxonConcept Knowledge Base</a> / <a href="http://lod.geospecies.org/" target="_blank">GeoSpecies Knowledge Base</a><br>
<a href="http://about.geospecies.org/" target="_blank">About the GeoSpecies Knowledge Base</a><br>------------------------------------------------------------</p></div></div></div></div></div></div></blockquote></div><br>
<br clear="all"><br>-- <br>---------------------------------------------------------------<br>Pete DeVries<br>Department of Entomology<br>University of Wisconsin - Madison<br>445 Russell Laboratories<br>1630 Linden Drive<br>
Madison, WI 53706<br><a href="http://www.taxonconcept.org/" target="_blank">TaxonConcept Knowledge Base</a> / <a href="http://lod.geospecies.org/" target="_blank">GeoSpecies Knowledge Base</a><br><a href="http://about.geospecies.org/" target="_blank">About the GeoSpecies Knowledge Base</a><br>
------------------------------------------------------------<br>
</div>