Reverting the process of DwC standardization
Dear colleagues,
Here on SiBBr we're using the new eventCore and measurementOrFacts and after the process of standardization to DwC and publishing we think some users/researchers will want the "original" table format because of multiple reasons.
Is possible to have a vertabimTable or some place where we can store the original table/column format?
Regards
David,
It's certainly possible, within the context of a Darwin Core Archive, to include other files within the ZIP file that lie outside the schema of the archive. Both GBIF and iDigBio do this when generating downloads for various reasons (RIGHTS & LICENSE files, additional EML metadata, etc). However, I do not believe it is possible to do this within IPT. You might submit an issue on the IPT issue tracker (https://github.com/gbif/ipt/issues) for potential inclusion of this feature in a future version of IPT.
There are workarounds you can use to include additional data in Darwin Core archives, but none of them will exactly match your old format. For instance, including an additional Occurrence file with the values as JSON in dynamicProperties or in some other verbatim format in the occurrenceRemarks field. Both of those would at least give some method of single-row access (vs joining multiple measurementOrFacts to a single event id) if that is the primary concern, even if they would require additional parsing steps to be useful.
Alex Thompson iDigBio Infrastructure
On 10/23/2015 09:40 AM, David Valentim Dias wrote:
Dear colleagues,
Here on SiBBr we're using the new eventCore and measurementOrFacts and after the process of standardization to DwC and publishing we think some users/researchers will want the "original" table format because of multiple reasons.
Is possible to have a vertabimTable or some place where we can store the original table/column format?
Regards
--
ass_sibbr
*David Valentim Dias*
*Biodiversity Data Management*
SiBBr | MCTI | PNUMA (UNEP)
Phone: +55 61 3329 6045 Phone: +55 61 9359 6151 Skype: ctenidae
dvdias@sibbr.gov.br mailto:dvdias@sibbr.gov.br
www.sibbr.gov.br http://www.sibbr.gov.br/
tdwg-content mailing list tdwg-content@lists.tdwg.org http://lists.tdwg.org/mailman/listinfo/tdwg-content
Hi David (CC’ing the IPT list as this might be an IPT specific thread - http://lists.gbif.org/mailman/listinfo/ipt http://lists.gbif.org/mailman/listinfo/ipt)
For clarification - is your question specific to the DwC-A standard which is possible as Alex says or is it specific to the IPT tool please?
Do you imagine a scenario where you’d effectively map the same extension 2 times - once to interpreted and once to verbatim - or do you envisage a different data schema for each?
Thanks, Tim
On 23 Oct 2015, at 16:00, Alex Thompson godfoder@acis.ufl.edu wrote:
David,
It's certainly possible, within the context of a Darwin Core Archive, to include other files within the ZIP file that lie outside the schema of the archive. Both GBIF and iDigBio do this when generating downloads for various reasons (RIGHTS & LICENSE files, additional EML metadata, etc). However, I do not believe it is possible to do this within IPT. You might submit an issue on the IPT issue tracker (https://github.com/gbif/ipt/issues https://github.com/gbif/ipt/issues) for potential inclusion of this feature in a future version of IPT.
There are workarounds you can use to include additional data in Darwin Core archives, but none of them will exactly match your old format. For instance, including an additional Occurrence file with the values as JSON in dynamicProperties or in some other verbatim format in the occurrenceRemarks field. Both of those would at least give some method of single-row access (vs joining multiple measurementOrFacts to a single event id) if that is the primary concern, even if they would require additional parsing steps to be useful.
Alex Thompson iDigBio Infrastructure
On 10/23/2015 09:40 AM, David Valentim Dias wrote:
Dear colleagues,
Here on SiBBr we're using the new eventCore and measurementOrFacts and after the process of standardization to DwC and publishing we think some users/researchers will want the "original" table format because of multiple reasons.
Is possible to have a vertabimTable or some place where we can store the original table/column format?
Regards
--
David Valentim Dias
Biodiversity Data Management
SiBBr | MCTI | PNUMA (UNEP)
Phone: +55 61 3329 6045 Phone: +55 61 9359 6151 Skype: ctenidae
dvdias@sibbr.gov.br mailto:dvdias@sibbr.gov.br www.sibbr.gov.br http://www.sibbr.gov.br/
tdwg-content mailing list tdwg-content@lists.tdwg.org mailto:tdwg-content@lists.tdwg.org http://lists.tdwg.org/mailman/listinfo/tdwg-content http://lists.tdwg.org/mailman/listinfo/tdwg-content
tdwg-content mailing list tdwg-content@lists.tdwg.org http://lists.tdwg.org/mailman/listinfo/tdwg-content
participants (3)
-
Alex Thompson
-
David Valentim Dias
-
Tim Robertson