7.1 Map data fields to Darwin Core
There are many possible ways of setting up your datasheets, and if you are new to OBIS you likely did not use standardized Darwin Core (DwC) or BODC vocabulary before samples were collected. In mapping your data fields to DwC we recommend documenting your choices so you have a reference to go back to should the need arise. In such a document you should take notes on the choices you made, as well as any actions you had to take (e.g. separate one column into many, convert dates or coordinates, etc.).
For example, a DwC mapping reference table could look like the following:
Verbatim field name | Mapped DwC term | Actions taken | Notes |
---|---|---|---|
date | eventDate | convert dates to ISO | |
coordinates | decimalLongitude, decimalLatitude | convert ddmmss to decimal degrees, separated one column into 2 for longitude and latitude | put original coordinates into verbatimCoordinates |
In order to help you map your data to DwC terms, we have provided the table below which outlines some common data fields, their associated Darwin Core vocabulary, and which data table the field is likely to go in:
Common Raw Terms | DwC Field | Data table |
---|---|---|
Date, Time | eventDate | Event, Occurrence |
Species, g_s, taxa | scientificName | Occurrence |
Any biotic/abiotic measurements* | measurementType, measurementValue, measurementUnit* | eMoF |
Depth | maximumDepthInMeters or minimumDepthInMeters | Event, Occurrence |
Lat/Latitude, Lon/Long/Longitude, dd | decimalLatitude, decimalLongitude | Event, Occurrence |
Sampling method | samplingProtocol | Event, eMoF |
Sample size, N, #, No. | sampleSizeValue | Event, eMoF |
Location | locality | Event |
Presence, absence | occurrenceStatus | Occurrence |
Type of record/ specimen | basisofRecord | Occurrence |
Person/ people that recorded the original Occurrence | recordedBy | Occurrence |
OrcID of person/ people that recorded the original Occurrence | recordedByID | Occurrence |
Person/ people that identified the organism | identifiedBy | Occurrence |
OrcID of person/ people that identified the organism | identifiedByID | Occurrence |
Data collector, data creator | recordedBy | Event, Occurrence |
Taxonomist, identifier | identifiedBy | Occurrence |
Record number, sample number, observation number | occurrenceID (either ID or incorporated into ID) | Occurrence |
Note that mapping abiotic/biotic measurement fields (sex, temperature, abundance, lengths, etc.) will occur within the extendedMeasurementOrFact extension. Here this data will go from being a separate column to being condensed into the measurementType
and measurementValue
fields.
The obistools
R package also has the map_fields
function that you can use to map your dataset fields to a DwC term.