4 Data formatting workflow
Preparing data can be a challenging process at first. This section of the manual provides guidance on formatting data for OBIS so that it complies to Darwin Core.
The general data formatting workflow you can follow is:
- Identify your dataset structure
- Understand the structure of your data and how it fits into Darwin Core
- Create unqiue identifiers
- Assign unique identifiers to distinguish between events, nested events, and biological occurrences
- Match taxon names
- Align taxonomic names to World Register of Marine Species (WoRMS) to ensure consistency and retrieve their associated identifiers
- Map data column names to Darwin Core
- Rename data columns to match Darwin Core terms
- Organize measurements, facts, and information
- Structure measurement and fact data in long format in the extendedMeasurementOrFact table
- Identify controlled vocabularies to include with your measurements
- Ensure measurements and facts reference appropriate controlled vocabularies for interoperability and clarity
- Standardize other fields
- Verify that all other fields, such as dates and coordinates, conform to standards
The following pages provide a detailed breakdown of each step, including examples and tips to help you through the formatting process. Remember, the OBIS Helpdesk and OBIS Nodes are available to help you.