Skip to content

Schema Documentation

The OAE Data Management Protocol outlines recommendations for producing consistent data and metadata across Ocean Alkalinity Enhancement projects, developed by Carbon To Sea and Submarine Scientific in collaboration with NOAA and the broader OAE community.

This site serves as the technical documentation for the machine-readable schemas that turn those recommendations into something software systems can validate and exchange.

Alpha Software

These schemas are under active development. We do not currently guarantee backwards compatibility between versions.

What is this?

The OAE Data Protocol is a structured approach to collecting, documenting, and sharing data from ocean alkalinity enhancement field trials and related experiments. This site provides the technical schema documentation — browse the class hierarchy, controlled vocabularies, and field definitions that make up the protocol's metadata model.

Schema Scope

  • Project — OAE field trials, leads, permits, spatial/temporal coverage
  • Experiment — Interventions, tracer studies, model experiments, dosing details
  • Dataset — Field datasets, model output datasets, platform info
  • Variables — Measured, calculated, and contextual variables with instrument and calibration metadata

For protocol requirements pertaining to general metadata management, excel metadata templates, dataset formatting, and column header names, see the published protocol.

Published Artifacts & Resources

  • OAE Metadata Builder — Web app for creating and managing JSON metadata files
  • JSON Schema — Machine readable schemas used for validation
  • LinkML Source Schema — The source of truth for generating all schema artifacts and documentation (contributors should only edit these files)
  • Python Dataclasses — For managing metadata directly in Python workflows (WIP)

Built with LinkML to support FAIR data practices

The OAE Data Protocol schema is defined using LinkML, a 'linked-data modeling language' that allows for data schemas to be authored as YAML files, integrating with external data standards and vocabularies, and output in a variety of machine-readable formats such as JSON Schema, Python dataclasses, and documentation.

One of the primary features of LinkML is the ability to support RDF & JSON-LD mappings and serialization for improved interoperability with existing data standards. Where applicable, this project strives to align with existing scientific data standards (such as science-on-schema.org, or controlled vocabularies hosted on NERC Vocabulary Server).

As the OAE Data Protocol has been developed in close collaboration with NOAA and the OCADS team, several parts of this schema (the Variable class and subclasses in particular) aim to align closely with with NOAA-PMEL's OAPMetadata XSD schemas.

Questions or Feedback?

Visit the GitHub repository or contact data@carbontosea.org.


Development of the OAE Data Protocol has been made possible with funding and steering support from Carbon To Sea.