The Linux Foundation Projects
Skip to main content

We are excited to unveil Open Data Contract Standard (ODCS) Version 3, featuring significant improvements and new functionalities to better support modern data management needs. This release introduces updates to the fundamentals, schema, and data quality, as well as adding a dedicated Support & Communication Channels section. Our goal with these changes is to streamline the data contract process, making it more adaptable, scalable, and easier to use in various contexts. OCDS v3 is now available on Bitol’s GitHub.

This release represents the culmination of a lot of tireless, focused, and inspiring work by the Technical Steering Committee and many other volunteers.

Gene StakhovInformation Governance Practice Lead at enChoice, USA
Announce OCDS v3

OCDS v3 is now available on Bitol’s GitHub.

Highlights of ODCS v3

One of the key advancements in ODCS v3 is the modification of the schema structure, which now enables support not only for relational databases (as in v2) but also for hierarchical and complex data storage, streaming messages, and unstructured data. This greatly expands the types of data environments that can seamlessly integrate with ODCS, facilitating better management of complex data types and structures.

 

With this enormous amount of unstructured data around us, Bitol is the project to drive order in the chaos.

Atanas IlievSystem Architect at ALH Gruppe, Germany

Additionally, the definition of infrastructure has been extended, improving integration by encompassing metadata around systems, applications, and platforms. This enhancement enables ODCS to serve as an even more powerful source of truth, capturing essential context from a broader range of system metadata, which helps create a unified view of the data and the systems interacting with it.

Data quality support has also been significantly boosted in v3 to enable compatibility with virtually any data quality tool, making it easier for organizations to integrate their preferred tools. Looking ahead, these capabilities will be further extended in the upcoming ODCS v3.1.

 

ODCS v3 is a breakthrough release as it marks ODCS, a real standard supported by a Technical Steering Committee of 16 people from 14 different companies in Europe, America, and Asia.

Dr. Simon HarrerCo-founder of Data Contract Manager, Germany

Visit OCDS v3 on Bitol’s GitHub. Don’t forget to star it while you are there!

ODCS v3 is now divided into 11 sections:

  • Fundamentals.
  • Schema.
  • Data quality.
  • Pricing.
  • Team (fka stakeholders).
  • Security.
  • Service-Level Agreement (SLA).
  • Infrastructure.
  • Support.
  • Business rules.
  • Custom properties.

Link to full-size PNG and SVG. Available under the Apache 2 license.

Changes in ODCS v3

ODCS v3 brings some changes, including breaking ones, to ODCS v2. Here is a list of the changes. Details can be found in the changelog.

Support & Communication Channels

We’ve added a new section dedicated to providing a more structured approach to support, including Slack channels, feedback URLs, and distribution lists, which were previously scattered throughout the contract.

Changes to Fundamentals

The fundamentals have been streamlined.

  • UUID renamed to id
  • Name was added for easier identification.
  • quantumName was renamed to dataProduct and made optional.
  • datasetDomain renamed to domain, removing the dataset prefix.
  • Dropped unused fields: datasetKind, userConsumptionMode, sourceSystem, sourcePlatform, productSlackChannel, productFeedbackUrl, productDl, username, password, driverVersion, driver, server, project, datasetName, database, schedulerAppName.

Schema Overhaul

Major changes have been applied to the schema, including support for non-table formats, hierarchies, and arrays.

  • New fields: name, items.
  • Deprecated: priorTableName, table (consider using name instead).
  • columns renamed to properties.
  • dataGranularity renamed to dataGranularityDescription.
  • Other field changes:
    • encryptedColumnName → encryptedName
    • partitionStatus → partitioned
    • sampleValues → examples
    • isNullable → required
    • isUnique → unique
    • isPrimaryKey → primaryKey
    • criticalDataElementStatus → criticalDataElement

We also introduced restrictions to schema.*.logicalType, limiting it to string, date, number, integer, object, array, and boolean. New logicalTypeOptions have been added to offer more flexibility.

Data Quality Improvements

ODCS v3 has reimagined its Data Quality section, offering more support for tools and use cases. Key changes include:

  • Deprecated toolName, replaced by type=custom; engine: <engine name>.
  • scheduleCronExpression replaced with a simplified schedule and scheduler.

Role and Team Updates

They stay mostly the same in their structure but clarify names.

  • Stakeholders were renamed to Team, with the content remaining the same.
  • Role updates: Descriptions have been added, and access is no longer required.

SLA and Security

In response to the new schema structure, we’ve adjusted the SLA to reflect that data is no longer solely based on tables and columns.

  • Columns are renamed to elements across the SLA.
  • The SLA now includes explicit references to Data QoS.

Custom Properties

Several elements, such as systemInstance, are no longer supported but can be included as custom properties where necessary. Custom properties can be used in more sections as well.

No Changes in Pricing and Security

While much has evolved, our pricing and security models remain unchanged.

 

The Team Behind ODCS v3

ODCS v3 represents the collaborative effort of a global team of experts and volunteers who worked tirelessly to make this version a reality:

  • Andrew Jones, Principal Engineer and Independent Consultant, UK
  • Andy Petrella, CEO at Kensu, Belgium
  • Atanas Iliev, System Architect at ALH Gruppe, Germany
  • Christian Foyer, Data Engineer, Denmark
  • Dirk Van de Poel, Co-founder of Klarrio, Belgium
  • Eugene (Gene) Stakhov, Information Governance Practice Lead at enChoice, USA
  • Georges Kopp, Economist and Business Analyst, France
  • Jean-Georges Perrin (Chair), Chief Innovation Officer at AbeaData, USA
  • Jochen Christ, Co-founder of Data Contract Manager, Germany
  • Manuel Destouesse, Software Engineer, USA
  • Martin Meermeyer, Head of Global Data Governance at Lidl e-commerce, Germany
  • Peter Flook, Data Engineer, Taiwan
  • Sandro Pugliese, Head of Enterprise Data Engineering at TelevisaUnivision, USA
  • Simon Harrer, Co-founder of Data Contract Manager, Germany
  • Todd Nemanich, Founder and CTO at AbeaData, USA
  • Tom Baeyens, CTO at Soda, Belgium

To avoid missing anyone, this list does not mention the volunteers outside the TSC who contributed in v3.

 

About Bitol

Bitol is an incubation project from the Linux Foundation, focused on creating open standards for data management. This initiative fosters industry-wide collaboration and innovation, allowing organizations to adopt a unified approach to handling data contracts, metadata, and system interoperability.

The Technical Steering Committee (TSC), which oversees the development and direction of Bitol, is chaired by Jean-Georges Perrin (USA) and includes Andrew Jones (UK), Andy Petrella (Belgium), Atanas Iliev (Germany), Christian Foyer (Denmark), Dirk Van de Poel (Belgium), Eugene (Gene) Stakhov (USA), Georges Kopp (France), Jochen Christ (Germany), Manuel Destouesse (USA), Martin Meermeyer (Germany), Peter Flook (Taiwan), Sandro Pugliese (USA), Simon Harrer (Germany), Todd Nemanich (USA), and Tom Baeyens (Belgium).

Currently, Bitol’s primary focus is on advancing the Open Data Contract Standard (ODCS) and the Open Data Product Standard (ODPS). These standards aim to provide comprehensive frameworks for data contracts and data products, supporting organizations in managing data with greater consistency, transparency, and collaboration across systems.

The Bitol project seeks to provide a shared framework that promotes consistency, transparency, and collaboration across diverse data systems and environments. For more information, visit bitol.io.

Jean-Georges Perrin

Jean-Georges “jgp” Perrin is the Chief Innovation Officer at AbeaData, where he spearheads the development of cutting-edge, modern data tools designed to transform the way businesses manage and leverage data. As the Chair of the Open Data Contract Standard (ODCS) at the Linux Foundation’s Bitol project, Jean-Georges leads global efforts to standardize data contracts, making data ecosystems more reliable and efficient. Jean-Georges is also a co-founder of the AIDA User Group and the author of several influential books, including Implementing Data Mesh (O’Reilly), a pivotal resource in the data community, and Spark in Action, 2nd Edition (Manning), which empowers developers to harness the power of Apache Spark. His thought leadership extends to data engineering, governance, the industrialization of data science, and his personal favorite, Data Mesh—where he continuously pushes the boundaries of modern data architecture. Recognized for his contributions to the tech world, Jean-Georges is proud to hold the titles of Lifetime IBM Champion, PayPal Champion, and Data Mesh MVP, reflecting his enduring influence in data innovation. With over 25 years in the IT industry, Jean-Georges is a sought-after speaker at international conferences, regularly contributes to both print and online media, and shares his insights through his blog at jgp.ai. When he’s not immersed in the world of data, Jean-Georges enjoys exploring the scenic landscapes of Upstate New York and New England with his wife and children, balancing his love for tech with a passion for nature and adventure.