Data Entity Groups & Domains

Data Entity Groups + the Domain framing — logical containers for related entities, how the Domain flag surfaces them on the Catalog Overview, and the relationship to ML experiments.

A Data Entity Group (DEG) is the platform's logical-grouping primitive — a catalog-level container that gathers related data entities (datasets, transformers, quality tests, consumers) under one umbrella with their own metadata, owners, tags, and terms. A Domain is a particular use of a DEG: a DEG flagged as a domain surfaces in the Catalog Overview home page's Domains section as a top-level discovery surface.

This page covers both framings — DEGs as the underlying primitive, and Domains as the operator-flag use of them.

What a DEG is

Create groups to gather similar entities (datasets, transformers, quality tests, etc.). Each group can be enriched with specific metadata, owners, and terms.

Example. An organisation has ingested metadata related to its finances into the ODD Platform. All the entities are united into the Finance Namespace by default. To categorise entities, one creates Revenue and Payrolls DEGs.

A DEG is itself a Data Entity (of type DATA_ENTITY_GROUP) — it has a detail page, an ODDRN, and participates in lineage (Group lineage returns the union of the group's children's lineage).

The Domain framing

Flagging a DEG as a domain unlocks one extra surface: the Catalog Overview page renders a Domains section listing all domain-flagged DEGs as quick-jump tiles. This makes domain-flagged DEGs the platform's first-class discovery axis on the home page.

Operationally:

  1. Create or open a DEG.

  2. Use the DEG's edit surface to flag it as a domain.

  3. The DEG now appears on the Catalog Overview's Domains section (which is conditional — the section appears only when at least one domain-flagged DEG exists).

The Domains section is also reachable through the Search Groups facet, where domain DEGs participate alongside non-domain DEGs.

DEG metadata

A DEG carries the same metadata model as any other Data Entity — owners, tags, terms, descriptions, statuses — at the group level. The convention is:

  • Owners — set on the DEG to mark domain-stewardship at the group level instead of duplicating per-child.

  • Tags — applied at the DEG level surface the group on Tag-faceted searches.

  • Terms — link glossary terms to the DEG to capture meaning.

  • Description — narrative authoring describing what the group represents.

Children of the DEG keep their own metadata; the DEG-level metadata supplements it rather than replacing it.

Relationship to ML Experiments

In ODD, an ML experiment is a Data Entity Group of class ML_EXPERIMENT that collects the entities produced by a training run — input datasets, feature tables, training jobs, model instances, and resulting model artifacts — into one logical container. Lineage, ownership, tags, and alerts follow through at the experiment level instead of being scattered across each child entity.

ML experiments are DEGs of a particular shape — the same primitive, used for a specific workflow.

ML experiments in ODD are a catalog view over the assets that participated in the run. The platform does not track metrics, compare runs, or select a "best" model — it has no experiment-tracking UI or API of its own. For tracking, keep using MLflow, Weights & Biases, Comet, or your tool of choice, and push the resulting entities (datasets, models, runs) into ODD through a push adapter or the ODD Specification so the experiment and its lineage are browsable alongside the rest of your data platform.

Each experiment's training-run inputs and outputs participate in the catalog-wide lineage graph — an operator opening an ML experiment's Lineage tab sees its dataset / model / run-level edges alongside the rest of the data platform.

Group lineage

The dedicated Group lineage endpoint returns the lineage graph for the DEG's children, not the DEG itself. This is what an operator usually wants when reasoning about a domain or pipeline group — "what does the Finance domain depend on, and what depends on it?" is a question about the union of the children's edges.

Where to next

Last updated