Tamr Documentation

Schema Mapping Workflow

To harmonize multiple data sources, curators align attributes from input data sources with the attributes in a unified schema.

After an admin creates the project and uploads one or more datasets into Tamr, you can access and review all of the attributes from all input datasets on the Schema Mapping page of the project. On this page you create a unified schema from multiple, tabular datasets.

You define:

  • the unified attributes in the target dataset
  • how input attributes map to unified attributes

When you manually map some of the input attributes to unified attributes, you train a Tamr model so that it can recommend additional mappings to help automate the process.

For example, your input datasets have attributes for givenName, First_Name, and Name. Using your knowledge of both the input datasets and of the downstream needs of data consumers, you decide that a unified attribute of firstName_original should store all first name values, and you map the givenName input attribute to that unified attribute. This initial mapping trains a Tamr model, which can then suggest additional mappings, potentially including First_Name and Name to firstName.

You can then add data transformations to attributes in the unified dataset, specified input datasets, or both.

Updated 19 days ago



Schema Mapping Workflow


To harmonize multiple data sources, curators align attributes from input data sources with the attributes in a unified schema.

Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.