HomeGuidesAPI ReferenceChangelog
HomeGuidesTamr API ReferenceTutorialsEnrichment API ReferenceSupport Help CenterLog In

Schema Mapping Workflow

To harmonize multiple data sources, curators align attributes from input data sources with the attributes in a unified schema.

After an admin creates the project, you can upload one or more datasets into Tamr. You then can access and review all of the attributes from all input datasets on the Schema Mapping page of the project. On this page you create a unified schema from one or more tabular datasets.

You define:

  • the unified attributes in the target dataset
  • how input attributes map to unified attributes

When you manually map some of the input attributes to unified attributes, you provide information to a Tamr machine learning model so that it can recommend additional mappings to help automate the process.

For example, your input datasets have attributes for givenName, First_Name, and Name. Using your knowledge of both the input datasets and of the downstream needs of data consumers, you decide that a unified attribute of firstName_original should store all first name values, and you map the givenName input attribute to that unified attribute. This initial mapping trains a Tamr model, which can then suggest additional mappings, potentially including First_Name and Name to firstName_original. As you iteratively accept suggestions, like First_Name, and reject others, like Name, the suggestions that the model makes can become increasingly helpful.

You can then add data transformations to attributes in the unified dataset, specified input datasets, or both.


Did this page help you?