After an admin creates the project, you can upload one or more datasets into Tamr. You then can access and review all of the attributes from all input datasets on the Schema Mapping page of the project. On this page you create a unified schema from one or more tabular datasets.
- the unified attributes in the target dataset
- how input attributes map to unified attributes
When you manually map some of the input attributes to unified attributes, you provide information to a Tamr machine learning model so that it can recommend additional mappings to help automate the process.
For example, your input datasets have attributes for
Name. Using your knowledge of both the input datasets and of the downstream needs of data consumers, you decide that a unified attribute of
firstName_original should store all first name values, and you map the
givenName input attribute to that unified attribute. This initial mapping trains a Tamr model, which can then suggest additional mappings, potentially including
firstName_original. As you iteratively accept suggestions, like
First_Name, and reject others, like
Name, the suggestions that the model makes can become increasingly helpful.
You can then add data transformations to attributes in the unified dataset, specified input datasets, or both.
Updated 2 months ago