To create a unified dataset
- Select the Schema Mapping page. The left-hand side of the page displays all of the input attributes from each uploaded dataset. The right-hand side lists attributes in the unified dataset as you create them. Tamr uses the following default name for the unified dataset:
- Optionally, edit the unified dataset's name in the text box or leave the default.
- Choose Create Unified Dataset. The screen changes to show that there are no unified attributes. You can begin adding them and building your unified schema.
Unified attributes must have names that conform to the following requirements:
- Names cannot contain the
- Names must be unique: they cannot match (case-insensitively) any other unified attribute in your project
- Names cannot be empty, and cannot contain a leading or trailing whitespace
- Names cannot match (case-insensitively) the reserved, Tamr-generated attribute names (see Understanding Tamr-Generated Data Attributes).
If you do use one of these names for an attribute, the error message "This name is reserved by the system" appears.
As a best practice, use a consistent style for your attribute names. For example, always use "camelCase" or "snake_case".
You can create unified attributes manually, or use the Tamr "bootstrap" feature to create unified attributes from specified input attributes.
To create a unified attribute manually
- On the right-hand side of the Schema Mapping page select Create.
- Enter a unique identifying name. See Naming Unified Attributes.
- Optionally, enter a description for the attribute.
- Select Create.
To bootstrap unified attributes from input attributes
- On the left side of the Schema Mapping page use Ctrl+click or Cmd+click to select one or more unmapped attributes.
- Choose Bootstrap. Tamr launches the attribute bootstrapping process to create unified attributes and map the selected input attributes to them. See Approaches to Creating a Unified Schema.
After you create unified attributes you can map input attributes to them one at a time by dragging and dropping them.
Tip: Use the Unmapped check box at the top of the list of input attributes to show only unmapped attributes.
You can map multiple input attributes by using Ctrl+click (Cmd+click) or Shift+clic.
If there is an input attribute that you choose not to map, select it and then choose Do not map from the Map drop-down menu.
After you map a few attributes manually, you can leverage the Tamr machine learning model to suggest additional attribute mappings for you.
If an input attribute is incorrectly mapped to a unified attribute, move your cursor over the count of attributes in the Mappings column for the unified attribute to see a list of mapped input attributes. Find the incorrectly mapped attribute in the list and select Unmap.
You can also unmap input attributes from Mappings column in the left-hand panel. Move your cursor over the name of the unified attribute to show a popup. Within the popup, move your cursor over the name of the unified attribute to reveal the Unmap option.
You can filter to the set of source attributes that are mapped to a unified attribute by selecting View all mappings.
When you view unified data in tables on subsequent pages of the project, you can sort records by attribute value. By default, Tamr assigns a sort value of alphabetical to every unified attribute.
To change the sort value, choose the More menu (⁝ tricolon icon) to the right of the unified attribute and then choose Alphabetically or Numerically.
To delete a unified attribute, select it and choose Remove.
To save your work and apply your changes and additions, you update the unified dataset.
To update a unified dataset:
- Navigate to the Schema Mapping tab of any project.
- Choose Update Unified Dataset.
Updated 4 months ago
|Attribute Recommendations from Machine Learning|
|Generating Attribute Recommendations|
|Previewing the Unified Dataset|