To create a unified dataset:
- Select the Schema Mapping page. The left side of the page displays all of the input attributes from each uploaded dataset. The right side lists attributes in the unified dataset as you create them. Tamr Core uses the following default name for the unified dataset:
- (Optional) Edit the unified dataset's name in the text box.
- Choose Create Unified Dataset. The screen changes to show that there are no unified attributes. Now, you can begin adding them and building your unified schema.
Unified attributes must have names that conform to the following requirements:
- Names cannot contain the
- Names must be unique: they cannot match (case-insensitively) any other unified attribute in your project.
- Names cannot be empty, and cannot contain a leading or trailing whitespace.
- Names cannot match (case-insensitively) the reserved, system-generated attribute names. See Understanding Tamr-Generated Data Attributes.
If you do use one of these names for an attribute, you receive the error message "This name is reserved by the system."
As a best practice, use a consistent style for your attribute names. For example, always use "camelCase" or "snake_case".
You can create unified attributes manually, or use the "bootstrap" feature to create unified attributes from specified input attributes.
To create a unified attribute manually:
- On the right side of the Schema Mapping page select Create.
- Enter a unique identifying name. See Naming Unified Attributes.
- (Optional) Enter a description for the attribute.
- Select Create.
To bootstrap unified attributes from input attributes:
- On the left side of the Schema Mapping page, use Ctrl+Select or Cmd+Select to select one or more unmapped attributes.
- Choose Bootstrap. Tamr Core launches the attribute bootstrapping process to create unified attributes and map the selected input attributes to them. See Approaches to Creating a Unified Schema.
After you create unified attributes you can map input attributes to them one at a time by dragging them onto the desired unified attribute.
Tip: Use the Unmapped checkbox at the top of the list of input attributes to show only unmapped attributes.
You can map multiple input attributes by using Ctrl+Select (Cmd+Select) or Shift+Select.
If there is an input attribute that you choose not to map, select it and then choose Do not map from the Map dropdown menu.
After you map a few attributes manually, you can leverage the machine learning model to suggest additional attribute mappings.
If an input attribute is incorrectly mapped to a unified attribute, move your cursor over the count of attributes in the Mappings column for the unified attribute to see a list of mapped input attributes. Find the incorrectly mapped attribute in the list and select Unmap.
You can also unmap input attributes from the Mappings column in the left side. Move your cursor over the name of the unified attribute to show a popup. Within the popup, move your cursor over the name of the unified attribute to reveal the Unmap option.
You can filter to the set of source attributes that are mapped to a unified attribute. Move your cursor over the name of the unified attribute to show a popup. Within the popup, select View all mappings.
When you view unified data in tables on subsequent pages of the project, you can sort records by attribute value. By default, Tamr Core assigns a sort value of alphabetical to every unified attribute.
To change the sort value, to the right of the unified attribute, choose More and then choose Alphabetically or Numerically.
To delete a unified attribute, select it and choose Remove.
To save your work and apply your changes and additions, you update the unified dataset.
To update a unified dataset:
- Navigate to the Schema Mapping page.
- Choose Update Unified Dataset.
Updated 6 days ago