User GuidesAPI ReferenceRelease Notes
Doc HomeHelp CenterLog In
User Guides

Categorizing Records

Overview

Once you upload the data and map it to a unified schema, you can begin classifying it.

To start, you can search and filter to find records that match a specific category, and then label these by selecting New Categorization.

You can assign records to other users, and then accept or contest their categorizations. If your project has multiple Reviewers, they can upvote or downvote an assigned category to indicate agreement. A Curator then reviews the votes and choses a winning categorization, known as the verified categorization. Tamr learns only from the verified categorization.

1412

Records that have been categorized.

Once reviewers have assigned category labels to at least three transactions for each unique category, and configured each dataset to indicate which fields should be used for the classifier, you can select Update Categorizations. This launches the classification job in Tamr for the remaining records. For its categorization job, Tamr will not use those categories that do not have user-generated suggestions.

Adding New Categories

If you identify a category that needs to be added to the taxonomy, you can add some from the Parts page. Once you select a record by checking the box to the left, choose New Categorization from the top menu, or choose add categorization in the Categorization column. If a category is missing from the taxonomy, you can add it in the dialogue that comes up.

2750

The small "plus" sign allows you to add a category at whichever tier it is missing. For example, if you add a category to a third tier with missing parent nodes, you can also add a node at each level.

Configuring the Parts Page

You can customize what you see in the Parts page for easier labeling. Use the filter tab to specify which records will be displayed. For example, you can choose preferences about user responses, Tamr responses, and the datasets from which the records originated.

You can also specify which columns will be displayed, and in which order, using the small gears icon in the upper right corner.

Reviewing Category Results

In the taxonomy review page, the display on the right hand shows details about each of the categories at different tiers. In the following example, Tamr offers a report on how many records have been classified for the Midwest, how many records were suggested by Tamr, and how much total spend is in this category. There is also information about the category tree: the category's parent (USA) and children (the states in the Midwest). This view has been organized by the number of records, but it could be reorganized to display categories in order of total spend. Note that the dark green part of the bar indicates finalized categorizations, while light green indicates Tamr suggestions.

2816