Tamr Documentation

Filtering Record Pairs

You can filter record pairs by priority, assignment, and other characteristics.

Tips for using filters to review and label record pairs effectively:

To reduce the number of record pairs that display further, you can use more than one filter.

Filtering to High-Impact Record Pairs

High-impact record pairs are pairs which have the most utility for training the Tamr machine learning model. In Tamr, high-impact pairs are identified by a lightning bolt icon. To help you locate and prioritize high-impact record pairs for review and labeling, a filtering option is provided for high-impact pairs.

To filter to high-impact record pairs:

  1. Navigate to the Pairs page in a mastering project.
  2. Select the Filter icon to open the filtering options.
  3. In the PAIRS section, select High-impact pairs.
Filtering to high-impact pairs helps you prioritize your work.Filtering to high-impact pairs helps you prioritize your work.

Filtering to high-impact pairs helps you prioritize your work.

  1. To close the filtering options panel on the left side, select the Filter icon again.

To remove all filters, select the X next to the Filter icon.

Filtering to My Assigned Record Pairs

You can filter record pairs to display only the ones that are assigned to you. You can also choose a status to show only records that you haven't labeled yet, or those that you skipped earlier.

To filter to record pairs with my assignments:

  1. On the Pairs page in a mastering project select the Filter icon to open the filtering options.
  2. In the ASSIGNMENT section, choose one of the following options:
  • All: Show all of the record pairs assigned to you, in any status.
  • Open: Show record pairs that you have not yet labeled or skipped.
  • Skipped: Show record pairs you previously skipped.
  • Completed. Show record pairs that you have labeled either match or no match.
  • No Filter: Remove this filter.
  1. To close the filtering options panel on the left side, select the Filter icon again.

To remove all filters, select the X next to the Filter icon.

Filtering to Record Pairs with Specified Responses

You set filters in the RESPONSES SECTION to find the record pair(s) that:

  • Has expert responses: All record pairs with a response from any project contributor.
  • Has inferred responses: This feature is in limited release. For information, contact [email protected]
  • My response: All record pairs where your response is one of Anything, Match, No Match, Skip, or No Response.
  • Experts agree on: All record pairs where all contributors agree on Match, No Match, Skipped.
  • Verified Response: All record pairs where the expert response has been verified as Match, No Match, or Anything (either match or no match), or Nothing.
  • Tamr's Response: All record pairs where Tamr's response is Match, No Match, or Anything (either match or no match), or Nothing.
  • Tamr's Confidence: All record pairs where Tamr's confidence is High, Medium, or Low.

The No Filter option, or clearing the checkbox, removes these filters.

To filter to record pairs with specific responses:

  1. On the Pairs page in a mastering project select the Filter icon to open the filtering options.
  2. In the RESPONSES section select the type(s) of responses the record pairs should have.
  3. To close the filtering options panel on the left side, select the Filter icon again.

To remove all filters, select the X next to the Filter icon.

Filtering Record Pairs by Attribute Similarity

When Tamr pairs two records, it compares the values for each attribute and calculates a similarity score from 0 to 100 for that pair of values. You can filter record pairs by selecting one or more attributes of interest and then specifying a range for the similarity score. This option can help you find and label record pairs that, based on your knowledge of the data, have a greater likelihood of matching or not matching.

Because Tamr cannot compute a similarity score if one or both of the values are null, empty, or blank, you can also define a filter for pairs that do not have a similarity score for an attribute.

Adding a Filter on Attribute Similarity

To filter on attribute similarity:

  1. On the Pairs page in a mastering project select the Filter icon to open the filtering options.
  2. Scroll to the ATTRIBUTE SIMILARITIES section and select Add Attributes.
  3. Select one or more attributes then select Okay.
  4. Enter values between 0 and 100 for the top and bottom of the similarity range. You can also drag the ends of the slider to adjust the range.
  5. Use the icons to specify Greater than or Greater than or equal to, and Less than or Less than or equal to for these values.
Filtering pairs based on attribute similarity.Filtering pairs based on attribute similarity.

Filtering pairs based on attribute similarity.

Adding a Filter on Attribute Nulls

To filter on an attribute with no similarity score:

  1. Follow the steps above to add a filter on attribute similarity.
  2. Toggle the Pairs without similarity scores option to the on (right) position.
  3. If you only want to include pairs that do not have a similarity score for the selected attribute, toggle the Pairs with similarity scores option to the off (left) position.

Filtering to Record Pairs with Comments

To filter to record pairs that have comments:

  1. On the Pairs page in a mastering project select the Filter icon to open the filtering options.
  2. In the PAIRS section select Pairs with comments.

Filtering Record Pairs by Source

You can specify the source dataset of each record in a pair:

  • Top row from: The top record in a pair can be selected from a specific source.
  • Bottom row from: The bottom record in a pair can be selected from a specific source.

To filter record pairs by source:

  1. On the Pairs page in a mastering project select the Filter icon to open the filtering options.
  2. In the SOURCES section select All Sources.
  3. Select one or more source datasets then select Okay.
  4. Optionally, select a source for Top row from and/or Bottom row from.

Updated 3 months ago



Filtering Record Pairs


You can filter record pairs by priority, assignment, and other characteristics.

Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.