User GuidesAPI ReferenceRelease Notes
Doc HomeHelp CenterLog In
User Guides

Working with Golden Records

Curators and Reviewers compose rules to consolidate the values of specific attributes, and review and edit the resulting golden records.

The dataset arrives from a Mastering Project and is obtained after you publish clusters. Next, a curator composes attribute-specific consolidation rules that Tamr uses to generate golden records from a dataset with a grouping key.

As a curator or reviewer, you can review the resulting records, directly edit their values, and refine the consolidation rules that Tamr used to generate them.

Creating and Previewing Consolidation Rules

For each golden record attribute, you create a consolidation rule. Each consolidation rule consists of:

  • input attributes
  • conditions
  • aggregation functions
  • expression aggregation functions.

For convenience, when you create a Golden Records project, Tamr automatically generates a golden records dataset using the default rule, most common value with golden record attributes matching 1-to-1 with records from the input dataset. You can then customize the rules, aggregation functions, and conditions for each attribute. For more information, see Editing Golden Record Consolidation Rules.

Rules Preview

It is often convenient to run a preview for how your rules will behave. You can do this, after you have created a rule but before you have applied it to the entire cluster or group of records.

Conditions

You can use conditions in the consolidation rule on golden records. The filter uses the condition for the grouping key in the input datasets. Tamr applies the condition before it applies the aggregation function. This way, you can use a condition to filter down to the records that meet it, and then apply the aggregation function. For the list of conditions, see Working with Golden Records.

Aggregation Functions

You can use an aggregation function in the consolidation rule on the golden records. The aggregation function is applied to the grouping key of the input dataset. For example, if you apply the function most common value to the attribute state in the following dataset with the grouping key published_id, the rule returns the value Massachusetts.

published_idstate
101Massachusetts
101Massachusetts
101Ohio
101Massachusetts
101Ohio
101Ohio
101Massachusetts

For the list of aggregation functions you can use, see Aggregation Functions.

Expression Aggregation Functions

Use a code editor to compose custom aggregation functions. All such functions are based on Tamr transformations. Tamr first applies conditions, filtering down to the records that meet it, and then it applies expression agregation functions. For more information, see Expression Aggregation Functions.

Editing Values

As a curator or reviewer, you review golden records and can directly edit their values. The values you enter override the values selected by the consolidation rule for a golden record attribute. See Creating or Editing a Value Override for a Golden Record.

If you create new or update existing consolidation rules, value overrides remain unchanged. You can see the number of value overrides for a given attribute in the rules panel and filter to them. See Filtering To Records with Value Overrides.