Jump to Content
User Guides
API Reference
Release Notes
v2019.019
v2020.004
v2020.13.0-2020.16.4
v2021.7.0-2021.10.0
v2021.19.0-2022.1.0
v2022.2.0-2022.5.0
v2022.6.0-2022.9.0
v2022.10.0-2022.13.0
v2023.1.0-2023.2.0
v2023.004
v2024.001
v2024.002
Doc Home
Help Center
Log In
User Guides
Doc Home
Help Center
Log In
v2024.002
User Guides
API Reference
Release Notes
Configuration Variable Reference
Tamr Core Overview
Solving Data Quality Challenges with Tamr Core
Working with Machine Learning Models
User Roles and Tamr Core Documentation
Schema Mapping Projects
Mastering Projects
Golden Records Projects
Categorization Projects
Data Quality and Enrichment Services
Navigating Data
Viewing Data in Tamr Core
Searching Records
Advanced Project Features
Tokenizers and Similarity Functions
Transformations
Core Connect
Geospatial Data in Mastering Projects
Understanding Primary Keys
Glossary
Tamr Support
Reviewer Guide
Reviewer Tasks and Responsibilities
Reviewing Pairs
Labeling Pairs
Adding Comments to a Pair
Filtering Record Pairs
Commenting on Cluster Records
Reviewing Categorizations
Categorizing Records
Filtering Categorizations
Verifier Guide
Verifier Tasks and Responsibilities
Coordinating a Mastering Project
Training Initial Pairs
Assigning Pairs
Viewing and Verifying Pairs
Filtering Clusters
Searching Cluster Records
Assigning Clusters
Verifying Clusters
Coordinating a Categorization Project
Using the Categorization Dashboard
Navigating a Taxonomy
Training Tamr Core to Categorize Records
Assigning Records in Categorization Projects
Filtering Records in Categorization Projects
Verifying Record Categorizations
Curator Guide
Curator Tasks and Responsibilities
Working with Datasets in Projects
Adding a Dataset to a Project
Uploading a Dataset into a Project
Profiling a Dataset
Previewing an Input Dataset
Managing Dataset Tags
Exporting a Dataset
Removing a Dataset from a Project
Schema Mapping Workflow
Creating a Unified Schema
Mapping Unified Attributes
Mapping Recommendations
Generating Attribute Recommendations
Previewing the Unified Dataset
Mastering Project Workflow
Creating the Unified Dataset for Mastering
Working with Pairs
Grouping Obvious Duplicates
Defining the Blocking Model
Curating Project Jobs and Viewing Metrics
Working with Clusters
Publishing Clusters
Examples of Cluster ID Changes
Golden Records Workflow
Working with Golden Records
Golden Record Consolidation Rules
Categorization Workflow
Creating the Unified Dataset for Categorization
Uploading a Taxonomy File
Updating Categorization Results
Managing Record Categorizations
Managing a Taxonomy
Taxonomy Design Principles
Managing Jobs
Viewing Job Details
Monitoring Job Status
Cancelling a Job
Managing Projects
Viewing and Editing a Project's Settings
Deleting a Project
Author Guide
Author Tasks and Responsibilities
Working with Projects and Datasets
Creating a Project
Updating Project Access
Updating Dataset Access
Administrator Guide
Admin Tasks and Responsibilities
Managing User Accounts and Access
Roles for Users and Groups
Permissions Matrix by User Role
Using Policies to Control Access
Navigating the Users Page
Creating a User or User Group
Editing a User's Information or Password
Editing Roles and Groups
Activating or Deactivating a User
Auditing User Policies and Access
Managing Datasets
Using the Dataset Catalog
Datasets Generated by Tamr Core
Data Attributes Generated by Tamr Core
Deleting a Dataset from All Projects
Auditing the Policies for a Dataset
System Administrator Guide
Deployment Options
Single-Node Deployments
Deploying Single-Node Tamr Core on AWS (Commercial Marketplace)
Deploying Single-Node Tamr Core on AWS (ICMP)
Deploying Single-Node Tamr Core on Google Cloud Platform
Deploying Single-Node Tamr Core on Azure
Deploying a Scalable Tamr Core Instance on AWS
Terraform IAM Principal Permissions for AWS
Tamr Core AWS Network Reference Architecture
Deploying a Scalable Tamr Core Instance on Google Cloud Platform
Installation
Requirements for Installing Tamr Core
Installing NGINX
Installing PostgreSQL
Creating the Database and Database User
Installing Tamr Core
Setting the License Key
Restarting Tamr Core
Security Configurations
Verifying Tamr Core Installation
Adding a Custom Toolbar Button
Project Movement
Configuration
Configuring Tamr Core
Configuring Tamr Core Backup
Configuring HDFS
Configuring Low-Latency Match Service
Configuring PostgreSQL
Configuring HTTPS
Configuring the Spark Environment
Working with the YARN Cluster Manager
Installing and Configuring Auxiliary Services
Configuring HBase
Configuring Core Connect
Command Reference
Configuration Variable Reference
Logging
Logging in Single-Node Deployments
Logging in Cloud-Native Deployments
User Interface Logs
Reading and Navigating Logs
Using Logs for Troubleshooting
Validation, Upgrades, and Backups
Using the Formula Option
Utilities for Validation and System Processes
Upgrading Tamr Core
Upgrading PostgreSQL
Backup
Restore
AWS Backup and Restore
System Health Status
User Authentication
LDAP Authentication and Authorization
SAML Authentication
Monitoring
Metrics
Transformations
Overview
Getting Help with Transformations
Managing Primary Keys
Data Types and Transformations
Using Metadata in Transformations
Working with Transformations for Geospatial Data
Using Fill, Formula, MultiFormula, and Unpivot
Using the Fill Option
Using the MultiFormula Option
Using the Unpivot Option
Writing Transformation Scripts
Example Tasks for Transformation Scripts
Working with Statements
Aggregating Records
Disaggregating Records
Removing Records
Referencing Other Datasets
Checkpoint
Drop
Explode
Filter
Group By
Join
Lookup
Merge
Order
Pivot
Repartition
Rows
Sample
Select
Union All
Unpivot
Use
Window
Statement Modifiers
Working with Expressions
Aggregating Expressions
Arithmetic Expressions
Case
Spread
Using Logical Comparators
Working with Dates
Working with Regular Expressions
Functions
General Functions
Map, Filter, and Reduce
Array Functions
ARRAY.OF
Mathematical Functions
Aggregate Functions
GIS Functions
Tips for Troubleshooting Transformations
Geospatial Data (Limited Release)
Working with Geospatial Data
Geospatial Data Types
Tamr Python Client
Tamr Python Client
Tamr Toolbox
Tamr Toolbox
Playbooks
Mastering Playbook
Mastering Pipeline
Bulk Matching External Records
Low-Latency Matching External Records
Categorization Pipeline
Configuration Variable Reference
Complete list of Tamr Core configuration variables.
Suggest Edits