User GuidesAPI ReferenceRelease Notes
Doc HomeHelp CenterLog In
User Guides

Uploading a Dataset Into Tamr

Upload a dataset into Tamr.

📘

Dataset File Format

Format: Comma separated values, .csv. Defaults for the delimiter, quote and escape characters are ,, " and ", respectively. To configure these, select Show advanced CSV options in the dataset upload window.

Encoding: UTF-8, or UTF-8 with BOM.

Header: A file must contain a header row.

🚧

Dataset Primary Key

Datasets must have a primary key column for Tamr to reference individual records. Each primary key value must not be null and is expected to be unique. If Tamr encounters a record that has duplicate primary key values, it overwrites it.

When you upload datasets without a primary key, Tamr generates a Primary Key attribute and populates it using internally-generated values that are guaranteed to be unique. You must reference these IDs when updating the dataset’s records using the Tamr API. See Modify a Dataset's Records.

To upload a dataset into Tamr:

  1. Navigate to the Dataset Catalog tab.
  2. Select Add New Dataset.
  3. Select Choose File and then choose the .csv file.
  4. Optionaly, choose Show advanced CSV options to configure the delimiter, quote, and escape characters.
  5. Optionally, choose Description and edit it.
  6. Choose Save.

📘

Access to a DataSet

To ensure that users in your group can access your dataset, attach a policy to it.


What’s Next