Uploading a Dataset Into Tamr
Upload a dataset into Tamr.
Dataset File Format
Format: Comma separated values,
.csv
. Defaults for the delimiter, quote and escape characters are,
,"
and"
, respectively. To configure these, select Show advanced CSV options in the dataset upload window.Encoding: UTF-8, or UTF-8 with BOM.
Header: A file must contain a header row.
Dataset Primary Key
Datasets must have a primary key column for Tamr to reference individual records. Each primary key value must not be
null
and is expected to be unique. If Tamr encounters a record that has duplicate primary key values, it overwrites it.When you upload datasets without a primary key, Tamr generates a Primary Key attribute and populates it using internally-generated values that are guaranteed to be unique. You must reference these IDs when updating the dataset’s records using the Tamr API. See Modify a Dataset's Records.
To upload a dataset into Tamr:
- Navigate to the Dataset Catalog tab.
- Select Add New Dataset.
- Select Choose File and then choose the
.csv
file. - Optionaly, choose Show advanced CSV options to configure the delimiter, quote, and escape characters.
- Optionally, choose Description and edit it.
- Choose Save.
Access to a DataSet
To ensure that users in your group can access your dataset, attach a policy to it.
Updated over 5 years ago