User GuidesAPI ReferenceRelease Notes
Doc HomeHelp CenterLog In
User Guides

Union All

UNION ALL concatenates tables together.

For example, if you have a table for every state, you can use UNION ALL to concatenate all of them into a single table with values for the whole country.

UNION ALL WITH table1, table2, ...;

The schema of each table is carried into the unioned table; no attribute is dropped. If the initial schemas match, then the unioned table has the same exact schema. If there is an attribute in one schema that is different, then the unioned table includes it at the end of the schema.

Note: Matching attributes must have matching types. If the schemas have a matching attribute, the attribute must be the same type in each schema for them to be unioned correctly.

important Important: Changing a UNION ALL statement can result in an unintentional change to tamr_id values in the unified dataset. To avoid data loss and maintain the stability of tamr_id in mastering and categorization projects, as a best practice Tamr recommends that you explicitly set the tamr_id to a value you're sure is stable in the last transformation. See Managing Primary Keys. In addition, both the origin_source_name and origin_entity_id must exist in the datasets being unioned or you must create and populate them with strings.

UNION ALL Example

For example, you have Massachusetts and Vermont datasets containing resident information, and you want to union them.

Massachusetts.csv contains:

IDNameSurname
002JohnSmith
003SarahBrown
004AlexJones

Vermont.csv contains:

First_NameSurname
AbbyElliot
BenAnderson
CarolBaldwin

To union these two tables together, you write a script such as:

USE Massachusetts.csv; UNION ALL WITH Vermont.csv

After the transformation runs, the new, unioned table contains:

IDNameSurnameFirst_Name
002JohnSmith
003SarahBrown
004AlexJones
ElliotAbby
AndersonBen
BaldwinCarol