Databricks Clean Rooms Integration Guide
This guide is for advertisers and data providers who want to convert their user data to raw UID2s in a Databricks environment.
Integration Overview
This solution enables you to securely share consumer identifier data without exposing sensitive directly identifying information (DII), by processing your data in an instance of the Databricks Clean Rooms feature. This feature provides a secure and privacy-protecting environment for working on sensitive data.
When you've set up the Databricks Clean Rooms environment, you establish a trust relationship with the UID2 service and allow the service to convert your data, which you share in the clean room, to raw UID2s.
Functionality
The following table summarizes the functionality available with the UID2 Databricks integration.
| Encrypt Raw UID2 to UID2 Token for Sharing | Decrypt UID2 Token to Raw UID2 | Generate UID2 Token from DII | Refresh UID2 Token | Map DII to Raw UID2s |
|---|---|---|---|---|
| — | — | — | — | ✅ |
Key Benefits
Here are some key benefits of integrating with Databricks for your UID2 processing:
- Native support for managing UID2 workflows within a Databricks data clean room.
- Secure identity interoperability between partner datasets.
- Direct lineage and observability for all UID2-related transformations and joins, for auditing and traceability.
- Streamlined integration between UID2 identifiers and The Trade Desk activation ecosystem.
- Self-service support for marketers and advertisers through Databricks.
Integration Steps
At a high level, the following are the steps to set up your Databricks integration and process your data:
- Create a clean room for UID2 collaboration.
- Send your Databricks sharing identifier to your UID2 contact.
- Add data to the clean room.
- Map DII by running the clean room notebook.
Create Clean Room for UID2 Collaboration
As a starting point, create a Databricks Clean Rooms environment—a secure environment for you to collaborate with UID2 to process your data.
Follow the steps in Create clean rooms in the Databricks documentation. Use the correct sharing identifier based on the UID2 environment you want to connect to: see UID2 Sharing Identifiers.
After you've created a clean room, you cannot change its collaborators. If you have the option to set clean room collaborator aliases—for example, if you’re using the Databricks Python SDK to create the clean room—your collaborator alias must be creator and the UID2 collaborator alias must be collaborator. If you’re creating the clean room using the Databricks web UI, the correct collaborator aliases are set for you.
Send Sharing Identifier to UID2 Contact
Before you can use the clean room notebook, you'll need to send your Databricks sharing identifier to your UID2 contact.
The sharing identifier is a string in this format: <cloud>:<region>:<uuid>.
Follow these steps:
-
Find the sharing identifier for the Unity Catalog metastore that is attached to the Databricks workspace where you’ll work with the clean room.
For information on how to find this value, see Finding a Sharing Identifier.
-
Send the sharing identifier to your UID2 contact.
Add Data to the Clean Room
Add one or more tables or views to the clean room. You can use any names for the schema, tables, and views. Tables and views must follow the schema detailed in Input Table.