Sync data from
Databricks to Apache Kafka
Connect your data from Databricks to Apache Kafka with Hightouch. No APIs, no months-long implementations, and no CSV files. Just your data synced forever.
Trusted by data teams at
Trusted by data teams at
Integrate your data in 3 easy steps
Add your source and destination
Connect to 15+ data sources, like Databricks, and 150+ destinations, like Apache Kafka.
Define your model
Use SQL or select an existing dbt or Looker model.
Sync your data
Define how fields from your model map to Apache Kafka, and start syncing.
Model your Databricks data using any of these methods
dbt Model Selector
Sync directly with your dbt models saved in a git.
Create and Edit SQL from your browser. Hightouch supports SQL native to Databricks.
Select available tables and sheets from Databricks and sync using existing views without having to write SQL.
For less technical users, pass traits and audiences from Databricks using our visual segmentation builder.
Does this integration support in-warehouse planning?
Yes, if you integerate Databricks and Apache Kafka using Hightouch, in-warehouse planning is supported.
Great, but what is in-warehouse planning?
Between every sync, Hightouch notices any and all changes in your data model. This allows you to only send updated results to your destination (in this case Apache Kafka). With the baseline setup, Hightouch picks out only the rows that need to be synced by querying every row in your data model before diffing using Hightouch’s infrastructure.
The issue here is this can be slow for large models.
Warehouse Planning allows Hightouch to do this diff directly in your warehouse. Read more on how this works here.
Publish messages into different topics whenever rows are added, changed, or removed in your data models.
Compose your messages using SQL or our Liquid-based templating engine, which supports variable injection, control flow, and loops.
Define custom ordering and partition keys.
Authenticate with SASL (SCRAM, AWS IAM, etc.) and bring your own certificate authority.
Hightouch supports all managed Kafka services (e.g., Amazon MSK and Confluent Cloud), as well as self-hosted instances.
Databricks is a data science and analytics platform built on top of Apache Spark. Databricks implement the Data Lakehouse concept in a single unified, cloud based platform.Learn more about Databricks
About Apache Kafka
This integration is part of the Hightouch custom destination toolkit, a suite of developer-focused destinations that make it easy to build custom connectors.Learn more about Apache Kafka
Other Databricks Integrations
Other Apache Kafka integrations
Hightouch Playbooks: Best practices to leverage reverse ETL
Read more about Hightouch
What is Operational Analytics & Why You Should Use It
Operational Analytics shifts the focus from simply understanding data to taking action on it in the tools that run business processes. Instead of using dashboards to make decisions, Operational Analytics is focused on turning insights into action – automatically.Read
Activate data to any of your marketing and advertising tools
This might be one of the greatest inventions for technical marketers since the advent of legacy CDPs back in 2015.
Head of Marketing Technology
Your data is always secure
SOC 2 Type 2 compliant
Your data stays secure, available, and confidential. To see our report, .
If you’re in the EU, your data is only processed on EU data centers.
Healthcare companies like ThirtyMadison, Chapter Health, and Headway trust Hightouch.
To see our DPA (Data Processing Addendum), .
increase in return on ad spend
improvement in email engagement
lift in customer acquisition