The Hightouch Blog - For data and analytics engineers
Oct 17
Predictable, painless, powerful: How to migrate to Hightouch Events
A step-by-step plan to making your event collection migration seamless
Should you be migrating to an Iceberg Lakehouse?
Apache Iceberg might be the coolest topic in data right now. When should your organization seriously consider migrating?
Introducing Hightouch’s Privado integration: enable proactive privacy
Privado provides full personal data visibility and privacy governance across the tech stack.
Power real-time uses cases from Google Cloud’s BigQuery with continuous queries
Announcing our integration with BigQuery continuous queries
The ultimate guide to data modeling
Learn why data modeling underpins everything you do in your business and is essential for driving business outcomes.
What is data collection?
Learn how data collection is important to power your analytics and business use cases.
What is marketing analytics? (Everything you need to know)
Learn why marketing analytics is so important and how to use it to increase the ROI across your marketing channels.
Governance shouldn’t be a four-letter word
Our commitment to deeply invest in features that make it easier than ever to govern data effectively.
Iceberg and the rise of the Lakehouse
Apache Iceberg won’t just change data storage. It’s going to change how data works in every industry.
What is product analytics?
Learn how to leverage product analytics to understand behavioral data to deliver personalized experiences to your customers.
Introducing: Identity Resolution Waterfalls and Profile Inspector
Two powerful new features that make the best warehouse-native identity solution even better.
What is web analytics? (Use cases + tools)
Learn how you can use web analytics to build personalized experiences for your users and increase conversions on your website.
What are data contracts & how do they work?
Master the art of Data Contracts to safeguard your data quality and improve the impact data has on your business.
What is behavioral data? The complete guide
Learn everything you need about behavioral data and how to leverage it to predict customer behavior and deliver personalized experiences.
Don’t let BI become another data silo
Move past analytics and get to the action with Data Activation. Data activation is the way modern data teams empower business users with data.
How I made $100,000 as a 16-year-old software engineer
Read about how Tejas became a software engineer and landed his role at Segment, earning $100,000 at just 16.
How dbt Created analytics engineering
Learn how a little known consultancy created dbt and created an entirely new category in the data world.
Introducing Data Contracts: collect events correctly the first time
Never collect bad data again. Data contracts help you collect the behavioral customer data you need.
It's time to end the build vs buy debate.
How the rapid adoption of composability changes the age-old Build vs Buy debate
How we reduced our Snowflake compute costs
How we uncovered 4-figure dollar savings monthly, and 5-figure dollar savings annually with 2 simple actions
What is clickstream data? (Tools + use cases)
Learn how to leverage clickstream to deliver better user experiences.
What is an identity graph?
Identity graphs are a powerful tool to navigate the complexities of customer data in the digital age. Discover how identity graphs can empower your marketing efforts and unlock valuable customer insights.
How Amazon Redshift pricing works: everything you need to know
Learn everything there is to know about Redshift pricing so you understand your usage and costs.
Modernizing ETL in the cloud data warehouse
Why data teams should replace legacy ETL with a combination of ELT and Reverse ETL.
What is entity resolution?
Learn why and how you should deduplicate and consolidate data for the key concepts that drive your business.
What is data curation? (Examples and use cases)
Learn the 8-step data curation framework to ensure your data assets are of high quality, usable, and accessible across teams and organizations.
Embracing data warehouse layers: how to build scalable data modeling
How to leverage the data warehouse layers for efficient data engineering and data activation with just the right amount of abstraction.
What is data governance? How to design and implement a framework
Find out how you can design and implement your a data governance framework
Data downtime destroys value
As you unlock more value from data, downtime becomes more costly. A Monte Carlo survey reveals data downtime has increased nearly twofold year over year.
The seven stages of data lifecycle management
Learn about each of the seven stages of data lifecycle management and how it can reduce costs and increase data quality.
How to setup lead scoring in Salesforce
Learn how you can start scoring your leads and accounts in Salesforce.
The definitive guide to HubSpot lead scoring
Learn how you can set up manual and predictive lead scoring in Hubspot.
How to setup the CRM Snowflake Connector
We have compiled the complete guide to setting up the Snowflake Connector tool to get data into Salesforce. Read on to get a step-by-step guide on the tool's implementation, benefits, and drawbacks.
Welcome to the Data Movement Movement
How Hightouch and Fivetran work together to enable leading organizations with the capabilities necessary to move data at scale.
The definitive guide to the Snowflake Marketplace
Learn how data consumers and data providers are leveraging the Snowflake Marketplace to build data products and enrich their Snowflake data with third-party datasets.
What is Apache Airflow?
Find out what Airflow is and how it's used to orchestrate data pipelines.
What is a data dictionary?
Learn everything there is to know about data dictionaries so you can leverage them provide relevant context and create transparency across teams.
Why team matters more than tech: an enterprise framework for customer data acceleration
Actable's detailed framework for accelerating enterprise-level customer data initiatives and driving business outcomes.
What is an ETL pipeline?
Learn why an ETL pipeline is essential for data analytics and data activation.
A technical deep dive into Snowflake pricing
Learn everything there is to know about Snowflake pricing so you can estimate your usage and costs.
What's the difference between a data warehouse and a database?
Learn the differences between a data warehouse and a database such as the different types of each and the use cases of each one.
Indeed’s Darrell Alfonso on the cloud data warehouse’s role in breaking customer data out of silos
Darrell Alfonso, Director of Marketing Strategy and Operations at Indeed, discusses the flexibility and power of the cloud data warehouse and how MarTech leaders can bridge technical and marketing teams.
M1’s Jake Larson on the future of marketing: how the cloud data warehouse is a game changer
Jake Larson, MarTech leader at M1, discusses how the cloud data warehouse has transformed his marketing organization and M1’s customer experiences for the better.
How to setup dataloader.io
Learn how to leverage dataloader.io to sync data into Salesforce and why it may not be the best option for you.
What is data observability and how does it impact data teams?
Learn how data observability can produce better data products and reduce data downtime.
How do the 6 Snowflake data types work?
Learn about the 6 different data types that you can use in Snowflake.
What Is data discovery and how can it benefit you
Find out how you can leverage data discovery to build trust and create better data products.
Usage-based billing is way too hard - how the warehouse makes It easy
Transform your B2B SaaS billing with Hightouch's automated solution.
What is data integration and how does it work?
Learn what data integration is and learn how it's used to extract, load, and transform data for analytics and activation.
The five steps for data analysis
Learn why data analysis is so important and what the five steps for data analysis are.
What is data transformation and how does it work?
Learn what data transformation is and how it's used to modify, reformat, cleanse, and restructure data.
Top 10 skills to learn as a data engineer
Data engineer is an in-demand job. Find out the top 10 skills you must know as a data engineer.
What is data taxonomy?
Find out what data taxonomy is and how it can benefit your data and your business.
A practical guide to using the new Hightouch Fivetran extension
How we automated a near real-time end-to-end pipeline in 15 minutes, without an orchestrator
What is data replication
Find what data replication is, the benefits it can give your business, and the different types depending on the platform.
What's in store for data teams in 2023?
Here’s what three dozen industry practitioners had to say.
The definitive guide to Product-Qualified Leads (PQLs)
Learn why you should care about product-qualified leads and how you can build a product-qualified lead score to increase conversions.
What is operational analytics & why you should use it
Operational Analytics shifts the focus from simply understanding data to taking action on it in the tools that run business processes. Instead of using dashboards to make decisions, Operational Analytics is focused on turning insights into action – automatically.
The ultimate guide to B2B SaaS metrics & how to calculate them
Find out what B2B SaaS metrics you should be measuring and how to calculate them.
Data mapping tools: what is the best option for your business?
Learn everything there is to know about data mapping and discover which tool is right for your business.
Data teams need to break out of their bubble
Data teams and stakeholders are often deeply siloed from each other, and the data team/marketing team divide is a powerful example of this dysfunction.
Reverse ETL has crossed the chasm
All signs point to widespread industry validation and a rapid acceleration in adoption.
How to get more out of your analytics
Leverage Data Activation to help your analytics team drive action and business impact.
How to calculate a PQL (Product Qualified Lead) in SQL
Learn how you can calculate product qualified lead in SQL.
How to calculate MRR (Monthly Recurring Revenue) in SQL
Learn how you can calculate monthly recurring revenue in SQL.
How to calculate LTV (Lifetime Value) in SQL
Learn how you can calculate lifetime value score in SQL.
The definitive guide to Tableau CRM
Learn how you can leverage Tableau CRM for data integration, predictive analytics, and cloud-based visualization.
How to calculate a SQL (Sales Qualified Lead) in SQL
Learn how you can calculate a sales qualified lead score in SQL.
How to calculate a (MQL) Marketing Qualified Lead in SQL
Learn how you can calculate a marketing qualified lead in SQL.
How to calculate ARR (Annual Recurring Revenue) in SQL
Learn how you can calculate your annual recurring revenue in SQL.
Power your product experiences with Hightouch
Combine the analytical power of your warehouse with the low-latency performance of your application databases to build real-time, personalized customer experiences.
Building a product-led growth engine on top of the data warehouse
Learn how we evolved our PLG engine to take advantage of our biggest asset: our data warehouse.
How to measure the impact of your data team
Learn why you should stop focusing on metrics and why you should start tracking outcomes to measure the impact of your data team.
Understanding data enrichment: a comprehensive guide
Learn what data enrichment is, why it matters, and how you can implement it today.
Should you learn dbt?
A look at why everyone seems to be learning dbt, and some practical tips on getting started
Data apps: fad or trend?
Learn about the future of Data Apps, the advantages and disadvantages, and whether or not they are here to stay.
Heroku Connect: the definitive guide
Learn everything there is to know about Heroku Connect and why Reverse ETL is a better alternative.
The definitive guide to API integrations
Learn everything you need to know about API Integrations including API types, authentication, reading, writing, deployment, monitoring, and scheduling.
Why your alerting should be configurable
Learn how you can adopt configurable alerting across your entire data stack.
What is data ingestion? | The definitive guide
Learn what data ingestion is, why it matters, and how you can use it to power your analytics and activate your data.
Airflow alternatives: a look at Prefect and Dagster
We take a deep dive into Airflow, Prefect, and Dagster and the differences between the three!
ETL vs. Reverse ETL: the technical differences
Discover the technical differences between Reverse ETL and ETL/ELT and learn how they work behind the scenes.
Azure Synapse vs Snowflake: the definitive guide
Discover the key differences between Azure Synapse and Snowflake around architecture, pricing, security, compliance, data support, administration, data protection, performance, etc...
7 useful Zapier integrations or “Zaps”
Review 7 most useful Zapier integrations or "Zaps" and learn why Reverse ETL is a simpler solution.
The top 5 most useful Workato integrations & recipes
Review the top 5 Workato integrations and recipes and learn why Reverse ETL is a simpler solution.
The top 5 most useful Tray.io integrations & recipes
Review the top 5 Tray.io integrations and recipes and learn why Reverse ETL is a simpler solution.
Why every tool in the Modern Data Stack needs Git
Leverage the power of software development and DevOps and implement the same best practices you use in your production code in the context of data integration with Git.
How to enrich Hubspot data in 6 steps
Learn How Hightouch can help you enrich your Hubspot data in 6 easy steps.
How to enrich Salesforce data in 6 steps
Learn How Hightouch can help you enrich your Salesforce data in 6 easy steps.
A deep dive into Hex and the future of data apps
Learn about Hex, a collaborative data analysis platform, and the future of data apps built on top of the data warehouse
Looker Actions: the definitive guide
Learn everything about Looker Actions, how it automates workflows to take action on data, where it falls flat, and the alternatives for your data needs.
Matillion: the definitive guide
Learn everything about Matillion and Matillion Data Loader, how they offer ETL & ELT services to solve data integration problems, where they fall flat, and alternatives for your data needs.
The state of automated testing in the data warehouse
A deep dive into automated data testing, why it matters and the top tools to consider
Exploring data lineage with OpenLineage
In this post, we ask what is data lineage and take a detailed dive into OpenLineage and how it aims to unify metadata and lineage across tools to make data lineage easier to reason about.
Automate your external reporting with the Modern Data Stack
With the right infrastructure, reporting can become truly routine. Even more than that, it’s a chance to demonstrate your value to the company and customers.
What is a data Lakehouse?
The Data Warehouse and the Data Lake both have their strengths and weaknesses. Like yin and yang, they often coexist within the same data stack and this has given rise to a hybrid category — the Data Lakehouse!
Why data engineers should not manage dbt
dbt transformations are critical and should be owned by data analysts and scientists because they have the most context to make the data a lot more usable for business teams.
Data engineers shouldn't write Reverse ETL: a guide to building a happy data engineering department
This guide sheds light on the state of data engineering in 2021 as well as talks about the rise of Reverse ETL as a core component of the Modern Data Stack.
Modern data warehouse modelling: the definitive guide - part 2
This guide on modern data warehouse modelling explores the current sentiment toward Kimball as well as shines some light on Wide Tables and what the data community thinks of them.
Modern data warehouse modelling: the definitive guide - part 1
A guide on modern data warehouse modelling, exploring best practices from the community and famous modelling paradigms like Kimball’s Dimensional Modelling, Inmon, Data Vault and Wide Tables.
dbt Snapshots: the definitive guide
One of the most important questions that any analytics-focused company should strive to answer is “How has my data changed over time?” dbt provides a simple solution addressing this exact problem called dbt snapshots.
dbt Cloud: 4 reasons for data teams to embrace it
The biggest benefit that dbt Cloud offers to data teams and analytics engineers? Freedom from distractions, and the ability to focus where you can add unique value making sense of your company's data.
How Datadog operationalizes their data warehouse to supercharge their business teams
In conversation with Romoli Bakshi, Engineering Team Lead at Datadog who walks us through the process her team follows to operationalize their data warehouse in order to supercharge their business teams.
How to monitor customer usage in Slack via SQL
The Slack integration on Hightouch enables you to pull the resources and metrics important to your business via SQL and then get notified when these resources and metrics change, right within Slack of course!