Canadian Data Guy Unfiltered
Subscribe
Sign in
Home
Podcast
Notes
Chat
TL;DR
Deep Dive
Youtube
About
Inside Delta Lake’s Idempotency Magic: The Secret to Exactly-Once Spark
Learn how txnAppId and epochId work together to create a bulletproof distributed two-phase commit. Achieve true exactly-once semantics for your production pipelines
LATEST
·
6:18
Most Popular
View all
How to Choose Between Liquid Clustering and Partitioning with Z-Order in Databricks
Jan 15
•
Canadian Data Guy
and
Geethu
7
3
1
Decode the Join: A Spark Data Engineer’s Visual Handbook
May 9, 2025
•
Canadian Data Guy
and
Harathi Pasam
16
4
1
Spark Join Strategies Explained: Broadcast Hash Join
Apr 14, 2025
•
Canadian Data Guy
9
1
A Deep Dive into Skewed Joins, GroupBy Bottlenecks, and Smart Strategies to Keep Your Spark Jobs Flying
Jun 6, 2025
•
Canadian Data Guy
7
1
Latest
Top
Discussions
17:55
Unlocking Sub-Second Latency with Databricks
Watch now | How Spark Real Time Mode Achieving Millisecond Latency with a Simple Trigger Switch
Jan 14
•
Canadian Data Guy
1
13:41
I Knew the Answer. I Just Couldn’t Remember It.
How you can turn your notes into a personal Knowledge Agent — no code required
Jan 10
•
Canadian Data Guy
5
1
How to Choose Between Liquid Clustering and Partitioning with Z-Order in Databricks
The views expressed in this blog are my own and do not represent official guidance from Databricks
Jan 15
•
Canadian Data Guy
and
Geethu
7
3
1
26:05
Stop Waiting for Connectors: Stream ANYTHING into Spark (It's 4 Functions)
Listen now | How to ingest data from any source into Apache Spark — demystified with real-world example of BlockChain Ingestion
Nov 3, 2025
•
Canadian Data Guy
and
Yogita Nesargi
4
1
1
4 Surprising Truths That Will Change How You Think About Spark Streaming
Spark gives you Real-Time without the complexity and pain
Dec 15, 2025
•
Canadian Data Guy
1
Why I Materialize Delta History for Debugging
Just a Quick Tip
Nov 27, 2025
•
Canadian Data Guy
1
How to write your first Spark application with Stream-Stream Joins with working code
A Practical, Hands-On Guide to Joining Real-Time Data Streams in Spark Structured Streaming
Oct 15, 2025
•
Canadian Data Guy
5
Build an Ethereum ETL Pipeline for Free Using Databricks Free Edition
Build a zero-infrastructure streaming pipeline: Step-by-step Ethereum data ingestion, schema evolution, and Delta storage
Sep 23, 2025
•
Yogita Nesargi
4
1
How to ace and structure your Data Modelling Interview
Prescriptive guidance for conducting your Data Modelling Interview
Jun 18, 2025
•
Canadian Data Guy
10
2
1
See all
Canadian Data Guy Unfiltered
The engineer who writes documentation-grade deep dives with production code you can run today
Subscribe
Recommendations
Databricksters
Canadian Data Guy
Canadian Data Guy Unfiltered
Subscribe
About
Archive
Recommendations
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts