Canadian Data Guy Unfiltered
Subscribe
Sign in
Home
Podcast
Notes
Chat
TL;DR
Deep Dive
Youtube
About
Latest
Top
Discussions
How to Choose Between Liquid Clustering and Partitioning with Z-Order in Databricks
The views expressed in this blog are my own and do not represent official guidance from Databricks
Jan 15
•
Canadian Data Guy
and
Geethu
6
3
1
I Knew the Answer. I Just Couldn’t Remember It.
How you can turn your notes into a personal Knowledge Agent — no code required
Jan 10
•
Canadian Data Guy
5
13:41
December 2025
4 Surprising Truths That Will Change How You Think About Spark Streaming
Spark gives you Real-Time without the complexity and pain
Dec 15, 2025
•
Canadian Data Guy
1
November 2025
Why I Materialize Delta History for Debugging
Just a Quick Tip
Nov 27, 2025
•
Canadian Data Guy
1
Stop Waiting for Connectors: Stream ANYTHING into Spark (It's 4 Functions)
Listen now | How to ingest data from any source into Apache Spark — demystified with real-world example of BlockChain Ingestion
Nov 3, 2025
•
Canadian Data Guy
and
Yogita Nesargi
4
1
1
26:05
October 2025
How to write your first Spark application with Stream-Stream Joins with working code
A Practical, Hands-On Guide to Joining Real-Time Data Streams in Spark Structured Streaming
Oct 15, 2025
•
Canadian Data Guy
5
September 2025
Build an Ethereum ETL Pipeline for Free Using Databricks Free Edition
Build a zero-infrastructure streaming pipeline: Step-by-step Ethereum data ingestion, schema evolution, and Delta storage
Sep 23, 2025
•
Yogita Nesargi
4
1
June 2025
How to ace and structure your Data Modelling Interview
Prescriptive guidance for conducting your Data Modelling Interview
Jun 18, 2025
•
Canadian Data Guy
9
2
1
A Deep Dive into Skewed Joins, GroupBy Bottlenecks, and Smart Strategies to Keep Your Spark Jobs Flying
Unlock comprehensive, practical solutions to conquer data skew in Apache Spark—step-by-step from basics to advanced strategies for perfectly balanced…
Jun 6, 2025
•
Canadian Data Guy
6
1
May 2025
Decode the Join: A Spark Data Engineer’s Visual Handbook
Understand when and why to use Broadcast, Shuffle, or Sort-Merge Joins in Spark— with clear visuals, real-world use cases, and strategy tips tailored…
May 9, 2025
•
Canadian Data Guy
and
Harathi Pasam
15
4
April 2025
Why Your PySpark UDF Is Slowing Everything Down
An in-depth exploration of architecture, execution flow, bottlenecks, and optimization strategies for PySpark UDFs
Apr 24, 2025
•
Canadian Data Guy
4
2
What a Netflix Senior Data Engineer Taught Us About Winning in Tech—And It’s Not What You Think
Spoiler: Tech is easy. Business is hard. And your ability to communicate might just be your biggest flex
Apr 17, 2025
•
Canadian Data Guy
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts