CanadianDataGuy’s No Fluff Newsletter
Subscribe
Sign in
Home
Notes
TL;DR
Deep Dive
Blogs on Medium
Youtube
Whatsapp Community
About
Latest
Top
Discussions
A Deep Dive into Skewed Joins, GroupBy Bottlenecks, and Smart Strategies to Keep Your Spark Jobs Flying
Unlock comprehensive, practical solutions to conquer data skew in Apache Spark—step-by-step from basics to advanced strategies for perfectly balanced…
Jun 6
•
Canadian Data Guy
3
Share this post
CanadianDataGuy’s No Fluff Newsletter
A Deep Dive into Skewed Joins, GroupBy Bottlenecks, and Smart Strategies to Keep Your Spark Jobs Flying
Copy link
Facebook
Email
Notes
More
1
May 2025
Decode the Join: A Spark Data Engineer’s Visual Handbook
Understand when and why to use Broadcast, Shuffle, or Sort-Merge Joins in Spark— with clear visuals, real-world use cases, and strategy tips tailored…
May 9
•
Canadian Data Guy
and
Harathi Pasam
11
Share this post
CanadianDataGuy’s No Fluff Newsletter
Decode the Join: A Spark Data Engineer’s Visual Handbook
Copy link
Facebook
Email
Notes
More
4
How to Read Delta Log Statistics (and Why You Should)
Learn how to extract and validate column-level stats from your Delta Lake logs to optimize performance and debug configurations
May 2
•
Canadian Data Guy
5
Share this post
CanadianDataGuy’s No Fluff Newsletter
How to Read Delta Log Statistics (and Why You Should)
Copy link
Facebook
Email
Notes
More
2:47
April 2025
When Data Engineering Met AI
Teaching AI to Play Nice in Data Engineering
Apr 26
•
Canadian Data Guy
Share this post
CanadianDataGuy’s No Fluff Newsletter
When Data Engineering Met AI
Copy link
Facebook
Email
Notes
More
5:03
Why Your PySpark UDF Is Slowing Everything Down
An in-depth exploration of architecture, execution flow, bottlenecks, and optimization strategies for PySpark UDFs
Apr 24
•
Canadian Data Guy
3
Share this post
CanadianDataGuy’s No Fluff Newsletter
Why Your PySpark UDF Is Slowing Everything Down
Copy link
Facebook
Email
Notes
More
What a Netflix Senior Data Engineer Taught Us About Winning in Tech—And It’s Not What You Think
Spoiler: Tech is easy. Business is hard. And your ability to communicate might just be your biggest flex
Apr 17
•
Canadian Data Guy
Share this post
CanadianDataGuy’s No Fluff Newsletter
What a Netflix Senior Data Engineer Taught Us About Winning in Tech—And It’s Not What You Think
Copy link
Facebook
Email
Notes
More
How Do I Think About Setting Spark Shuffle Partitions in 2025?
TLDR: A Quick Guide to setting Spark.Shuffle.Partitions, No Deep Dive Required
Apr 15
•
Canadian Data Guy
2
Share this post
CanadianDataGuy’s No Fluff Newsletter
How Do I Think About Setting Spark Shuffle Partitions in 2025?
Copy link
Facebook
Email
Notes
More
Spark Join Strategies Explained: Broadcast Hash Join
Everything You Need to Know About Broadcast Hash Join
Apr 14
•
Canadian Data Guy
3
Share this post
CanadianDataGuy’s No Fluff Newsletter
Spark Join Strategies Explained: Broadcast Hash Join
Copy link
Facebook
Email
Notes
More
One Spark Function to Rule All Your Custom Streaming Needs
Understand the power of the foreachBatch function, incremental processing, and S3 integration in this hands-on intro to Spark Streaming.
Apr 10
•
Canadian Data Guy
Share this post
CanadianDataGuy’s No Fluff Newsletter
One Spark Function to Rule All Your Custom Streaming Needs
Copy link
Facebook
Email
Notes
More
3:38
Spark Join Strategies Explained: Shuffle Hash
Everything You Need to Know About Shuffle Hash Join
Apr 10
•
Canadian Data Guy
1
Share this post
CanadianDataGuy’s No Fluff Newsletter
Spark Join Strategies Explained: Shuffle Hash
Copy link
Facebook
Email
Notes
More
Spark Join Strategies Explained: Sort Merge Join
Slow and Steady always wins the race
Apr 10
•
Canadian Data Guy
1
Share this post
CanadianDataGuy’s No Fluff Newsletter
Spark Join Strategies Explained: Sort Merge Join
Copy link
Facebook
Email
Notes
More
March 2025
Your Degree Isn't Enough: How to Actually Break Into Data
Practical Tips for Building Real Experience, Networking Authentically, and Winning Interviews
Mar 20
•
Canadian Data Guy
Share this post
CanadianDataGuy’s No Fluff Newsletter
Your Degree Isn't Enough: How to Actually Break Into Data
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts