CanadianDataGuy’s No Fluff Newsletter

CanadianDataGuy’s No Fluff Newsletter

Home
Notes
TL;DR
Deep Dive
Blogs on Medium
Youtube
Whatsapp Community
About

Sitemap - 2025 - CanadianDataGuy’s No Fluff Newsletter

Why I Materialize Delta History for Debugging

Stop Waiting for Connectors: Stream ANYTHING into Spark (It's 4 Functions)

How to write your first Spark application with Stream-Stream Joins with working code

How Spark Structured Streaming Recovers After Failures

Build an Ethereum ETL Pipeline for Free Using Databricks Free Edition

How Many Spark Streaming Jobs Can You REALLY Run on One Cluster?

How to ace and structure your Data Modelling Interview

A Deep Dive into Skewed Joins, GroupBy Bottlenecks, and Smart Strategies to Keep Your Spark Jobs Flying

Decode the Join: A Spark Data Engineer’s Visual Handbook

How to Read Delta Log Statistics (and Why You Should)

When Data Engineering Met AI

Why Your PySpark UDF Is Slowing Everything Down

What a Netflix Senior Data Engineer Taught Us About Winning in Tech—And It’s Not What You Think

How Do I Think About Setting Spark Shuffle Partitions in 2025?

Spark Join Strategies Explained: Broadcast Hash Join

One Spark Function to Rule All Your Custom Streaming Needs

Spark Join Strategies Explained: Shuffle Hash

Spark Join Strategies Explained: Sort Merge Join

Your Degree Isn't Enough: How to Actually Break Into Data

How to Generate 1TB of Synthetic Data Faster Than a Coffee Break

© 2025 Canadian Data Guy · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture