I'm Niraj Zade.

I work at the intersection of computing and data. Have spent a decade working on computers, and 2.5 years of it working at tech companies. I bring a lot to the table beyond just engineering.
Past work includes:

  • IoT data acquisition and device control platform on AWS
  • Enterprise data pipelines
  • Pre-delivery cloud infra compliance clearance
  • ...and a lot more...

CONTACT:

Are you looking for PySparkGuide.com?

Blog

Tags: blog gen-ai

Computers understanding humans makes codebases irrelevant
A consequence of good code generation models
2023 Apr 08   8 min   blog gen-ai

Own your email's domain
Protect identity loss by owning the email's domain
2023 Feb 12   6 min   blog

Why interpreted languages make superior scripting languages
Why {python, bash, ruby, ...} are preferred for scripting over {go, c, ...}
2023 Jan 03   1 min   blog

Resources

Tags: conventions data-engineering git python spark

Git based development - Standards and Etiquette
Standards and guidelines on writing commit messages, creating branches, creating pull requests.
2024 Aug 28   9 min   git conventions

Python unicode string comparison and normalization schemes
Universally applicable string comparison in python. For string comparisons, and for de-duplication in datasets.
2024 May 06   8 min   python

Spark Adaptive Query Engine (AQE) - all the details you need to know
Spark Adaptive Query Engine - what problems it solves and how it works in detail.
2024 Feb 20   15 min   spark data-engineering

Spark join strategies
Spark join strategies
2024 Jan 22   16 min   spark data-engineering