Blog

Make ML Systems Ship Again

Use the Theory of Constraints to replace incremental tweaks with step-function wins.

MLSys

MLOps

Machine Learning

Data Science

Deep Learning

Devising a Plan: The Creative Heart of ML Problem Solving

How Pólya’s Step 2 powers reliable DS/ML/AI systems under real-world limits.

MLSys

Deep Learning

Machine Learning

Data Science

Understand, Then Build

How Pólya’s Step 1 powers reliable DS/ML/AI systems under real-world limits.

MLSys

Deep Learning

Machine Learning

Data Science

The Production ML Survival Guide

What I Learned Deploying Models That Serve Millions While Others Failed at Launch

MLSys

MLOps

Machine Learning

Data Science

Deep Learning

Hard-Learned Lessons in Shipping Software (AI/ML) Projects

A Guide for Engineers and Product Managers

Product Management

Machine Learning

From Forgetting to Fluency: How to Learn Smarter, Not Harder

Evidence-based tips from cognitive science to transform the way you absorb, retain, and apply knowledge in any field.

Personal Growth

Why Your Final Layer Shouldn’t Have Softmax

The log-sum-exp trick, numerical instability, and when raw logits are the right choice

Deep Learning

Cutting the Fat: A Practical Guide to Neural Network Pruning

From weight sparsity to hardware-aware strategies—learn how to shrink deep learning models without shrinking their performance

Deep Learning

Building GPT(2/3) from Scratch: Turning Theory into a Working Transformer

A hands-on journey through implementing the 124M GPT architecture in PyTorch—complete with attention mechanisms, optimizations, and the lessons learned along the way

NLP

Deep Learning

Byte Pair Encoding from Scratch

Building a BPE tokenizer step by step — the algorithm that decides how language models see text

NLP

Deep Learning

The RAG Optimization Playbook

Proven tactics, pitfalls, and fine-tuning methods to build faster, smarter, and more accurate retrieval-augmented generation systems

NLP

RAG

Inside Python’s Modules and Packages: The Machinery Behind import

Explore the inner workings of Python’s import system, including path resolution, package types, importlib internals, and advanced import hacks

Python

SWE

Automatic Differentiation Demystified

From dual numbers to backpropagation — the intuition, the trade-offs, and what breaks in practice for LLM training

MLSys

Git from the Inside Out

An in-depth guide to Git’s object model, branching mechanics, and the hidden workflows that make it so powerful

SWE

I Built My Own PyTorch (Tiny Version) — Here’s Everything I Learned

Inside the engineering decisions, optimizations, and trade-offs behind a homegrown deep learning framework

MLSys

Attention Is All You Need… But Here’s the Rest

A practical, code-first breakdown of Transformers—covering the theory, the math, and how to implement every architecture variant

NLP

Deep Learning

Breaking Text Apart (The Smart Way)

From single characters to advanced subword splits — see how modern tokenizers like WordPiece and SentencePiece prepare language for AI.

NLP

Inside LSTMs: Implementing and Optimizing Sequential Models from First Principles

A deep dive into LSTM internals—covering the math, gates, performance considerations, and a full PyTorch-aligned implementation from scratch.

NLP

Deep Learning

C Program Startup

Does C program really start at main?

SWE

C

Anomaly Detection

Machine Learning

Data Science

Gradient Descent Algorithm and Its Variants

Deep dive into gradient descent algorithm: Batch vs. Mini-batch vs. Stochastic.

Machine Learning

Deep Learning

Conda Essentials

Enough background about Conda to be productive!

SWE

K-means Clustering: Algorithm, Applications, Evaluation Methods, and Drawbacks

Deep dive into K-means algorithm to find subgroups within data.

Machine Learning

Data Science

Unsupervised Learning

Coding Neural Network Part 5 - Dropout

What is Dropout, its use, and how to implement it?

Machine Learning

Deep Learning

Coding Neural Network Part 4 - Regularization

What is regularization and how it helps NN generalizes better?

Machine Learning

Deep Learning

Coding Neural Network Part 3 - Parameters’ Initialization

The role of parameter initialization in training and different ways to initialize parameters.

Machine Learning

Deep Learning

Coding Neural Network Part 2 - Gradient Checking

How to check numerically if the implementation of backward propagation is correct?

Machine Learning

Deep Learning

Coding Neural Network Part 1 - Forward & Backward Propagation

What it takes to go from input to output? And how to compute the gradients?

Machine Learning

Deep Learning

Bandit Algorithms: epsilon-Greedy Algorithm

What is epsilon-Greedy Algorithm and how to use it in A/B testing?

Data Science

Website Optimization

Predicting Loan Repayment

Trying different modeling techniques to deal with imbalanced data, missing values, and ensemble models.

Machine Learning

Data Science

Character-Level Language Model

How an RNN learns to generate names one character at a time — and what it teaches us about language models

NLP

Deep Learning

Predicting Employee Turnover

Experimenting with different models on employee turnover data.

Machine Learning

Data Science