Latest Blog Posts

May 7, 2026 • 20 min read Policy Gradient Methods: REINFORCE, Actor-Critic, and the Policy Gradient Theorem (S&B Ch. 13)

Notes on policy approximation and its advantages, the policy gradient theorem and its proof, REINFORCE (Monte Carlo policy gradient), REINFORCE with baseline, actor-critic methods, policy gradient for continuing problems, and policy parametrization for continuous actions.
Apr 17, 2026 • 3 min read Transformer Architecture Explained: Self-Attention, Encoders, and Decoders

Quick overview of the Transformer architecture covering tokenization, attention, multi-head attention, encoders, decoders, masked attention, and cross attention.
Apr 17, 2026 • 14 min read SAM 2 Explained: Meta's Promptable Visual Segmentation Model

Quick overview on SAM 2, a foundation model for promptable visual segmentation in images and videos, covering the PVS task, model architecture, data engine, SA-V dataset, zero-shot experiments, and conclusions.
Apr 4, 2026 • 13 min read Muon Optimizer Explained: Newton-Schulz Orthogonalization Beyond Adam

From Muon optimizer to MuonClip covering Adam’s limitations, momentum 2D matrix, SVD-based orthogonalization, odd polynomial approximation, Newton-Schulz 5 iteration, the exploding attention logit crisis, and QK-Clip for MHA & MLA.
Mar 16, 2026 • 4 min read Inkcast: Turn Any EPUB or PDF into an Audiobook in Your Browser

A privacy-first, text-to-speech reader for EPUBs, PDFs, and web articles. Built in a single HTML file, no backend required.
Mar 13, 2026 • 33 min read Eligibility Traces Explained: TD(λ), Sarsa(λ), and the λ-Return (S&B Ch. 12)

Notes on the lambda-return and its variants - TD, n-step truncated, online, true online, Sarsa, Watkins’s Q, Tree-Backup - dutch traces in MC learning, variable bootstrapping and discounting, off-policy traces with control variates, stable off-policy methods with traces, and implementation issues.
Mar 9, 2026 • 22 min read The Deadly Triad in RL: Off-Policy Learning with Function Approximation (S&B Ch. 11)

Notes on semi-gradient off-policy methods, examples of divergence, the deadly triad, linear value-function geometry, gradient descent in the Bellman error, learnability, Gradient-TD methods, Emphatic-TD, and reducing variance.
Mar 9, 2026 • 8 min read Semi-Gradient Sarsa and the Average Reward Setting in RL (S&B Ch. 10)

Notes on episodic semi-gradient control, n-step Sarsa, average reward for continuing tasks, deprecating the discounted setting, and differential semi-gradient n-step Sarsa.
Mar 4, 2026 • 10 min read Why AI Voice Assistants Fail Tonal Languages: ASR Bias in Igbo

A systematic evaluation of tonal fidelity in facebook/omniASR-CTC-1B reveals 75% diacritic loss on Igbo language tone marks. Evidence suggests the model generates tones probabilistically rather than acoustically.
Mar 1, 2026 • 13 min read Evaluating Tone Preservation in ASR: How Meta's omniASR Handles Igbo Diacritics

Controlled evaluation of omniASR-CTC-1B on Igbo language using 21 systematically designed samples. Diacritic Error Rate (DER) quantifies tone-specific failures. Bootstrap resampling (10,000 iterations) yields 75.5% loss (95% CI [57.1%, 89.7%]). Monotone control reveals orthographic bias. Includes full methodology, code, and reproducibility guide.
Feb 27, 2026 • 26 min read Value Function Approximation in RL: Tile Coding, Fourier Basis, and SGD (S&B Ch. 9)

Notes on value-function approximation, the prediction objective (VE), stochastic gradient descent, linear methods, feature construction, ANNs, LSTD, memory-based and kernel-based approximation, and interest & emphasis.
Feb 24, 2026 • 30 min read Model-Based RL Explained: Dyna-Q, MCTS, and Prioritized Sweeping (S&B Ch. 8)

Notes on model-based RL, Dyna-Q, prioritized sweeping, trajectory sampling, RTDP, heuristic search, rollout algorithms, and Monte Carlo Tree Search.
Feb 9, 2026 • 20 min read Machine Learning Math: From Linear Algebra to Attention — A Rigorous Deep Dive

A rigorous exploration of machine learning mathematics, from information theory and linear algebra to optimization dynamics and generative models, with correct implementations and theoretical connections.
Jan 22, 2026 • 17 min read Neural Network Optimizers Compared: SGD, Adam, AdamW, and Muon

From SGD to Muon, explore how each optimizer builds on its predecessors to solve gradient descent challenges in deep learning.
May 30, 2025 • 17 min read Machine Learning Equations Cheat Sheet: From Entropy to Attention

A reference guide to fundamental machine learning equations with correct implementations and clear explanations of how they connect to each other.
May 30, 2025 • 10 min read Backpropagation from Scratch: Chain Rule, Math, and Python Code

A highly technical yet understandable exploration of backpropagation, detailing its mechanics, mathematical foundations, and practical applications, making it accessible to those with a basic understanding of calculus and machine learning.
Feb 24, 2025 • 7 min read America's Energy Strategy After COVID-19: Natural Gas, Nuclear, and Renewables Reassessed

A post-COVID-19 critical review of Robert Bryce’s “Power Hungry: The Myths of 'Green' Energy and the Real Fuels of the Future.”
Aug 2, 2024 • 9 min read Building an RNN from Scratch in Python: Character-Level Language Model with BPTT

In this blog post, we’ll dive deep into the implementation of a character-level language model using a vanilla Recurrent Neural Network (RNN). This type of model can learn to generate text one character at a time, capturing the patterns and structure of the language it’s trained on. We’ll walk through…
Jul 15, 2024 • 3 min read Karpathy's Neural Networks Zero to Hero: Complete Implementation Notes

This blog post presents my detailed implementation of Andrej Karpathy’s Neural Networks: Zero to Hero YouTube lecture series and exercises in Jupyter Notebook. The articles go deeply from NNs to MLPs to CNNs to LLMs to ensure a thorough and robust understanding of neural networks.
Sep 20, 2020 • 10 min read Why Natural Gas to Nuclear (N2N) Can't Solve America's Energy Problem: A Critical Review

A pre-COVID-19 critical review of Robert Bryce’s “Power Hungry: The Myths of 'Green' Energy and the Real Fuels of the Future.”