Ideas

Machine Learning

Machine learning notes, course projects, and research writeups on supervised learning, data analysis, evaluation, and model behavior.

Two things, mostly. I took a graduate ML course and wrote up a retrospective that covers the whole thing: supervised learning, reinforcement learning, and the theory behind them. The other posts are notes from papers I've read and topics I keep coming back to.

This is the general-purpose end of what I work on. Deep learning, GPUs, and AI safety have their own pages.

machine-learning

A riff on Tullawallal Circuit by Rachel Gaffney Dawson - go buy her art!

Project Draft 49 min read

Same Parts, Different Wiring: Mechanistic Interpretability of Moral Fine-Tuning

An exploration of how moral fine-tuning changes LLMs

Thread Retro 1 part Updated Feb 25, 2026

Retro Complete 29 min read

GPU Hardware and Software

What I Learned in GPU Hardware and Software (CS 8803) - A Retrospective

Feb 25, 2026

Project Complete 19 min read

FlashAttention & LLM Inference on GPUs

Writing a FlashAttention CUDA kernel from scratch, tiling the attention matrix to avoid materializing N×N memory, building a KV cache for token generation, and running GPT-2 with custom kernels end-to-end.

Feb 25, 2026

Note In-progress 9 min read

Notes on Effective ML Research

A summary of my takeaways from three influential articles on conducting effective empirical AI alignment research.

Note In-progress 34 min read

BlueDot AI Safety Evals Paper Club Notes

My notes and takeaways from the BlueDot AI Safety Evals paper club, covering recent papers on AI alignment, security, and evaluations.

Talk Complete 2 min read

Hierarchical Reasoning Models

A talk exploring novel neural architectures for complex reasoning tasks, featuring two-level recurrence and adaptive computation.

Thread Retro 1 part Updated Aug 10, 2025

Retro In-progress 70 min read

Machine Learning

A survey of concepts covered in my graduate Machine Learning course

Aug 9, 2025

Project In-progress 19 min read

A Practical Guide to Supervised Learning

A conceptual walkthrough of the supervised learning process, covering data analysis, model evaluation, and the fundamental trade-offs involved in building effective classifiers.

Aug 10, 2025

Article Complete 2 min read

Reflections on ICML 2025

Some notes and observations from my time at the 2025 International Conference for Machine Learning (ICML)

Talk Complete 2 min read

Language Diffusion Survey

A talk surveying diffusion models for language, from DDPM foundations to modern mask diffusion competing with auto-regressive models.

The Water-Lily Pond 1896 by Claude Monet

Article Complete 6 min read

Semantic Search

What's an embedding vector, and how can we use neural networks to improve the relevance of search results?

Article Complete 10 min read

Language Models

From n-grams to ChatGPT, how language models work and how they can be used to solve real-world problems.