Tyler Crosse

I work on machine learning and AI safety, from research experiments to production systems.
Georgia Tech MSCS. 7+ years building production software and leading teams.

A riff on Tullawallal Circuit by Rachel Gaffney Dawson - go buy her art!

Project AI-safety Feb 28, 2026 43 min read

Mechanistic interpretability of moral fine-tuning

How moral fine-tuning on iterated prisons changes LLMs.

Retro GPU Feb 25, 2026 29 min read

GPU Hardware and Software

What I learned in GPU Hardware and Software (CS 8803).

Retro machine-learning Aug 9, 2025 70 min read

Machine Learning

A survey of concepts covered in my graduate Machine Learning course

Recent Work

Writing & Papers

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety
Paper, ICML 2026 AI4GOOD, AI-WILD, and FAGEN, Jun 2026 · arXiv opens in a new tab
When Offline Selectors Cannot Beat the Best Single Model: A Diagnostic Study on edX Dropout Prediction
Paper, ICML 2026 DEMO Workshop, Jun 2026 · arXiv opens in a new tab
Same Facts, Different Updates: Inference Setup Shapes LLM Behavior in Medical Allocation
Paper, ICML 2026 AI4GOOD + Pluralistic Alignment Workshops, May 2026 · OpenReview opens in a new tab
Attack Selection in Agentic AI Control Evals Can Decrease Safety opens in a new tab
Essay, LessWrong, Apr 2026
Asymmetric Goal Drift in Coding Agents Under Value Conflict
Paper, AI-WILD Workshop at ICLR 2026, Mar 2026 · arXiv opens in a new tab
Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals
Paper, AI-WILD Workshop at ICLR 2026, Mar 2026 · arXiv opens in a new tab

Talks

Hierarchical Reasoning Models
Talk, Latent Space Paper Club, Sep 2025
Language Diffusion Survey
Talk, Latent Space Paper Club, May 2025

Selected Posts

all ideas →

Note Jul 20, 2026 8 min read

Persona vectors and the persona selection model

Notes on two accounts of character in language models: persona vectors, which locate traits as steerable directions in activation space, and the persona selection model, which explains why such directions exist.

Note Jul 19, 2026 14 min read

Notes on Tracing Attention Computation Through Feature Interactions

Notes on QK attributions, attention-head loadings, and what feature interactions reveal about why a transformer attends to one token rather than another.

Article Jul 8, 2026 17 min read

How collaborative apps handle conflicting edits

A survey of last-writer-wins, operational transformation, CRDTs, optimistic concurrency, and locks, and what each one does when two people edit the same thing at once.

Calorie Tracker add-food composer with search, photo, barcode, and recent-food quick-add controls

Project Jun 27, 2026 6 min read

A full-stack AI calorie tracker PWA

A mobile-first nutrition PWA that estimates calories and macros from food photos, scans barcodes and menus, and turns model output into a reviewable first draft.

Article Apr 30, 2026 9 min read

Git worktrees: branches without the context switch

Work on multiple branches at once without ever touching git stash.

Project Feb 25, 2026 17 min read

CUDA Fundamentals: Tiled Matrix Multiply & Bitonic Sort

Writing real GPU kernels, exploring shared memory tiling, parallel sorting algorithms, and performance optimization on an NVIDIA H100.

Note Oct 18, 2025 9 min read

Notes on Effective ML Research

A summary of my takeaways from three influential articles on conducting effective empirical AI alignment research.

Article Oct 17, 2025 9 min read

Using uv: A Modern Python Workflow

An introduction to uv, a fast, Rust-based tool for Python packaging