← All posts

Tagged With

llm

1 post connected to this tag.

dL/d(LLM): The Full Backward Pass

Jun 12, 2026

dL/d(LLM): The Full Backward Pass

A capstone walkthrough of the full LLM backward pass: loss to LM head, final norm, decoder layers, attention, FFN, residual splits, embedding scatter-add, training loop, AdamW, and C-Kernel-...

Read post →

Subscribe

Get my rants delivered to your inbox

I will send new posts as and when I write. No fixed cadence, just engineering notes, rants, and things I am thinking through.