Jun 21, 2026
Flash Attention on CPU: Online Softmax, Cache Discipline, and C-Kernel-Engine
ML fundamentals This ShivasNotes deep dive is written for CPU silicon teams asking a very specific question: does C-Kernel-Engine really own attention on CPU down at the kernel layer? The an...
Read post →