Vik's Newsletter
Subscribe
Sign in
Home
Podcast
Notes
Chat
Semi Doped Podcast
Archive
About
Computing
TurboQuant: Inner Workings and Implications
Google's year old research resurfaced in a recent blog post and spooked memory investors.
Mar 26
•
Vikram Sekar
37
2
3
Beyond GTC: A Deep Dive into Compute, LPX, and the Untold Story of SpecDec
Analyzing the CPU, GPU, and LPU chip ratios unveiled at the Nvidia GTC keynote, the impact of the Groq LPX chip on disaggregated decoding, and its…
Mar 18
•
Vikram Sekar
35
6
GTC 2026 Preview | Implications of Nvidia's SRAM-Decode Hardware on the Inference Market
The case for dedicated decode hardware and what it means for AMD, HBM, and the SRAM startup market.
Mar 4
•
Vikram Sekar
40
8
6
The AI Datacenter CPU Yellow Pages
Grace, Vera, Venice, Turin, Diamond/Granite Rapids, Clearwater/Sierra Forest, Graviton, Cobalt, Phoenix, AmpereOne.
Feb 24
•
Vikram Sekar
39
1
The CPU Bottleneck in Agentic AI and Why Server CPUs Matter More Than Ever
How agent orchestration shifts the CPU-GPU ratio, a framework for scoring server CPUs across reasoning and action workloads, applied to 17 datacenter…
Feb 17
•
Vikram Sekar
80
1
9
How d-Matrix's In-Memory Compute Tackles AI Inference Economics
A deep look into the architecture from chip construction to rack-scale deployments, performance metrics, and end applications.
Dec 9, 2025
•
Vikram Sekar
21
2
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts