~/devreads

#dynamo

3 posts

12 Feb

Shruti Koparkar 6 min read

A diagnostic insight in healthcare. A character’s dialogue in an interactive game. An autonomous resolution from a customer service agent. Each of these AI-powered interactions is built on the same unit of intelligence: a token. Scaling these AI interactions requires businesses to consider whether they can afford more tokens. The answer lies in better tokenomics […]

aiai infrastructureagentic aidynamoinference

3 Dec 2025

Shruti Koparkar 8 min read

The top 10 most intelligent open-source models all use a mixture-of-experts architecture. Kimi K2 Thinking, DeepSeek-R1, Mistral Large 3 and others run 10x faster to enable one-tenth the cost per token on NVIDIA GB200 NVL72. A look under the hood of virtually any frontier model today will reveal a mixture-of-experts (MoE) model architecture that mimics […]

ai infrastructureartificial intelligencedynamoinferencenvidia blackwell

13 Nov 2025

Shruti Koparkar 4 min read

Editor’s note: This post is part of Think SMART, a series focused on how leading AI service providers, developers and enterprises can boost their inference performance and return on investment with the latest advancements from NVIDIA’s full-stack inference platform. NVIDIA Blackwell delivers the highest performance and efficiency, and lowest total cost of ownership across every […]

ai infrastructuredynamoinferencethink smart