AgentPerf Benchmark Launches, vLLM v0.23.0 Ships: AI Infrastructure This Week
This week in AI infrastructure: the first AgentPerf benchmark launched, vLLM v0.23.0 shipped with DeepSeek-V4 and multi-tier KV cache support, and NVIDIA detailed how Dynamo and DOCA are being rebuilt for agentic workloads. Here is what matters.