SpinKube runs Spin WebAssembly apps on Kubernetes without containers, using a containerd shim and Kubernetes primitives. Pairing it with the Gateway API gives teams a cleaner, role-oriented way to expose WASM services without annotation sprawl.
EKS Capabilities package Argo CD, AWS Controllers for Kubernetes (ACK), and Kube Resource Orchestrator (kro) as managed, Kubernetes-native building blocks. Here’s what changes when platform teams can compose AWS resources and Kubernetes resources behind custom APIs — without running the controllers themselves.
AWS and the vLLM community describe multi-LoRA serving for Mixture-of-Experts models, with kernel and execution optimizations that let many fine-tuned variants share a single GPU. The pitch: higher utilization, better latency, and a clearer path to serving ‘dozens of models’ without dozens of endpoints.
vLLM 0.16.0 landed with ROCm-focused fixes and ongoing production hardening. Even when a release looks incremental, inference runtimes are now platform-critical dependencies—affecting cost, reliability, and model portability.
OpenTelemetry’s eBPF Instrumentation project (OBI) just hit its first release. That’s a milestone for low-overhead, zero-code observability—but it also raises new questions about privilege, fleet rollout, and data governance.
Cloudflare says one engineer and an AI model rebuilt a drop-in Next.js replacement on Vite (vinext) in a week—with big build-time and bundle-size claims. Whether or not the benchmarks hold for every app, the real story is how AI is compressing framework and platform rewrites.
Flux 2.8 GA ships with Helm v4 support, bringing server-side apply and kstatus-based health checking to Helm releases. Here’s why that’s bigger than it sounds—and how platform teams should approach the upgrade.
AWS is packaging common platform components (GitOps and infrastructure orchestration) as managed, Kubernetes-native ‘capabilities’ for Amazon EKS. Here’s what it changes for day-2 ops, how it compares to rolling your own controllers, and what to watch before you standardize on it.
vLLM 0.16.0 isn’t a routine release. It signals a shift toward higher-throughput, more interactive open model serving—plus the operational primitives (sync, pause/resume) teams need for RLHF and agentic workloads.
GitHub is tightening the screws on enterprise governance: enterprise-defined custom org roles are GA, and IP allow lists now extend deeper into EMU user namespaces. Here’s what it changes for platform teams.
Harbor is easy to install, hard to productionize. Here’s a practical checklist for HA, storage, signing/scanning, and day-2 ops when Harbor becomes your cluster’s artifact backbone.
Logs are expensive because repetition is free to emit and costly to store. The OTel Collector’s log deduplication processor offers a new middle path: compress noise at ingest while preserving incident context.
OpenStack’s 6‑month cycles continue into 2026 (Gazpacho, Hibiscus), but the bigger story is OpenInfra’s positioning: open source infrastructure as a foundation for digital sovereignty and AI-era resilience.
Kubernetes v1.35 continues a trend: clusters are increasingly asked to run mixed AI workloads (training, batch, and latency-sensitive inference) alongside traditional services. Here’s what’s new that matters for platform teams—especially around scheduling, resizing, and safer config workflows.
OpenTelemetry is now mainstream, and the project’s own ‘2025 year in review’ highlights a less-discussed scaling story: documentation localization, contributor growth, and the operational maturity required when observability becomes an industry baseline.
GitHub is rolling Copilot usage metrics down from enterprise to organization scope, enabling least-privilege reporting. For platform and security teams, this is the missing layer for governing AI coding tools without centralizing all visibility at the enterprise tier.
LiteLLM continues to evolve from a simple proxy into an operational layer: recent releases include a Prompt Management API and access-control improvements. For teams running multiple model providers, this is a step toward repeatable prompt governance and safer rollout.
Agentic systems are moving into production, and the cloud native community is converging on interoperable protocols for connecting models to tools and data. CNCF’s Agentics Day framing around MCP highlights the shift: reliability and governance are now the hard part.
AWS published a reference controller that connects Amazon Application Recovery Controller (ARC) zonal shifts to Karpenter node pools. Here’s what the integration changes operationally, how it works under the hood, and how to adopt it safely in production EKS.
Cloudflare’s February 20, 2026 incident withdrew customer BYOIP routes via BGP. The postmortem is a masterclass in failure domains for ‘network-as-code.’ Here are the actionable cloud-native lessons for change management, blast radius, and rollback.