How Morgan Stanley Scaled Flux to 500 Kubernetes Clusters
A five-year journey from push-based pipelines to a self-service GitOps platform managing over 500 clusters, 2,000 nodes, and 100,000 containers.
A five-year journey from push-based pipelines to a self-service GitOps platform managing over 500 clusters, 2,000 nodes, and 100,000 containers.
The CNCF's new Kubernetes AI conformance program aims to solve portability and predictability challenges for AI workloads running on the 80% of enterprises already using Kubernetes.
AWS AFT now supports native OIDC integration with HCP Terraform, eliminating manual IAM configuration. Here's how to implement secure, short-lived credentials for your infrastructure automation.
The latest Cilium release addresses critical L7 policy handling bugs, memory leaks, and KVStore initialization issues. Here's what platform teams need to know.
On April 9, 2026, Virtru announced integration between its Data Security Platform and Cloudflare R2 object storage. The move enables organizations to enforce cryptographic, attribute-based access…
The 2023 debate was about licensing. The 2026 decision is about control plane ownership. Three years after HashiCorp moved Terraform from MPL to BSL, teams that…
We’re experiencing an “everything changed” moment for IT operations and site reliability engineering. Driven by AI-assisted development, cloud adoption, and Kubernetes auto-scaling, infrastructure deployments are scaling…
At KubeCon EU 2026 in Amsterdam, Broadcom announced that Velero—the Kubernetes-native backup, restore, and migration tool—has been accepted into the CNCF Sandbox. The move traces a…
The vLLM Korea Meetup 2026, held in Seoul on April 2nd, delivered more than just technical presentations—it offered a window into how AI inference infrastructure is…
Learn how to connect private PostgreSQL databases to Grafana Cloud using Private Data Source Connect (PDC) and leverage the AI assistant to translate complex queries into visualizations without exposing data to the public internet.
vLLM v0.19.0 brings full Google Gemma 4 architecture support, speculative decoding with zero-bubble async scheduling, and significant Model Runner V2 maturation for improved throughput and efficiency.
Cloudflare's global network now exceeds 500 Tbps of external capacity, enabling autonomous DDoS mitigation at unprecedented scale using eBPF and XDP.
KubeCon EU 2026 highlighted how diversity, inclusion, and belonging are becoming core design principles for successful platform engineering teams.
The declarative configuration specification for OpenTelemetry hits stable 1.0, bringing consistent YAML-based SDK configuration across five languages with more implementations underway.
The latest containerd patch release fixes critical CRI bugs including registry mirror configuration, CNI DEL handling after restarts, and an AppArmor regression affecting unix domain sockets.
The latest vLLM release adds Google Gemma 4 architecture support with MoE, multimodal, and tool-use capabilities, plus breakthrough performance improvements through zero-bubble async scheduling.
Continuous production profiling becomes a first-class OpenTelemetry signal as Profiles enters public Alpha, featuring an eBPF-based profiler and unified OTLP format compatible with pprof.
OpenTelemetry's eBPF-based zero-code instrumentation now captures HTTP headers for span enrichment, enabling faster incident response by adding request context like tenant and user segment without code changes.
Learn how to migrate from Ingress-NGINX to Gateway API using the stable 1.0 release of Ingress2Gateway, featuring support for over 30 annotations and comprehensive integration testing.
Flux 2.8.0 introduces Helm v4 support, server-side apply for HelmReleases, kstatus-based health checking, faster recovery from failed deployments, and GitHub App integration for source authentication.