Two new genAI tests (Llama 3.1 8B, Flux.1) align with production stacks as multi-node results climb. NVIDIA posts many fastest times; University of Florida, Wiwynn, and Datacrunch expand the ecosystem.
Allyson Klein talks with author and Google/Intel alum Wanjiku Kamau on moving past AI skepticism, learning fast, and using new tools with intention—so readers start where they are and explore AI with hope.
AI racks are blowing past air’s limits. Here’s a frank framework for when cold plate still wins, when it fails, and how to plan the pivot to immersion—without stranding today’s investments.
On Day 1 of KubeCon + CloudNativeCon Atlanta, CNCF unveiled Kubernetes AI Conformance to make workloads portable—arriving as inference surges to ~1.33 quadrillion tokens/month across Google’s systems.
FinTech expert Anusha Nerella shares insights on staying ahead of fraud, navigating regulation, and building collaborative teams to scale responsible AI across the financial services sector.
Modern software-defined cars blend multiple links—CAN/LIN, MIPI, SerDes, and Ethernet/TSN—to shrink wiring and cost, manage EMI, and deliver reliable, deterministic timing from sensors to actuators.
Enterprise AI doesn’t create fragility; it reveals undocumented assumptions, missing ownership, and invisible pipeline debt. Fix the foundations and AI gets cheaper, faster, and more trusted.
The deal moves Synopsys’ ARC processor IP and ASIP Designer/Programmer tools to GF’s MIPS business, while Synopsys keeps interface and foundation IP and leans further into AI-era engineering.
Hedgehog CEO Marc Austin joins Data Insights to break down open-source, automated networking for AI clusters—cutting cost, avoiding lock-in, and keeping GPUs fed from training to inference.
From SC25 in St. Louis, Nebius shares how its neocloud, Token Factory PaaS, and supercomputer-class infrastructure are reshaping AI workloads, enterprise adoption, and efficiency at hyperscale.
Runpod head of engineering Brennen Smith joins a Data Insights episode to unpack GPU-dense clouds, hidden storage bottlenecks, and a “universal orchestrator” for long-running AI agents at scale.
Billions of customer interactions during peak seasons expose critical network bottlenecks, which is why critical infrastructure decisions must happen before you write a single line of code.
Recorded at #OCPSummit25, Allyson Klein and Jeniece Wnorowski sit down with Giga Computing’s Chen Lee to unpack GIGAPOD and GPM, DLC/immersion cooling, regional assembly, and the pivot to inference.
Durgesh Srivastava unpacks a data-loop approach that powers reliable edge inference, captures anomalies, and encodes technician know-how so robots weld, inspect, and recover like seasoned operators.
Innovative power delivery unlocks a shift in data-center design. CelLink PowerPlane routes thousands of amps in a flat, flexible circuit—cutting cabling and accelerating AI factory builds.
CEO Carl Schlachte joins TechArena at OCP Summit to share how Ventiva’s solid-state cooling—proven in dense laptops—scales to servers, cutting noise, complexity and power while speeding deployment.
From OCP Summit, Metrum AI CEO Steen Graham unpacks multi-agent infrastructure, SSD-accelerated RAG, and the memory-to-storage shift—plus a 2026 roadmap to boost GPU utilization, uptime, and time-to-value.
From the OCP Global Summit in San Jose, Allyson Klein sits down with Chris Butler of Flex to unpack how the company is collapsing the gap between IT and power—literally and figuratively.
From OCP Summit 2025, Kelley Mullick joins Allyson Klein and co-host Jeniece Wnorowski for a Data Insights episode on rack-scale design, hybrid cooling (incl. immersion heat recapture), and open standards.
At OCP’s 2025 global summit, Momenthesis founder Matty Bakkeren joins Allyson Klein to explore why open standards and interoperability are vital to sustaining AI innovation at datacenter scale.