The pace of AI innovation is accelerating, and Sema4.ai’s vision goes beyond large language models (LLMs) to the transformative potential of AI agents. These agents, unlike traditional software, complete tasks autonomously, acting as knowledge workers that can reason, collaborate, and deliver work products. Sema4 pioneers this technology, offering AI agents optimized for specific industries, enhancing productivity significantly.
Lisa Spelman introduced as new CEO of Cornelis Networks
Unveiling the Role of Advanced Semiconductor Packaging in Powering AI: Explore the innovations in 2.5D and 3D packaging, high bandwidth memory, and chiplet solutions driving AI infrastructure into the future.
TechArena’s take from Satya Nadella’s keynote at MSBuild 2024. This post covers infrastructure, silicon collaborations and service delivery.
TechArena’s take on custom silicon advancements in the AI era with Alphawave Semi.
CNCF + SlashData’s latest report counts 15.6M cloud-native developers as IDPs pull backend teams into the fold; hybrid + multi-cloud rise with AI demand while inference stacks + agentic frameworks coalesce.
Two new genAI tests (Llama 3.1 8B, Flux.1) align with production stacks as multi-node results climb. NVIDIA posts many fastest times; University of Florida, Wiwynn, and Datacrunch expand the ecosystem.
Allyson Klein talks with author and Google/Intel alum Wanjiku Kamau on moving past AI skepticism, learning fast, and using new tools with intention—so readers start where they are and explore AI with hope.
AI racks are blowing past air’s limits. Here’s a frank framework for when cold plate still wins, when it fails, and how to plan the pivot to immersion—without stranding today’s investments.
On Day 1 of KubeCon + CloudNativeCon Atlanta, CNCF unveiled Kubernetes AI Conformance to make workloads portable—arriving as inference surges to ~1.33 quadrillion tokens/month across Google’s systems.
FinTech expert Anusha Nerella shares insights on staying ahead of fraud, navigating regulation, and building collaborative teams to scale responsible AI across the financial services sector.
Hedgehog CEO Marc Austin joins Data Insights to break down open-source, automated networking for AI clusters—cutting cost, avoiding lock-in, and keeping GPUs fed from training to inference.
Rose-Hulman Institute of Technology shares how Azure Local, AVD, and GPU-powered infrastructure are transforming IT operations and enabling device-agnostic access to high-performance engineering software.
From SC25 in St. Louis, Nebius shares how its neocloud, Token Factory PaaS, and supercomputer-class infrastructure are reshaping AI workloads, enterprise adoption, and efficiency at hyperscale.
Runpod head of engineering Brennen Smith joins a Data Insights episode to unpack GPU-dense clouds, hidden storage bottlenecks, and a “universal orchestrator” for long-running AI agents at scale.
Billions of customer interactions during peak seasons expose critical network bottlenecks, which is why critical infrastructure decisions must happen before you write a single line of code.
Recorded at #OCPSummit25, Allyson Klein and Jeniece Wnorowski sit down with Giga Computing’s Chen Lee to unpack GIGAPOD and GPM, DLC/immersion cooling, regional assembly, and the pivot to inference.
Hedgehog CEO Marc Austin joins Data Insights to break down open-source, automated networking for AI clusters—cutting cost, avoiding lock-in, and keeping GPUs fed from training to inference.
Rose-Hulman Institute of Technology shares how Azure Local, AVD, and GPU-powered infrastructure are transforming IT operations and enabling device-agnostic access to high-performance engineering software.
From SC25 in St. Louis, Nebius shares how its neocloud, Token Factory PaaS, and supercomputer-class infrastructure are reshaping AI workloads, enterprise adoption, and efficiency at hyperscale.
Runpod head of engineering Brennen Smith joins a Data Insights episode to unpack GPU-dense clouds, hidden storage bottlenecks, and a “universal orchestrator” for long-running AI agents at scale.
Billions of customer interactions during peak seasons expose critical network bottlenecks, which is why critical infrastructure decisions must happen before you write a single line of code.
Recorded at #OCPSummit25, Allyson Klein and Jeniece Wnorowski sit down with Giga Computing’s Chen Lee to unpack GIGAPOD and GPM, DLC/immersion cooling, regional assembly, and the pivot to inference.