MLPerf Inference 5.0 signals the rise of large language models, with LLAMA 2 70B surpassing ResNet-50 in submissions and driving next-gen AI performance across compute platforms.
MemryX, a provider of edge AI acceleration hardware, recently closed its latest round of funding, serving as a potential bellwether for the next growth edge in AI compute.
From VAST Data to Weka, Graid to Solidigm — storage disruptors shined bright at NVIDIA GTC 2025. Here’s how storage innovators are redefining AI infrastructure and why it matters to the future of AI.
Deloitte and VAST Data share how secure data pipelines and system-level integration are supporting the shift to scalable, agentic AI across enterprise environments.
This video explores how Nebius and VAST Data are partnering to power enterprise AI with full-stack cloud infrastructure—spanning compute, storage, and data services for training and inference at scale.
Weka’s new memory grid raises new questions about AI data architecture—exploring how shifts in interface speeds and memory tiers may reshape performance, scale, and deployment strategies.
Stanford’s Daniel Wu unpacks AI democratization — exploring agentic & embodied AI, multi-modal models, and trustworthy systems. Learn more at Daniel’s AI Infra Summit 2025 live presentation.
CoreWeave acquires Core Scientific in a $9B all-stock deal, unlocking 1.3 GW of power and advancing its vision of vertically integrated AI infrastructure for next-gen hyperscale workloads.
A bold $1B move unites Clio and vLex to build the first AI-native platform connecting legal practice with firm management — signaling a new era of AI-driven legal transformation.
Google DeepMind's AlphaGenome uses AI to decode the mysteries of non-coding DNA — a leap that could transform how we understand disease, evolution, and what it means to be human.
Intel's decision to outsource marketing to Accenture and generative AI sparks debate: is this a visionary leap into the future of work or a symptom of a deeper retreat from innovation leadership?
Feeling overwhelmed by AI? You’re not alone. This new series cuts through the hype to explore practical tools, evolving trends, and smart strategies to help you navigate the AI ecosystem.
From CPU orchestration to scaling efficiency in networks, leaders reveal how to assess your use case, leverage existing infrastructure, and productize AI instead of just experimenting.
From the OCP Global Summit, hear why 50% GPU utilization is a “civilization-level” problem, and why open standards are key to unlocking underutilized compute capacity.
In the Arena: Allyson Klein with Axelera CMO Alexis Crowell on inference-first AI silicon, a customer-driven SDK, and what recent tapeouts reveal about the roadmap.
In this episode of Data Insights, host Allyson Klein and co-host Jeniece Wnorowski sit down with Dr. Rohith Vangalla of Optum to discuss the future of AI in healthcare.
From OCP Summit, Metrum AI CEO Steen Graham unpacks multi-agent infrastructure, SSD-accelerated RAG, and the memory-to-storage shift—plus a 2026 roadmap to boost GPU utilization, uptime, and time-to-value.
Anusha Nerella joins hosts Allyson Klein and Jeniece Wnorowski to explore responsible AI in financial services, emphasizing compliance, collaboration, and ROI-driven adoption strategies.
From CPU orchestration to scaling efficiency in networks, leaders reveal how to assess your use case, leverage existing infrastructure, and productize AI instead of just experimenting.
From the OCP Global Summit, hear why 50% GPU utilization is a “civilization-level” problem, and why open standards are key to unlocking underutilized compute capacity.
In the Arena: Allyson Klein with Axelera CMO Alexis Crowell on inference-first AI silicon, a customer-driven SDK, and what recent tapeouts reveal about the roadmap.
In this episode of Data Insights, host Allyson Klein and co-host Jeniece Wnorowski sit down with Dr. Rohith Vangalla of Optum to discuss the future of AI in healthcare.
From OCP Summit, Metrum AI CEO Steen Graham unpacks multi-agent infrastructure, SSD-accelerated RAG, and the memory-to-storage shift—plus a 2026 roadmap to boost GPU utilization, uptime, and time-to-value.
Anusha Nerella joins hosts Allyson Klein and Jeniece Wnorowski to explore responsible AI in financial services, emphasizing compliance, collaboration, and ROI-driven adoption strategies.