Deepgram Chooses Penguin Solutions for Optimized Enterprise Voice AI Infrastructure at GTC 2026
Penguin Solutions has teamed with Deepgram and Dell Technologies to build a production-ready voice AI environment. The project supplies an optimized enterprise voice AI infrastructure for Deepgram’s speech services.
Solution design and hardware
Penguin Solutions led the architecture, deployment, and ongoing management. The design uses Dell PowerEdge XE7745 servers and Dell PowerScale storage tuned for AI workloads.
High-performance Nvidia RTX PRO 6000 Blackwell Server Edition GPUs power real-time inferencing. The stack supports Deepgram’s speech-to-text, text-to-speech, and voice agent capabilities.
Performance, scale and reliability
Enterprises require low latency and high concurrent throughput to meet strict SLAs. The deployment focuses on predictable scaling and consistent performance under heavy loads.
Penguin’s approach combines a purpose-built architecture with continuous performance optimization. That combination aims to keep voice AI responsive in mission-critical settings.
Industry use cases and governance
The platform targets sectors such as healthcare and retail. It enables accurate, real-time transcription and speech synthesis while preserving data governance.
Deepgram’s API-driven voice features pair with Penguin Solutions’ AI services and Dell’s infrastructure. Together they offer control over sensitive enterprise data.
Executive perspectives
Joe Castillo, VP of sales at Penguin Solutions, emphasized the need for infrastructure that scales reliably for real-time inference. He highlighted Penguin’s validated, end-to-end architecture for demanding voice AI workloads.
Abe Pursell, Deepgram’s VP of partnerships and business development, said the vendor collaboration produced a robust environment. He noted the infrastructure aligns with Deepgram’s performance and enterprise requirements.
David Noy, Dell Technologies’ VP for unstructured data solutions product management, stressed the role of Dell PowerScale and PowerEdge servers. He said the configuration accelerates enterprise AI adoption at scale.
GTC 2026 presence
Attendees at the Nvidia GTC AI Conference and Expo can learn more about the collaboration. The conference runs March 16-19, 2026, in San Jose, California.
Details will be available at Dell’s Booth #721 during GTC 2026. Filmogaz.com will monitor further updates as deployments roll out.