Lenovo, together with Nvidia, is launching the AI Cloud Gigafactory program, which enables AI cloud providers to build and deploy large-scale AI factories faster through a combination of hardware, software, services, and manufacturing.
On the sidelines of CES, Lenovo, together with Nvidia, is introducing the AI Cloud Gigafactory program, which should help AI cloud providers build and bring large-scale AI factories into production faster. The solution combines hardware, software, services, and production capacity to deploy millions of GPUs for complex AI applications.
Faster deployment
With the AI Cloud Gigafactory program, Lenovo and Nvidia are focusing on accelerating the so-called ‘time-to-first-token’ (TTFT). This value, invented by the parties themselves, measures how quickly an AI investment leads to a working model in production. The program includes pre-developed building blocks, guidance, and industrial construction processes that allow AI cloud providers to set up a functional AI infrastructure in a matter of weeks.
The factories are intended for processing demanding AI workloads and high-performance computing (HPC). The program includes the use of Lenovo’s Neptune liquid cooling, Nvidia’s accelerated computing platforms, and a global production network.
Complete AI ecosystem
The collaboration makes it possible for customers, with Lenovo’s help, to build a complete AI ecosystem, from cloud and on-premises data centers to edge environments and robotic systems. Lenovo’s Gigafactory solution supports, among other things, the new Nvidia GB300 NVL72 platform, which combines 72 Blackwell Ultra GPUs and 36 Grace CPUs in one liquid-cooled rack system. The Nvidia Vera Rubin NVL72 system, also announced at CES, is also supported.
Finally, Lenovo offers additional services through Hybrid AI Factory Services, which simplify the design, implementation, and management of AI factories. Thanks to ready-made use cases from the Lenovo AI Library and integration with Nvidia AI Enterprise, companies can, according to Lenovo, develop and deploy AI solutions faster. The goal is to attract customers with the promise of making AI deployable faster in business processes.
