IBM Collaborates with Nvidia Alternative Groq for Cheaper Inference

IBM Collaborates with Nvidia Alternative Groq for Cheaper Inference

IBM and Groq are integrating their technologies to offer businesses faster and more efficient AI solutions. Groq’s inference capabilities are being integrated by IBM.

IBM and Groq announce a strategic partnership where GroqCloud, Groq’s AI inference platform, is being integrated into IBM’s watsonx Orchestrate.

As part of the partnership, IBM and Groq also plan to combine Red Hat’s open source vLLM technology with Groq’s LPU architecture. Additionally, IBM’s Granite models will be made available on GroqCloud for IBM customers.

Groq, not Grok

Groq is a self-proclaimed inference specialist that relies on its own chips: the so-called LPUs. Groq debuted its Language Processing Units back in 2016 as an alternative accelerator tailored for inference. The chips are designed to deliver efficiency and performance at a lower cost than GPUs from competitors Nvidia and AMD. The company’s capabilities are available via GroqCloud, which promises faster and more efficient inference than traditional GPU-based systems.

Don’t confuse Groq with Elon Musk’s more recently established Grok. Grok is an LLM and an alternative to ChatGPT, for example, that’s popular among users who don’t mind an AI generating racy images of celebrities like Taylor Swift without permission.

Performance and Integrations

The collaboration focuses on three main points: high performance in inference, support for secure and privacy-oriented AI applications, and seamless integration with IBM’s agentic AI platform watsonx Orchestrate.

Finally, the integration of vLLM and Groq’s hardware developments will help developers with inference orchestration, load balancing, and hardware acceleration. Customers continue to work in their familiar tools but benefit from faster processing via GroqCloud.

With this collaboration, IBM can differentiate itself from other AI technology providers. Groq, for its part, gains access to the IBM ecosystem and can thus tap into a new market. By linking the less-known LPU technology to IBM’s name, Groq can demonstrate that its approach offers added value and is mature. The new capabilities will be available immediately.