Establishing a Framework to Optimize Scale-up Interconnect for AI and HPC Clusters
Today, the Open Compute Project Foundation (OCP), the nonprofit organization bringing hyperscale innovations to all, and the Ultra Accelerator Link™ (UALink™) Consortium announced a new collaboration to enhance scale-up interconnect performance in AI clusters and High-Performance Computing (HPC). The UALink Consortium is developing an open industry standard for high-performance accelerated compute scale-up interconnects tailored for AI and HPC workloads, while the OCP Community is actively designing sustainable, large-scale data center infrastructure with a focus on Open Systems for AI. Together, OCP and UALink aim to integrate UALink’s scale-up AI interconnect technology into OCP Community-delivered AI clusters, providing the high-bandwidth, low-latency, low-power connectivity required for high-performance AI training and inference.
“The rapid adoption of AI across industries, from autonomous systems to enterprise analytics, is driving unprecedented demand for scalable, high-performance AI infrastructure. This has created a pivotal moment for data center investments, with hyperscale operators deploying large-scale AI clusters to meet these needs. By collaborating, the UALink Consortium and the OCP Community can shape system specifications to address critical challenges in interconnect bandwidth and scalability posed by advanced AI models,” said George Tchaparian, CEO at the OCP Foundation.
Key aspects of the collaboration will focus on aligning OCP’s community-led infrastructure development with UALink’s interconnect innovations, ensuring seamless integration and shared objectives. The alliance will leverage the expertise of both organizations to advance scale-up AI interconnect performance. Following the release of UALink 1.0 Specification earlier this month, both organizations and their communities are setting up for collaboration across OCP’s Open Systems for AI Strategic Initiative and OCP’s Future Technologies Initiative Short-Reach Optical Interconnect workstream.
“AI and HPC workloads require ultra-low latency and massive bandwidth to handle the scale and complexity of accelerated compute data processing to meet LLM requirements. The UALink Consortium was formed to create an open standard for accelerated compute interconnects that meets these demands, enabling faster and more efficient data exchange. Partnering with the OCP Community will accelerate the adoption of UALink’s innovations into complete systems, delivering transformative performance for AI markets,” said Peter Onufryk, UALink Consortium President.
“The surge in generative AI and HPC applications is placing immense pressure on data center interconnects to deliver the bandwidth and responsiveness needed for training and inference. The alliance between OCP and UALink creates a powerful collaborative framework to develop and integrate advanced interconnect solutions, enhancing the performance of large-scale AI clusters. This alliance has the potential to redefine industry solutions for AI infrastructure,” said Sameh Boujelbene, VP at Dell’Oro Group.
Discover the latest trends and insights—explore the Business Insights Journal for up-to-date strategies and industry breakthroughs!