基于以太无损网络的智算中心光网络架构研究(特邀)OA北大核心CSTPCD
Research on Optical Network of Intelligent Computing Center based on Ethernet Lossless Networking
[目的]近年来,生成式人工智能(AIGC)掀起了人工智能革命,智算中心(ICC)的网络联接也随之向超高带宽、智能无损和算网融合等方向发展,因此 ICC光网络需要降低卡间通信时间,以提升数据访问效率.[方法]文章针对 ICC场景光网络的组网架构进行了研究,实现了大带宽、低时延和中央处理器(CPU)效率高的无损网络,满足了 ICC的大模型训练和推理需求.文章详细分析了 ICC的流量分布特征和人工智能(AI)大模型训练组网场景下的通信流特征,深入研究了基于远程直接内存访问(RDMA)的以太无损传输方案的 ICC组网架构,并最终在 ICC场景下进行了组网实践和时延测试.[结果]文章提出的基于以太网的 RDMA(RoCE)传输方案具备基于优先级的流控制、显示拥塞通知、增强传输选择和数据中心桥能力交换协议(DCBX)等能力,可实现数据中心内基于以太协议的无损传输.测试结果显示,使用 RoCE协议的传输时延大约稳定在 1μs,并且显著优于互联网广域 RDMA协议(iWARP).[结论]文章基于智算场景下的流量特征分析,深入研究了 ICC的无损以太网络关键特性,利用 RDMA技术实现了 ICC场景下光交换网络传输效率的提升,并提出了一种在 ICC 大模型推理场景下的无损以太网络方案,为 RDMA技术在智算场景下的应用探索出了可行的方向.
[Objective]In recent years,Artificial Intelligence Generated Content(AIGC)has set off the artificial intelligence revo-lution.The network connection of the Intelligent Computing Center(ICC)has also developed in the direction of ultra-high band-width,intelligent lossless,and computing network convergence.Therefore,the optical network of the ICC needs to reduce the inter-card communication time in order to improve the efficiency of data access.[Methods]The paper addresses the networking architecture of optical networks for ICC scenarios to realize a lossless network with large bandwidth,low latency and high Cen-tral Processor Unit(CPU)efficiency,which can satisfy the demand of large model training and reasoning in ICC.This paper analyzes in detail the traffic distribution characteristics of the ICC and the communication flow characteristics under the AI large model training networking scenario.It also conducts in-depth research on the technologies such as Ethernet lossless network based on Remote Direct Memory Access(RDMA)technology and optoelectronic co-encapsulation.Finally it carries out the net-working practice and latency test under the ICC scenario.[Results]The RDMA over Converged Ethernet(RoCE)-based trans-port scheme proposed in this paper has the capabilities of priority-based flow control,displaying congestion notification,en-hanced transport selection and data center bridge capability switching protocols,which can realize lossless transmission based on Ethernet protocols in data centers.The test results in this paper show that the transmission delay using the RoCE protocol is approximately stable at around 1 μs and significantly outperforms the Internet Wide Area RDMA Protocol(iWARP).[Con-clusion]In this paper,based on the traffic characterization in the intelligent computing scenario,we have studied the key char-acteristics of the lossless Ethernet network in the ICC,and used the RDMA technology to realize the enhancement of the trans-mission efficiency of the optical switching network in the scenario of the ICC.We have also put forward a lossless Ethernet net-work scheme under the large model inference scenario of the ICC,and explored the feasible direction for the application of the RDMA technology in the intelligent computing scenario.The proposed scheme explores a feasible direction for the application of RDMA technology in the smart computing scenario.
翟锐;李壮志;侯广营;马艺嘉;徐化朗
中国联合网络通信有限公司山东省分公司,济南 250002
电子信息工程
长距直接内存访问以太无损网络智算中心光交换
RDMAEthernet lossless networkICCoptical switching
《光通信研究》 2024 (005)
70-75 / 6
评论