机电工程技术2024,Vol.53Issue(6):105-108,4.DOI:10.3969/j.issn.1009-9492.2024.06.024
面向自动化领域AI模型训练的光交换计算集群系统设计
Design of Optical Switching Computing Cluster System for AI Model Training in Automation Field
摘要
Abstract
In response to the increasing demand for computing power in AI model training in the field of automation,an optical switching computing cluster system has been designed,which includes complete control and communication processes and can achieve larger bandwidth and smaller latency than an electric switching computing cluster.At the system level,detailed performance modeling was conducted from the internal hardware and software costs,network costs,algorithm costs,and communication costs of the AI server.The performance calculation of the AI model trained optical switching computing cluster system was quantified,and an AI model trained optical switching computing cluster system performance simulation software was developed.The calculation results of the developed simulation software under different parameter settings are consistent with the theoretical calculation results,and the average running time of the software simulation is 0.432 seconds.The software inputs parameters through the UI interactive interface,then calculates them into the modeling formula,and displays the calculated results on the interface.Building a modular system with menu bar style parameter settings,this software can reduce the difficulty of users′entry and operation,facilitate performance simulation of the optical switching computing cluster system,and guide the design and optimization of the entire optical switching computing cluster system.关键词
人工智能/光交换/AI分布式训练/系统开发Key words
AI/optical switching/AI distributed training/system development分类
信息技术与安全科学引用本文复制引用
黎泽,彭慧斌..面向自动化领域AI模型训练的光交换计算集群系统设计[J].机电工程技术,2024,53(6):105-108,4.基金项目
广州铁路职业技术学院人才引进项目"人工智能业务驱动的光交换计算集群关键技术研究"(GTXYR2318) (GTXYR2318)