信息通信技术与政策2024,Vol.50Issue(6):2-9,8.DOI:10.12267/j.issn.2096-5931.2024.06.001
大模型算力基础设施技术趋势、关键挑战与发展路径
Large model computing infrastructure technological trends,key challenges,and development trajectories
张政 1冯少飞1
作者信息
- 1. 浪潮电子信息产业股份有限公司,北京 100089
- 折叠
摘要
Abstract
Starting from the latest technological development trends of large models, this paper first analyzes the architectural characteristics and computing power demand features of multimodal, long sequence, and mixture of experts models. Further, it focuses on the requirements of the latest large models for massive computing power scale and complex communication patterns. It quantitatively analyzes the current development problems and technical challenges faced by large model computing infrastructure from two aspects: computating efficiency and cluster interconnection technology. Finally, it proposes a high-quality computing infrastructure development trajectory oriented by applications, centered on systems, and targeted at efficiency.关键词
多模态模型/长序列模型/混合专家模型/算力利用效率/集群互联/高质量算力Key words
multimodal model/long sequence model/mixture of experts model/computating efficiency/cluster interconnection/high-quality computing power分类
信息技术与安全科学引用本文复制引用
张政,冯少飞..大模型算力基础设施技术趋势、关键挑战与发展路径[J].信息通信技术与政策,2024,50(6):2-9,8.