高技术通讯(英文版)2025,Vol.31Issue(2):118-130,13.DOI:10.3772/j.issn.1006-6748.2025.02.002
StM:a benchmark for evaluating generalization in reinforcement learning
StM:a benchmark for evaluating generalization in reinforcement learning
摘要
关键词
reinforcement learning(RL)/generalization/benchmark/environmentKey words
reinforcement learning(RL)/generalization/benchmark/environment引用本文复制引用
袁凯钊..StM:a benchmark for evaluating generalization in reinforcement learning[J].高技术通讯(英文版),2025,31(2):118-130,13.基金项目
Supported by the National Key R&D Program of China(No.2023YFB4502200),the National Natural Science Foundation of China(No.U22A2028,61925208,62222214,62341411,62102398,62102399,U20A20227,62302478,62302482,62302483,62302480,62302481),the Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDB0660300,XDB0660301,XDB0660302),the Chinese Academy of Sciences Project for Young Scientists in Basic Research(No.YSBR-029)and the Youth Innovation Promotion Associa-tion of Chinese Academy of Sciences and Xplore Prize. (No.2023YFB4502200)