基于深度强化学习的炼钢车间天车调度方法

doi:10.13228/j.boyuan.issn1006-9356.20200402

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (0 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要针对炼钢车间天车任务产生的动态不确定性,提出了基于深度强化学习算法的炼钢车间天车调度方法。首先,基于强化学习将天车调度问题转化为对天车操作动作序列的求解,采用DQN(Deep Q-network)算法构建动作价值网络模型进行求解;然后,以某钢厂出钢跨天车调度为研究对象,以任务完成总时间最短为目标,介绍了基于深度强化学习的天车调度方法的具体设计;最后,采用实际数据对天车动作价值网络模型进行训练,与目前现场广泛使用的基于固定分区的天车调度方案进行仿真试验对比。结果表明,基于深度强化学习的天车调度方法在任务完成总时间上减少了11.52%,提高了天车任务的完成效率,说明了方法的可行性和优化性,为天车调度研究提供了新的思路。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	作者相关文章
	林时敬
	徐安军
	刘成
	冯凯
	李稷

关键词 ：天车调度, 深度强化学习, DQN, 炼钢厂, 仿真, 神经网络

Abstract：Aiming at the dynamic uncertainty caused by overhead crane operation in steelmaking workshop, a crane scheduling method in steelmaking workshop based on deep reinforcement learning algorithm is proposed.Firstly, based on reinforcement learning, the crane scheduling problem is transformed into solving the sequence of crane operation, and DQN (Deep Q-network) algorithm is used to build the action value network model for solving.Then, taking a steel plant as the research object, taking the shortest time to complete the task as the goal, the specific design of the crane scheduling method based on deep reinforcement learning is introduced.Finally, the actual data is used to train the crane action value network model, and the method proposed in this paper is compared with the current widely used crane scheduling method based on fixed partition by simulation experiments. The results show that the cranescheduling method based on deep reinforcement learning reduces the total task completion time by 11.52%, improves the completion efficiency of the crane task, and proves the feasibility of the method. It provides a new idea for the research of the crane scheduling.

Key words： crane scheduling deep reinforcement learning DQN steel works simulation neural network

收稿日期: 2020-07-01

基金资助:国家自然科学基金资助项目(51674030); 国家重点研发计划资助项目(2017YFB0304001)

作者简介: 林时敬(1994—),男,硕士生;E-mail:17801055073@163.com

引用本文:

林时敬, 徐安军, 刘成, 冯凯, 李稷. 基于深度强化学习的炼钢车间天车调度方法[J]. 中国冶金, 2021, 31(3): 37-43. LIN Shi-jing, XU An-jun, LIU Cheng, FENG Kai, LI Ji. Crane scheduling method in steelmaking workshop based on deep reinforcement learning[J]. China Metallurgy, 2021, 31(3): 37-43.

链接本文:

http://www.zgyj.ac.cn/CN/10.13228/j.boyuan.issn1006-9356.20200402 或 http://www.zgyj.ac.cn/CN/Y2021/V31/I3/37

[1]	俞侠.炼钢-精炼-连铸生产过程天车调度问题研究[D].沈阳:东北大学,2012.
[2]	王勇,胡建光,孙玉军,等.智能制造在梅钢炼钢厂的应用实践[J].中国冶金,2018,28(1):32.
[3]	颉建新,张福明.钢铁制造流程智能制造与智能设计[J].中国冶金,2019,29(2):1.
[4]	张强.钢厂天车多机多任务的动态调度模型研究[C]//中国计量协会冶金分会2011年会暨全国自动化应用技术学术交流会论文.西安:中国计量协会冶金分会,2011: 246.
[5]	雷兆明,王鹏程,廖文喆,等.钢铁企业同轨多天车调度方法研究[J].计算机仿真,2019,36(6):465.
[6]	李维刚,王肖,赵云涛,等.基于栅格法的钢厂无人天车调度系统[J].系统仿真学报,2020,32(4):687.
[7]	ZHAO G D,LIU J,DONG Y.Scheduling the operations of a double-load crane in slab yards[J].International Journal of Production Research,2020,58(9):1.
[8]	庞新富,刘炜,李海波,等.炼钢-连铸生产过程运输设备天车调度方法[J].信息与控制,2019,48(6):745.
[9]	臧雪松,徐安军,李稷,等.炼钢-连铸区段天车调度的多目标建模与求解[J].中国冶金,2020,30(2):68.
[10]	LI J,XU A J,ZANG X S.Simulation-based solution for a dynamic multi-crane-scheduling problem in a steelmaking shop[J].International Journal of Production Research,2019(9):1.
[11]	郑忠,周超,陈开.基于免疫遗传算法的车间天车调度仿真模型[J].系统工程理论与实践,2013,33(1):223.
[12]	马长波.基于多智能体的炼钢厂车间天车调度仿真方法研究[D].昆明:昆明理工大学,2011.
[13]	Mnih V,Kavukcuoglu K,Silver D,et al.Playing atari with deep reinforcement learning[DB/OL].(2013-12-19) [2020-02-13].https://arxiv.org/pdf/1312.5602.pdf.
[14]	MENG H,CHAO D,GUO Q,et al.Delay-sensitive task scheduling with deep reinforcement learning in mobile-edge computing systems[J].Journal of Physics: Conference Series,2019,1229(1):484.
[15]	WANG H J,YANG Z,ZHOU W G,et al.Online scheduling of image satellites based on neural networks and deep reinforcement learning[J].Chinese Journal of Aeronautics,2019,32(4):1011.
[16]	Sutton R S,Barto A G.Reinforcement learning: An introduction[J].Machine Learn,1992,8(3):279.
[17]	Watkins C J,Dayan P.Q-learning[J].Machine Learn,1992,8(3):225.
[18]	Mnih V,Kavukcuoglu K,Silver D,et al.Human-level control through deep reinforcement learning[J].Nature,2015,518(7540): 529.