Design of a Reinforcement Learning PID Controller

Zhe Guan; Toru Yamamoto

doi:10.1002/tee.23430

journal article Jul 27, 2021

Design of a Reinforcement Learning PID Controller

Zhe Guan Toru Yamamoto

IEEJ Transactions on Electrical and Electronic Engineering Vol. 16 No. 10 pp. 1354-1360 · Wiley

View at Publisher Save 10.1002/tee.23430

Abstract

This paper addresses a design scheme of a proportional‐integral‐derivative (PID) controller with a new adaptive updating rule based on reinforcement learning (RL) approach for nonlinear systems. A new design scheme that RL can be used to complement the conventional PID control technology is presented. In the proposed scheme, a single radial basis function (RBF) network is considered to calculate the control policy function of Actor and the value function of Critic simultaneously. Regarding the PID controller structure, the inputs of RBF network are system errors, the difference of output as well as the second‐order difference of output, and they are composed of system states. The temporal difference (TD) error in the proposed scheme involves the reinforcement signal, the current and the previous stored value of the value function. The gradient descent method is adopted based on the TD error performance index, then, the updating rules can be yielded. Therefore, the network weights and the kernel function can be calculated in an adaptive way. Finally, the numerical simulations are conducted in nonlinear systems to illustrate the efficiency and robustness of the proposed scheme. © 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.

Topics

No keywords indexed for this article. Browse by subject →

References

25

[1]

Åström KJ (1995)

[2]

Ziegler JG "Optimum settings for automatic controllers" Transactions of the ASME (1942)

[3]

Chien KL "On the automatic control of generalized passive systems" Transactions of the ASME (1952)

[4]

10.1016/s0952-1976(03)00023-x

[5]

10.1049/ip-cta:20040853

[6]

10.1016/j.conengprac.2007.02.004

[7]

10.1016/s0959-1524(03)00039-8

[8]

10.1002/asjc.1806

[9]

10.1109/tie.2016.2636126

[10]

10.1007/978-0-387-45528-0

[11]

Sutton RS (2018)

[12]

10.1109/mcas.2009.933854

[13]

A Proposal of Adaptive PID Controller Based on Reinforcement Learning

Xue-song Wang, Yu-hu CHENG, Wei Sun

Journal of China University of Mining and Technolo... 10.1016/s1006-1266(07)60009-1

[14]

10.1016/s0967-0661(99)00141-0

[15]

10.1007/s00170-018-2864-2

[16]

10.1016/0893-6080(95)00042-9

[17]

10.1016/j.ifacol.2016.07.276

[18]

10.1016/j.apenergy.2017.03.081

[19]

10.1016/j.compchemeng.2019.05.029

[20]

Neuronlike adaptive elements that can solve difficult learning control problems

Andrew G. Barto, Richard S. Sutton

IEEE Transactions on Systems, Man, and Cybernetics 10.1109/tsmc.1983.6313077

[21]

10.1109/72.298229

[22]

10.1109/72.182710

[23]

Omatu S (1995)

[24]

Identification and control of dynamical systems using neural networks

K.S. Narendra, K. Parthasarathy

IEEE Transactions on Neural Networks 10.1109/72.80202

[25]

10.1109/9.280761

Cited By

44

Application of Reinforcement Learning-Based Adaptive PID Controller for Automatic Generation Control of Multi-Area Power System

Rasananda Muduli, Debashisha Jena · 2025

IEEE Transactions on Automation Sci...

Metrics

44

Citations

25

References

Details

Published: Jul 27, 2021
Vol/Issue: 16(10)
Pages: 1354-1360
License: View

Authors

Z

Zhe Guan

KOBELCO Construction Machinery, Dream‐Driven Co‐Creation Research Center Hiroshima University 1‐4‐1 Kagamiyama, Higashi‐Hiroshima 739‐8527 Japan

T

Toru Yamamoto

Academy of Science and Technology Hiroshima University 1‐4‐1 Kagamiyama, Higashi‐Hiroshima 739‐8527 Japan

Funding

Cabinet Office, Government of Japan

Cite This Article

Zhe Guan, Toru Yamamoto (2021). Design of a Reinforcement Learning PID Controller. IEEJ Transactions on Electrical and Electronic Engineering, 16(10), 1354-1360. https://doi.org/10.1002/tee.23430

Design of a Reinforcement Learning PID Controller

You May Also Like