单控制器与多控制器自适应动态规划（英文版）

单控制器与多控制器自适应动态规划（英文版）

分享

作者: 宋睿卓 , 魏庆来 , 李擎

出版社: 科学出版社

出版时间: 2019-06

版次: 1

ISBN: 9787030605276

定价: 168.00

装帧: 其他

开本: 16开

分类: 计算机与互联网

2人买过

Contents

1 Introduction 1

1.1 Optimal Control 1

1.1.1 Continuous-Time LQR 1

1.1.2 Discrete-Time LQR 2

1.2 Adaptive Dynamic Programming 3

1.3 Review of Matrix Algebra 5

References 6

2 Neural-Network-BasedApproach for Finite-TimeOptimal Control 7

2.1 Introduction 7

2.2 Problem Formulation and Motivation 9

2.3 The Data-Based Identifier 9

2.4 Derivation of the Iterative ADP Algorithm with Convergence Analysis 11

2.5 Neural Network Implementation of theIterative Control Algorithm 17

2.6 Simulation Study 18

2.7 Conclusion 20

References 22

3 Nearly Finite-HorizonOptimalControlfor Nonafiine Time-Delay Nonlinear Systems 25

3.1 Introduction 25

3.2 Problem Statement 26

3.3 The Iteration ADP Algorithm and ItsConvergence 30

3.3.1 The Novel ADP Iteration Algorithm 30

3.3.2 Convergence Analysis of the Improved Iteration Algorithm 33

3.3.3 Neural Network Implementation of the Iteration ADP Algorithm 38

3.4 Simulation Study 40

3.5 Conclusion 48

References 48

4 Multi-objective Optimal Control for Time-Delay Systems 49

4.1 Introduction 49

4.2 Problem Formulation 50

4.3 Derivation of the ADP Algorithm for Time-Delay Systems 51

4.4 Neural Network Implementation for the Multi-objective Optimal Control Problem of Time-Delay Systems 54

4.5 Simulation Study 55

4.6 Conclusion 61

References 62

5 Multiple Actor-Critic Optimal Control via ADP 63

5.1 Introduction 63

5.2 Problem Statement 65

5.3 SIANN Architecture-Based Classification 66

5.4 Optimal Control Based on ADP 69

5.4.1 Model Neural Network 70

5.4.2 Critic Network and Action Network 74

5.5 Simulation Study 82

5.6 Conclusion 91

References 91

6 Optimal Control for a Class of Complex-Valued Nonlinear Systems 95

6.1 Introduction 95

6.2 Motivations and Preliminaries 96

6.3 ADP-Based Optimal Control Design 99

6.3.1 Critic Network 99

6.3.2 Action Network. 101

6.3.3 Design of the Compensation Controller 102

6.3.4 Stability Analysis 103

6.4 Simulation Study 107

6.5 Conclusion. 110

References 110

7 Off-Policy Neuro-Optimal Control for Unknown Complex-Valued Nonlinear Systems 113

7.1 Introduction 113

7.2 Problem Statement 114

7.3 Off-Policy Optimal Control Method 115

7.3.1 Convergence Analysis of Off-Policy PI Algorithm 117

7.3.2 Implementation Method of Off-Policy Iteration Algorithm 119

7.3.3 Implementation Process 122

7.4 Simulation Study 122

7.5 Conclusion 125

References 125

8 Approximation-Error-ADP-Based Optimal Tracking Control for Chaotic Systems 127

8.1 Introduction 127

8.2 Problem Formulation and Preliminaries 128

8.3 Optimal Tracking Control Scheme Basedon Approximation-Error ADP Algorithm 130

8.3.1 Description of Approximation-Error ADP Algorithm 130

8.3.2 Convergence Analysis of the Iterative ADP Algorithm 132

8.4 Simulation Study 136

8.5 Conclusion 144

References 144

9 Off-Policy Actor-Critic Structure for Optimal Controlof Unknown Systems with Disturbances 147

9.1 Introduction 147

9.2 Problem Statement 148

9.3 Off-Policy Actor-Critic Integral Reinforcement Learning 151

9.3.1 On-Policy IRL for Nonzero Disturbance 151

9.3.2 Off-Policy IRL for Nonzero Disturbance 152

9.3.3 NN Approximation for Actor-Critic Structure 154

9.4 Disturbance Compensation Redesign andStability Analysis 157

9.4.1 Disturbance Compensation Off-Policy Controller Design 157

9.4.2 Stability Analysis 158

9.5 Simulation Study 161

9.6 Conclusion 163

References 163

10 An Iterative ADP Method to Solve for a Class of Nonlinear Zero-Sum DifferentialGames 165

10.1 Introduction 165

10.2 Preliminaries and Assumptions 166

10.3 Iterative Approximate Dynamic Programming Method for ZS Differential Games 169

10.3.1 Derivation of the Iterative ADP Method 169

10.3.2 The Procedure of theMethod 174

10.3.3 The Properties of theIterativeADP Method 176

10.4 Neural Network Implementation 190

10.4.1 The Model Network 191

10.4.2 The Critic Network 192

10.4.3 The Action Network 193

10.5 Simulation Study 195

10.6 Conclusion 204

References 204

11 Neural-Network-Based Synchronous Iteration Learning Method for Multi-player Zero-Sum Games 207

11.1 Introduction 207

11.2 Motivations and Preliminaries 208

11.3 Synchronous Solution of Multi-playerZSGames 213

11.3.1 Derivation of Off-Policy Algorithm 213

11.3.2 Implementation Method for Off-Policy Algorithm 214

11.3.3 Stability Analysis 218

11.4 Simulation Study 219

11.5 Conclusion 224

References 224

12 Off-Policy Integral Reinforcement Learning Method for Multi-player Non-Zero-Sum Games 227

12.1 Introduction 227

12.2 Problem Statement 228

12.3 Multi-player Learning PI SolutionforNZSGames 229

12.4 Off-Policy Integral ReinforcementLearningMethod 234

12.4.1 Derivation of Off-Policy Algorithm 234

12.4.2 Implementation Method for Off-Policy Algorith
目录:
Contents

1 Introduction 1

1.1 Optimal Control 1

1.1.1 Continuous-Time LQR 1

1.1.2 Discrete-Time LQR 2

1.2 Adaptive Dynamic Programming 3

1.3 Review of Matrix Algebra 5

References 6

2 Neural-Network-BasedApproach for Finite-TimeOptimal Control 7

2.1 Introduction 7

2.2 Problem Formulation and Motivation 9

2.3 The Data-Based Identifier 9

2.4 Derivation of the Iterative ADP Algorithm with Convergence Analysis 11

2.5 Neural Network Implementation of theIterative Control Algorithm 17

2.6 Simulation Study 18

2.7 Conclusion 20

References 22

3 Nearly Finite-HorizonOptimalControlfor Nonafiine Time-Delay Nonlinear Systems 25

3.1 Introduction 25

3.2 Problem Statement 26

3.3 The Iteration ADP Algorithm and ItsConvergence 30

3.3.1 The Novel ADP Iteration Algorithm 30

3.3.2 Convergence Analysis of the Improved Iteration Algorithm 33

3.3.3 Neural Network Implementation of the Iteration ADP Algorithm 38

3.4 Simulation Study 40

3.5 Conclusion 48

References 48

4 Multi-objective Optimal Control for Time-Delay Systems 49

4.1 Introduction 49

4.2 Problem Formulation 50

4.3 Derivation of the ADP Algorithm for Time-Delay Systems 51

4.4 Neural Network Implementation for the Multi-objective Optimal Control Problem of Time-Delay Systems 54

4.5 Simulation Study 55

4.6 Conclusion 61

References 62

5 Multiple Actor-Critic Optimal Control via ADP 63

5.1 Introduction 63

5.2 Problem Statement 65

5.3 SIANN Architecture-Based Classification 66

5.4 Optimal Control Based on ADP 69

5.4.1 Model Neural Network 70

5.4.2 Critic Network and Action Network 74

5.5 Simulation Study 82

5.6 Conclusion 91

References 91

6 Optimal Control for a Class of Complex-Valued Nonlinear Systems 95

6.1 Introduction 95

6.2 Motivations and Preliminaries 96

6.3 ADP-Based Optimal Control Design 99

6.3.1 Critic Network 99

6.3.2 Action Network. 101

6.3.3 Design of the Compensation Controller 102

6.3.4 Stability Analysis 103

6.4 Simulation Study 107

6.5 Conclusion. 110

References 110

7 Off-Policy Neuro-Optimal Control for Unknown Complex-Valued Nonlinear Systems 113

7.1 Introduction 113

7.2 Problem Statement 114

7.3 Off-Policy Optimal Control Method 115

7.3.1 Convergence Analysis of Off-Policy PI Algorithm 117

7.3.2 Implementation Method of Off-Policy Iteration Algorithm 119

7.3.3 Implementation Process 122

7.4 Simulation Study 122

7.5 Conclusion 125

References 125

8 Approximation-Error-ADP-Based Optimal Tracking Control for Chaotic Systems 127

8.1 Introduction 127

8.2 Problem Formulation and Preliminaries 128

8.3 Optimal Tracking Control Scheme Basedon Approximation-Error ADP Algorithm 130

8.3.1 Description of Approximation-Error ADP Algorithm 130

8.3.2 Convergence Analysis of the Iterative ADP Algorithm 132

8.4 Simulation Study 136

8.5 Conclusion 144

References 144

9 Off-Policy Actor-Critic Structure for Optimal Controlof Unknown Systems with Disturbances 147

9.1 Introduction 147

9.2 Problem Statement 148

9.3 Off-Policy Actor-Critic Integral Reinforcement Learning 151

9.3.1 On-Policy IRL for Nonzero Disturbance 151

9.3.2 Off-Policy IRL for Nonzero Disturbance 152

9.3.3 NN Approximation for Actor-Critic Structure 154

9.4 Disturbance Compensation Redesign andStability Analysis 157

9.4.1 Disturbance Compensation Off-Policy Controller Design 157

9.4.2 Stability Analysis 158

9.5 Simulation Study 161

9.6 Conclusion 163

References 163

10 An Iterative ADP Method to Solve for a Class of Nonlinear Zero-Sum DifferentialGames 165

10.1 Introduction 165

10.2 Preliminaries and Assumptions 166

10.3 Iterative Approximate Dynamic Programming Method for ZS Differential Games 169

10.3.1 Derivation of the Iterative ADP Method 169

10.3.2 The Procedure of theMethod 174

10.3.3 The Properties of theIterativeADP Method 176

10.4 Neural Network Implementation 190

10.4.1 The Model Network 191

10.4.2 The Critic Network 192

10.4.3 The Action Network 193

10.5 Simulation Study 195

10.6 Conclusion 204

References 204

11 Neural-Network-Based Synchronous Iteration Learning Method for Multi-player Zero-Sum Games 207

11.1 Introduction 207

11.2 Motivations and Preliminaries 208

11.3 Synchronous Solution of Multi-playerZSGames 213

11.3.1 Derivation of Off-Policy Algorithm 213

11.3.2 Implementation Method for Off-Policy Algorithm 214

11.3.3 Stability Analysis 218

11.4 Simulation Study 219

11.5 Conclusion 224

References 224

12 Off-Policy Integral Reinforcement Learning Method for Multi-player Non-Zero-Sum Games 227

12.1 Introduction 227

12.2 Problem Statement 228

12.3 Multi-player Learning PI SolutionforNZSGames 229

12.4 Off-Policy Integral ReinforcementLearningMethod 234

12.4.1 Derivation of Off-Policy Algorithm 234

12.4.2 Implementation Method for Off-Policy Algorith

查看详情

相关分类

计算机理论编程与开发操作系统大数据与云计算图形图像/多媒体网站设计与网页开发网络与通讯硬件、嵌入式开发办公软件信息安全辅助设计与工程计算软件工程/开发项目管理

单控制器与多控制器自适应动态规划（英文版）正版品相完好，套书和多封面版本咨询客服后再下单

九品

新起点书店

北京市海淀区

平均发货23小时成功完成率89.81%

￥156.86

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）正版现货，品相完整，套书只发一本,多版面书籍只对书名

九品

旧书香书城

北京市昌平区

平均发货23小时成功完成率88.61%

￥156.14

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）全新正版未拆封

全新

A小二郎书舍A

湖南省长沙市

平均发货33小时成功完成率85.56%

￥146.14

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）全新正版快速发货

全新

星尘大海书店

天津市河北区

平均发货25小时成功完成率78.71%

￥148.16

券

100减20

立即购买加入购物车
【正版新书】单控制器与多控制器自适应动态规划（英文版）出版社，新华书店库存新书，保证正版，放心下单

全新

精诚所至正品书专营店

浙江省杭州市

平均发货5小时成功完成率82.41%

￥147.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）正版认证假一赔十支持七天无理由退货

全新

兴文书店

北京市海淀区

平均发货16小时成功完成率88.46%

￥132.30

券

100减20

立即购买加入购物车不属于本条目
单控制器与多控制器自适应动态规划（英文版）正版全新

全新

闲暇一卷书的书店

上海市浦东新区

平均发货21小时成功完成率85.09%

￥119.70

券

100减20

立即购买加入购物车不属于本条目
单控制器与多控制器自适应动态规划（英文版）全新正版现货

全新

天涯淘书阁

四川省成都市

平均发货22小时成功完成率91.19%

￥126.00

券

100减20

立即购买加入购物车不属于本条目
单控制器与多控制器自适应动态规划（英文版）【标题与图片不一致时,请质询，正版有货可开发票】

全新

雅逸阁书店

海南省海口市

平均发货22小时成功完成率80.86%

￥148.00

券

100减20

立即购买加入购物车不属于本条目
单控制器与多控制器自适应动态规划（英文版）【正版有货可开发票；库存情况请咨询，及标题与图片不一致时】

全新

书香静谧书店

广东省广州市

平均发货8小时成功完成率91.89%

￥190.00

券

100减20

立即购买加入购物车不属于本条目
3

单控制器与多控制器自适应动态规划（英文版）

全新

帅帅书社

北京市朝阳区

平均发货9小时成功完成率96.21%

￥110.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）全新正版

全新

品雅轩文斋

北京市通州区

平均发货34小时成功完成率84.27%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）按需印刷全新正版

全新

知汇文轩书店

北京市通州区

平均发货36小时成功完成率88.1%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）全新正版

全新

博文苑

北京市通州区

平均发货25小时成功完成率89.17%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）按需印刷

全新

藏典阁书店

北京市通州区

平均发货25小时成功完成率90.24%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）全新正版按需印刷

全新

专利文献资料汇编

北京市通州区

平均发货38小时成功完成率87.76%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）全新正版按需印刷

全新

启慧知远书店

北京市通州区

平均发货39小时成功完成率88.64%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）单控制器与多控制器自适应动态规划（英文版） 3I30j ax预售介意者慎拍拍下即表示认可祝您购物愉快！版次更新不同步以实际收到书为准

全新

东方博古书城

北京市房山区

平均发货30小时成功完成率78.26%

￥168.00

券

100减20

立即购买加入购物车
单控制器与多控制器自适应动态规划（英文版）正版现货，品相完整，套书只发一本,多版面书籍只对书名

九品

RUC书店

北京市昌平区

平均发货20小时成功完成率76.19%

￥156.65

券

100减20

立即购买加入购物车
3

单控制器与多控制器自适应动态规划（英文版） 9.78703E+12

全新

仟寻书局

北京市朝阳区

平均发货31小时成功完成率86.82%

￥154.00

券

100减20

立即购买加入购物车不属于本条目
单控制器与多控制器自适应动态规划(英文版)(精)

全新

煤老板就是不一样

山东省德州市

平均发货12小时成功完成率90.82%

￥129.34

券

100减20

立即购买加入购物车不属于本条目
3

[按需印刷]单控制器与多控制器自适应动态规划（英文版）/宋睿卓，魏庆来，李擎 9787030605276

全新

井大书店

江西省吉安市

平均发货55小时成功完成率84.3%

￥151.12

券

100减20

立即购买加入购物车不属于本条目