Book contents
- Frontmatter
- Contents
- Preface
- Acknowledgments
- 1 Introduction
- 2 The perception–action cycle
- 3 Power-spectrum estimation for sensing the environment
- 4 Bayesian filtering for state estimation of the environment
- 5 Dynamic programming for action in the environment
- 6 Cognitive radar
- 7 Cognitive radio
- 8 Epilogue
- Glossary
- References
- Index
5 - Dynamic programming for action in the environment
Published online by Cambridge University Press: 05 June 2012
- Frontmatter
- Contents
- Preface
- Acknowledgments
- 1 Introduction
- 2 The perception–action cycle
- 3 Power-spectrum estimation for sensing the environment
- 4 Bayesian filtering for state estimation of the environment
- 5 Dynamic programming for action in the environment
- 6 Cognitive radar
- 7 Cognitive radio
- 8 Epilogue
- Glossary
- References
- Index
Summary
The previous two chapters of the book have focused on the basic issue: perception of the environment from two different viewpoints. Specifically, Chapter 3 focused on perception viewed as power-spectrum estimation, which is applicable to applications such as cognitive radio, where spectrum sensing of the radio environment is of paramount importance. In contrast, Chapter 4 focused on Bayesian filtering for state estimation, which is applicable to another class of applications exemplified by cognitive radar, where estimating the parameters of a target in a radar environment is the issue of interest. Although those two chapters address entirely different topics, they also have a common perspective: they both pertain to the perceptor of a cognitive dynamic system, the basic function of which is to account for perception in the perception–action cycle. In this chapter, we move on to the action part of this cycle, which is naturally the responsibility of the actuator of the cognitive dynamic system.
Just as the Bayesian framework provides the general formalism for the perceptor to perceive the environment by estimating the hidden state of the environment, Bellman's dynamic programming provides the general formalism for the actuator to act in the environment through control or decision-making.
Dynamic programming is a technique that deals with situations where decisions are made in stages (i.e. different time steps), with the outcome of each decision being predictable to some extent before the next decision is made.
- Type
- Chapter
- Information
- Cognitive Dynamic SystemsPerception-action Cycle, Radar and Radio, pp. 125 - 166Publisher: Cambridge University PressPrint publication year: 2012