Sequential decision problems, dependent types and generic solutions

We present a computer-checked generic implementation for solving finite-horizon sequential decision problems. This is a wide class of problems, including inter-temporal optimizations, knapsack, optimal bracketing, scheduling, etc. The implementation can handle time-step dependent control and state spaces, and monadic representations of uncertainty (such as stochastic, non-deterministic, fuzzy, or combinations thereof). This level of genericity is achievable in a programming language with dependent types (we have used both Idris and Agda). Dependent types are also the means that allow us to obtain a formalization and computer-checked proof of the central component of our implementation: Bellman’s principle of optimality and the associated backwards induction algorithm. The formalization clarifies certain aspects of backwards induction and, by making explicit notions such as viability and reachability, can serve as a starting point for a theory of controllability of monadic dynamical systems, commonly encountered in, e.g., climate impact research.

Keywords

Dynamical systems, Stochastic systems, Central component, Climate impact researches, Dependent types, Generic implementation, Generic solutions, Induction algorithms, Principle of optimality, Sequential decisions, Problem oriented languages

Publication Type

Article

Version

publishedVersion

URI

https://doi.org/10.34657/3748
https://oa.tib.eu/renate/handle/123456789/5119

Collections

Mathematik

License

CC BY 4.0 Unported

https://creativecommons.org/licenses/by/4.0/

Full item page

Sequential decision problems, dependent types and generic solutions

Files

Date

Authors

Editor

Advisor

Volume

Issue

Journal

Series Titel

Book Title

Publisher

Supplementary Material

Other Versions

Link to publishers' Version

Abstract

Description

Keywords

Keywords GND

Conference

Publication Type

Version

URI

Collections

License