Reinforced optimal control

dc.bibliographicCitation.seriesTitleWIAS Preprintseng
dc.bibliographicCitation.volume2792
dc.contributor.authorBayer, Christian
dc.contributor.authorBelomestny, Denis
dc.contributor.authorHager, Paul
dc.contributor.authorPigato, Paolo
dc.contributor.authorSchoenmakers, John G. M.
dc.contributor.authorSpokoiny, Vladimir
dc.date.accessioned2022-06-30T13:24:03Z
dc.date.available2022-06-30T13:24:03Z
dc.date.issued2020
dc.description.abstractLeast squares Monte Carlo methods are a popular numerical approximation method for solving stochastic control problems. Based on dynamic programming, their key feature is the approximation of the conditional expectation of future rewards by linear least squares regression. Hence, the choice of basis functions is crucial for the accuracy of the method. Earlier work by some of us [Belomestny, Schoenmakers, Spokoiny, Zharkynbay, Commun. Math. Sci., 18(1):109?121, 2020] proposes to reinforce the basis functions in the case of optimal stopping problems by already computed value functions for later times, thereby considerably improving the accuracy with limited additional computational cost. We extend the reinforced regression method to a general class of stochastic control problems, while considerably improving the method?s efficiency, as demonstrated by substantial numerical examples as well as theoretical analysis.eng
dc.description.versionpublishedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/9442
dc.identifier.urihttps://doi.org/10.34657/8480
dc.language.isoeng
dc.publisherBerlin : Weierstraß-Institut für Angewandte Analysis und Stochastik
dc.relation.doihttps://doi.org/10.20347/WIAS.PREPRINT.2792
dc.relation.issn2198-5855
dc.rights.licenseThis document may be downloaded, read, stored and printed for your own use within the limits of § 53 UrhG but it may not be distributed via the internet or passed on to external parties.eng
dc.rights.licenseDieses Dokument darf im Rahmen von § 53 UrhG zum eigenen Gebrauch kostenfrei heruntergeladen, gelesen, gespeichert und ausgedruckt, aber nicht im Internet bereitgestellt oder an Außenstehende weitergegeben werden.ger
dc.subject.ddc510
dc.subject.otherReinforced regressioneng
dc.subject.otherleast squares Monte Carloeng
dc.subject.otherstochastic optimal controleng
dc.titleReinforced optimal controleng
dc.typeReporteng
dc.typeTexteng
dcterms.extent30 S.
tib.accessRightsopenAccess
wgl.contributorWIAS
wgl.subjectMathematik
wgl.typeReport / Forschungsbericht / Arbeitspapier
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
wias_preprints_2792.pdf
Size:
505.77 KB
Format:
Adobe Portable Document Format
Description: