• Non ci sono risultati.

A model of online learning as a Linear Quadratic Gaussian (LQG) optimal control problem with random matrices

N/A
N/A
Protected

Academic year: 2021

Condividi "A model of online learning as a Linear Quadratic Gaussian (LQG) optimal control problem with random matrices"

Copied!
1
0
0

Testo completo

(1)

A model of online learning as a Linear Quadratic Gaussian (LQG) optimal control problem with random matrices

Giorgio Gnecco, Alberto Bemporad, Marco Gori, Rita Morisi, and Marcello Sanguineti

Abstract - Machine learning and optimal control are combined to develop and solve in closed form an optimal-control formulation of online learning from supervised examples. In the proposed framework, the examples are used to learn an unknown vector parameter that models the relationship between inputs and outputs. The connections with the classical Linear Quadratic Gaussian (LQG) optimal control problem are investigated. The proposed problem is a non-trivial variation of the latter, as it involves random matrices. The optimal solution is compared with the Kalman-filter estimate of the parameter vector to be learned and the improvements in terms of smoothness and robustness to outliers are shown. These properties are useful, e.g., for the extension of the proposed online-learning framework to the case of a slowly-time-varying parameter vector.

Riferimenti

Documenti correlati

This chapter has the aim to introduce the reader to option pricing, implicit contracts and wage bargaining issues and how deterministic and stochastic optimal control theory can

Come abbiamo ripetuto più volte nel primo capitolo, il cambiamento del finale potrebbe causare un mutamento di tono dell’intera storia, ma, il senso di malinconia

The fifth chapter by Dagmar Sternad focuses on a particular motor task, bouncing a ball on the tennis racket to address several basic aspects of the produc- tion of rhythmic actions

introduced by Aronson in [1], in the study of uniformly parabolic equations, only relies on the local upper bound proved by Moser and provides an pointwise bound of the

A mysterious character who looks at us from a pictorial scene in which nobody looks at what we are looking at, that is, the flayed and monstrous hand of a hanged and dissected

See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/233771716 Hospital admissions for food-induced anaphylaxis in

Como resultado de todos estos estudios, investigaciones y compromisos internacionales se ha consensuado, aunque está todavía en construcción el concepto, de la economía

6, cassa la tua manina (arre- stando il tratto di penna prima del punto fermo dattiloscritto) e inserisce pende dal letto (seguito da punto fermo) nell’interlinea superiore (notevole