Linear Complementarity for Regularized Policy Evaluation and Improvement

Tools | Bookmark & Share | Make MrWhy My Homepage

Answers Shopping eBay Amazon More Web Search Videos Search Recent News

MrWhy.com » Videos » Linear Complementarity for Regularized Policy Evaluation and Improvement

Watch Video

Linear Complementarity for Regularized Policy Evaluation and Improvement

Recent work in reinforcement learning has emphasized the power of L1 regularization to perform feature selection and prevent overfitting. We propose formulating the L1 regularized linear fixed point problem as a linear complementarity problem (LCP). This formulation offers several advantages over the LARS-inspired formulation, LARS-TD. The LCP formulation allows the use of efficient off-the-shelf solvers, leads to a new uniqueness result, and can be initialized with starting points from similar problems (warm starts). We demonstrate that warm starts, as well as the efficiency of LCP solvers, can speed up policy iteration. Moreover, warm starts permit a form of modified policy iteration that can be used to approximate a "greedy" homotopy path, a generalization of the LARS-TD homotopy path that combines policy evaluation and optimization.

Channel: VideoLectures

Category: Educational

Video Length: 0

Date Found: January 15, 2011

Date Produced: January 12, 2011

View Count: 0

MrWhy.com Special Offers

About Us: About MrWhy.com | Advertise on MrWhy.com | Contact MrWhy.com | Privacy Policy | MrWhy.com Partners

Answers: Questions and Answers | Browse by Category

Comparison Shopping: Comparison Shopping | Browse by Category | Top Searches

Shop eBay: Shop eBay | Browse by Category

Shop Amazon: Shop Amazon | Browse by Category

Videos: Video Search | Browse by Category

Web Search: Web Search | Browse by Searches