LSTD with Random Projections

Tools | Bookmark & Share | Make MrWhy My Homepage

Answers Shopping eBay Amazon More Web Search Videos Search Recent News

MrWhy.com » Videos » LSTD with Random Projections

Watch Video

LSTD with Random Projections

We consider the problem of reinforcement learning in high-dimensional spaces when the number of features is bigger than the number of samples. In particular, we study the least-squares temporal difference (LSTD) learning algorithm when a space of low dimension is generated with a random projection from a high-dimensional space. We provide a thorough theoretical analysis of the LSTD with random projections and derive performance bounds for the resulting algorithm. We also show how the error of LSTD with random projections is propagated through the iterations of a policy iteration algorithm and provide a performance bound for the resulting least-squares policy iteration (LSPI) algorithm.

Channel: VideoLectures

Category: Educational

Video Length: 0

Date Found: March 26, 2011

Date Produced: March 25, 2011

View Count: 0

MrWhy.com Special Offers

About Us: About MrWhy.com | Advertise on MrWhy.com | Contact MrWhy.com | Privacy Policy | MrWhy.com Partners

Answers: Questions and Answers | Browse by Category

Comparison Shopping: Comparison Shopping | Browse by Category | Top Searches

Shop eBay: Shop eBay | Browse by Category

Shop Amazon: Shop Amazon | Browse by Category

Videos: Video Search | Browse by Category

Web Search: Web Search | Browse by Searches