Warm-Start Reinforcement Learning: From Function Approximation Error to Sub-Optimality Gap

Junshan Zhang

Conventional reinforcement learning (RL) techniques face the formidable challenge of high sample complexity and intensive computation load, which hinders RL's applicability in real-world tasks.Â Â To tackle this challenge, Warm-Start RL is emerging as a promising new paradigm, with the basic idea being to accelerate online learningÂ Â by starting with an initial policy trained offline. Indeed,Â Â owing to the knowledge transfer from an initial policy,Â Â Warm-Start RLÂ Â has beenÂ Â successfully applied in AlphaZero and ChatGPT, demonstrating its great potential to speed upÂ Â online learning. Despite these remarkable successes, a fundamental understanding ofÂ Â Warm-Start RL is lacking. The primary objective of this study is to quantify the impact of function approximation errors on the sub-optimality gapÂ Â for Warm-Start RL. We consider the widely used "Actor-Critic"Â method for RL. Our findings reveal thatÂ Â Â a 'good'Â warm-start policy (obtained by offline training) may be insufficient, and bias reduction in online learning also plays an essential role to lower the suboptimality gap.

Speaker: Junshan Zhang, UC Davis

Attend in person or online here. Passcode: 2009A

Thursday, 10/19/23

04:00 PM - 05:00 PM

Contact:

Website: Click to Visit

Cost:

Free

Save this Event:

iCalendar
Google Calendar
Yahoo! Calendar
Windows Live Calendar

Sonoma State Dept. of Engineering Science

1801 East Cotati Ave
Cerent Engineering Science Complex, Salazar Hall Room #2009A
Rohnert Park, CA 94928

Phone: (707) 664-2030
Website: Click to Visit

<						>
S	M	T	W	T	F	S
			01	02	03	04
05	06	07	08	09	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30

Thursday, 10/19/23

Contact:

Cost:

Save this Event:

Sonoma State Dept. of Engineering Science

Categories: