We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement learning algorithm than TRPO and A2C, and requires only slightly more computation than A2C per update.
Originally published on
OpenAI News.
Latest Briefs
Fast updates from the latest stories.
NEWS
+1
New-Age Tech Stocks Rebound: FirstCry Leads Gains This Week
Mar 21, 2026
EXCLUSIVE
+4
How fusion power works and the startups pursuing it
Mar 21, 2026
COMPANIES
AI boom? OpenAI set to double its team by end of 2026; new hires to be deployed across these fields - Report
Mar 21, 2026
NEWS
NeuroPause Lab Introduces 'AI Action Firewall' for Enhanced AI Safety
Mar 21, 2026