Skip to main content
← All Tags

Offline Learning

1 article in this category

AI NewsReinforcement LearningOffline Learning

Training Safety-Critical Reinforcement Learning Agents Offline

Conservative Q-Learning achieves a 25% higher return mean than Behavior Cloning in safety-critical environments.

Read more