What is ART?
Can I start RL from an existing SFT LoRA adapter?
base_model
when creating your TrainableModel
.How does ART work under the hood?
Why separate frontend from backend? Doesn't that increase complexity?
How do I know whether ART can help my agent improve performance?
Which pieces of ART are open source?
How expensive are ART training runs?
Can I use ART to train on user feedback in production directly?
Is ART just for agents? I'm interested in training a non-agentic model with RL.