Train open source models on ART.
<think>
tokens from previous turns, which makes training more complicated. It is still possible to use for multi-turn workflows by splitting each turn into a separate message history with our additional_histories
trajectory parameter (see Additional Histories).