v0.1.3
- Form Filler Agent described in paper now available in the examples
- uv as the package manager
- Agent optimization examples
- One agent can use many LLMs now
- Faster RL loops with batching, multi-GPU and multi-node support
- Using VLLM with log probability and token ids during RL training