You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current testing results of TouchStone v0.1 only include single-turn conversations, but we plan to expand it to encompass multi-turn dialogues in the future.
We will soon release a technical report to provide an overview of the evaluation process and the detailed results. The benchmark will be open-sourced in the future.
TouchStone is a VQA benchmark or a multi-turn dialogue benchmark? Will this benchmark be open-source?
The text was updated successfully, but these errors were encountered: