You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The final project for Applied Deep Learning (ADL) 2024 @ntu lectured by Prof. Yun-Nung (Vivian) Chen. The project is based on the paper StreamBench: Towards Benchmarking Continuous Improvement of Language Agents.
We propose a framework for LLMs to seek user support, design evaluation metrics to measure the trade-off between performance boost and user burden, and empirically assess this ability on Text-to-SQL generation.