Toolkit&Dataset

metatool
MetaTool
Dataset | ICLR 2024 | A benchmark/dataset designed to evaluate whether LLMs have tool usage awareness and can correctly choose tools.
datagen
DataGen
Toolkit | ICLR 2025 | DataGen is an LLM-powered framework designed to generate diverse, accurate, and highly controllable text datasets.
trustllm
TrustLLM
Toolkit | ICML 2024 | Trustllm (python package) help you assess the performance of your LLM in trustworthiness more quickly.
trusteval
TrustEval
Toolkit | NAACL 2025 Demo | TrustEval is a modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs). This toolkit enables you to evaluate models across various dimensions such as safety, fairness, robustness, privacy, and more.