TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness [paper]

Environment: conda env export > trustscore_environment.yml

Behavior_consistency_example.ipynb: An example code showing how $Trust_{BC}$ works.

qa_human_check.json: includes the mixed QA data used in this project, the predictions of Flan-T5-XXL, LLAMA-7B, GPT-3.5-turbo, and the human evaluation for the predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Behavior_consistency_example.ipynb		Behavior_consistency_example.ipynb
LICENSE		LICENSE
README.md		README.md
distractor_generator.py		distractor_generator.py
qa_human_check.json		qa_human_check.json
trustscore_environment.yml		trustscore_environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness [paper]

About

Releases

Packages

Languages

License

dannalily/TrustScore

Folders and files

Latest commit

History

Repository files navigation

TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness [paper]

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages