DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations

This repo contains the code and data for the paper "DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations".

Installation

git clone https://github.com/lukeyoffe/debunc.git
cd debunc
conda create --name debunc python=3.10 -y 
conda activate debunc 
pip install -U pip==24.0  
pip install -e .

To use restricted models, log in to Hugging Face with the following command:

huggingface-cli login

Usage

The scripts to run and evaluate on various benchmarks can be found in src/debate/.

To use Llama 3 instead of Mistral 7B, replace "mistralai/Mistral-7B-Instruct-v0.2" with "meta-llama/Meta-Llama-3-8B-Instruct". Other models are not currently supported.

To use TokenSAR instead of Mean Token Entropy, replace ue_method = MeanTokenEntropy() with ue_method = TokenSAR()

Attention Scaling Demo

src/models/demo.ipynb contains a demonstration of attention scaling applied to RAG.

Citation

@article{yoffe2024debunc,
  title={DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations},
  author={Yoffe, Luke and Amayuelas, Alfonso and Wang, William Yang},
  journal={arXiv preprint arXiv:2407.06426},
  year={2024}
}

Acknowledgement

The code in src/lm-polygraph is based on the LM-Polygraph project, and contains implementations for various uncertainty metrics.

The modeling_*.py files in src/models are based on the Huggingface Transformers library, with modifications to perform attention scaling. The attention scaling occurs between ##### <AttentionScaling> ##### and ##### </AttentionScaling> #####.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations

Installation

Usage

Attention Scaling Demo

Citation

Acknowledgement

About

Releases

Packages

Languages

License

lukeyoffe/debunc

Folders and files

Latest commit

History

Repository files navigation

DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations

Installation

Usage

Attention Scaling Demo

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages