DynamicGPTSwarm

Introduction

Recent progress in the areas of Large Language Models (LLMs) and Language Agents has demonstrated significant promise for various future applications across multiple disciplines. Traditional approaches to language agents often rely on fixed, handcrafted designs. Our research aims to develop agents that are both learnable and dynamic, utilizing a graph framework to generate edges dynamically based on input.

In this framework, we learn a model that generates edges representing the flow of communication within the graph, adjusting the internal communication of a language agent. By fine-tuning a pretrained LLM with reinforcement learning on multiple datasets, we demonstrate that our approach surpasses static methods in accuracy and adaptability across various tasks. Specifically, our approach achieves nearly 6% higher accuracy on a combined dataset of MMLU and CMMLU, and over 10% higher with a sparsity-inducing loss.

Features

Dynamic edge generation based on input
Train edge generation with reinforcement learning
Supports multiple datasets simultaneously
Superior performance on MMLU, CMMLU, and Mini Crossword Puzzles datasets

Installation

To install and set up the repository, follow these steps:

Clone the repository:

 git clone https://github.com/lukasVierling/DynamicGPTSwarm.git
 cd DynamicGPTSwarm

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

To use the code, follow these steps:

python run_mmlu.py <arguments>

or

python run_crosswords.py <arguments>

You can include one of the following arguments:

mode: str (default: 'OptimizedSwarm') - Mode of operation.
num_truthful_agents: int (default: 1) - Number of truthful agents.
num_adversarial_agents: int (default: 1) - Number of adversarial agents.
num_iterations: int (default: 200) - Number of optimization iterations.
model_name: str or List[str] (default: ["google/gemma-7B-it"]) - Model names.
domain: str (default: "mmlu") - Domain (same as dataset name).
debug: bool (default: False) - Set for a quick debug cycle.
edge_network_enable: bool (default: False) - Enable edge network.
reproduce: bool (default: False) - Set seed to 0 for deterministic training data.
lr: float (default: 0.0001) - Learning rate for edge network optimization.
reduce_edges: bool (default: False) - Reduce edges.
delta: float (default: 0.2) - Weight for edge reduction.
embedding_only: bool (default: False) - Set for only embedding optimization.

Results

We demonstrate that our approach surpasses the previous static approach by nearly 6% accuracy on a combined dataset of MMLU and CMMLU, and by more than 10% when trained with a sparsity-inducing loss. It also shows superior performance in additional experiments conducted with the MMLU and Mini Crossword Puzzles datasets.

Acknowledgments

This research builds upon the work by Zhuge et al. Their original code base can be found here.

Contact

For any questions or issues, please open an issue on this repository or contact us at [email protected].

Paper

The paper related to this repository can be found here.

Citation

If you find this work useful for your research, please consider citing (TODO: add citation):

@misc{vierling2024input,
      title={Input Conditioned Graph Generation for Language Agents}, 
      author={Lukas Vierling and Jie Fu and Kai Chen},
      year={2024},
      eprint={2406.11555},
      archivePrefix={arXiv},
      primaryClass={id='cs.CL' full_name='Computation and Language' is_active=True alt_name='cmp-lg' in_archive='cs' is_general=False description='Covers natural language processing. Roughly includes material in ACM Subject Class I.2.7. Note that work on artificial languages (programming languages, logics, formal systems) that does not explicitly address natural-language issues broadly construed (natural-language processing, computational linguistics, speech, text retrieval, etc.) is not appropriate for this area.'}
}

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.github/workflows		.github/workflows
assets		assets
config		config
dataset		dataset
diagrams		diagrams
experiments		experiments
notebooks		notebooks
swarm		swarm
test		test
.coveragerc		.coveragerc
.env.template		.env.template
.gitignore		.gitignore
DEVELOPMENT.md		DEVELOPMENT.md
LICENSE		LICENSE
ReadMe.md		ReadMe.md
config.json		config.json
edge_network_tests.ipynb		edge_network_tests.ipynb
evaluate.py		evaluate.py
evaluate.sh		evaluate.sh
finetuning.py		finetuning.py
motif_extract.txt		motif_extract.txt
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
pytorch_model.bin		pytorch_model.bin
requirements.txt		requirements.txt
requirements_py310_linux.txt		requirements_py310_linux.txt
run.sh		run.sh
run_crosswords.py		run_crosswords.py
run_mmlu.py		run_mmlu.py
setup.py		setup.py
tmp		tmp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DynamicGPTSwarm

Introduction

Features

Installation

Usage

Results

Acknowledgments

Contact

Paper

Citation

About

Releases

Packages

Languages

License

lukasVierling/DynamicGPTSwarm

Folders and files

Latest commit

History

Repository files navigation

DynamicGPTSwarm

Introduction

Features

Installation

Usage

Results

Acknowledgments

Contact

Paper

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages