Itera: Self-improving Multi-agent Systems via Bootstrapped Reasoning

Overview

Itera is a self-improving multi-agent framework designed to continuously enhance reasoning capabilities. It achieves this by maintaining an experience library of successful trajectories and bootstrapping from failed ones, effectively allowing agents to learn from their own interactions.

This repository implements the framework described in the paper Itera: Self-improving Multi-agent Systems via Bootstrapped Reasoning.

Modules

The project is organized into three main multi-agent settings:

problem_solving/: Pipelines for collaborative Question Answering (QA), covering College Physics/Chemistry and PubMedQA-style reasoning.
actor_critic/: An iterative refinement pipeline involving an Actor, a Judgment agent, and a Critic agent.
competitive/: Environments for negotiation and game-theoretic interactions (e.g., Resource Exchange, Ultimatum Game).

Installation

Prerequisites

Python 3.10+
Conda (recommended)

Setup

Clone the repository:

git clone https://github.com/Alkamal01/itera.git
cd itera

Create and activate the environment:
```
conda env create -f environment.yml
conda activate itera
```
Alternatively, you can install dependencies using pip:
```
pip install .
```
Configure API Access: Set your OpenAI API key as an environment variable:
```
export OPENAI_API_KEY="your-api-key-here"
```

Usage

1. Collect Trajectories

Run the multi-agent system to collect interaction trajectories. For example, to run the physics problem-solving task:

python problem_solving/PhyChem/get_a_sol.py --model='gpt-3.5-turbo' --task='MMLU_physics' --prompt_type='multi_agent' --mode='generate' --subject='phy'

2. Process and Regrow

Filter successful trajectories and generate feedback for failed ones:

# Filter trajectories
python libs/merge.py

# Generate feedback for incorrect solutions
python problem_solving/PhyChem/get_b_feedback.py ...

# Regenerate solutions based on feedback
python problem_solving/PhyChem/get_c_regenerate.py ...

3. Fine-Tune

Fine-tune your agents using the collected experience library:

python problem_solving/PhyChem/get_finetune_data.py
python problem_solving/PhyChem/fine_tune.py

Structure

problem_solving/: Code for reasoning tasks.
actor_critic/: Code for actor-critic feedback loops.
competitive/: Code for game-theory scenarios.
libs/: Shared utility libraries.
dataset/: Directory for storing training and evaluation datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
actor_critic		actor_critic
assets		assets
competitive		competitive
dataset		dataset
libs		libs
problem_solving		problem_solving
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Itera: Self-improving Multi-agent Systems via Bootstrapped Reasoning

Overview

Modules

Installation

Prerequisites

Setup

Usage

1. Collect Trajectories

2. Process and Regrow

3. Fine-Tune

Structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Itera: Self-improving Multi-agent Systems via Bootstrapped Reasoning

Overview

Modules

Installation

Prerequisites

Setup

Usage

1. Collect Trajectories

2. Process and Regrow

3. Fine-Tune

Structure

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages