In-Context Reinforcement Learning with Algorithm Distillation

This repository is an environment for running experiments with algorithm distillation. Link to work in this area: arxiv.

How to run the project?

Tool	Version
Python	Python 3.10
CUDA Toolkit	12.8
nvcc Compiler	12.8.61
CUDA Build	cuda_12.8.r12.8/compiler.35404655_0

Installing the CUDA toolchain is optional, and I'll leave that for you to explore.

1. Create a virtual environment

If you only have one Python interpreter installed, you can create a virtual environment with the command:

python -m venv <your_env_name>

If you have multiple Python versions, you can select the one you need with the command:

py -3.10 -m venv <your_env_name>

Make sure you create the environment in the correct directory.

2. Activate the environment

Windows

<your_env_name>\Scripts\activate.bat

Linux

source <your_env_name>/bin/activate

3. Install libraries

I'd like to point out that the virtual environment in the total size is about 6 GB. This isn't a warning, just a fact. Be prepared.

pip install -r requirements.txt

4. Starting the MLFlow server

Create a directory in advance to store the MLFlow server data.

<mlflow_dir_path>
├── data_local/
└── artifacts/

Next, run the command.

mlflow server \
--backend-store-uri "file:///<mlflow_dir_abspath>/data_local" \
--default-artifact-root "file:///<mlflow_dir_abspath>/artifacts" \
--host localhost \
--port 5000

Now we have a local server running on port 5000. You can check http://localhost:5000.

5. Run the experiment

Go to the scripts folder. It contains directories with the names of the environments on which the experiments were run. Each directory contains scripts for training and evaluating models (to change hyperparameters, you need to change the values inside the script).

python scripts/<experiment_name>/<experiment_script>.py

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
models		models
scripts/KArmedBandit		scripts/KArmedBandit
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

In-Context Reinforcement Learning with Algorithm Distillation

How to run the project?

1. Create a virtual environment

2. Activate the environment

3. Install libraries

4. Starting the MLFlow server

5. Run the experiment

About

Uh oh!

Languages

CaseyJohnson-RS/In-context-RL-with-AD

Folders and files

Latest commit

History

Repository files navigation

In-Context Reinforcement Learning with Algorithm Distillation

How to run the project?

1. Create a virtual environment

2. Activate the environment

3. Install libraries

4. Starting the MLFlow server

5. Run the experiment

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages