Speculative Forecasting

This repo contains some experiments for controlling how many tokens to predict ahead for speculative decoding. The RL environment scaffolding and DQN implementation is from CS 285 at UC Berkeley.

Install

conda create --name dman
conda activate dman
conda install python=3.10 swig
pip install -r requirements.txt
pip install -e .

Generate Dataset

This repo makes heavy use of offline preprocessing. For now, we preprocess all sequences with scripts/process_dataset.py and cache the main and draft model hidden states for each token. To generate the dataset, download lmsys chat 1m and run scripts/process_dataset.py to generate the caches.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
experiments		experiments
fspec		fspec
scripts		scripts
.gitignore		.gitignore
README.md		README.md
graph.ipynb		graph.ipynb
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speculative Forecasting

Install

Generate Dataset

About

Releases

Packages

Contributors 2

Languages

skrider/speculative-forecasting

Folders and files

Latest commit

History

Repository files navigation

Speculative Forecasting

Install

Generate Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages