llaminate

Optimized version of llama3, using tokun.

This project is a showcase for a neural tokenization technique. Since the inputs are compressed and have a smaller shape, the LLM is downsized accordingly.

For example, llama3-8b is brought down to 34 million parameters instead of 8 billion.

Installation

Usage

Resources

Models

Notebooks

Final model:

pretraining: file / Google Colab
fine-tuning: file / Google Colab

TODO

See TODO.

Credits

This project winks at llama3 from Meta, but doesn't actually its weights nor code.

License

Licensed under the aGPLv3.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
.github		.github
llaminate		llaminate
notebooks		notebooks
scripts		scripts
tests		tests
.gitignore		.gitignore
.python-version		.python-version
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llaminate

Installation

Usage

Resources

Models

Notebooks

TODO

Credits

License

About

Releases

Packages

Languages

apehex/llaminate

Folders and files

Latest commit

History

Repository files navigation

llaminate

Installation

Usage

Resources

Models

Notebooks

TODO

Credits

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages