Finetuning-Serving Llama 3

In this repo I experiment with finetuning llama3 8b model on synthetic dataset created using gpt4-o. Followed by a full deployment to both synth streaming endpoint with langserve and async endpoint with FastAPI.

Problem statment

I want to generate a medical report based on patient history and some measurments on the patient current health status.

I want the report to be detailed and describe each measurment.

I want the report to deduce a diagnostics.

Data

The dataset was created using an existing dataset Pima Indians Diabetes Database

The steps to create and produce the data was as follows:

Turn every column of the dataset into into text description of the patient using prepared text templates.
Prompt gpt4-o asking for a complete analysis, detailed illustration and diagnostics.
Use gpt4-o output to train a smaller model like llama 8b

Finetuning

I used the finetuning notebook offerd by unsloth using the data in data/synthetic.csv file.

Complete notebook can be found in notebooks/Create_synthetic_dataset or on GoogleColab

llama 8b finetuned adheres very well to the format in the training data.

Inference Examples

Examples can be fount in examples directory.

Deployment

Sync-architecture

The web server is created using langServe library, check it out here.

To run the server:

install requirements pip install -r requirements.txt
make sure you have the model in gguf format saved in the /model directory
run poetry run langchain serve --port=8100

Async-architecture

Built the async api using FastAPI and BackgroundTask

To run the server:

install requirements pip install -r requirements.txt
make sure you have the model in gguf format saved in the /model directory
runuvicorn main:app --host 0.0.0.0 --port 8000 --reload

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
deployment		deployment
examples		examples
notebooks		notebooks
.DS_Store		.DS_Store
.gitignore		.gitignore
Readme.md		Readme.md
Screenshot 2024-05-21 at 8.54.16 PM.png		Screenshot 2024-05-21 at 8.54.16 PM.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finetuning-Serving Llama 3

Problem statment

Data

Finetuning

Inference Examples

Deployment

Sync-architecture

Async-architecture

About

Releases

Packages

Languages

Abd-elr4hman/Finetuning-Serving-Llama-3

Folders and files

Latest commit

History

Repository files navigation

Finetuning-Serving Llama 3

Problem statment

Data

Finetuning

Inference Examples

Deployment

Sync-architecture

Async-architecture

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages