pgllm: Use LLMs in Postgres

API | Usage | llamafile | Embeddings | Installation

pgllm brings LLMs to Postgres.

It does so primarily by embedding CPython and wrapping the beautiful llm Python library.

Features

CREATE EXTENSION pgllm;

Text generation with both local and remote models.

SELECT llm_generate('hello world', 'markov', '{"length": 20, "delay": 0.2}');
SELECT llm_generate('hello world', 'mistral', '{"mistral": "abc0123"}');

llamafile support

SELECT llm_generate('A story about a frog', 'llamafile')

Embedding models and pgvector support

SELECT llm_embed('hello world', 'jina-embeddings-v2-small-en')::vector;

You can use any LLM plugin

API

llm_generate(input text, model text[, params jsonb]) → text
llm_embed(input text/bytea, model text[, params jsonb]) → float8[]

Usage

Generation

Let's start by installing a simple generational model

python3 -m llm install llm-markov

IMPORTANT: You have to be sure that the python3 you're using is the same one that you pointed to during the Installation; better be explicit.

select llm_generate('hello world', 'markov');
          llm_generate          
--------------------------------
 world hello world world hello ....
(1 row)

Model Parameters

Can be passed as a jsonb argument.

select llm_generate('hello world', 'markov', '{"length": 20, "delay": 0.2}');
                                                       llm_generate                                                       
--------------------------------------------------------------------------------------------------------------------------
 world world hello world hello world hello world world hello world world world world world world world world world hello 
(1 row)

Embeddings

Install a dummy embedding model

python3 -m llm install llm-embed-hazo

select llm_embed('hello world', 'hazo');

             llm_embed             
-----------------------------------
{5,5,0,0,0,0,0,0,0,0,0,0,0,0,0,0}
(1 row)

pgvector

If you have pgvector already installed you can cast the resulting float8[] to a vector type instead, and use pgvector as usual.

For example to get the L2 distance:

select llm_embed('hello world', 'hazo')::vector <-> llm_embed('world hold on', 'hazo')::vector;
     ?column?     
------------------
 2.23606797749979
(1 row)

Local Models

llamafile

pgllm supports llamafile by using curl to query its web API. This does not use the llm-llamafile plugin!

Install with WITH_LLAMAFILE=1 flag

Start llamafile server

Download llava-v1.5-7b-q4.llamafile (4.29 GB).
Open your computer's terminal.
If you're using macOS, Linux, or BSD, you'll need to grant permission for your computer to execute this new file. (You only need to do this once.)

chmod +x llava-v1.5-7b-q4.llamafile

If you're on Windows, rename the file by adding ".exe" on the end.
Run the llamafile. e.g.:

./llava-v1.5-7b-q4.llamafile

Your browser should open automatically and display a chat interface. (If it doesn't, just open your browser and point it at http://localhost:8080)
When you're done chatting, return to your terminal and hit Control-C to shut down llamafile.

SELECT llm_generate('3 neat characteristics of a pelican', 'llamafile')::jsonb
                                                                                                                                                                                                                                                                                                   llm_generate                                                                                                                                                                                                                                                                                                   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 {"id": "chatcmpl-qLMCLi9ghpvobVxrwr83DOVVdvaMICef", "model": "LLaMA_CPP", "usage": {"total_tokens": 132, "prompt_tokens": 58, "completion_tokens": 74}, "object": "chat.completion", "choices": [{"index": 0, "message": {"role": "assistant", "content": "1. Pelicans have a large, broad beak that is adapted for catching fish.\n2. They have a pouch under their beak, which they use to hold their catch.\n3. Pelicans are known for their distinctive wading and fishing behavior, where they stand on one leg while waiting for fish to swim by.</s>"}, "finish_reason": "stop"}], "created": 1725269144}
(1 row)

Remote APIs

LLM plugins for remote APIs should work easily.

Start by installing the model plugin you want, for example:

python3 -m llm install llm-mistral

And then use you can pass the API_KEY as a model parameter.

select llm_generate('hello world', 'mistral', '{"mistral": "abc0123"}');

WARNING: You can easily exhaust any credits you may have by a simple select query. Hence, use with caution!

Embeddings

JinaAI

jina-embeddings-v2-small-en: 33 million parameters.
jina-embeddings-v2-base-en: 137 million parameters.

python3 -m llm install llm-embed-jina

select llm_embed('hello world', 'jina-embeddings-v2-small-en');

Onnx

onnx-bge-micro
onnx-gte-tiny
onnx-minilm-l6
onnx-minilm-l12
onnx-bge-small
onnx-bge-base
onnx-bge-large

python3 -m llm install llm-embed-onnx

select llm_embed('hello world', 'onnx-bge-micro');

Installation

The crucial thing in the installation process is to be sure which python3 Postgres uses.

git clone https://github.com/Florents-Tselai/pgllm.git
cd pgllm

# make sure that Python 3.XX minor versions match 
make all PYTHON=/path/to/bin/python3.11 PYTHON_CONFIG=/path/to/python3.11-config
make install
make installcheck

The host postgrse process must have access to the Python library too. You can set LD_LIBRARY_PATH before starting postgres.

For example you may have to to something like:

export LD_LIBRARY_PATH="$pythonLocation"/lib:/usr/local/lib:/usr/lib:$HOME/local/lib:$LD_LIBRARY_PATH
$PGBIN/initdb $PGDATA
$PGBIN/pg_ctl --pgdata $PGDATA start

See build.yml

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
.github		.github
docs		docs
sql		sql
src		src
test		test
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pgllm.control		pgllm.control
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pgllm: Use LLMs in Postgres

Features

API

Usage

Generation

Model Parameters

Embeddings

pgvector

Local Models

llamafile

Remote APIs

Embeddings

JinaAI

Onnx

Installation

About

Releases

Sponsor this project

Packages

Languages

License

Florents-Tselai/pgllm

Folders and files

Latest commit

History

Repository files navigation

pgllm: Use LLMs in Postgres

Features

API

Usage

Generation

Model Parameters

Embeddings

pgvector

Local Models

llamafile

Remote APIs

Embeddings

JinaAI

Onnx

Installation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages