feat: vectorize #177

RihanArfan · 2024-06-19T15:14:05Z

Closes #174, Related #173

Adds support for using Vectorize indexes.

For docs: Vectorize through Cloudflare bindings accessed via const vectorize = hubVectorize(<index>) so their docs apply. https://developers.cloudflare.com/vectorize/reference/client-api/

How to use now?

While vector databases is still a wip PR, using it is pretty straightforward to use now and if you're fine with temporary caveats like manually adding bindings to environments, and developing via --remote then you can use it today. There may be breaking changes, so review when updating @nuxt-hub/core.

1. Create Index

You'll currently need to manually create a binding via wrangler while this PR is still in progress. This will eventually be handled by Nuxt Hub while deploying.

An index's dimensions and metrics should be set based on the embeddings model you're using. I'm using bge-base-en-v1.5, which needs 768/cosine. You cannot change this later without recreating the index (and triggering a new Pages deployment)

pnpx wrangler vectorize create ecommerce-products --dimensions=768 --metric=cosine

Once you've made the index , you can add a binding for it via the Cloudflare dash (Pages -> Settings -> Functions Vectorize index bindings).

Update: Make sure the binding name follows this format: VECTORIZE_<index name in upper case>. In this scenario it'd be VECTORIZE_PRODUCTS.

2. Use `@nuxt-hub/core` version built from this PR

npm i https://pkg.pr.new/nuxt-hub/core/@nuxthub/core@177

3. Enable Vectorize

// nuxt.config.ts
export default defineNuxtConfig({
  hub: {
    vectorize: {
      products: {
        metric: 'cosine',
        dimensions: '768',
        metadataIndexes: { name: 'string', price: 'number' }
      },
    },
    // ...
  },
})

4. Deploy the website

As --remote is required because Cloudflare doesn't support local Vectorize bindings yet, you'll need to push and deploy now so we can use Vectorize via bindings on the deployed application.

5. Done!

You need to use --remote for now.

pnpm dev --remote

Docs

What are vector databases for?

Read https://developers.cloudflare.com/vectorize/reference/what-is-a-vector-database/

Usage

See operations here https://developers.cloudflare.com/vectorize/reference/client-api/#operations

const vectorize = hubVectorize('products')
const { matches } = await vectorize.query(vectors, { topK: 5 })

vectorize.insert()
vectorize.upsert()
// etc.

Usage example

Querying

In this example, 1. a vector is generated from the query, 2. search is via Vectorize, 3. then data is enriched by querying the database. https://developers.cloudflare.com/vectorize/reference/what-is-a-vector-database/#vector-search

If you wanted to build a RAG experience, you'd have a 4th step where you pass all this information to an LLM as context in a prompt. See https://developers.cloudflare.com/workers-ai/tutorials/build-a-retrieval-augmented-generation-ai/

Code

import { z } from "zod";

interface EmbeddingResponse {
  shape: number[];
  data: number[][];
}

const Query = z.object({
  query: z.string().min(1).max(256),
  limit: z.coerce.number().int().min(1).max(20).default(10),
});

export default defineEventHandler(async (event) => {
  const { query, limit } = await getValidatedQuery(event, Query.parse);

  // 1. generate embeddings for search query
  const ai = hubAi();
  const embeddings: EmbeddingResponse = await ai.run(
    "@cf/baai/bge-base-en-v1.5",
    { text: [query] },
    // cache using ai gateway - https://developers.cloudflare.com/ai-gateway/
    // commented it out as requires creating an AI gateway from cf dash
    // { gateway: { id: "new-role" } },
  );
  const vectors = embeddings.data[0];

  // 2. query vectorize to find similar results
  const vectorize: VectorizeIndex = hubVectorize('jobs');
  const { matches } = await vectorize.query(vectors, {
    topK: limit,
    namespace: "job-titles",
  });

  // 3. get details for matching jobs
  const jobMatches = await useDrizzle().query.jobs.findMany({
    where: (jobs, { inArray }) =>
      inArray(
        jobs.id,
        matches.map((match) => match.id),
      ),
    with: {
      division: true,
      department: true,
      subDepartment: true,
    },
  });

  // 4. add score to job matches
  const jobMatchesWithScore = jobMatches.map((job) => {
    const match = matches.find((match) => match.id === job.id);
    return { ...job, score: match!.score };
  });

  // 5. sort by score
  jobMatchesWithScore.sort((a, b) => b.score - a.score);

  return jobMatchesWithScore;
});

Bulk vector generation and import

This example bulk generates and imports vectors for items in a database using a text embeddings model to create search experience.

Code

// server/tasks/generate-embeddings.ts

import { jobs } from "../database/schema";
import { asc, count } from "drizzle-orm";

import type { VectorizeIndex } from "@nuxthub/core";

export default defineTask({
  meta: {
    name: "vectorize:seed",
    description: "Generate vector text embeddings",
  },
  async run() {
    console.log("Running Vectorize seed task...");

    // count all rows
    const jobCount = (await useDrizzle().select({ count: count() }).from(tables.jobs))[0].count;

    // loop through total job row count in increments of X. Get job rows (id and jobTitle columns) with paginated based on loop
    const INCREMENT_AMOUNT = 20;

    // log total batches
    const totalBatches = Math.ceil(jobCount / INCREMENT_AMOUNT);
    console.log(`Total jobs: ${jobCount} total jobs (${totalBatches} batches)`);

    for (let i = 0; i < jobCount; i += INCREMENT_AMOUNT) {
      console.log(`⏳ Processing jobs ${i} - ${i + INCREMENT_AMOUNT}...`);

      // get id and job titles for batch
      const jobsChunk = await useDrizzle()
        .select()
        .from(tables.jobs)
        .orderBy(asc(jobs.id))
        .limit(INCREMENT_AMOUNT)
        .offset(i);

      // generate embeddings for job titles
      const ai = hubAi();
      const embeddings = await ai.run(
        "@cf/baai/bge-base-en-v1.5",
        { text: jobsChunk.map((job) => job.jobTitle) },
        { gateway: { id: "new-role" } },
      );
      const vectors = embeddings.data;

      // format embeddings with id and metadata (jobTitle) for vectorize index
      const formattedEmbeddings = jobsChunk.map((job, index) => {
        const { sufaCode: id, ...metadata } = job;

        return {
          id,
          namespace: "job-titles",
          metadata: { ...metadata },
          values: vectors[index],
        };
      });

      // save embeddings to vectorize index
      const vectorize: VectorizeIndex = hubVectorize('jobs');
      await vectorize.upsert(formattedEmbeddings);

      console.log(`✅ Processed jobs ${i} - ${i + INCREMENT_AMOUNT}...`);
    }

    console.log("Vectorize seed task completed!");
    return { result: "success" };
  },
});

Vectorize supports upserting 1000 via the Workers API and 5000 via the HTTP API currently, so it's unlikely that the looping for batches is necessary, however, I ran into some issues before which I didn't have time to debug so I made the chunks smaller.

You can likely simplify this code a lot, but it's a starting point.

More

It's possible to store core data in Vectorize directly as metadata on the record. If you fetch from Vectorize with metadata or values, you're limited to the top 30 results. If you only want to get IDs and match % back, you can get the top 100 results. (https://developers.cloudflare.com/vectorize/platform/limits/)

pkg-pr-new · 2024-06-19T15:26:10Z

Open in Stackblitz

pnpm add https://pkg.pr.new/nuxt-hub/core/@nuxthub/core@177

commit: cf33615

RihanArfan · 2024-06-19T18:29:31Z

Turns out Vectorize doesn't support local development, only with wrangler with --remote. This is unlike Workers AI, which supports local development, however models are actually ran on Cloudflare with your account.

Issue tracking Vectorize local bindings: cloudflare/workers-sdk#4360
https://developers.cloudflare.com/workers/testing/local-development/#supported-resource-bindings-in-different-environments

For now, this feature could only be supported with --remote (either via NuxtHub's proxy or wrangler remote). This would involve adding Vectorize to endpoints to NuxtHub's backend and I don't think that's OSS. Alternatively it could be blocked until local development is supported with Vectorize. Alternatively, t

atinux · 2024-06-20T06:16:48Z

Thanks for looking at it so quickly.

I think this could anyway be possible within the OSS as you would need to deploy your application at first in order to use Vectorize.

Would you be happy to work on the proxy API routes?

RihanArfan · 2024-06-20T12:55:19Z

I didn't realise those routes were for anything more than just devtools preview with --remote for some reason lol 😄 I've added the proxy routes, ~~but I don't think I can test them yet. If I understand correctly, bindings are added once the build hook is ran. Could you support adding the Vectorize bindings?~~ Edit: With a fresh mind I realised I can manually add the bindings from CF dash myself 🤦

I'll continue my dissertation where I'll be testing test both AI and Vectorize integrations to build a simple vector search engine.

RihanArfan · 2024-06-25T19:44:46Z

Vectorize and AI works ✨

Got some small things to clean up code wise, which I'll get sorted hopefully by mid ~~July~~ August.

src/runtime/vectorize/server/api/_hub/vectorize/query.post.ts

RihanArfan · 2024-08-15T22:27:51Z

45 minutes of rebasing 😓 git is not my passion

…o feat/vectorize

src/runtime/vectorize/server/utils/vectorize.ts

Co-authored-by: Sébastien Chopin <[email protected]>

RihanArfan force-pushed the feat/vectorize branch 2 times, most recently from d374cd8 to df6a752 Compare June 19, 2024 15:25

RihanArfan force-pushed the feat/vectorize branch 2 times, most recently from 69f60a0 to b2ee77b Compare June 19, 2024 17:18

RihanArfan force-pushed the feat/vectorize branch from 0f12411 to 8672867 Compare June 20, 2024 10:02

RihanArfan force-pushed the feat/vectorize branch 6 times, most recently from 6173476 to aedb7c2 Compare June 25, 2024 15:11

RihanArfan commented Jun 27, 2024

View reviewed changes

src/runtime/vectorize/server/api/_hub/vectorize/query.post.ts Outdated Show resolved Hide resolved

RihanArfan force-pushed the feat/vectorize branch from afb114f to 95cf814 Compare August 15, 2024 22:21

RihanArfan and others added 10 commits August 15, 2024 23:24

feat: support specifying account id to wrangler

56fdea3

feat: vectorize

94b30b7

fix: typo in error message

544cd0f

feat: vectorize via proxy

16a363a

feat: include vectorize in build hook

6847c0d

feat: include vectorize and ai in manifest

6b7813a

fix: put server routes in correct directory

1b1a2c7

[autofix.ci] apply automated fixes

aeb96f0

fix: remove unused param from vectorize.getByIds proxy wrapper

79a488e

feat: match namespace length limits in proxy wrapper validator

b890979

RihanArfan force-pushed the feat/vectorize branch from 95cf814 to b890979 Compare August 15, 2024 22:24

RihanArfan added 11 commits September 30, 2024 20:12

refactor: vectorize manifest check

10ffe49

chore: clean up code

34a67e3

docs: vectorize

bb30ac1

docs: add vectorize pricing

11c4418

docs: include free writes on docs

a5f3068

feat: vectorize playground

44100b1

docs: improve vectorize docs

dbbfd9d

docs: update changelog

ee4b12e

Merge branch 'main' into feat/vectorize

0049cb5

docs: add examples

2a2d0f4

Merge remote-tracking branch 'refs/remotes/origin/feat/vectorize' int…

743548e

…o feat/vectorize

RihanArfan marked this pull request as ready for review October 4, 2024 03:35

Merge branch 'main' into feat/vectorize

74500f6

atinux reviewed Oct 4, 2024

View reviewed changes

src/runtime/vectorize/server/utils/vectorize.ts Outdated Show resolved Hide resolved

atinux and others added 3 commits October 4, 2024 18:41

docs: update

1add42b

docs: remove unnecessary link in comment

2649810

Co-authored-by: Sébastien Chopin <[email protected]>

docs: small updates

6d38572

vercel bot had a problem deploying to Preview October 4, 2024 17:57 Failure

docs: improvements

768c4db

vercel bot had a problem deploying to Preview October 4, 2024 18:30 Failure

fix: add condition when data is not available yet

b1a25ae

vercel bot deployed to Preview October 4, 2024 18:35 View deployment

atinux added 2 commits October 4, 2024 20:40

update doc for self deployment

763a676

lint fix

c893a21

vercel bot deployed to Preview October 4, 2024 18:43 View deployment

up

5544f32

vercel bot deployed to Preview October 4, 2024 22:18 View deployment

atinux added 2 commits October 5, 2024 00:31

up

1a22a5e

docs: og-image

cf33615

atinux merged commit af4dc62 into nuxt-hub:main Oct 5, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: vectorize #177

feat: vectorize #177

RihanArfan commented Jun 19, 2024 •

edited

Loading

pkg-pr-new bot commented Jun 19, 2024 •

edited

Loading

RihanArfan commented Jun 19, 2024 •

edited

Loading

atinux commented Jun 20, 2024

RihanArfan commented Jun 20, 2024 •

edited

Loading

RihanArfan commented Jun 25, 2024 •

edited

Loading

RihanArfan commented Aug 15, 2024

feat: vectorize #177

feat: vectorize #177

Conversation

RihanArfan commented Jun 19, 2024 • edited Loading

How to use now?

1. Create Index

2. Use @nuxt-hub/core version built from this PR

3. Enable Vectorize

4. Deploy the website

5. Done!

Docs

What are vector databases for?

Usage

Usage example

Querying

Bulk vector generation and import

More

pkg-pr-new bot commented Jun 19, 2024 • edited Loading

RihanArfan commented Jun 19, 2024 • edited Loading

atinux commented Jun 20, 2024

RihanArfan commented Jun 20, 2024 • edited Loading

RihanArfan commented Jun 25, 2024 • edited Loading

RihanArfan commented Aug 15, 2024

RihanArfan commented Jun 19, 2024 •

edited

Loading

2. Use `@nuxt-hub/core` version built from this PR

pkg-pr-new bot commented Jun 19, 2024 •

edited

Loading

RihanArfan commented Jun 19, 2024 •

edited

Loading

RihanArfan commented Jun 20, 2024 •

edited

Loading

RihanArfan commented Jun 25, 2024 •

edited

Loading