Short answer is that you need to consider strategy and how to shape the data in your training phase.

Short answer is that you need to consider strategy and how to shape the data in your training phase. Similarity is exactly what the name suggests - mathematical similarity in terms of position, generally returned from cosine (or dot product) equations - giving you an approximation of similarity based on vector positions and or directions and or magnitudes of said vectors etc The values that make the calculation possible are from the embeddings model you use - which is where the real magic takes place.

The only way for similarity to happen accurately is for the initial embedding pass to reflect tokens with spatial efficiency in terms of relationships. So when you're asking for a similarity score return of 'What are the opening hours' - the similarity of that ENTIRE sentence to the ENTIRE embedding sentence is what is being computed - you're comparing 5 tokens in your query against 33 tokens in your embedding, so you must expect a much lower score.

The 'similarity' is not some magical logic that is reasoning over what you have asked, it is a pure math calculation.

charl.dev•10/15/23, 6:18 AM

If it’s a once off thing for a client, I’d put the opening hours in the system message. E.g. if the question relates to the opening hours, respond based on the following opening hours..

OOld Man Maker Short answer is that you need to consider strategy and how to shape the data in ...

Matt Silverlock•10/17/23, 12:50 PM

Great answer

cosbgn•10/20/23, 10:14 AM

Can I delete a vector from an index using the API?

cosbgn•10/20/23, 10:15 AM

It seems to be missing somehow

Jerome•10/20/23, 3:20 PM

@cosbgn vector deletion on the REST api is on the backlog. Upsert will ship soon as well.

cosbgn•10/20/23, 6:14 PM

That would be amazing, it's quite complicated now to deal with situation where a user for example deletes a file but I can't delete the vectors for it

cosbgn•10/20/23, 6:14 PM

Upseet is nice, but delete is really really needed.

SSushidata How come this name is failing?

Hello, I’m Allie!•10/21/23, 9:23 AM

I would guess so?

SSushidata but `"name": "bb70",` is fine? Is it because it starts with a number?

Old Man MakerOP•10/22/23, 11:43 AM

Correct - a valid Javascript variable name must start with a letter or underscore or $

wadefletch•10/23/23, 6:35 PM

If I was only planning to query data via vector similarity, is there a downside to using KV over D1 for the underlying data storage? Would the performance increase be significant enough to justify the lost flexibility?

wadefletch•10/23/23, 6:42 PM

Also, can anyone from CF (@Matt?) give any timeline estimates (can be rough: this month/quarter/year) on when metadata filtering will be available, even in Beta?

Wwadefletch Also, can anyone from CF (@Matt?) give any timeline estimates (can be rough: thi...

Matt Silverlock•10/23/23, 7:20 PM

This quarter, on the nearer side.

Wwadefletch If I was only planning to query data via vector similarity, is there a downside ...

Matt Silverlock•10/23/23, 7:20 PM

Ultimately up to your data access patterns & structure. Impossible to otherwise advise.

MMatt Silverlock This quarter, on the nearer side.

wadefletch•10/23/23, 7:43 PM

Thanks! I am building a multi-tenant RAG application, in which users can upload their own documents. Is the better way to isolate these, a) hybird queries on some namespace property or b) separate indexes for each user? I imagine it would depend on the size of the indicies (aka the # of embeddings per user)? Do you expect the index limit will increase sufficiently to even make this possible?

Victor•10/23/23, 9:11 PM

On the limits page, I don't see

beta

beta

flag next to Maximum index name lengthMaximum index name length, so is this a infrastructure limitation or can this be increased later?

VVictor On the limits page, I don't see `beta` flag next to `Maximum index name length`,...

Matt Silverlock•10/23/23, 9:45 PM

What kind of length do you need and why? 63 characters per name is common across most Cloudflare (and cloud provider) naming - would love to understand the need for longer names

Matt Silverlock•10/23/23, 9:45 PM

(63 = max length of a DNS label)

MMatt Silverlock What kind of length do you need and why? 63 characters per name is common across...

Victor•10/23/23, 9:55 PM

I'm looking for a way to structure the organization of the data in vectorize. Like a SaaS platform would have the tenant id/data space id/source id/resource idtenant id/data space id/source id/resource id since a new vectorize index is created per resource (each vector inside = each line in the resource). If each id is a uuid, that's 16 (uuid length) * 4ids + 3 separators = 67 characters.

Victor•10/23/23, 9:56 PM

That way when queried through code (rest not workers bindings), we can jump to the right vectorize store directly

Victor•10/23/23, 9:57 PM

I mean an alternative is to have just a resource uuid and a lookup table that has the other info in it, but that's an extra pre-call before the vectorize call

VVictor I'm looking for a way to structure the organization of the data in vectorize. Li...

Matt Silverlock•10/23/23, 9:58 PM

Note that Vectorize supports namespaces per index and (soon) metadata filtering. How many indexes overall do you expect? If we bumped one limit (name length) I don’t want you to just run into another. Better to understand what you’re trying to architect here.

Cc42alyst Thanks, with postgres databases that support vector datatype we can store the da...

Old Man MakerOP•10/25/23, 12:19 PM

Yep - I get what you're saying, and I can see a positive to that. I guess with Vectorize not being opinionated on data store adds to the flexibility. I've got a few cases where I have data in different sources and it really works well for my architecture and use cases. But I certainly see the benefit to the postgres solution as you mention

cosbgn•10/25/23, 1:51 PM

currently when querying a vector db, the result has only the id and score.
So I need an extra 5 calls to get the metadata (one for each, if topK=5).

Is it planned to return also metadata on query or it will never be available?

cosbgn•10/25/23, 2:02 PM

Great! Thanks!

Ccosbgn currently when querying a vector db, the result has only the id and score. So I ...

Matt Silverlock•10/25/23, 5:52 PM

To be extra clear, you can set returnVectors: truereturnVectors: true to get the metadata today (alongside the vectors themselves). The API updates will allow you to just get the metadata on its own, but you don't need to make 5 API calls to do that today.

Ccosbgn currently when querying a vector db, the result has only the id and score. So I ...

Matt Silverlock•10/25/23, 5:53 PM

See https://developers.cloudflare.com/vectorize/learning/query-vectors/

Query vectors · Vectorize

Querying an index, or vector search, enables you to search an index by providing an input vector and returning the nearest vectors based on the …

cosbgn•10/25/23, 8:36 PM

Oh, I thought returnVectors would return the values. Anyways the new API returnValues and returnMetadata is much cleared! Thanks!

Wwadefletch Thanks! I am building a multi-tenant RAG application, in which users can upload ...

charl.dev•10/26/23, 3:42 PM

You can create a index per client via api

Wwadefletch If I was only planning to query data via vector similarity, is there a downside ...

charl.dev•10/26/23, 3:43 PM

I was doing this with kv, the downside is you need to list each kv, fetch it and then do the similarity search. This was pre workers ai etc

Ccharl.dev You can create a index per client via api

charl.dev•10/26/23, 3:44 PM

So when creating a client, create their index and a db of you want, store a ref somewhere

charl.dev•10/26/23, 3:45 PM

Cf now allows you to create like 50000dbs, so each client can have an isolated db. It’s pretty epic

Ccharl.dev You can create a index per client via api

wadefletch•10/26/23, 4:15 PM

Only 100 indexes of 200,000 vectors rn though, right? That's a mighty small user count for a startup.

wesleyyue•10/26/23, 4:57 PM

Is there any bench mark on Vectorize? Something like this? https://supabase.com/blog/increase-performance-pgvector-hnsw

Supabase

pgvector v0.5.0: Faster semantic search with HNSW indexes

Increase performance in pgvector using HNSW indexes

wadefletch•10/26/23, 5:29 PM

@Matt Any flexibility on the 100 indexes limit? Is that something exceptions could be granted for within the beta? In what ballpark do you expect the final (paid) limit to land?

Wwadefletch @Matt Any flexibility on the 100 indexes limit? Is that something exceptions cou...

Matt Silverlock•10/26/23, 5:30 PM

We expect to land metadata filtering soon, which is going to be the better solution here. We will increase the index limit too but not immediately.

Wwadefletch @Matt Any flexibility on the 100 indexes limit? Is that something exceptions cou...

Matt Silverlock•10/26/23, 5:31 PM

(Do you have a need for more than 100 indexes right now or within the next few weeks, or is this forward looking?)

MMatt Silverlock (Do you have a need for more than 100 indexes _right now_ or within the next few...

wadefletch•10/26/23, 5:32 PM

Very forward looking, just spec'ing infra options.

MMatt Silverlock We expect to land metadata filtering soon, which is going to be the better solut...

wadefletch•10/26/23, 5:32 PM

Ok, great! Thanks!

Wwadefletch Very forward looking, just spec'ing infra options.

Matt Silverlock•10/26/23, 5:34 PM

Got it. Right now our focus is:

1. Metadata filtering
2. Increasing overall performance and per-index vector limits
3. User-facing analytics

Wwadefletch Only 100 indexes of 200,000 vectors rn though, right? That's a mighty small user...

charl.dev•10/26/23, 9:49 PM

Ment per tenant, not user

Short answer is that you need to consider strategy and how to shape the data in your training phase.

Similar Threads

Similar Threads

Similar Threads