Hi, why can't we use BGE-M3 embedding model (@cf/baai/bge-m3) to create vector index? Only these mo
Hi, why can't we use BGE-M3 embedding model (@cf/baai/bge-m3) to create vector index?
Only these model are available to choose from: [ "@cf/baai/bge-small-en-v1.5", "@cf/baai/bge-base-en-v1.5", "@cf/baai/bge-large-en-v1.5", "openai/text-embedding-ada-002", "cohere/embed-multilingual-v2.0" ]
14 Replies
When should we consider encapsulating the official SDK of C #?
please ask in #workers-ai
Has anyone seen this Vectorize error?
VECTOR_QUERY_ERROR (code = 40006): invalid query vector, expected 1024 dimensions, and got 0 dimensions
My index is definitely set to 1024 dimensions, and Iām querying with a plain JS array of length 1024. Even a hardcoded array fails. Storage is under 600 vectors. Is this a known issue or am I missing something obvious?
I adjusted the query method call, and it worked. š
I'm getting the following error locally
ā [ERROR] Uncaught (in promise) TypeError: Cannot read properties of undefined (reading 'insert')I'm running
npm run dev --experimental-vectorize-bind-to-prod
and wrangler is showing
on start up?
no
3072
š¦Is there a way to use CF's AI to extract image embeddings?
Hey CF Team, is something off with vectorize indexing latencies? I am continously getting vectors not found via getByIds. Can anyone update the status?
@yevgen @garvitg
Can each vector database have a maximum of 5 million pieces of data?
What type of data is ndjson insertion?
{'result': None, 'success': False, 'errors': [{'code': 1005, 'message': 'vectorize.unknown_content_type'}], 'messages': []}

Why can't I upload it?
Not sure if this is your specific issue, but the ID must be a string not a number: https://developers.cloudflare.com/vectorize/best-practices/insert-vectors/
I want to know a few questions, can an index only store a maximum of 5 million data?
My translation is not quite accurate. It seems to mean that there is no limit on the frequency of HTTP requests?
Error code: 400 - {'result': None, 'success': False, 'errors': [{'code': 1005, 'message': 'vectorize.unknown_content_type'}], 'messages': []}