So the idea with RAG is this: - you embed all your documents and store them in a vector DB (vectoriz
So the idea with RAG is this:
- you embed all your documents and store them in a vector DB (vectorize or otherwise)
- when a user question comes in, you embed the user question
- you do a search in the vector DB for "top <n> items" where
nis 3, or 5, or 10, or higher depending on your use-case - you give the results to the LLM along with the query
