Need help to query external vector DB

@ab2308 @nathaniel I found another way to achieve this. When I posted this thread, the requirement was a continuously synced DB. I use Vectorize.io as a service. Vectorize lets you connect Google Drive, Sharepoint, Onedrive and a host of other Source connectors with a Vector DB of choice (Pinecone, AstraDB, etc.) and then provide a retrieval API for fetching the response. The good thing is you can schedule the pipeline sync (which ingests the documents at a specific time from the connected sources). I was able to set it up as a pickaxe action. Here is the code:


Ensure that the Authorization Header DOESN’T contain ‘Bearer’ else you’ll get a 401 error.

Pros: 1. Extremely simple to set-up specifically if you want realtime data sync. 2. They have inbuilt RAG eval on the platform.
Cons: 1. Adds to the cost (although I find vectorize’s pricing to be lower than others) and the free tier is good enough for small businesses.
2. Currently, their RAG system is rather naive with fixed chunking. One of our use cases is a multimodal RAG (for example, say a geometry problem in a textbook which requires both an image and text to be embedded together and then retrieved). We are still figuring out how to do it with pickaxe.

For those who ABSOLUTELY need continuous sync or need to use a specific VectorDB like Pinecone, this ticks most of the boxes!

3 Likes

Thank you mate! That’s a great solution.

You might also want to have a look at unstructured.io if you don’t know it already

1 Like

I’ll check it out for sure. Btw, are you using it with pinecone assistant? One limitation I have found is that Pinecone assistant has a completely different setup from regular Pinecone DBs. You can’t connect a DB to Pinecone Assistant. Llamaparse allows for (Image+Text) embeddings and storing them in Pinecone but the retrieval service needs to be built and one cannot connect it via pinecone assistant in my understanding. My next objective is to make and solve this issue.

1 Like

Both great solutions.

Would love to see some videos on the implementation.

Thanks AB - i signed up so am straight into the 14 day trial.

I see they have an API.

One thing I could not see was how they charge.

1 Like

Yes correct. Unfortunately there is not a way to connect the assistant to an existing vector at the moment

1 Like