I am using a vector DB using Docker image.
And for debugging and benchmarking local RAG retrieval, I've been building
a CLI tool that shows what's actually being retrieved:
ragtune explain "your query" --collection prod
Shows scores, sources, and diagnostics. Helps catch when your chunking
or embeddings are silently failing or you need numeric estimations to base your judgements on.
Open source: https://github.com/metawake/ragtune