RAGs are great for glossary like structured information. Arbitrarily chunking pr...

jcims · on Dec 30, 2023

This is something that has always bothered me about RAG. It seems that it’s fine for first order relevance, like a search engine, but for knowledge there needs to be some kind of rumination stage where it revisits the entire corpus to find information that has a second order relevance to round out its ‘understanding’.

You might be able to approximate by chunking and globbing the chunks and searching for those, as well as having the LLM summarize and extract data and search for those items as well.

meehai · on Dec 30, 2023

isn't this what llamaindex is doing though?

gchamonlive · on Dec 30, 2023

I think as long as you are below 50% context window, performance should be ok (https://dev.to/maximsaplin/gpt-4-128k-context-it-is-not-big-...)