You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Great work so far. It would be nice if we could enhance retrieval performance be prefixing context in the document. Recent work by Anthropic shows notable gains by doing so. This appears to be relatively simple to implement (link shared below)
Basic Example
Suppose the document is of a SEC filing,
original_chunk = "The company's revenue grew by 3% over the previous quarter."
contextualized_chunk = "This chunk is from an SEC filing on ACME corp's performance in Q2 2023; the previous quarter's revenue was $314 million. The company's revenue grew by 3% over the previous quarter."
Drawbacks
While uploading a file into the rag database, it will take more time now (as the context has to be estimated for all chunks)
Reference Issues
No response
Summary
Dear Kotaemon team
Great work so far. It would be nice if we could enhance retrieval performance be prefixing context in the document. Recent work by Anthropic shows notable gains by doing so. This appears to be relatively simple to implement (link shared below)
Basic Example
Suppose the document is of a SEC filing,
Drawbacks
While uploading a file into the rag database, it will take more time now (as the context has to be estimated for all chunks)
Additional information
Reference links:
The text was updated successfully, but these errors were encountered: