[REQUEST] Improve RAG performance by Contextual Retrieval #682

rsnk96 · 2025-02-28T20:21:45Z

Reference Issues

No response

Summary

Dear Kotaemon team

Great work so far. It would be nice if we could enhance retrieval performance be prefixing context in the document. Recent work by Anthropic shows notable gains by doing so. This appears to be relatively simple to implement (link shared below)

Basic Example

Suppose the document is of a SEC filing,

original_chunk = "The company's revenue grew by 3% over the previous quarter."

contextualized_chunk = "This chunk is from an SEC filing on ACME corp's performance in Q2 2023; the previous quarter's revenue was $314 million. The company's revenue grew by 3% over the previous quarter."

Drawbacks

While uploading a file into the rag database, it will take more time now (as the context has to be estimated for all chunks)

Additional information

Reference links:

Blog Post: https://www.anthropic.com/news/contextual-retrieval
Reference implementation by Anthropic: https://github.com/anthropics/anthropic-cookbook/blob/main/skills/contextual-embeddings/guide.ipynb

The text was updated successfully, but these errors were encountered:

rsnk96 added the enhancement New feature or request label Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Improve RAG performance by Contextual Retrieval #682

[REQUEST] Improve RAG performance by Contextual Retrieval #682

rsnk96 commented Feb 28, 2025 •

edited

Loading

[REQUEST] Improve RAG performance by Contextual Retrieval #682

[REQUEST] Improve RAG performance by Contextual Retrieval #682

Comments

rsnk96 commented Feb 28, 2025 • edited Loading

Reference Issues

Summary

Basic Example

Drawbacks

Additional information

rsnk96 commented Feb 28, 2025 •

edited

Loading