Skip to content

Fix/summarizer #16

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 61 commits into from
Jun 11, 2025
Merged

Fix/summarizer #16

merged 61 commits into from
Jun 11, 2025

Conversation

a-klos
Copy link
Member

@a-klos a-klos commented Jun 6, 2025

This pull request introduces significant updates to the PageSummaryEnhancer class in admin-api-lib/src/admin_api_lib/impl/information_enhancer/page_summary_enhancer.py. The changes focus on improving the logic for summarizing pages by restructuring the grouping mechanism and adding a new method for summarizing page content asynchronously.

Enhancements to page summarization:

  • Added a new method _asummarize_page to asynchronously summarize the content of grouped page pieces. This method combines page content, invokes the summarizer, and constructs metadata with additional attributes such as a unique ID and related document IDs.

Refactoring of page grouping logic:

  • Replaced the previous grouping logic in _acreate_summary with a new approach that explicitly identifies distinct pages and groups information pieces based on their page metadata. The new logic also filters out groups that do not meet the chunk size threshold when self._chunker_settings is enabled.

a-klos added 23 commits June 2, 2025 08:24
…e unused managed_page_summary_enhancer module
…r DefaultSourceUploader to use it

refactor: update JSON serialization in ExtractionParameters, ExtractionRequest, InformationPiece, and KeyValuePair models
refactor: remove unused test files for confluence and thread management integration
@a-klos a-klos merged commit bac170a into main Jun 11, 2025
6 checks passed
@a-klos a-klos deleted the fix/summarizer branch June 11, 2025 12:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants