-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor: document processing interface #419
Conversation
Code Coverage Summary
Diff against main
Results for commit: 9716f64 Minimum allowed coverage is ♻️ This comment has been updated with latest results |
Trivy scanning results. Report Summary ┌─────────┬──────┬─────────────────┬─────────┐
For OSS Maintainers: VEX NoticeIf you're an OSS maintainer and Trivy has detected vulnerabilities in your project that you believe are not actually exploitable, consider issuing a VEX (Vulnerability Exploitability eXchange) statement. To disable this notice, set the TRIVY_DISABLE_VEX_NOTICE environment variable. uv.lock (uv)Total: 21 (MEDIUM: 11, HIGH: 9, CRITICAL: 1) ┌──────────────────┬────────────────┬──────────┬────────┬───────────────────┬───────────────┬──────────────────────────────────────────────────────────────┐ |
bdb0e54
to
e1a8d59
Compare
d08ffb6
to
9d9622a
Compare
closes #402
This PR focuses solely on reducing the complexity and unifying the interface for the document ingest api. The unstructured parser has been heavily refactored to make better use of the lib api. Also, intermediate handlers operate now on raw elements rather than intermediate representations, after a few iterations this abstraction seems no longer needed.
The docs will be updated in the next PRs, they require heavy rewrite and I don't want to make this PR >2k lines, for now I updated the api reference since it was straightforward.