-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: evaluate testing #390
base: main
Are you sure you want to change the base?
Conversation
Code Coverage Summary
Diff against main
Results for commit: 9c7af53 Minimum allowed coverage is ♻️ This comment has been updated with latest results |
Trivy scanning results. Report Summary ┌─────────┬──────┬─────────────────┬─────────┐
For OSS Maintainers: VEX NoticeIf you're an OSS maintainer and Trivy has detected vulnerabilities in your project that you believe are not actually exploitable, consider issuing a VEX (Vulnerability Exploitability eXchange) statement. To disable this notice, set the TRIVY_DISABLE_VEX_NOTICE environment variable. uv.lock (uv)Total: 19 (MEDIUM: 10, HIGH: 8, CRITICAL: 1) ┌──────────────────┬────────────────┬──────────┬────────┬───────────────────┬───────────────┬──────────────────────────────────────────────────────────────┐ |
Compute the evaluation results for the given pipeline and data. | ||
|
||
Args: | ||
pipeline: The pipeline to be evaluated. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't see an argument pipeline
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
added
|
||
class EvaluationPipeline(Generic[EvaluationTargetT], WithConstructionConfig, ABC): | ||
""" | ||
Collection evaluation pipeline. | ||
""" | ||
|
||
CONCURRENCY: int = 10 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wouldn't it be better to have this as a part of EvaluationConfig so it could be changed if needed?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good point. also renamed the parameter to batch_size
- should be more intuitive. probably we should also support multiple batching strategies here as we do withing document processing
""" | ||
model = EvaluationConfig.model_validate(config) | ||
dataloader: DataLoader = DataLoader.subclass_from_config(model.dataloader) | ||
pipeline: EvaluationPipeline = EvaluationPipeline.subclass_from_config(model.pipeline) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
After checking the code, I am no longer sure that moving Evaluator
to EvaluationPipeline
is a good move. Here you are creating an EvaluationPipeline
object inside EvaluationPipeline
. This is confusing for the user and mixes two different abstractions that imho should be separated - the code being evaluated and the evaluation orchestration. I think we should undo this change and bring back Evaluator
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
changed it to be a class method now. would not say it is confusing - as well one can say that all class methods are confusing. To me stateless object is more confusing than this and actually the code which is being evaluated is the object evaluation_target
(eg. DocumentSearch) and EvaluationPipeline
is already kind of orchestration for eval execution
4957491
to
175c3eb
Compare
No description provided.