Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GenAi-PA error handling #483

Merged
merged 5 commits into from
Mar 1, 2024

Conversation

dyastremsky
Copy link
Contributor

This PR adds exception handling to the core logic of GenAi-PA and prints an error when perf_analyzer returns a non-zero code. There is also an added exception make sure that run() returns this exception.

This also moves llm_inputs.py underneath the genai_pa to help start integrating the components of GenAi-PA. This organization seems logical to me and consistent with the way the Model Analyzer code is currently written, but I am open to feedback here. (CC: @nv-braf, let me know if you think this should be reverted.)

Passing tests:
image

Additional error log when there is a failure (in this case, a non-zero exit code from Perf Analyzer):
image

@dyastremsky dyastremsky self-assigned this Mar 1, 2024
@dyastremsky dyastremsky changed the base branch from main to feature-genai-pa March 1, 2024 21:37
Copy link
Contributor

@nv-braf nv-braf left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's fine by me.

@debermudez
Copy link
Contributor

@dyastremsky @nv-braf I think the logic for the separate directory was that there might be more input specific files coming. The separate directory was prefetching organization that was going to be necessary in the short-mid term.
Not gating but we may end up reverting this later.

Copy link
Contributor

@debermudez debermudez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree with @rmccorm4 suggestions. Update those sections and this is good to go in.

dyastremsky and others added 2 commits March 1, 2024 14:29
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
@dyastremsky
Copy link
Contributor Author

Agree with @rmccorm4 suggestions. Update those sections and this is good to go in.

Thanks for the corrections, @rmccorm4! All fixed.

Understood on the structure, @nv-braf @debermudez. Thanks for the clarification, that sounds good!

@dyastremsky dyastremsky merged commit 1c56a46 into feature-genai-pa Mar 1, 2024
3 checks passed
@dyastremsky dyastremsky deleted the dyas-pa-llm-error-handling branch March 1, 2024 22:39
debermudez pushed a commit that referenced this pull request Mar 2, 2024
* Print exception when non-zero error code

* Add test for GenAiPAException for nonzero exit

* Revert format/comment changes.

* Fix typo

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

* Remove unnecessary return value

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

---------

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

WIP clean up and link up llm_inputs
debermudez pushed a commit that referenced this pull request Mar 12, 2024
* Print exception when non-zero error code

* Add test for GenAiPAException for nonzero exit

* Revert format/comment changes.

* Fix typo

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

* Remove unnecessary return value

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

---------

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
debermudez pushed a commit that referenced this pull request Mar 13, 2024
* Print exception when non-zero error code

* Add test for GenAiPAException for nonzero exit

* Revert format/comment changes.

* Fix typo

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

* Remove unnecessary return value

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

---------

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
mc-nv pushed a commit that referenced this pull request Mar 13, 2024
* Print exception when non-zero error code

* Add test for GenAiPAException for nonzero exit

* Revert format/comment changes.

* Fix typo

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

* Remove unnecessary return value

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

---------

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

4 participants