Add response outputs to profile export #478

matthewkotila · 2024-02-28T23:27:59Z

Adds response outputs to the profile export. Here's an example new profile_export.json:

{
    "experiments": [
        {
            "experiment": {
                "mode": "request_rate",
                "value": 1.0
            },
            "requests": [
                {
                    "timestamp": 1709164981471989921,
                    "response_timestamps": [
                        1709164981512336466,
                        1709164981513018359
                    ],
                    "response_outputs": [
                        {"text_output": "<\u0000\u0000\u0000machine learning is a very important part of the business.\n\n"},
                        {"text_output": ""}
                    ]
                },
                {
                    "timestamp": 1709164982471945536,
                    "response_timestamps": [
                        1709164982496197074,
                        1709164982497025684
                    ],
                    "response_outputs": [
                        {"text_output": "=\u0000\u0000\u0000Hello! What is your name?\n\nI'm a student at the University of"},
                        {"text_output": ""}
                    ]
                }
            ],
            "window_boundaries": [
                ...
            ]
        }
    ],
    "version": "0.0.0"
}

src/c++/library/common.h

src/c++/library/grpc_client.cc

src/c++/perf_analyzer/infer_context.cc

src/c++/library/http_client.cc

rmccorm4

Overall this LGTM for short term solution, but a few high level comments.

The implementation of Output() being something that only supports string outputs named "text_output" felt not ideal, but I understand the reasoning for doing it this way in terms of supporting LLM-specific behavior right now.
- More generically down the line, I might expect the report to include all outputs or something.
The example report you posted looks great! However, it looks like it should also include some information to associate the responses with their requests. Are there plans to include either the corresponding inputs or request_ids etc. to correlate them later? Do we think that would be useful?
Do we have any unit tests for a model that doesn't contain an output named "text_output", just to assert the expected behavior?

I expect some of the other folks may have some feedback/changes requested, so I'll save the approval for now.

NOTE: If you want to respond to any of these, please start a thread by commenting on some arbitrary part of the file and we can discuss in the thread.

debermudez · 2024-02-29T01:58:08Z

@rmccorm4 the way the json is structured now, we have the responses contained within their respective request. Originally we thought this would be sufficient to enable correlation of 1 request to N responses. Do you think we should have a different approach going forward?

src/c++/perf_analyzer/test_profile_data_exporter.cc

@tgerdes

should wait on @tgerdes to look at it

debermudez · 2024-02-29T02:26:18Z

Removed approval so @tgerdesnv can get eyes on it too.
LGTM.

src/c++/perf_analyzer/request_record.h

tgerdesnv

Changes requested. Let me know if you think we need to stick to your current plan in order to meet the deadline

nnshah1 · 2024-03-01T16:03:08Z

I do think we want to consider moving to a file format with https://jsonlines.org/ - this is probably out of scope for this PR and MVP. but json lines will allow us to support large files easier and without keeping things in memory. It's become quite popular for that reason as a way to use for logging.

It would also allow PA to "stream" results one record at a time to other scripts for real time processing.

debermudez · 2024-03-01T16:39:02Z

I do think we want to consider moving to a file format with https://jsonlines.org/ - this is probably out of scope for this PR and MVP. but json lines will allow us to support large files easier and without keeping things in memory. It's become quite popular for that reason as a way to use for logging.

It would also allow PA to "stream" results one record at a time to other scripts for real time processing.

Took a quick look at the examples you linked. Functionally I think its a small change but I agree out of scope for this MVP.
If I understand the example, I think you still need to hold the complete experiment json object in memory instead of the list of experiments until it is complete. It would allow for us to cut our memory imprint down but makes the outputting of the file a bit more gnarly.

nnshah1 · 2024-03-01T18:42:29Z

I do think we want to consider moving to a file format with https://jsonlines.org/ - this is probably out of scope for this PR and MVP. but json lines will allow us to support large files easier and without keeping things in memory. It's become quite popular for that reason as a way to use for logging.
It would also allow PA to "stream" results one record at a time to other scripts for real time processing.

Took a quick look at the examples you linked. Functionally I think its a small change but I agree out of scope for this MVP. If I understand the example, I think you still need to hold the complete experiment json object in memory instead of the list of experiments until it is complete. It would allow for us to cut our memory imprint down but makes the outputting of the file a bit more gnarly.

What I'm imagining is rather that we have a line at the top with any experiment settings, like a header, and then each line after that is a record. Probably should put a proposal together.

debermudez · 2024-03-01T18:47:50Z

I do think we want to consider moving to a file format with https://jsonlines.org/ - this is probably out of scope for this PR and MVP. but json lines will allow us to support large files easier and without keeping things in memory. It's become quite popular for that reason as a way to use for logging.
It would also allow PA to "stream" results one record at a time to other scripts for real time processing.

Took a quick look at the examples you linked. Functionally I think its a small change but I agree out of scope for this MVP. If I understand the example, I think you still need to hold the complete experiment json object in memory instead of the list of experiments until it is complete. It would allow for us to cut our memory imprint down but makes the outputting of the file a bit more gnarly.

What I'm imagining is rather that we have a line at the top with any experiment settings, like a header, and then each line after that is a record. Probably should put a proposal together.

Can this be verified via a json schema?
I want the reports locked in with verification via schemas soon.

tgerdesnv

Yes! I like this much better.
I didn't do a full thorough review, so please make sure to follow up on anyone else's feedback before merging

rmccorm4

🚀

src/c++/perf_analyzer/infer_context.cc

matthewkotila requested review from debermudez, rmccorm4, tgerdesnv and nv-hwoo February 28, 2024 23:27

matthewkotila commented Feb 28, 2024

View reviewed changes

src/c++/library/common.h Outdated Show resolved Hide resolved

matthewkotila commented Feb 28, 2024

View reviewed changes

src/c++/library/grpc_client.cc Outdated Show resolved Hide resolved

matthewkotila commented Feb 28, 2024

View reviewed changes

src/c++/library/grpc_client.cc Outdated Show resolved Hide resolved

matthewkotila commented Feb 29, 2024

View reviewed changes

src/c++/perf_analyzer/infer_context.cc Outdated Show resolved Hide resolved

debermudez reviewed Feb 29, 2024

View reviewed changes

src/c++/library/http_client.cc Outdated Show resolved Hide resolved

rmccorm4 reviewed Feb 29, 2024

View reviewed changes

src/c++/perf_analyzer/test_profile_data_exporter.cc Outdated Show resolved Hide resolved

debermudez previously approved these changes Feb 29, 2024

View reviewed changes

tgerdesnv reviewed Feb 29, 2024

View reviewed changes

src/c++/perf_analyzer/request_record.h Outdated Show resolved Hide resolved

tgerdesnv requested changes Feb 29, 2024

View reviewed changes

matthewkotila added 2 commits March 1, 2024 16:54

Add response outputs to profile export

8a88cf3

Address feedback

2eed0dc

matthewkotila force-pushed the matthewkotila-response-output branch from 8601412 to 2eed0dc Compare March 2, 2024 02:39

matthewkotila requested review from rmccorm4, tgerdesnv and debermudez March 2, 2024 02:39

tgerdesnv approved these changes Mar 4, 2024

View reviewed changes

rmccorm4 approved these changes Mar 4, 2024

View reviewed changes

debermudez approved these changes Mar 4, 2024

View reviewed changes

Fix bug

5ab1b1e

matthewkotila commented Mar 5, 2024

View reviewed changes

src/c++/perf_analyzer/infer_context.cc Show resolved Hide resolved

matthewkotila commented Mar 5, 2024

View reviewed changes

src/c++/perf_analyzer/infer_context.cc Show resolved Hide resolved

matthewkotila commented Mar 5, 2024

View reviewed changes

src/c++/perf_analyzer/infer_context.cc Show resolved Hide resolved

matthewkotila merged commit ae5d5b6 into feature-genai-pa Mar 5, 2024
3 checks passed

matthewkotila deleted the matthewkotila-response-output branch March 5, 2024 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add response outputs to profile export #478

Add response outputs to profile export #478

matthewkotila commented Feb 28, 2024 •

edited

Loading

rmccorm4 left a comment •

edited

Loading

debermudez commented Feb 29, 2024

debermudez commented Feb 29, 2024

tgerdesnv left a comment

nnshah1 commented Mar 1, 2024

debermudez commented Mar 1, 2024

nnshah1 commented Mar 1, 2024 •

edited

Loading

debermudez commented Mar 1, 2024

tgerdesnv left a comment

rmccorm4 left a comment

Add response outputs to profile export #478

Add response outputs to profile export #478

Conversation

matthewkotila commented Feb 28, 2024 • edited Loading

rmccorm4 left a comment • edited Loading

Choose a reason for hiding this comment

debermudez commented Feb 29, 2024

debermudez commented Feb 29, 2024

tgerdesnv left a comment

Choose a reason for hiding this comment

nnshah1 commented Mar 1, 2024

debermudez commented Mar 1, 2024

nnshah1 commented Mar 1, 2024 • edited Loading

debermudez commented Mar 1, 2024

tgerdesnv left a comment

Choose a reason for hiding this comment

rmccorm4 left a comment

Choose a reason for hiding this comment

matthewkotila commented Feb 28, 2024 •

edited

Loading

rmccorm4 left a comment •

edited

Loading

nnshah1 commented Mar 1, 2024 •

edited

Loading