-
Notifications
You must be signed in to change notification settings - Fork 234
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add response outputs to profile export #478
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall this LGTM for short term solution, but a few high level comments.
- The implementation of
Output()
being something that only supports string outputs named "text_output" felt not ideal, but I understand the reasoning for doing it this way in terms of supporting LLM-specific behavior right now.- More generically down the line, I might expect the report to include all outputs or something.
- The example report you posted looks great! However, it looks like it should also include some information to associate the responses with their requests. Are there plans to include either the corresponding inputs or request_ids etc. to correlate them later? Do we think that would be useful?
- Do we have any unit tests for a model that doesn't contain an output named "text_output", just to assert the expected behavior?
I expect some of the other folks may have some feedback/changes requested, so I'll save the approval for now.
NOTE: If you want to respond to any of these, please start a thread by commenting on some arbitrary part of the file and we can discuss in the thread.
@rmccorm4 the way the json is structured now, we have the responses contained within their respective request. Originally we thought this would be sufficient to enable correlation of 1 request to N responses. Do you think we should have a different approach going forward? |
should wait on @tgerdes to look at it
Removed approval so @tgerdesnv can get eyes on it too. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes requested. Let me know if you think we need to stick to your current plan in order to meet the deadline
I do think we want to consider moving to a file format with https://jsonlines.org/ - this is probably out of scope for this PR and MVP. but json lines will allow us to support large files easier and without keeping things in memory. It's become quite popular for that reason as a way to use for logging. It would also allow PA to "stream" results one record at a time to other scripts for real time processing. |
Took a quick look at the examples you linked. Functionally I think its a small change but I agree out of scope for this MVP. |
What I'm imagining is rather that we have a line at the top with any experiment settings, like a header, and then each line after that is a record. Probably should put a proposal together. |
Can this be verified via a json schema? |
8601412
to
2eed0dc
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes! I like this much better.
I didn't do a full thorough review, so please make sure to follow up on anyone else's feedback before merging
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🚀
Adds response outputs to the profile export. Here's an example new profile_export.json: