-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#27 Fix obj serialization for saving #29
Conversation
Codecov ReportAttention: Patch coverage is ✅ All tests successful. No failed tests found.
|
…eten extras & pyproject build ignore img dir
…eten extras & pyproject build ignore img dir
Added stronger, more defensive input checks for the result_writer and unit tests. |
- Ensures non-ASCII characters are preserved in the output. | ||
""" | ||
if len(eval_inputs) != len(eval_outputs): | ||
raise ValueError("eval_inputs and eval_outputs must have the same length") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This will likely raise an error if there have been downstream errors with outputs. Eval outputs can be <= eval_inputs
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great updates! |
I've pushed 'append' style result_writing for batches and end-to-end test for result writing with Llama-Index. I still need to test a bit more and understand what makes the most sense. |
flow_judge/flow_judge.py
Outdated
def _handle_batch_result( | ||
self, batch_result: BatchResult, batch_len: int, fail_on_parse_error: bool | ||
) -> list[EvalOutput]: | ||
"""Handle output parsing for batched results from Baseten. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is probably for all model instances not just for baseten right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
True, updated!
@sariola Pushed the update for downstream errors! |
Fixes #27