#27 Fix obj serialization for saving #29

minaamshahid · 2024-10-25T06:44:58Z

Fixes #27

codecov · 2024-10-25T06:52:23Z

Codecov Report

Attention: Patch coverage is 57.91246% with 125 lines in your changes missing coverage. Please review.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
flow_judge/utils/result_writer.py	0.00%	58 Missing ⚠️
flow_judge/integrations/llama_index.py	0.00%	40 Missing ⚠️
flow_judge/flow_judge.py	0.00%	23 Missing ⚠️
flow_judge/models/vllm.py	0.00%	3 Missing ⚠️
flow_judge/eval_data_types.py	0.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
...sts/e2e-local/integrations/test_llama_index_e2e.py	`91.26% <100.00%> (-0.40%)`	⬇️
tests/unit/models/test_baseten.py	`90.09% <100.00%> (+0.04%)`	⬆️
tests/unit/utils/test_result_writer.py	`100.00% <100.00%> (ø)`
flow_judge/eval_data_types.py	`0.00% <0.00%> (ø)`
flow_judge/models/vllm.py	`0.00% <0.00%> (ø)`
flow_judge/flow_judge.py	`0.00% <0.00%> (ø)`
flow_judge/integrations/llama_index.py	`0.00% <0.00%> (ø)`
flow_judge/utils/result_writer.py	`0.00% <0.00%> (ø)`

…eten extras & pyproject build ignore img dir

sariola · 2024-10-25T11:06:35Z

Added stronger, more defensive input checks for the result_writer and unit tests.
Robustness improvements to the file name encoding.

minaamshahid · 2024-10-25T13:36:57Z

flow_judge/utils/result_writer.py

+        - Ensures non-ASCII characters are preserved in the output.
+    """
+    if len(eval_inputs) != len(eval_outputs):
+        raise ValueError("eval_inputs and eval_outputs must have the same length")


This will likely raise an error if there have been downstream errors with outputs. Eval outputs can be <= eval_inputs

Thanks Minaam, it's the zip function below that needs them to be the same lengths. This check is just to throw the right error type.

I'll see what I could do.

minaamshahid · 2024-10-25T13:38:12Z

Great updates!

sariola · 2024-10-25T14:41:03Z

I've pushed 'append' style result_writing for batches and end-to-end test for result writing with Llama-Index.

I still need to test a bit more and understand what makes the most sense.

bergr7 · 2024-10-28T14:20:23Z

flow_judge/flow_judge.py

+    def _handle_batch_result(
+        self, batch_result: BatchResult, batch_len: int, fail_on_parse_error: bool
+    ) -> list[EvalOutput]:
+        """Handle output parsing for batched results from Baseten.


This is probably for all model instances not just for baseten right?

True, updated!

minaamshahid · 2024-10-28T14:55:32Z

@sariola Pushed the update for downstream errors!

fix: serialize obj for saving

afbe27f

minaamshahid requested a review from bergr7 October 25, 2024 06:46

fix: add log warning back to steam

b5cf856

sariola self-requested a review October 25, 2024 09:22

sariola added 3 commits October 25, 2024 12:36

fix: quality improvements to result_writer & change to readme for bas…

64bf870

…eten extras & pyproject build ignore img dir

fix: quality improvements to result_writer & change to readme for bas…

5782b7d

…eten extras & pyproject build ignore img dir

fix: rm non-allowed exclude key from pyproject.toml

56c086d

minaamshahid commented Oct 25, 2024

View reviewed changes

feat: vllm pinned & test result writing unit + e2e

6b3fcf8

fix: equate eval outputs and inputs

4837d7a

bergr7 reviewed Oct 28, 2024

View reviewed changes

fix: handle_batch_results docstring

27413b4

fix: failing baseten test

1fd7e6a

sariola approved these changes Oct 29, 2024

View reviewed changes

sariola merged commit f143379 into main Oct 29, 2024
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#27 Fix obj serialization for saving #29

#27 Fix obj serialization for saving #29

minaamshahid commented Oct 25, 2024

codecov bot commented Oct 25, 2024 •

edited

Loading

sariola commented Oct 25, 2024

minaamshahid Oct 25, 2024

sariola Oct 25, 2024

minaamshahid commented Oct 25, 2024

sariola commented Oct 25, 2024 •

edited

Loading

bergr7 Oct 28, 2024

minaamshahid Oct 28, 2024

minaamshahid commented Oct 28, 2024

#27 Fix obj serialization for saving #29

#27 Fix obj serialization for saving #29

Conversation

minaamshahid commented Oct 25, 2024

codecov bot commented Oct 25, 2024 • edited Loading

Codecov Report

sariola commented Oct 25, 2024

minaamshahid Oct 25, 2024

Choose a reason for hiding this comment

sariola Oct 25, 2024

Choose a reason for hiding this comment

minaamshahid commented Oct 25, 2024

sariola commented Oct 25, 2024 • edited Loading

bergr7 Oct 28, 2024

Choose a reason for hiding this comment

minaamshahid Oct 28, 2024

Choose a reason for hiding this comment

minaamshahid commented Oct 28, 2024

codecov bot commented Oct 25, 2024 •

edited

Loading

sariola commented Oct 25, 2024 •

edited

Loading