[GSK-4033] Fix correctness aggregation error #2090

henchaves · 2024-12-19T12:53:53Z

Description

Related Issue

Type of Change

📚 Examples / docs / tutorials / dependencies update
🔧 Bug fix (non-breaking change which fixes an issue)
🥂 Improvement (non-breaking change which improves an existing feature)
🚀 New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to change)
🔐 Security fix

Checklist

I've read the CODE_OF_CONDUCT.md document.
I've read the CONTRIBUTING.md guide.
I've written tests for all new methods and classes that I created.
I've written the docstring in Google format for all the methods and classes that I used.
I've updated the pdm.lock running pdm update-lock (only applicable when pyproject.toml has been
modified)

linear · 2024-12-19T12:53:56Z

GSK-4033 Fix correctness aggregation error

…-error

sonarqubecloud · 2024-12-19T13:14:15Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
75.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

kevinmessiaen

LGTM, however I'm thinking it would be better to add those check inside the parse_json_output since it has a builtin feature to ask the llm to correct the json output

kevinmessiaen · 2025-01-07T08:43:18Z

giskard/rag/metrics/correctness.py

                out.content,
                llm_client=llm_client,
                keys=["correctness", "correctness_reason"],
                caller_id=self.__class__.__name__,
            )

+            if "correctness" in json_output and not isinstance(json_output["correctness"], bool):
+                raise LLMGenerationError(
+                    f"Error in correctness evaluation: {json_output['correctness']}. Please make sure the agent answer is correctly formatted."


Switching to repr is better so we can differentiate "True" from True, also

Suggested change

f"Error in correctness evaluation: {json_output['correctness']}. Please make sure the agent answer is correctly formatted."

f"Error in correctness evaluation: {repr(json_output['correctness'])}. Please make sure the agent answer is correctly formatted. Correctness value should be of boolean type."

henchaves added 2 commits December 13, 2024 19:52

Raise exception if correctness is string

481f91d

Update the condition to raise LLMGenerationError

2ce7fca

Merge branch 'main' into feature/gsk-4033-fix-correctness-aggregation…

effa6a0

…-error

henchaves self-assigned this Jan 7, 2025

henchaves requested a review from kevinmessiaen January 7, 2025 01:51

henchaves changed the title ~~[WIP][GSK-4033] Fix correctness aggregation error~~ [GSK-4033] Fix correctness aggregation error Jan 7, 2025

henchaves marked this pull request as ready for review January 7, 2025 01:51

kevinmessiaen approved these changes Jan 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GSK-4033] Fix correctness aggregation error #2090

[GSK-4033] Fix correctness aggregation error #2090

henchaves commented Dec 19, 2024

linear bot commented Dec 19, 2024

sonarqubecloud bot commented Dec 19, 2024

kevinmessiaen left a comment

kevinmessiaen Jan 7, 2025

	f"Error in correctness evaluation: {json_output['correctness']}. Please make sure the agent answer is correctly formatted."
	f"Error in correctness evaluation: {repr(json_output['correctness'])}. Please make sure the agent answer is correctly formatted. Correctness value should be of boolean type."

[GSK-4033] Fix correctness aggregation error #2090

Are you sure you want to change the base?

[GSK-4033] Fix correctness aggregation error #2090

Conversation

henchaves commented Dec 19, 2024

Description

Related Issue

Type of Change

Checklist

linear bot commented Dec 19, 2024

sonarqubecloud bot commented Dec 19, 2024

Quality Gate passed

kevinmessiaen left a comment

Choose a reason for hiding this comment

kevinmessiaen Jan 7, 2025

Choose a reason for hiding this comment