Verticapy 1.0.1 - Unit test failure for three regression metrics (Aic_score, Bic_score, R2_score) #1207
-
Hello, While conducting unit tests with Titanic data on Verticapy 1.0.1, I noticed a discrepancy between the AIC, BIC, and R2 scores calculated by Verticapy and those calculated using scikit-learn. Here are the steps to reproduce the issue: 1.Use Titanic data from Verticapy:
2.Train a linear regression model with Verticapy and Python:
3. Calculate AIC, BIC, and R2 scores with Verticapy:
4. Calculate AIC, BIC, and R2 scores with Scikit-learn:
Results obtained: Verticapy AIC Score: -1863.62512466986 Scikit-learn AIC Score: -1488.3492662576539 The AIC, BIC, and R2 scores obtained with Verticapy are significantly different from those calculated using scikit-learn. Verticapy Test 0.12: Scores are identical between Verticapy and scikit-learn. Please examine this issue and keep me informed of any updates or solutions that may be provided. Best regards, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
Hi @okankcb we will need to investigate why the differences are so big. I've created an issue to track this bug. #1236 |
Beta Was this translation helpful? Give feedback.
-
The problem is due to missing values. It seems, the new version is sensitive to missing values. It will be fixed in the next PR. |
Beta Was this translation helpful? Give feedback.
Solved here: #1254