-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Correlations stopped working #1527
Comments
I was looking into this a bit as I was running into the issue. It's something with pandas going from 2.0.3 to 2.1.x. For ydata-profiling v.4.6.4 it works fine with pandas v2.0.3 but once you upgrade to pandas v2.1.x the autocorrelation stops working. Won't claim to know what in pandas is causing the break, but if you downgrade to pandas 2.0.3 it'll work again. |
@driscoll42 good catch. I can confirm that the reason 4.6.2 works is that it pins pandas < 2.1. The below pr relaxed pandas pin and thus broke correlations. @aquemy @ricardodcpereira any idea what could be wrong? |
@aquemy @ricardodcpereira is there anything I can help with? |
Could it be to the newer pandas datatypes. There are now nullable datatypes for string, float etc. with pandas.NA as missing values. I get many issues where data attempts to convert sting to float:
|
Maybe after pandas 2.0, we need to add |
I believe this line will also have to be updated to this or its equivalent: method = (
_pairwise_spearman
if col_1_name not in categorical_columns and col_2_name not in categorical_columns
else _pairwise_cramers
) Setting |
this includes the fixes described in the issue
this includes the fixes described in the issue
this includes the fixes described in the issue
this includes the fixes described in the issue
Current Behaviour
Trying to create a profile with default settings, correlations do not work for some relatively simple data sets with the below error:
I think this issue started with 4.6.3 and is still the case for 4.6.4.
EDIT: I can confirm that downgrading to 4.6.2 solves the issue.
Expected Behaviour
Correlations work
Data Description
Standard boston data set
Code that reproduces the bug
pandas-profiling version
v4.6.4
Dependencies
OS
Ubuntu 22
Checklist
The text was updated successfully, but these errors were encountered: