-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when joining dataframes with duplicate column names if dataframes generated from file #14147
Comments
|
You're right, it works with CSV-created dataframes, I missed header stuff But originally issue arised with parquet files. I tested with parquet this time, and problem is there:
|
i created an issue in data fusion python apache/datafusion-python#996 |
Datafusion currently requires all field names to be unique. Recently added was alias support for tables which might help in the future. Hopefully that will make it to the pythonlib #14127 |
Describe the bug
Encountered an issue joining dataframes with duplicate column names if they generated from file read (I tried csv and parquet).
Dataframes produced from python dict do join without problem.
I did my testing with latest version of Datafusion on Windows.
To Reproduce
Fine with dataframes from dict
Continue to file read
Expected behavior
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: