Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] DataFrames as a first class schema #505

Closed
1 of 2 tasks
mynameisvinn opened this issue Jan 28, 2021 · 2 comments
Closed
1 of 2 tasks

[FEATURE] DataFrames as a first class schema #505

mynameisvinn opened this issue Jan 28, 2021 · 2 comments
Labels
feature-discussion open discussion on feature request

Comments

@mynameisvinn
Copy link
Contributor

mynameisvinn commented Jan 28, 2021

🚨🚨 Feature Request

  • Related to an existing Issue
  • A new implementation (Improvement, Extension)

Should DataFrames be treated as a first-class schema (alongside Text, Audio, Images)?

If your feature will improve HUB

Even though Hub is designed for unstructured datasets, many data science projects still rely on DataFrames in some form or fashion. While there may be better solutions for large scale tabular data analysis, Hub should aim for feature parity.

Ultimately, the goal is to create a package familiar to data scientists, and supporting DataFrames is a big step towards that goal.

Description of the possible solution

Since a DataFrame is a collection of numpy arrays, a DataFrame schema could be assembled with existing Schemas (eg Tensors).

@mynameisvinn mynameisvinn added the feature-discussion open discussion on feature request label Jan 31, 2021
@mynameisvinn
Copy link
Contributor Author

Closing this feature request due to inactivity and lack of interest. Will revive it if more users request it.

@mynameisvinn
Copy link
Contributor Author

Closing this feature request due to inactivity and lack of interest. Will revive it if more users request it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature-discussion open discussion on feature request
Projects
None yet
Development

No branches or pull requests

1 participant