Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project: Google Data Commons #194

Open
2 tasks done
cmbz opened this issue Mar 15, 2024 · 8 comments
Open
2 tasks done

Project: Google Data Commons #194

cmbz opened this issue Mar 15, 2024 · 8 comments
Assignees
Labels
Dataverse Project Issues related to Dataverse Project software

Comments

@cmbz
Copy link
Contributor

cmbz commented Mar 15, 2024

Overview

Google-Harvard collaboration to share data with the Google Data Commons.

Issues

Resources

@cmbz cmbz self-assigned this Mar 15, 2024
@cmbz cmbz added the Dataverse Project Issues related to Dataverse Project software label Mar 19, 2024
@cmbz
Copy link
Contributor Author

cmbz commented May 9, 2024

Status: July 2024

  • Worked on design of a tool to allow data contributors to provide information on the data transformations required to adapt data tables to the format required by Google Data Commons.
  • Currently researching the use of AI tools in automating the process of transforming data.

@cmbz
Copy link
Contributor Author

cmbz commented Aug 22, 2024

Status: August 2024

  • No updates for August

@cmbz
Copy link
Contributor Author

cmbz commented Oct 8, 2024

Status: September 2024

@cmbz
Copy link
Contributor Author

cmbz commented Nov 3, 2024

Status: October 2024

  • Development of a tool to transform Dataverse tables into required Google Data Commons format is underway.

@cmbz
Copy link
Contributor Author

cmbz commented Nov 6, 2024

Status: November 2024

  • Continuing development of a tool for transforming Dataverse tables into the format required by Google Data Commons. The tool will also create and update the ancillary documentation required for GDC upload/presentation.

@cmbz
Copy link
Contributor Author

cmbz commented Jan 15, 2025

Status: December 2024

  • Working with Stefano to develop a plan to train LLM to help automate the tool for transforming Dataverse tables (recognizing Location and Dates and Statistical Variables.)
  • Jan. 2025 - met with developers at Google to establish a partnership in ongoing data conversion development.

@cmbz
Copy link
Contributor Author

cmbz commented Feb 10, 2025

Status: January 2025

  • Stefano and Stephen K. continuing work on training LLM to recognize Dataverse Tables that may fit the Google Data Commons requirements. (Time series data associated with a recognized geographic area.)

@cmbz
Copy link
Contributor Author

cmbz commented Mar 7, 2025

Status: February 2025

  • Pending

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Dataverse Project Issues related to Dataverse Project software
Projects
None yet
Development

No branches or pull requests

2 participants