Skip to content

alick-at-google/public-datasets-pipelines

 
 

Repository files navigation

Google Cloud Datasets: Data Pipelines and Documentation Set

This repository contains the followings:

  • Cloud-native, data pipeline architecture for onboarding public datasets to Google Cloud Datasets.
  • Documentation set for tutorials, samples, and other articles related to the datasets hosted by the program.

For detailed documentation, please see the Wiki Pages.

About

Cloud-native, data onboarding architecture for Google Cloud Datasets

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 78.7%
  • HCL 12.0%
  • Jupyter Notebook 6.9%
  • Dockerfile 2.0%
  • Jinja 0.4%