Skip to content

nlarge-google/public-datasets-pipelines

This branch is 1 commit ahead of, 139 commits behind GoogleCloudPlatform/public-datasets-pipelines:main.

Folders and files

NameName
Last commit message
Last commit date
Jul 11, 2022
Oct 13, 2022
Apr 8, 2022
Aug 17, 2022
Aug 17, 2022
Aug 3, 2022
Aug 17, 2022
Feb 15, 2022
Aug 17, 2022
Aug 17, 2022
Apr 9, 2021
Aug 3, 2022
Apr 9, 2021
Apr 9, 2021
Jun 1, 2021
Jul 1, 2022
Jul 22, 2021
Jul 26, 2022
Sep 20, 2022
Apr 11, 2022
Aug 3, 2022

Repository files navigation

Google Cloud Datasets: Data Pipelines and Documentation Set

public-datasets-pipelines

This repository contains the following:

  • Cloud-native, data pipeline architecture for onboarding public datasets to Google Cloud Datasets.
  • Documentation set containing tutorials, samples, and other articles making use of the datasets hosted by the program.

For detailed documentation, please see the Wiki Pages.

About

Cloud-native, data onboarding architecture for Google Cloud Datasets

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 77.8%
  • HCL 10.6%
  • Jupyter Notebook 9.6%
  • Dockerfile 1.8%
  • Jinja 0.2%