Skip to content
View ddelange's full-sized avatar
๐Ÿ’ฅ
["translatio", "imitatio", "aemulatio"]
๐Ÿ’ฅ
["translatio", "imitatio", "aemulatio"]

Block or report ddelange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

Extract-Transform-Load, Data Wrangling, Data Mining, ...
255 repositories

A Python package to manage extremely large amounts of data

Python 1,323 275 Updated Feb 1, 2025

A curated list of analytics frameworks, software and other tools.

3,971 438 Updated May 9, 2024

Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in Google BigQuery.

Python 754 42 Updated Jan 15, 2025

Fast data store for Pandas time-series data

Python 568 101 Updated Jul 10, 2024

Apache DataFusion Ballista Distributed Query Engine

Rust 1,638 202 Updated Jan 24, 2025

DatenLord, Computing Defined Storage, an application-orientated, cloud-native distributed storage system

Rust 889 89 Updated Dec 2, 2024

An io_uring backed runtime for Rust

Rust 1,183 128 Updated Aug 2, 2024

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 845 122 Updated Jan 29, 2025

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data everโ€ฆ

Java 1,639 336 Updated Jan 1, 2024

A scikit-learn compatible neural network library that wraps PyTorch

Jupyter Notebook 5,955 394 Updated Jan 31, 2025

C++ library for value-oriented design using the unidirectional data-flow architecture โ€” Redux for C++

C++ 714 69 Updated Nov 13, 2024

Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.

C++ 298 47 Updated Jun 9, 2024

Content-Addressable Data Synchronization Tool

C 1,509 117 Updated Dec 21, 2023

Deploy a Prefect flow to serverless AWS Lambda function

Python 36 6 Updated Sep 27, 2022

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,872 349 Updated Aug 7, 2024

a high-performance, POSIX-ish Amazon S3 file system written in Go

Go 5,269 525 Updated Jul 18, 2024

Cache AnyThing filesystem written in Rust

Rust 869 54 Updated Oct 9, 2023
Mustache 16 12 Updated Jul 10, 2023

๐——๐—ฎ๐˜๐—ฎ, ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ & ๐—”๐—œ. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust 8,138 761 Updated Feb 1, 2025

LogAI - An open-source library for log analytics and intelligence

Python 494 70 Updated Nov 14, 2024

Scalable Python DS & ML, in an API compatible & lightning fast way.

Python 1,141 70 Updated Feb 3, 2025

Tools for running OCR against files stored in S3

Python 118 7 Updated Aug 10, 2022

The country converter (coco) - a Python package for converting country names between different classification schemes.

Python 221 75 Updated Nov 15, 2024

Trigger.dev is the open source background jobs platform.

TypeScript 10,116 627 Updated Feb 2, 2025

Temporal service

Go 12,812 890 Updated Feb 3, 2025

Fast NumPy array functions written in C

Python 1,089 105 Updated Oct 18, 2024

๐Ÿš€ (currently broken) Backup Google Takeout archives (YouTube channel and Google Photos) at 1GB/s+ to Azure Storage periodically with minimal human toil and financial cost

142 5 Updated Jan 4, 2025

Basic AWS S3 WebDAV interface implemented in Rust

Rust 52 10 Updated Sep 19, 2018

Python library and CLI you can use to move relational data from one place to another - DBs/CSV/gsheets/dataframes/...

Python 37 4 Updated Jun 18, 2024

Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring โ˜Ž๏ธ

C 1,176 43 Updated Jan 13, 2025