Skip to content
View ddelange's full-sized avatar
💥
["translatio", "imitatio", "aemulatio"]
💥
["translatio", "imitatio", "aemulatio"]

Block or report ddelange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

Extract-Transform-Load, Data Wrangling, Data Mining, ...
255 repositories

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 19,711 1,033 Updated Feb 3, 2025

Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.

Rust 4,000 486 Updated Feb 2, 2025

A database-backed work queue for Django

Python 26 16 Updated Jun 25, 2024

A JSON-like data structure (a CRDT) that can be modified concurrently by different users, and merged again automatically.

JavaScript 4,383 186 Updated Jan 30, 2025

Productive, portable, and performant GPU programming in Python.

C++ 26,657 2,329 Updated Jan 6, 2025

Data formats useful for API, Big Data, ML, Graph & co

41 4 Updated Dec 29, 2024

AMQP 0.9 client designed for asyncio and humans.

Python 1,299 196 Updated Dec 16, 2024

OpenFaaS - Serverless Functions Made Simple

Go 25,381 1,946 Updated Dec 9, 2024

BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and Bui…

Python 194 7 Updated Jan 10, 2024

ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.

C++ 1,690 112 Updated Feb 3, 2025

Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale.

Rust 4,503 561 Updated Feb 2, 2025

👻 Experimental library for scraping websites using OpenAI's GPT API.

Python 1,426 86 Updated Oct 9, 2024

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoo…

Python 3,439 629 Updated Feb 7, 2024

Integration layer between Requests and Selenium for automation of web actions.

Python 1,831 147 Updated Jan 18, 2025

🌀 Browse the whole web from a web page. Remote browser isolation. For compliance, integration, security, privacy and more! By https://dosyago.com

JavaScript 3,557 371 Updated Jan 30, 2025

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,883 273 Updated Dec 28, 2024

Library for reading and writing large multi-dimensional arrays.

C++ 1,377 125 Updated Feb 1, 2025

A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.

Rust 4,848 182 Updated Jan 31, 2025

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,642 356 Updated Jan 28, 2025

Mount s3 buckets into pods in k8s

Dockerfile 39 31 Updated May 6, 2024

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

C++ 2,434 161 Updated Jan 23, 2025

YouTube Full Text Search - Search all of a YouTube channel from the command line

Python 1,658 83 Updated Sep 13, 2024

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.

Go 7,158 284 Updated Jan 28, 2025

A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥

Python 3,442 257 Updated Aug 11, 2024

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 2,005 134 Updated Jan 13, 2025

PostgreSQL-based Task Queue for Python

Python 916 62 Updated Feb 3, 2025

The Memory layer for AI Agents

Python 24,329 2,254 Updated Feb 2, 2025

Find web directories without bruteforce

Python 1,796 256 Updated Oct 29, 2023

Upload and download files from Telegram up to 4 GiB using your account

Python 1,161 244 Updated Jun 18, 2024

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

Python 1,408 69 Updated Dec 9, 2024