- Digital Nomad
- [email protected]
etl
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.
A database-backed work queue for Django
A JSON-like data structure (a CRDT) that can be modified concurrently by different users, and merged again automatically.
Productive, portable, and performant GPU programming in Python.
Data formats useful for API, Big Data, ML, Graph & co
AMQP 0.9 client designed for asyncio and humans.
BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and Bui…
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale.
👻 Experimental library for scraping websites using OpenAI's GPT API.
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoo…
Integration layer between Requests and Selenium for automation of web actions.
🌀 Browse the whole web from a web page. Remote browser isolation. For compliance, integration, security, privacy and more! By https://dosyago.com
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
Library for reading and writing large multi-dimensional arrays.
A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
YouTube Full Text Search - Search all of a YouTube channel from the command line
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
PostgreSQL-based Task Queue for Python
Upload and download files from Telegram up to 4 GiB using your account
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.