Skip to content
View ddelange's full-sized avatar
💥
["translatio", "imitatio", "aemulatio"]
💥
["translatio", "imitatio", "aemulatio"]

Block or report ddelange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

Extract-Transform-Load, Data Wrangling, Data Mining, ...
255 repositories

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,116 813 Updated Feb 1, 2025

A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning

584 190 Updated Oct 26, 2018

Translators is a library that aims to bring free, multiple, enjoyable translations to individuals and students in Python. 「翻译官」是一个旨在用Python为个人和学生带来免费、多样、愉快翻译的库。

Python 1,800 202 Updated Jan 27, 2025

Access other storage backends via the S3 API

Java 1,840 236 Updated Feb 2, 2025

Library for exploring and validating machine learning data

Python 767 175 Updated Jan 30, 2025

Python library of 60+ commonly-used validator functions

Python 127 12 Updated Dec 8, 2022

A Python library to provide functions to handle, parse and validate standard numbers.

Python 519 211 Updated Jan 11, 2025

Python Data Validation for Humans™.

Python 1,006 159 Updated Oct 7, 2024

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

Python 512 44 Updated Jan 5, 2025

🚤 Label data at scale. Fun and precision included.

Python 322 19 Updated Jan 28, 2025

Mirror of the Xapian repository. You're welcome to open pull requests on github (they'll just get merged indirectly).

C++ 815 282 Updated Jan 23, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 18,168 1,703 Updated Feb 3, 2025

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 15,702 1,320 Updated Feb 2, 2025

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,067 225 Updated Jan 30, 2025

A Python Library for Graph Outlier Detection (Anomaly Detection)

Python 1,374 130 Updated Nov 14, 2024

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Rust 1,387 134 Updated Jan 31, 2025

SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.

Python 15,636 1,596 Updated Feb 2, 2025

Small, dependency-free, fast Python package to infer binary file types checking the magic numbers signature

Python 680 112 Updated Sep 9, 2024

Truly universal encoding detector in pure Python

Python 610 52 Updated Feb 1, 2025

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Python 266 40 Updated Jan 22, 2025

A library of Reversible Data Transforms

Python 123 25 Updated Feb 3, 2025

Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators

Python 5,718 1,095 Updated Jul 28, 2024

An s3 datastore implementation

Go 244 67 Updated Jan 28, 2025

ExifLooter finds geolocation on all image urls and directories also integrates with OpenStreetMap

Go 431 25 Updated Jul 14, 2024

Postgresql capture data change software in Rust to allow realtime websockets

Rust 12 1 Updated Sep 24, 2024

Apache DataFusion SQL Query Engine

Rust 6,705 1,294 Updated Feb 2, 2025

Rasterio reads and writes geospatial raster datasets

Python 2,302 537 Updated Jan 27, 2025

A powerful and user-friendly binary analysis platform!

Python 7,730 1,094 Updated Feb 2, 2025

The leading workflow orchestration platform. Run stateful step functions and AI workflows on serverless, servers, or the edge.

Go 2,445 95 Updated Jan 31, 2025

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,773 1,387 Updated Jan 13, 2025