All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Added: Code formatting using Black
- Added: Added PyPi Trusted Publisher with a dedicated Github action workflow (publish_pypi) in dedicated Github environment
- Changed: Minimal Python version: 3.11
- Changed: Minimal dependency versions: Scrapy 2.11, fastavro 1.9, fastparquet 2024.02, pandas 2.2, pyorc 0.9
- Added: Migrate to Pandas concat instead of append due to deprecation #4
- Added: Add basic Github Actions workflow #5
- Added: Add support for pyproject.toml #6
- Added: Add a proper changelog based on https://keepachangelog.com/
- Added: Support for Parquet reader to define a custom schema as an alternative to inferring it
- Added: Initial version with support for parquet, orc and avro export