Releases: dacort/faker-cli
v0.7.0 - Iceberg support!
With the introduction of PyIceberg, we can now create Iceberg tables.
Iceberg requires a catalog to store the table definition, so with this initial release, we support Iceberg tables in AWS Glue with the data on S3.
fake -n 10 pyint,user_name,date_this_year -f iceberg -C glue://default.iceberg_sample -o s3://YOUR_BUCEKT/iceberg/
With that command, faker-cli
will generate a 10-row dataset, upload it to S3 and create the table in the Glue Data Catalog. It assumes your AWS credentials are already setup with the necessary permissions.
v0.6.0 - Mimesis support!
Adds a new CLI option -p
that allows you to specify your provider of choice. Faker and Mimesis are currently supported.
Note that the column names will need to change to match your provider of choice and templates are only supported with Faker.
Below is an a timed example of generating 100,000 usernames with Faker vs. Mimesis.
❯ time poetry run fake -n 100000 user_name > /dev/null
11.18s user 0.20s system 98% cpu 11.570 total
vs.
❯ time poetry run fake -p mimesis -n 100000 username > /dev/null
1.58s user 0.19s system 93% cpu 1.902 total
v0.5.0
v0.4.0
v0.3.0 - DeltaLake Support!
v0.2.0 - Parquet, parquet, S3!
What's Changed
You can now write to Parquet files by specifying a combination of format (-f
) and output filename (-o
). Also, simply because we use the awesome pyarrow library for Parquet support, you can just specify a filename on S3 (-o s3://bucket/data.parquet
) and we'll write the Parquet data there. 🙌
New Contributors
Full Changelog: v0.1.1...v0.2.0
v0.1.1 - Fix default usage command
Update packages-dir
v0.1.0 - CloudFront Support
- Added CloudFront support:
fake -t cloudfront