Can using Windmill to ETL/ELT data? #4000

nguyenthanhtien · 2024-06-28T09:24:30Z

nguyenthanhtien
Jun 28, 2024

I'm looking to set up a new data pipeline using Windmill, and I'd like to get some feedback from the community. What are the key advantages and considerations when using Windmill for ETL/ELT workflows compared to other popular data engineering tools like Apache Airflow or Prefect? How does Windmill's architecture and feature set align with common data integration use cases? Any tips or best practices for designing robust and scalable Windmill-based data pipelines would be greatly appreciated.

rubenfiszel · 2024-06-28T09:27:39Z

rubenfiszel
Jun 28, 2024
Maintainer

Hi @nguyenthanhtien , I recommend taking a look at: https://www.windmill.dev/docs/core_concepts/data_pipelines

We're much faster than Airflow and Prefect. However, the only meaningful way to larger scale data pipeline is to use our S3 + parquet integration which is limited to 50MB uploads on the community edition. Windmill allow to use parquet (or any other blob format) files stored in s3 as input and outputs easily and then will even be able to cache the steps based on if the etags of those have changed or not.

3 replies

nguyenthanhtien Jun 28, 2024
Author

Any limitation with the Community version? Still faster than the Airflow & Perfect tool?

nguyenthanhtien Aug 13, 2024
Author

@rubenfiszel Any limitation with the Community version? Still faster than the Airflow & Perfect tool?

rubenfiszel Aug 14, 2024
Maintainer

There are some limitation but not with performance or number of workers. However, the native s3 integration is limited to 50MB

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can using Windmill to ETL/ELT data? #4000

{{title}}

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Can using Windmill to ETL/ELT data? #4000

nguyenthanhtien Jun 28, 2024

Replies: 1 comment · 3 replies

rubenfiszel Jun 28, 2024 Maintainer

nguyenthanhtien Jun 28, 2024 Author

nguyenthanhtien Aug 13, 2024 Author

rubenfiszel Aug 14, 2024 Maintainer

nguyenthanhtien
Jun 28, 2024

Replies: 1 comment 3 replies

rubenfiszel
Jun 28, 2024
Maintainer

nguyenthanhtien Jun 28, 2024
Author

nguyenthanhtien Aug 13, 2024
Author

rubenfiszel Aug 14, 2024
Maintainer