A collection of sample Python scripts using Apache Beam.
- Install Python with Anaconda
- Create and activate a new Conda 2.7 environment (Apache Beam only supports Python 2.7)
- Clone the Github repo and cd into the directory
- Install the required Python libraries:
pip install -r requirements.txt
- Run the script of your choice!
Run the unit tests in the root directory:
python -m unittest discover -s tests -t tests
Want to be sure I'm following the Python style guide? Run this command:
flake8 scripts
hello_beam.py
: creates a file from another file.wordcount_minimal.py
: a minimalist word-counting workflow that counts words in a file.streaming_wordcount.py
: a streaming word-counting workflow.