diff --git a/README.md b/README.md index 7458327ce0344cad54043d774e1b9196101ed60a..a79a420097e26519bc3fed7aacab8198f49c4f7f 100644 --- a/README.md +++ b/README.md @@ -1,19 +1,38 @@ # Code samples for Data Stream Processing -## Install dependencies +## Preparation: Install dependencies ``` pip3 -r requirements.txt ``` -## Run the Spark application +## Word counts +Start the Spark application: +```sh +python3 wordcount.py ``` -python3 average.py + +In another terminal session, execute +```sh +touch text-data/data.txt ``` -## Feed the example data into the application +## Averages +Start the Spark application: +```sh +python3 average.py ``` + +In another terminal session, execute +```sh ./send-data.sh ``` + +## Averages (SQL) + +Start the Spark application: +```sh +python3 average-sql.py +```