- build the code with
mvn package
command - Get your keys from Twitter
- Fill the twitter4j.properties file
- Go to spark directory
- Place the twitter4j.properties in your working directory
- run program with this command:
./bin/spark-submit --class "arman.spark.twitterTopHashtags.TwitterTopHashtags" --master MASTER JAR_FILE REFRESH_TIME CHECKPOINT_DIR OUTPUT_DIR SORT_OUTPUT RUNNING_TIME
- MASTER: Master config based on: Spark Config
- JAR_FILE: Jar file
- REFRESH_TIME: Refresh intervals(Sec)
- CHECKPOINT_DIR: Checkpoint directory
- OUTPUT_DIR: Output dir
- SORT_OUTPUT: Sort output by hashtags count (true|false)
- RUNNING_TIME: Time to run the program(Sec)