Big Data

Analyzing 400k+ Tweets Using PySpark

Implementation of Spark context, Spark SQL context on Amazon Tweets data set with 400k Tweets. Analyzed the tweets on the busiest day to find the words that were repeated the most in the selected tweets.