To understand the Spark performance and tuning the application we have created Spark application using RDD, DataFrame, Spark SQL and Dataset APIs to answer the below questions from the SFO Fire department call service dataset.How many different types of calls were made to the Fire Department?,How many incidents of each call type were there?,How many years of Fire Service Calls are in the data file?, How many service calls were logged in the past 7 days? and Which neighborhood in SF generated the most calls last year?. -
View it on GitHub