Apache Spark Project

Analysis of Chicago crime data done using Apache Spark core APIs

Suggestions are welcome

1 Like

@Varun_Upadhyay1,

That was a nice work Varun. Your documentation skills with presentation is superb. This is what expecting by most of the employers.

By the way, try to incorporate ML with this data to extract interesting insights. Because without ML whenever we use Hadoop/Spark we are doing is just data cleaning, data model, data extraction & aggregations. If you’re not interested in ML, then try building data pipelines, data modeling in Hive, HBase or some other NoSQL & streaming analytics.

Thanks, I am interested in the field of data engineering and exploring how to build a data pipeline using Spark, Can you please elaborate on requirements for a data pipeline project?

1 Like

@Varun_Upadhyay1,

Please follow below links:

Another session:

Other resources be like:

1 Like

@Varun_Upadhyay1,

If you’re interested in Hadoop also, then let me know i can help you with few Data Engineering activities as well.

1 Like