Missing data sets in the data.git repo following the udemy cca certification course

cca-175

#1

Hello,

I had recently enrolled in the udemy course for the cca certification preparation and I realized that there
were some missing data sets in the ‘data’ repo mentioned in the course which are being used in the lecture series. For instance, I came across a Hadoop fs example using a 4GB crimes data set but I could not find any in the GitHub repo. Could some body please point to me an appropriate place for finding all of the data sets needed for a good spark practice.

Thanks