Assignment 2
Objective:
This assignment is intended to introduce the spark-ml library. As part of this assignment, student need to train/build, test and evaluate the movie recommendation model.
Use the built-in spark ALS recommendation algorithm to recommend the user the movie titles. The main tasks involves:
- converting the raw data to RDDs that are compactable with ALS algorithm
- training the recommendation model using ALS algorithm
- evaluate the recommendation model performance
- evaluate the performance of the recomendation model by varying rank(1,2,3,4,5) and iteration(25,50,100,125,150)
Data
Please download the MovieLens ratings dataset (small-dataset) here and find the readme help here
Also try with full dataset of 20 Million ratings here