The netflixdata.mat
contains training and test data for 20000 users and
5000 movies.
data_train
-- sparse matrix that contains user ratings from 1 to 5 for
20000 users and 5000 movies. Users and movies were randomly chosen from
the full Netflix dataset.
data_test -- sparse matrix of the same form.
There are some users that have no ratings and some movies that have no
ratings. You will need to figure out how to deal with those.