The netflixdata.mat contains training and test data for 20000 users and 5000 movies.

data_train -- sparse matrix that contains user ratings from 1 to 5 for 20000 users and 5000 movies. Users and movies were randomly chosen from the full Netflix dataset.

data_test -- sparse matrix of the same form.

There are some users that have no ratings and some movies that have no ratings. You will need to figure out how to deal with those.