The initial distribution of feature values affects the results of matrix factorization (SVD) algorithm (this implementation). In this post, let's have a look at performance of SVD algorithm with different distributions of initial values. To conduct experiments, I used Lenskit framework and MovieLens100K dataset. The experiments includes three distributions:
- Fixed values (0.1) (Fixed)
- Random values (Random)
- Popularity distribution for item features and random for user features (POP)