- 1,000 most active users
- 9 attributes (userID, title)
- CF --> Explicit Matrix Factorization (MF)
- Content-based --> TF-IDF + Cosine similarity
- Recall (hit rate) --> positive instances that are correctly predicted
- CTR (Click- through rate) --> recommendations clicked over total rec
- ARHR (average reciprocal hit rate)
- MSE
- Motivation
- overview of existing news recommender algorithms and approaches
- evaluation results
- conclusion
- train test split --> scikit-learn
- Implement a Collaborative filtering RS
- Implement a Content-based RS
- Merge them and build a Hybrid RS
- Evaluation Recall, ARHR)
Implement the article from https://medium.com/@bindhubalu/content-based-recommender-system-4db1b3de03e7 or from https://towardsdatascience.com/content-based-recommender-systems-28a1dbd858f5
-
Only work with single user file:
-
TF*IDF for keywords compared to the keywords of all their articles
-
Build matrix from the article
doc0 doc1 doc2 doc3 w0 tfidf00 w1 ... w2 tfidf23
-
Compute cosine similarity between items
-
Recommend from the article