Javascript must be enabled to continue!
User-based Collaborative Filtering: Sparsity and Performance
View through CrossRef
It is generally assumed that all users in a dataset are equally adversely affected by data sparsity and hence addressing this problem should result in improved performance. However, although all users may be members of a sparse dataset, they do not all suffer equally from the data sparsity problem. This indicates that there is some ambiguity as to which users should be identified as suffering from data sparsity, referred to as sparse users throughout this paper, and targeted with new recommendation improvement strategies. This paper defines sparsity in terms of number of item ratings and average similarity with nearest neighbours and then goes on to look at the impact of sparsity so defined on performance. Counterintuitively, it is found that in top-N recommendations sparse users actually perform better than some other categories of users when a standard approach is used. These results are explained, and empirically verified, in terms of a bias towards users with a low number of ratings. The link between sparsity and performance is also considered in the case of predictions rather than top-N recommendations. This work provides the motivation for targeting improvement approaches towards distinct groups of users as opposed to the entire dataset.
Title: User-based Collaborative Filtering: Sparsity and Performance
Description:
It is generally assumed that all users in a dataset are equally adversely affected by data sparsity and hence addressing this problem should result in improved performance.
However, although all users may be members of a sparse dataset, they do not all suffer equally from the data sparsity problem.
This indicates that there is some ambiguity as to which users should be identified as suffering from data sparsity, referred to as sparse users throughout this paper, and targeted with new recommendation improvement strategies.
This paper defines sparsity in terms of number of item ratings and average similarity with nearest neighbours and then goes on to look at the impact of sparsity so defined on performance.
Counterintuitively, it is found that in top-N recommendations sparse users actually perform better than some other categories of users when a standard approach is used.
These results are explained, and empirically verified, in terms of a bias towards users with a low number of ratings.
The link between sparsity and performance is also considered in the case of predictions rather than top-N recommendations.
This work provides the motivation for targeting improvement approaches towards distinct groups of users as opposed to the entire dataset.
Related Results
EVALUATION OF HYBRID MOVIE RECOMMENDATION SYSTEM BASED ON NEURAL NETWORKS
EVALUATION OF HYBRID MOVIE RECOMMENDATION SYSTEM BASED ON NEURAL NETWORKS
Abstract: Recommendation systems are becoming increasingly important with the growth of streaming platforms. The purpose of this study is to compare the performance of Content-Base...
An adaptive spatiotemporal filtering method for GNSS coordinate time series in CMONOC
An adaptive spatiotemporal filtering method for GNSS coordinate time series in CMONOC
Abstract
Common mode errors (CMEs) are a persistent challenge in regional GNSS coordinate time series, becoming more difficult to extract as distance increases. Thi...
Improvised Collaborative Filtering for Recommendation System
Improvised Collaborative Filtering for Recommendation System
Collaborative filtering (CF) is one of the most important techniques of recommendation system and has been utilized by many e-commerce businesses to provide recommendation to its u...
Filtering forbidden content
Filtering forbidden content
The relevance of this study lies in the need to filter content with high accuracy due to the creation of optimal variations of neural network architectures. The solutions available...
A Collaborative Filtering Recommendation Model Based on Fusion of Correlation-Weighted and Item Optimal-Weighted
A Collaborative Filtering Recommendation Model Based on Fusion of Correlation-Weighted and Item Optimal-Weighted
Traditional collaborative filtering algorithm has a shortcoming—it assigns all items with equal importance, which can result in excessive frequency in recommending hot it...
Divide and conquer method for sparsity estimation within compressed sensing framework
Divide and conquer method for sparsity estimation within compressed sensing framework
A novel method for sparsity estimation by means of the divide and conquer method is presented. Also, the underestimation and overestimation criteria for signal sparsity is proposed...
PocketAID: The Pocket Atlas of Infectious Diseases Mobile Application
PocketAID: The Pocket Atlas of Infectious Diseases Mobile Application
ObjectiveThe Pocket Atlas of Infectious Diseases (PocketAID) mobile application developed at Pacific Northwest National Laboratory (PNNL) provides infectious disease education and ...
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND
Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...

