Abstract
Similarity-based algorithms, often referred to as memory-based collaborative filtering techniques, are one of the most successful methods in recommendation systems. When explicit ratings are available, similarity is usually defined using similarity functions, such as the Pearson correlation coefficient, cosine similarity or mean square difference. These metrics assume similarity is a symmetric criterion. Therefore, two users have equal impact on each other in recommending new items. In this paper, we introduce new weighting schemes that allow us to consider new features in finding similarities between users. These weighting schemes, first, transform symmetric similarity to asymmetric similarity by considering the number of ratings given by users on non-common items. Second, they take into account the habit effects of users are regarded on rating items by measuring the proximity of the number of repetitions for each rate on common rated items. Experiments on two datasets were implemented and compared to other similarity measures. The results show that adding weighted schemes to traditional similarity measures significantly improve the results obtained from traditional similarity measures.






Similar content being viewed by others
References
Liu H, Hu Z, Mian AU, Tian H, Zhu X (2014) A new user similarity model to improve the accuracy of collaborative filtering. Knowl-Based Syst 56:156–166
Sanchez F, Barrilero M, Uribe S, Alvarez F, Tena A, Menendez JM (2012) Social and content hybrid image recommender system for mobile social networks. Mobile Netw Appl 17(6):782– 795
Ahn HJ, Kang H, Lee J (2010) Selecting a small number of products for effective user profiling in collaborative filtering. Expert Syst Appl 37(4):3055–3062
de Sousa EPM, Traina Caetano J, Traina AJM, Wu L, Faloutsos C (2007) A fast and effective method to find correlations among attributes in databases. Data Min Knowl Disc 14(3):367–407
Pham XH, Jung JJ (2014) Recommendation system based on multilingual entity matching on linked open data. J Intell Fuzzy Syst 27(2):589–599
Jung JJ (2012) Attribute selection-based recommendation framework for short-head user group: An empirical study by MovieLens and IMDB. Expert Syst Appl 39(4):4049–4054
Rodgers JL, Nicewander WA (1988) Thirteen ways to look at the correlation coefficient. Am Stat 42(1):59–66
Adomavicius G, Tuzhilin A (2005) Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans Knowl Data Eng 17(6):734–749
Herlocker JL, Konstan JA, Terveen LG, Riedl J (2004) Evaluating collaborative filtering recommender systems. IEEE Trans Knowl Data Eng 22(1):5–53
Jung JJ (2013) Cross-lingual query expansion in multilingual folksonomies: a case study on Flickr. Knowl-Based Syst 42:60–67
Cai Z, Hu C, Kang Z, Liu Y (2010) An improved similarity algorithm based on hesitation degree for user-based collaborative filtering. In: Proceedings ISICA 2010, 5th international symposium on advances in computation and intelligence. Springer, pp 261–271
Bobadilla J, Serradilla F, Bernal J (2010) A new collaborative filtering metric that improves the behavior of recommender systems. Knowl-Based Syst 23(6):520–528
Choi K, Suh Y (2013) A new similarity function for selecting neighbors for each target item in collaborative filtering. Knowl-Based Syst 37:146–153
Sarwar BM, Karypis G, Konstan J, Riedl J (2002) Incremental singular value decomposition algorithms for highly scalable recommender systems. In: Proceedings of the 5th international conference on computer and information science. Springer, pp 27–28
Koren Y, Bell RM, Volinsky C (2009) Matrix factorization techniques for recommender systems. IEEE Comput 42(8):30–37
Breese JS, Heckerman D, Kadie CM (1998) Empirical analysis of predictive algorithms for collaborative filtering. In Proceedings of the 14th conference on Uncertainty in artificial intelligence. Morgan Kaufmann Publishers, pp 43–52
Wang J, Pouwelse JA, Fokker J, de Vries AP, Reinders MJT (2008) Personalization on a peer-to-peer television system. Multimed Tools Appl 36(1–2):89–113
Bobadilla J, Ortega F, Hernando A, Bernal J (2012) A collaborative filtering approach to mitigate the new user cold start problem. Knowl-Based Syst 26(1–2):225–238
Jamali M, Ester M (2009) Trustwalker: a random walk model for combining trust-based and item-based recommendation. In: Proceedings ACM SIGKDD (2009), 15th international conference on knowledge discovery and data mining. ACM, pp 397–406
Massa P, Avesani P (2009) Trust metrics in recommender systems. In: Computing with social trust. Springer, pp 259–285
Papagelis M, Plexousakis D, Kutsuras T (2005) Alleviating the sparsity problem of collaborative filtering using trust inferences. In: Proceedings of trust management. Springer, Berlin Heidelberg, pp 224–239
Neal Lathia Stephen Hailes LC (2008) Trust-based collaborative filtering. In: Proceedings of trust management II. Springer, Boston, pp 119–134
Luis M, de Campos Juan M, Fernandez-Luna JFH (2010) Combining content-based and collaborative recommendations: A hybrid approach based on bayesian networks. Int J Approx Reason 51(7):785–799
Shinde SK, Kulkarni UV (2012) Hybrid personalized recommender system using centering-bunching based clustering algorithm. Expert Syst Appl 39(1):1381–1387
Maneeroj S, Takasu A (2009) Hybrid recommender system using latent features. In: Proceedings AINA 2009, 23rd international conference on advanced information networking and applications. IEEE Computer Society, pp 661–666
Candillier L, Meyer F, Fessant F (2008) Designing specific weighted similarity measures to improve collaborative filtering systems. In: Proceedings ICDM (2008), 8th industrial conference on advances in data mining, medical applications, e-commerce, marketing, and theoretical aspects. Springer, pp 242–255
Desrosiers C, Karypis G (2011) A comprehensive survey of neighborhood-based recommendation methods. In: Recommender systems handbook. Springer, pp 107–144
Ricci F, Rokach L, Shapira B (2011) Introduction to recommender systems handbook. In: Recommender systems handbook. Springer, pp 1–35
Schein AI, Popescul A, Ungar LH, Pennock DM (2002) Methods and metrics for cold-start recommendations. In: Proceedings SIGIR (2002) 25th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, pp 253–260
Barragáns-Martínez AB, Costa-Montenegro E, Burguillo-Rial JC, Rey-López M, Mikic-Fonte FA, Peleteiro-Ramallo A (2010) A hybrid content-based and item-based collaborative filtering approach to recommend tv programs enhanced with singular value decomposition. Inf Sci 180(22):4290–4311
Kannan R, Ishteva M, Park H (2014) Bounded matrix factorization for recommender system. Knowl Inf Syst 39(3):491–511
YZhang YC, Blattner M, Yu YK (2007) Heat conduction process on community networks as a recommendation model. PRL 99(15):154–301
Huang Z, Zeng DD (2011) Why does collaborative filtering work? transaction-based recommendation model validation and selection by analyzing bipartite random graphs. Informs J Comput 23(1):138–152
Liu JG, Wang BH, Guo Q (2009) Improved collaborative filtering algorithm via information transformation. Int J Mod Phys C 20(02):285–293
Liu JG, Zhou T, Xuan ZG, Che HA, Wang BH, Zhang YC (2010) Degree correlation effect of bipartite network on personalized recommendation. CoRR 21(01):137–147
Liu JG, Zhou T, Guo Q, Wang BH, Zhang YC (2009) Effect of user tastes on personalized recommendation. CoRR 20(12):1925–1932
Qiu T, Chen G, Zhang ZK, Zhou T (2011) An item-oriented recommendation algorithm on cold-start problem. Europhys Lett 95(5):580003
Liu C, Zhou WX (2012) An improved heats+probs hybrid recommendation algorithm based on heterogeneous initial resource configurations. CoRR 391(22):5704–5711
Baselga A, Jiménez-Valverde A, Niccolini G (2007) A multiple-site similarity measure independent of richness. Biol Lett 3(6):642–645
Massa P, Avesani P (2007) Trust metrics on controversial users: balancing between tyranny of the majority and echo chambers. IJSWIS 3(1):39–64
Sarwar BM, Karypis G, Konstan JA, Riedl J (2001) Item-based collaborative filtering recommendation algorithms. In: Proceedings WWW (2001) 10th international conference on World Wide Web. ACM, pp 285–295
Zhang Bin SH (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol 4(1)
Francois F, Pirotte A, Renders JM, Saerens M (2007) Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Trans Knowl Data Eng 19(3):355–369
Buckland MK, Gey FC (1994) The relationship between recall and precision. JASIS 45(1):12–19
Acknowledgments
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. NRF-2014R1A2A2A05007154).
Author information
Authors and Affiliations
Corresponding author
Additional information
These authors contributed equally to this work as the first author.
Rights and permissions
About this article
Cite this article
Pirasteh, P., Hwang, D. & Jung, J.E. Weighted Similarity Schemes for High Scalability in User-Based Collaborative Filtering. Mobile Netw Appl 20, 497–507 (2015). https://doi.org/10.1007/s11036-014-0544-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-014-0544-5