Item request has been placed!
×
Item request cannot be made.
×
Processing Request
基于Spark的混合协同过滤算法改进与实现.
Item request has been placed!
×
Item request cannot be made.
×
Processing Request
- Author(s): 王源龙1; 孙卫真1 ; 向 勇2
- Source:
Application Research of Computers / Jisuanji Yingyong Yanjiu. Mar2019, Vol. 36 Issue 3, p855-860. 6p.
- Subject Terms:
- Additional Information
- Alternate Title:
New improvement and implementation of hybrid collaborative filtering algorithm based on Spark platform.
- Subject Terms:
- Abstract:
Aiming at optimizing and improving a hybrid collaborative filtering based on Spark platform for its sparsity, scalability and personalized recommendation by using the method of algorithm integration, this paper took the model of Stacking integration to integrate multiple weak recommender units in a linearly weighted into a comprehensive recommender. Firstly, this algorithm optimized the collaborative filtering based on the nearest neighbor by presorting and adjusting the similarity calculation strategy with popularity and praise degree, and improved the rationality and complexity of similarity calculation. It solved the problem of score sparsity to some extent. At the same time, this algorithm integrated closely distributed computing platform, which could make full use of the advantages of distributed platform to design and implement an increment iterative model of recommendation algorithm by using the Spark streaming and distributed storage structure. It solved the problem that collaborative filtering algorithm was hard to expand and made poor real-time performance. The experimental data used UCI public data set named MovieLens and NetFlix films’ score. The experimental results show that the improved algorithm has a good performance and makes great progress in personalized recommendation, accuracy and scalability compared with the previous algorithms. It provides a feasible algorithm integration scheme for the application of the recommended system. [ABSTRACT FROM AUTHOR]
- Abstract:
针对传统协同过滤在推荐过程中存在的稀疏性、扩展性以及个性化问题,通过引入算法集成的思想,旨在优化和改进一种新型的基于Spark平台下的混合协同过滤。借鉴Stacking集成学习思想,将多个弱推荐器线性加权组合,形成综合性强的推荐器。算法基于近邻协同过滤,结合分类、流行度、好评度等对近邻相似度计算策略进行优化,旨在改善相似度的合理性以及相似度计算的复杂度,在一定程度上改善了评分稀疏性的问题;算法结合Spark分布式计算平台,充分借鉴分布式平台的优点,利用其流式处理以及分布式存储结构等特性,设计并实现一种推荐算法的增量迭型,解决了协同过滤算法扩展性和实时性问题。实验数据采用UCI公用数据集MovieLens和Net Flix电影评分数据。实验结果表明,改进算法在推荐个性化、准确率以及扩展性上都有不错的表现,较以前同类型算法均有不同程度的提高,为推荐系统的应用提供一种可行的算法集成方案。 [ABSTRACT FROM AUTHOR]
- Abstract:
Copyright of Application Research of Computers / Jisuanji Yingyong Yanjiu is the property of Application Research of Computers Edition and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
No Comments.