TY - JOUR
T1 - Artificial Intelligence Scientific Documentation Dataset for Recommender Systems
AU - Ortega, Fernando
AU - Bobadilla, Jesus
AU - Gutierrez, Abraham
AU - Hurtado, Remigio
AU - Li, Xin
PY - 2018/8/28
Y1 - 2018/8/28
N2 - The existing scientific documentation-based recommender systems focus on exploiting the citations and references information included in each research paper and also the lists of co-authors. In this way, it can be addressed the recommendation of related papers and even related authors. The approach we propose is original because instead of using each paper citations and co-authors, we relate each of the papers with their main research topics. This approach provides a semantic level superior to that currently used, which allows us to obtain useful results. We can use collaborative filtering recommender systems to recommend research topics related to each paper and also to recommend papers related to each research topic. In order to face this innovative proposal, we have solved a series of challenges that allow us to offer various resources and results in the paper. Our main contributions are: 1) making a data mining of scientific documentation; 2) creating and publishing an open database containing the data mining results; 3) extracting the research topics from the available scientific documentation; 4) creating and publishing a recommender system data set obtained from the database and the research topics; 5) testing the data set through a complete set of collaborative filtering methods and quality measures; and 6) selecting and showing the best methods and results, obtained using the open data set, in the context of scientific documentation recommendations. Results of the paper show the suitability of the provided data set in collaborative filtering processes, as well as the superiority of the model-based methods to face scientific documentation recommendations.
AB - The existing scientific documentation-based recommender systems focus on exploiting the citations and references information included in each research paper and also the lists of co-authors. In this way, it can be addressed the recommendation of related papers and even related authors. The approach we propose is original because instead of using each paper citations and co-authors, we relate each of the papers with their main research topics. This approach provides a semantic level superior to that currently used, which allows us to obtain useful results. We can use collaborative filtering recommender systems to recommend research topics related to each paper and also to recommend papers related to each research topic. In order to face this innovative proposal, we have solved a series of challenges that allow us to offer various resources and results in the paper. Our main contributions are: 1) making a data mining of scientific documentation; 2) creating and publishing an open database containing the data mining results; 3) extracting the research topics from the available scientific documentation; 4) creating and publishing a recommender system data set obtained from the database and the research topics; 5) testing the data set through a complete set of collaborative filtering methods and quality measures; and 6) selecting and showing the best methods and results, obtained using the open data set, in the context of scientific documentation recommendations. Results of the paper show the suitability of the provided data set in collaborative filtering processes, as well as the superiority of the model-based methods to face scientific documentation recommendations.
KW - Dataset
KW - Scopus
KW - artificial intelligence
KW - data mining
KW - machine learning
KW - recommender systems
KW - scientific documentation
KW - topics
UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85052641362&origin=inward
UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85052641362&origin=inward
UR - http://www.mendeley.com/research/artificial-intelligence-scientific-documentation-dataset-recommender-systems
U2 - 10.1109/ACCESS.2018.2867731
DO - 10.1109/ACCESS.2018.2867731
M3 - Article
VL - 6
SP - 48543
EP - 48555
JO - IEEE Access
JF - IEEE Access
M1 - 8449912
ER -