Artificial Intelligence Scientific Documentation Dataset for Recommender Systems

F. Ortega, J. Bobadilla, A. GutieĢrrez, R. Hurtado, & X. Li

Keywords: Dataset, Scientific documentation, Recommender Systems, Machine learning, Data mining, Artificial Intelligence, Scopus, Topics

The existing scientific documentation-based recommender systems focus on exploiting the citations and references information included in each research paper, and also the lists of co-authors. In this way, it can be addressed the recommendation of related papers and even related authors. The approach we propose is original, because instead of using each paper citations and co-authors, we relate each of the papers with their main research topics. This approach provides a semantic level superior to that currently used, which allows us to obtain useful results. We can use collaborative filtering recommender systems to recommend research topics related to each paper, and also to recommend papers related to each research topic. In order to face this innovative proposal, we have solved a series of challenges that allow us to offer various resources and results in the paper. Our main contribution are: 1) Making a data mining of scientific documentation, 2) Creating and publishing an open database containing the data mining results, 3) Extracting the research topics from the available scientific documentation, 4) Creating and publishing a recommender system dataset, obtained from the database and the research topics, 5) Testing the dataset through a complete set of collaborative filtering methods and quality measures, and 6) Selecting and showing the best methods and results, obtained using the open dataset, in the context of scientific documentation recommendations. Results of the paper show the suitability of the provided dataset in collaborative filtering processes, as well as the superiority of the model-based methods to face scientific documentation recommendations.

Dowloadeable material:

Papers SQL database

SD4AI dataset