We have relevant datasets, repositories, frameworks and tools of relevance for research and technology transfer initiatives related to knowledge extraction. This section provides an overview on a selection of them and links to download or contact details.

The MdM Strategic Research Program has its own community in Zenodo for material available in this repository  as well as at the UPF e-repository  . Below a non-exhaustive list of datasets representative of the research in the Department.

As part of the promotion of the availability of resources, the creation of specific communities in Zenodo has also been promoted, at level of research communities (for instance, MIR and Educational Data Analytics) or MSc programs (for instance, the Master in Sound and Music Computing)

 

 

Back AbuRa’ed A, Chiruzzo L, Saggion H. Experiments in detection of implicit citations. OSP 2018: 7th International Workshop on Mining Scientific Publications

 

AbuRa’ed A, Chiruzzo L, Saggion H. Experiments in detection of implicit citations. OSP 2018: 7th International Workshop on Mining Scientific Publications

The identification of explicit and implicit citations to a given reference paper is important for numerous scientific text mining activities such as citation purpose identification, scientific opinion mining, and scientific summarization. This paper presents experiments on the identification of implicit citations in scientific papers. As in previous work, and relying on an annotated dataset of explicit and implicit citation sentences, we cast the problem as classification, evaluating several machine learning algorithms trained on a set of task-motivated features. We compare our work with the state of the art on the annotated dataset obtaining improved performance. We also annotate a new dataset which we make publicly available to validate our approach. The results on the new dataset confirm our set of features outperforms previously published research

OA version at UPF e-repository: http://hdl.handle.net/10230/35180