We have relevant datasets, repositories, frameworks and tools of relevance for research and technology transfer initiatives related to knowledge extraction. This section provides an overview on a selection of them and links to download or contact details.

The MdM Strategic Research Program has its own community in Zenodo for material available in this repository  as well as at the UPF e-repository  . Below a non-exhaustive list of datasets representative of the research in the Department.

As part of the promotion of the availability of resources, the creation of specific communities in Zenodo has also been promoted, at level of research communities (for instance, MIR and Educational Data Analytics) or MSc programs (for instance, the Master in Sound and Music Computing)

 

 

Back Accuosto P, Ronzano F, Ferrés D, Saggion H. Multi-level mining and visualization of scientific text collections. 6th International Workshop on mining scientific publications, Proceedings of The 6st International Workshop on Mining Scientific Publications. Joint Conference on Digital Libraries (JCDL’17)

Accuosto P, Ronzano F, Ferrés D, Saggion H. Multi-level mining and visualization of scientific text collections. 6th International Workshop on mining scientific publications, Proceedings of The 6st International Workshop on Mining Scientific Publications. Joint Conference on Digital Libraries, Toronto, Canada, June 2017 (JCDL’17).

We present a system to mine and visualize collections of scientific documents by semantically browsing information extracted from single publications or aggregated throughout corpora of articles. The text mining tool performs deep analysis of document collections allowing the extraction and interpretation of research paper’s contents. In addition to the extraction and enrichment of documents with metadata (titles, authors, affiliations, etc), the deep analysis performed comprises semantic interpretation, rhetorical analysis of sentences, triple-based information extraction, and text summarization. The visualization components allow geographicalbased exploration of collections, topic-evolution interpretation, and collaborative network analysis among others. The paper presents a case study of a bilingual collection in the field of Natural Language Processing (NLP)

Additional details: