We have relevant datasets, repositories, frameworks and tools of relevance for research and technology transfer initiatives related to knowledge extraction. This section provides an overview on a selection of them and links to download or contact details.

The MdM Strategic Research Program has its own community in Zenodo for material available in this repository  as well as at the UPF e-repository  . Below a non-exhaustive list of datasets representative of the research in the Department.

As part of the promotion of the availability of resources, the creation of specific communities in Zenodo has also been promoted, at level of research communities (for instance, MIR and Educational Data Analytics) or MSc programs (for instance, the Master in Sound and Music Computing)

 

 

Back SUMMA - Text Summarization Toolkit

SUMMA is a text summarization toolkit which follows the architectural precepts of the GATE framework therefore providing much needed functionality in the form of language and processing resources for composition of practical summarization applications.

SUMMA main features

  • Resources for statistical text analysis
  • Resources for features computation
  • Resources for customization of summaries
  • Resources for exporting results
  • Single-document summarization
  • Multi-document summarization
  • Multilingual processing
  • Ready-made summarizers and baselines for research comparison
  • Easy to install and use
  • Easy to extend
  • Easy to customize