Praat on the Web

This system is an implementation of the widely used speech analysis tool Praat as a web application with an extended functionality for feature annotation. In particular, Praat on the Web addresses some of the central limitations of the original Praat tool and provides (i) enhanced visualization of annotations in a dedicated window for feature annotation at interval and point segments, (ii) a dynamic scripting composition exemplified with a modular prosody tagger, and (iii) portability and an operational web interface. Speech annotation tools with such a functionality are key for exploring large corpora and designing modular pipelines.

If you use this software please cite:

Domínguez M, Latorre I, Farrús M, Codina-Filbà J, Wanner L. Praat on the Web: An Upgrade of Praat for Semi-Automatic Speech  Annotation. 26th International Conference on Computational Linguistics (COLING 2016) Osaka, Japan p. 218-222

Praat on the Web audio annotation tool

Text Simplification for English

This site is a Text Simplification demo for English texts (up to 384 characters). The YATS simplifier was designed for people with intellectual disabilities. The automatic Text Simplification process aims to aid in text reading by reducing the lexical and syntactic complexity of sentecnes. Basically the text simplifier detects complex words and sentences and generates simpler vocabulary and shorter sentences. This technology has been investigated and developed by the TALN(Natural Language Processing) group of Universitat Pompeu Fabra (UPF).

Text Simplification technologies

Text simplification is the process of transforming a text into an equivalent which is more understandable for a target population. Simplified texts are appropriate for many groups of readers, such as language learners, elderly persons and people with other special reading and comprehension needs. TALN-UPFgroup develops robust natural language processing technology to produce simplified versions of documents at both syntactic and lexical levels.


Daniel Ferrés, Montserrat Marimon, Horacio Saggion, Ahmed AbuRa’ed
YATS: Yet Another Text Simplifier.
NLDB (2016)  [PDF] [BibTeX]

Daniel Ferrés, Montserrat Marimon, Horacio Saggion
A Web-based Text Simplification System for English.
Procesamiento del Lenguaje Natural 55: 191-194 (2015).  [PDF] [BibTeX]



Demo Page
Demo Page for text in Spanish

Online demo of the patent summarization system

Online demo of the patent summarization system. The system performs a linguistic analysis of the patent document, selects the most important passages and combines them in an abstractive summary.

The Demo is the result of several european projects and the experience in our group on patent processing.

To use the demo please contact  joan dot codina at upf dot edu to ask for a user/password

TOPAS Engine demo

Online Patent Summarization Demo


A very easy to use e-mail web client adapted to people with IDD so that they can use it at their working activity.

1. Use of the ABLE accessibility layer: Text2Picto, Text2Speech and Simplext.
2. Very easy interface.
3. Few functionalities: New message, Sent, Inbox and Contacts.
4. Accessible for all kind of dissability.


Demo page (use the unsafe mode when connecting to google)


Online tool for PDF-to-XML conversion

Freely online tool to parse PDF files. Our approach is based on the PDFdigest tool, a PDF textual content extraction system specially designed to extract scientific articles' headings and logical structure (title, authors, abstract and so on) and its textual content too. The result is provided in a XML file. Furthermore, PDFdigest also provides a structured HTML file as a clone of the original PDF file.

In addition, the pre-processing step implemented in DrInventor (link) is applied to the previous XML file in order to mark off tokens and sentences. As a result, we also provide an additional GATE document.     

Demo Page

OpenMinted Summarization Services

In the current context of scientific information overload in which new knowledge is created at a rapid pace, we propose to develop text summarization services for automatically identifying the most important information of a research article. The work will be based on an adaptation of our current scientific text mining and summarization technology at our LaSTUS/TALN lab. The summarization system will apply a natural language processing pipeline for deep analysis of scientific documents and compute a series of sentence relevance features based on the results of text analysis.                         

Demo Page


OpenMinted Babelnet extractor


UIMA component that detects BabelNet terms in text. It can be used with any of the 271 languges available in Babelnet 3.7                         


Code on Github

Demo Page

Several applications from the computational analysis of language

This video is about the research that is carried out by the TALN group, Tractament Automatic del Llenguatge Natural (Natural Lenguage Processing), at Pompeu Fabra University in Barcelona.

Patent claims

Summarization and paraphrasing of patent claims.


Summa toolkit

This video shows how to create summarization applications with the SUMMA toolkit.

SUMMA is a set of language and processing resources implemented using the GATE NLP framework.