Making it Simplext: Implementation and Evaluation of a Text Simplification System for Spanish

  • Authors
  • Saggion H, Štajner S, Bott S, Mille S, Rello L, Drndarevic B
  • UPF authors
  • RELLO, LUZ; SAGGION, HORACIO; MILLE, SIMON ;
  • Type
  • Scholarly articles
  • Journal títle
  • ACM Transactions on Accessible Computing
  • Publication year
  • 2015
  • Volume
  • 6
  • Number
  • 4
  • Pages
  • 1-36
  • ISSN
  • 1936-7228
  • Publication State
  • Published
  • Abstract
  • The way in which a text is written can be a barrier for many people. Automatic text simplification is a natural language processing technology that, when mature, could be used to produce texts that are adapted to the specific needs of particular users. Most research in the area of automatic text simplification has dealt with the English language. In this article, we present results from the Simplext project, which is dedicated to automatic text simplification for Spanish. We present a modular system with dedicated procedures for syntactic and lexical simplification that are grounded on the analysis of a corpus manually simplified for people with special needs. We carried out an automatic evaluation of the system¿s output, taking into account the interaction between three different modules dedicated to different simplification aspects. One evaluation is based on readability metrics for Spanish and shows that the system is able to reduce the lexical and syntactic complexity of the texts. We also show, by means of a human evaluation, that sentence meaning is preserved in most cases. Our results, even if our work represents the first automatic text simplification system for Spanish that addresses different linguistic aspects, are comparable to the state of the art in English Automatic Text Simplification.
  • Complete citation
  • Saggion H, Štajner S, Bott S, Mille S, Rello L, Drndarevic B. Making it Simplext: Implementation and Evaluation of a Text Simplification System for Spanish. ACM Transactions on Accessible Computing 2015; 6(4): 1-36.
Bibliometric indicators
  • 71 times cited Scopus
  • Índex Scimago de 0.439 (2015)