Automatic selection of HPSG-parsed sentences for treebank construction

  • Authors
  • Marimon, Montserrat; Bel Rafecas, Núria; Padró, Lluís
  • UPF authors
  • BEL RAFECAS, NÚRIA; MARIMON FELIPE, MONTSERRAT;
  • Type
  • Scholarly articles
  • Journal títle
  • Computational linguistics
  • Publication year
  • 2014
  • Volume
  • 40
  • Number
  • 3
  • Pages
  • 523-531
  • ISSN
  • 0891-2017
  • Publication State
  • Published
  • Abstract
  • This article presents an ensemble parse approach to detecting and selecting high-quality linguistic analyses output by a hand-crafted HPSG grammar of Spanish implemented in the LKB system. The approach uses full agreement (i.e., exact syntactic match) along with a MaxEnt parse selection model and a statistical dependency parser trained on the same data. The ultimate goal is to develop a hybrid corpus annotation methodology that combines fully automatic annotation and manual parse selection, in order to make the annotation task more efficient while maintaining high accuracy and the high degree of consistency necessary for any foreseen uses of a treebank.
  • Complete citation
  • Marimon, Montserrat; Bel Rafecas, Núria; Padró, Lluís. Automatic selection of HPSG-parsed sentences for treebank construction. Computational linguistics 2014; 40(3): 523-531.
Bibliometric indicators
  • 4 times cited Scopus
  • 4 times cited WOS
  • Índex Scimago de 0.764 (2014)