Towards the automatic merging of language resources

  • Authors
  • Necsulescu, Silvia; Bel Rafecas, Núria; Padró, Muntsa; Marimon, Montserrat; Revilla, Eva
  • UPF authors
  • BEL RAFECAS, NÚRIA; MARIMON FELIPE, MONTSERRAT;
  • Authors of the book
  • Sagot, Benoît
  • Book title
  • Proceedings of first international Workshop on Lexical Resources
  • Publisher
  • WoLeR
  • Publication year
  • 2011
  • Pages
  • 71-78
  • Abstract
  • Language Resources are a critical component for Natural Language Processing applications. Throughout the years many resources were manually created for the same task, but with different granularity and coverage information. To create richer resources for a broad range of potential reuses, nformation from all resources has to be joined into one. The hight cost of comparing and merging different resources by hand has been a bottleneck for merging existing resources. With the objective of reducing human intervention, we present a new method for automating merging resources. We have addressed the merging of two verbs subcategorization frame (SCF) lexica for Spanish. The results achieved, a new lexicon with enriched information and conflicting information signalled, reinforce our idea that this approach can be applied for other task of NLP.
  • Complete citation
  • Necsulescu, Silvia; Bel Rafecas, Núria; Padró, Muntsa; Marimon, Montserrat; Revilla, Eva. Towards the automatic merging of language resources. In: Sagot, Benoît. Proceedings of first international Workshop on Lexical Resources. 1 ed. Ljubljana: WoLeR; 2011. p. 71-78.