english | espaņol

Publications

Research Papers

  • pdf BEL, N.; MARIMON, M.; ESPEJA, S.; SEGHEZZI, N. "The Spanish Resource Grammar: Pre-processing Strategy and Lexical Acquisition"
    • This paper describes work on the development of an open-source HPSG grammar for Spanish implemented within the LKB system.

      Following a brief description of the main features of the grammar, we present our approach for pre-processing and ongoing research on automatic lexical acquisition
  • pdf BEL, N.; ESPEJA, S.; MARIMON, M. "Automatic Acquisition of Grammatical Types for Nouns"
    • The work we present here is concerned with the acquisition of deep grammatical information for nouns in Spanish.

      The aim is to build a learner that can handle noise, but, more interestingly, that is able to overcome the problem of sparse data, especially important in the case of nouns. We have based our work on two main points.

      Firstly, we have used distributional evidences as features. Secondly, we made the learner deal with all occurrences of a word as a single complex unit. The obtained results show that grammatical features of nouns is a level of generalization that can be successfully approached with a Decision Tree learner.
  • pdf BEL, N.; MARIMON, M.; ESPEJA, S. "New tools for the encoding of lexical data extracted from corpus"
    • This paper describes the methodology and tools that are the basis of our platform AAILE.

      AAILE has been built for supplying those working in the construction of lexicons for syntactic parsing with more efficient ways of visualizing and analyzing data extracted from corpus. The platform offers support using techniques such as similarity measures, clustering and pattern classification.