Espinosa-Anke L, Camacho-Collados J, Delli Bovi C, Saggion H. Supervised Distributional Hypernym Discovery via Domain Adaptation. 2016 Conference on Empirical Methods on Natural Language Processing (EMNLP 2016)
We develop a large number of software tools and hosting infrastructures to support the research developed at the Department. We will be detailing in this section the different tools available. You can take a look for the moment at the offer available within the UPF Knowledge Portal, the innovations created in the context of EU projects in the Innovation Radar and the software sections of some of our research groups:
Artificial Intelligence |
Nonlinear Time Series Analysis |
Web Research |
Music Technology |
Interactive Technologies |
Barcelona MedTech |
Natural Language Processing |
Nonlinear Time Series Analysis |
UbicaLab |
Wireless Networking |
Educational Technologies |
Espinosa-Anke L, Camacho-Collados J, Delli Bovi C, Saggion H. Supervised Distributional Hypernym Discovery via Domain Adaptation. 2016 Conference on Empirical Methods on Natural Language Processing (EMNLP 2016)
Espinosa-Anke L, Camacho-Collados J, Delli Bovi C, Saggion H. Supervised Distributional Hypernym Discovery via Domain Adaptation. 2016 Conference on Empirical Methods on Natural Language Processing (EMNLP 2016)
Lexical taxonomies are graph-like hierarchical structures that provide a formal representation of knowledge. Most knowledge graphs to date rely on is-a (hypernymic) relations as the backbone of their semantic structure. In this paper, we propose a supervised distributional framework for hypernym discovery which operates at the sense level, enabling large-scale automatic acquisition of disambiguated taxonomies. By exploiting semantic regularities between hyponyms and hypernyms in embeddings spaces, and integrating a domain clustering algorithm, our model becomes sensitive to the target data. We evaluate several configurations of our approach, training with information derived from a manually created knowledge base, along with hypernymic relations obtained from Open Information Extraction systems. The integration of both sources of knowledge yields the best overall results according to both automatic and manual evaluation on ten different domains.
Additional material
In this link the following information is available:
- Training Data: Wikidata, KB-Unify
- Nasari Domain Labels
- SensEmbed Sense Vectors
- Python API