Three different dependency annotations based on the AnCora corpus are presented here:
AnCora Surface Syntax Dependencies
This annotation represents a new resource available to the research community which offers a surface syntax-oriented annotation of AnCora dependencies. The annotation was done by an automatic conversion from AnCora constituents into the new dependency format.
The annotation process was widely covered by the article From constituents to syntax-oriented dependencies. The article describes the linguistic decisions taken for this annotation and presents the new syntactic function tagset which was applied. The multiword deconstruction process was not discussed yet in the former mentioned article, therefore this is now done in a technical report.
- Download the corpus here.
Changes (compared to the former dependency annotation of AnCora) imply:
· consistent surface syntax oriented head selection (instead of a semantic oriented one)
· new syntactic function tagset (only dependency related tags; reduced quantity)
· multiword deconstruction
· deletion of elliptic subjects and empty sentences
AnCora Temporal Dependencies
This annotation represents an adapted version of Surface Syntax Dependencies for time-related NLP tasks such as temporal parsing. All details are given in the upcoming PhD dissertation Tailoring Dependency Models to NLP Tasks.
Download the corpus here.
AnCora Discourse Dependencies
This annotation represents an adapted version of Surface Syntax Dependencies for discourse-related NLP tasks such as discourse parsing. All details are given in the upcoming PhD dissertation Tailoring Dependency Models to NLP Tasks.Download the corpus here.