Tutorial:

 

Tools:

  • Dr. Inventor Text Mining Framework: a self-contained java library that implements and integrates a wide range of NLP resources for the analysis of scientific publications. More info at: http://driframework.readthedocs.io/
  • TextDigester: an open-source text summarization java library that implements several extractive summarization approaches  More info at: https://github.com/fra82/textdigester
  • Twitter Crawlers: onpen-source fully-customizable java library to easily retrieve data exploiting Twitter REST and streaming APIs. More info at: https://github.com/fra82/twitter-crawler

More on GitHub at: https://github.com/fra82

 

Corpora:

  • Dr. Inventor Multi-layer Scientific Corpus: corpus of scientific publications manually annotated with respect to several semantic facets of scientific information (scientific discourse, citation purpose, sentence summary relevance). More info at: http://sempub.taln.upf.edu/dricorpus