SUMMA is a text summarization toolkit which follows the architectural precepts of the GATE framework therefore providing much needed functionality in the form of language and processing resources for composition of practical summarization applications.
BACKGROUND
Our lives are saturated with information from multiple channels such as news agencies, mails, tweets, commentaries, social networks, etc. Automatic Text Summarization can help extract, merge, and condense what is important from a stream of textual information reducing information overload. There is no implemented summarizer which can summarize all types of document, therefore the need for a toolkit which allows composition to customize specific summatization applications.
THE TECHNOLOGY
SUMMA main features:
- Resources for statistical text analysis
- Resources for features computation
- Resources for customization of summaries
- Resources for exporting results
- Single-document summarization
- Multi-document summarization
- Multilingual processing
- Ready-made summarizers and baselines for research comparison
- Easy to install and use
- Easy to extend
- Easy to customize
ADVANTAGES
- Implemented algorithms for creating summarization systems
- Multi-platform
- Multi-lingual
STATE OF DEVELOPMENT
Fully developed.
INTELLECTUAL PROPERTY
©UPF 2014, software registered.
MARKET OPPORTUNITY
Customizable summarization applications for government and private institutions. Software distribution and consultancy.
COMMERCIAL OPPORTUNITY
Software available for licensing with technical cooperation.
CONTACT
Marc Santandreu
Technology Transfer Unit
(+34) 93 542 2896
[email protected]
KEYWORDS
Text Processing, Summarization, Information Extraction.
Ref: TEC-0090/S-0008
Fact Sheet