Web Based Engine for Processing and Clustering of Polish Texts

The paper presents a service oriented, online engine for processing and clustering texts in the Polish language. The engine, designed according to Web-Oriented Architecture paradigm, allows to run a large number of different language tools (like tagger, named entity recognizer, feature extractor) and clustering tools (like CLUTO or R) from almost any type of applications including HTML/JavaScript’s ones. It allows constructing of a complex workflow, not only a simple chain of tools. To meet high availability requirements, the engine is deployed in a private cloud.
Research areas:
Year:
2015
Type of Publication:
In Proceedings
Keywords:
natural language processing; clustering Polish texts; web application
Editor:
Wojciech Zamojski, Jacek Mazurkiewicz, Jarosław Sugier, Tomasz Walkowiak, Janusz Kacprzyk
Volume:
365
Book title:
Theory and Engineering of Complex Systems and Dependability
Series:
Advances in Intelligent Systems and Computing
Pages:
515-522
ISBN:
978-3-319-19216-1
ISSN:
2194-5357
Hits: 1302