WCCL Match – A Language for Text Annotati

In this paper we present a formalism for text annotation called WCCL Match. The need for a new formalism originates from our works related to Question Answering for Polish. We examined several existing formalisms to conclude that none of them fulfills our requirements. The new formalism was designed on top of an existing language for writing morphosyntactic functional expressions, namely WCCL. The major features of WCCL Match are: creation of new annotations, modification of existing ones, support for overlapping annotations, explicit access to tagset attributes and referring to context outside of captured annotation. We discuss three applications of the formalism: recognition of proper names, question analysis and question-to-query transformation. The implementation of WCCL Match is language-independent and can be used for almost any natural language.
Year:
2013
Type of Publication:
In Proceedings
Keywords:
WCCL Match; text annotation; rule-based framework
Editor:
Kłopotek, Mieczysław A. and Koronacki, Jacek and Marciniak, Małgorzata and Mykowiecka, Agnieszka and Wierzchoń, Sławomir T.
Volume:
7912
Book title:
Language Processing and Intelligent Information Systems
Series:
Lecture Notes in Computer Science
Pages:
131-144
ISBN:
978-3-642-38633-6
Hits: 918