User-driven Language Technology Infrastructure – the Case of CLARIN-PL

The paper discusses a user-driven development of CLARIN-PL, the Polish branch of the European language technology infrastructure for Humanities and Social Sciences. CLARIN-PL can be used as an exemplar of a bi-directional (i.e. top-down and bottom-up) approach to developing language resources and tools. The paper presents an overview of the state of the basic processing chain for Polish, the set of basic Polish language resources and tools and typical processing schemes emerging from the development of key applications. We also discuss the problem of the quality of services offered by language tools that goes much beyond the typical measures used during testing. In conclusion, we try to envisage further user needs and further language technology infrastructure development for which the 3-4 year construction phase is a good starting point for a fully-fledged infrastructure
Year:
2014
Type of Publication:
In Proceedings
Editor:
Tomaž Erjavec, Jerneja Žganec Gros
Volume:
G
Book title:
Proceedings of the 17th International Multiconference Information Society - IS 2014
Series:
Language technologies
Pages:
7-13
ISBN:
978-961-264-077-4
Hits: 4691