B. Zeller and J. Šnajder and S. Pado. DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German. Proceedings of ACL 2013. Sofia, Bulgaria.
Note: Data available here.
Derivational models are still an under-researched area in computational morphology. Even for German, a rather resource-rich language, there is a lack of large-coverage derivational knowledge. This paper describes a rule-based framework for inducing derivational families (i.e., clusters of lemmas in derivational relationships) and its application to create a high-coverage German resource, DERIVBASE, mapping over 280k lemmas into more than 17k non-singleton clusters. We focus on the rule component and a qualitative and quantitative evaluation. Our approach achieves up to 93% precision and 71% recall. We attribute the high precision to the fact that our rules are based on information from grammar books.
@InProceedings{zeller-vsnajder-pado:2013:ACL2013, author = {Zeller, Britta and \v{S}najder, Jan and Pad\'{o}, Sebastian}, title = {DErivBase: Inducing and Evaluating a Derivational Morphology Resource for German}, booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics}, year = {2013}, address = {Sofia, Bulgaria}, pages = {1201--1211}, url = {http://www.aclweb.org/anthology/P13-1118} }