Application of Weighted Voting Taggers to Languages Described with Large Tagsets
Keywords:
Part-of-speech tagging, combination tagger, weighted probability distribution voting tagger, TagPair taggerAbstract
The paper presents baseline and complex part-of-speech taggers applied to the modified corpus of Frequency Dictionary of Contemporary Polish, annotated with a large tagset. First, the paper examines accuracy of 6 baseline part-of-speech taggers. The main part of the work presents simple weighted voting and complex voting taggers. Special attention is paid to lexical voting methods and issues of ties and fallbacks. TagPair and WPDV voting methods achieve the top accuracy among all considered methods. Error reduction 10.8 % with respect to the best baseline tagger for the large tagset is comparable with other author's results for small tagsets.Downloads
Download data is not yet available.
Downloads
Published
2012-01-26
How to Cite
Kuta, M., Wojcik, W., Wrzeszcz, M., & Kitowski, J. (2012). Application of Weighted Voting Taggers to Languages Described with Large Tagsets. Computing and Informatics, 29(2), 203–225. Retrieved from http://147.213.75.17/ojs/index.php/cai/article/view/81
Issue
Section
Articles