Increasing Quality of the Corpus of Frequency Dictionary of Contemporary Polish for Morphosyntactic Tagging of the Polish Language

Authors

  • Marcin Kuta
  • Paweł Chrzaszcz
  • Jacek Kitowski

Abstract

The paper is devoted to the issue of correction of the erroneous and ambiguous corpus of Frequency Dictionary of Contemporary Polish (FDCP) and its application to morphosyntactic tagging of the Polish language. Several stages of corpus transformation are presented and baseline part-of-speech tagging algorithms are evaluated, too.

Downloads

Download data is not yet available.

Downloads

Published

2012-01-26

How to Cite

Kuta, M., Chrzaszcz, P., & Kitowski, J. (2012). Increasing Quality of the Corpus of Frequency Dictionary of Contemporary Polish for Morphosyntactic Tagging of the Polish Language. Computing and Informatics, 28(3), 319–338. Retrieved from http://147.213.75.17/ojs/index.php/cai/article/view/40