Rodrigues, Ricardo
[VerfasserIn];
Gonçalo Oliveira, Hugo
[VerfasserIn];
Gomes, Paulo
[VerfasserIn]
;
Ricardo Rodrigues and Hugo Gonçalo Oliveira and Paulo Gomes
[MitwirkendeR]
LemPORT: a High-Accuracy Cross-Platform Lemmatizer for Portuguese
Anmerkungen:
Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
Beschreibung:
Although lemmatization is a very common subtask in many natural language processing tasks, there is a lack of available true cross-platform lemmatization tools specifically targeted for Portuguese, namely for integration in projects developed in Java. To address this issue, we have developed a lemmatizer, initially just for our own use, but which we have decided to make publicly available. The lemmatizer, presented in this document, yields an overall accuracy over 98% when compared against a manually revised corpus.