SCAP-TT
PDF

Keywords

POS-tagging
lemmatisation
Spanish
TreeTagger
tourism discourse
SCAP
etiquetado gramatical
lematización
español
discurso turístico
SCAP-tur

How to Cite

Goethals, P., Lefever, E., & Macken, L. (2017). SCAP-TT: Tagging and lemmatising Spanish tourism discourse, and beyond . Ibérica, (33), 279–288. Retrieved from https://www.revistaiberica.org/index.php/iberica/article/view/170

Abstract

In this research note we report on the first results of SCAP, the Spanish Corpus Annotation Project, applied to tourism discourse. In particular, we present and assess a new TreeTagger parameter set for Spanish (SCAP-TT), which has been trained for the Part-of-Speech tagging (POS-tagging) and lemmatisation of Spanish promotional tourism texts. Although SCAP-TT has been trained for specialized tourism discourse, we also show promising results for the annotation of other text genres such as essays and literary text
PDF

Copyright (c) 2017 Patrick Goethals, Els Lefever, Lieve Macken

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Downloads

Download data is not yet available.