Corpus tools for parallel corpora of theatre plays: an introduction to TAligner and ACM-theatre
Ver/ Abrir
Registro completo
Mostrar el registro completo DCFecha
2022Derechos
© The Author(s), under exclusive licence to Springer Nature B.V. 2022. This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature's AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s10579-022-09585-5
Publicado en
Language Resources and Evaluation, 2022,
Editorial
Springer
Palabras clave
Corpus building
Corpus analysis
Software
Parallel corpora
Theatre translations
Resumen/Abstract
Software tools are of vital importance in corpus-based research, but they can also lead to restrictions on the type of supported corpora and the range of analyses that can be performed. For example, corpus analysis tools, as general purpose software, do not include specific features to process corpora of theatre plays. This situation is even worse for parallel corpora of theatrical texts, in that there is currently a lack of software that allows for both the alignment and analysis of parallel corpora here. In this contribution, we will first outline the peculiarities of theatre texts and suggest three software features to address them: annotation of the
structural units of plays, alignment at the utterance level, and concordances and statistics using the annotated units. Second, we will present the specific functionalities of TAligner and ACM to build and analyse parallel corpora of play texts, showing how new avenues of research are opening up with the development of these
tools.
Colecciones a las que pertenece
- D13 Artículos [225]
- D13 Proyectos de investigación [34]