Publications
Here is a list of my research papers, conference presentations and other research activities. This listing is up-to-date as of 24 February 2026.
Papers
2026
- Elo, K., Tarkka, O., Laine, J., Koljonen, J., Korhonen, M., & Martiskainen, K. (forthcoming) Exploring Emotions in Parliamentary Debates with a Sentiment Recognition Deep Learning Model: A Case Study of Finnish Plenary Debates on Economy and Environmental Issues 1990-2023.
- Tarkka, O., Elo, K., Ginter, F., & Laippala, V. (2026). Do all politicians sound the same? Comparing model explanations to human responses. Digital Humanities Quarterly 20(1). URL: https://dhq.digitalhumanities.org/vol/20/1/000839/000839.html
- Saarni, J., Tarkka, O., & Laippala, V. (2026). Evaluation in social media discourse: A corpus-assisted discourse study of evaluative images of the Covid-19 pandemic on the Finnish Twitter-sphere. Finnish Journal of Linguistics 38, pp. 139–165. URL: https://doi.org/10.61197/fjl.156423
- Ristilä, A., Tarkka, O., Laippala V., & Elo, K. (2026, to appear). Hopes and Fears — Emotion Distribution in the Topic Landscape of Finnish Parliamentary Speech 2000-2020. ArXiv preprint. URL: https://arxiv.org/abs/2601.20424
2025
- Henriksson, E., Tarkka, O., & Ginter, F. (2025) FinerWeb-10BT: Refining web data with LLM-based line-level filtering. Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pp. 70–76. URL: https://aclanthology.org/2025.nodalida-1.27/
2024
- Tarkka, O., Koljonen, J., Korhonen, M., Laine, J., Martiskainen, K., Elo, K., & Laippala, V. (2024) Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4. ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora. URL: https://aclanthology.org/2024.parlaclarin-1.11/
- Kanerva, J., Ginter, F., Chang, L., Rastas, I., Skantsi, V., Kilpeläinen, J., Kupari, H.-M., Piirto, A., Saarni, J., Sevón, M., & Tarkka, O. (2024). Towards diverse and contextually anchored paraphrase modeling: A dataset and baselines for Finnish. Natural Language Engineering 30(2), pp. 319–353. URL: https://doi.org/10.1017/S1351324923000086
2023
- Kanerva, J. Ginter, F., Chang, L., Skantsi, V., Kilpeläinen, J., Kupari, H.-M., Piirto, A., Saarni, J., Sevón, M., & Tarkka, O. (2023). Textual Paraphrase Dataset for Deep Language Modelling. In G. Rehm, (Ed.), European Language Grid. Cognitive Technologies. (pp. 343–348). Springer, Cham. URL: https://doi.org/10.1007/978-3-031-17258-8_27
2021
- Kanerva, J. Ginter, F., Chang, L., Rastas, I., Skantsi, V., Kilpeläinen, J., Kupari, H.-M., Saarni, J., Sevón, M., & Tarkka, O. (2021). Finnish Paraphrase Corpus. Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa), pp. 288–298. URL: https://aclanthology.org/2021.nodalida-main.29/
Conference presentations
2025
- NoDaLiDa/Baltic-HLT, Tallinn, Estonia. FinerWeb-10BT: Refining web data with LLM-based line-level filtering (presentation).
- Corpus linguistics conference, Birmingham, UK. Automated corpus characterization using LLMs: A register analysis case study (poster).
2024
- ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora, Turin, Italy. Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4 (presentation).
2023
- Digital Humanities in the Nordic and Baltic Countries, online conference. Exploring the stability of political rhetoric in Finnish parliamentary debates using deep learning (presentation).
- Corpus linguistics conference, Lancaster, UK. Modelling political ideologies from parliamentary speeches (poster).
- AFinLa syyssymposium, Tampere, Finland. Millainen puhuja, sellainen puhe? Koneoppivat menetelmät keinoina tunnistaa eduskuntapuheiden ideologisia positioita (presentation, in Finnish).
Other
- Humanisti vastaa: Mitä tekoäly paljastaa kansanedustajien puheista, Otto Tarkka?. (2024) Podcast interview (in Finnish). URL: https://www.ts.fi/uutiset/6520042