Language variation, automatic speech recognition and algorithmic bias

Markl, Nina

Language variation, automatic speech recognition and algorithmic bias

Simple item page

dc.contributor.advisor

Lai, Catherine

dc.contributor.advisor

Hall-Lew, Lauren

dc.contributor.author

Markl, Nina

dc.contributor.sponsor

Economic and Social Research Council (ESRC)

en

dc.date.accessioned

2023-12-12T12:16:33Z

dc.date.available

2023-12-12T12:16:33Z

dc.date.issued

2023-12-12

dc.description.abstract

In this thesis, I situate the impacts of automatic speech recognition systems in relation to sociolinguistic theory (in particular drawing on concepts of language variation, language ideology and language policy) and contemporary debates in AI ethics (especially regarding algorithmic bias and fairness). In recent years, automatic speech recognition systems, alongside other language technologies, have been adopted by a growing number of users and have been embedded in an increasing number of algorithmic systems. This expansion into new application domains and language varieties can be understood as an expansion into new sociolinguistic contexts. In this thesis, I am interested in how automatic speech recognition tools interact with this sociolinguistic context, and how they affect speakers, speech communities and their language varieties. Focussing on commercial automatic speech recognition systems for British Englishes, I first explore the extent and consequences of performance differences of these systems for different user groups depending on their linguistic background. When situating this predictive bias within the wider sociolinguistic context, it becomes apparent that these systems reproduce and potentially entrench existing linguistic discrimination and could therefore cause direct and indirect harms to already marginalised speaker groups. To understand the benefits and potentials of automatic transcription tools, I highlight two case studies: transcribing sociolinguistic data in English and transcribing personal voice messages in isiXhosa. The central role of the sociolinguistic context in developing these tools is emphasised in this comparison. Design choices, such as the choice of training data, are particularly consequential because they interact with existing processes of language standardisation. To understand the impacts of these choices, and the role of the developers making them better, I draw on theory from language policy research and critical data studies. These conceptual frameworks are intended to help practitioners and researchers in anticipating and mitigating predictive bias and other potential harms of speech technologies. Beyond looking at individual choices, I also investigate the discourses about language variation and linguistic diversity deployed in the context of language technologies. These discourses put forward by researchers, developers and commercial providers not only have a direct effect on the wider sociolinguistic context, but they also highlight how this context (e.g., existing beliefs about language(s)) affects technology development. Finally, I explore ways of building better automatic speech recognition tools, focussing in particular on well-documented, naturalistic and diverse benchmark datasets. However, inclusive datasets are not necessarily a panacea, as they still raise important questions about the nature of linguistic data and language variation (especially in relation to identity), and may not mitigate or prevent all potential harms of automatic speech recognition systems as embedded in larger algorithmic systems and sociolinguistic contexts.

en

dc.identifier.uri

https://hdl.handle.net/1842/41277

dc.identifier.uri

http://dx.doi.org/10.7488/era/4013

dc.language.iso

en

dc.publisher

The University of Edinburgh

en

dc.relation.hasversion

Nina Markl (2022b). “Language Variation and Algorithmic Bias: Understanding Algorithmic Bias in British English Automatic Speech Recognition”. In: 2022 ACM Conference on Fairness, Accountability, and Transparency. FAccT ’22. Seoul, Republic of Korea: Association for Computing Machinery, pp. 521–534. DOI: 10.1145/3531146.3533117

en

dc.relation.hasversion

Nina Markl (2022a). “(Commercial) Automatic Speech Recognition as a Tool in Sociolinguistic Research”. In: University of Pennsylvania Working Papers in Linguistics 28.2. URL: https://repository.upenn.edu/pwpl/vol28/iss2/11

en

dc.relation.hasversion

Nina Markl, Electra Wallington, Ondrej Klejch, Thomas Reitmaier, Gavin Bailey, Jennifer Pearson, Matt Jones, Simon Robinson, and Peter Bell (2023). “Automatic transcription and (de)standardisation”. English. In: Proceedings - SIGUL 2023, 2nd Annual Meeting of the Special Interest Group on Under-resourced Languages. URL: https://sigul-2023.ilc.cnr.it/wp-content/uploads/2023/08/9_Paper.pdf

en

dc.relation.hasversion

Nina Markl (2022c). “Mind the data gap(s): Investigating power in speech and language datasets”. In: Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion. Dublin, Ireland: Association for Computational Linguistics, pp. 1–12. URL: https://aclanthology.org/2022.ltedi-1.1

en

dc.relation.hasversion

Nina Markl and Stephen Joseph McNulty (2022). “Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR”. in: Proceedings of the Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association, pp. 6328–6339. URL: https : / / aclanthology . org/2022.lrec-1.680

en

dc.relation.hasversion

Nina Markl and Catherine Lai (2023). “Everyone has an accent”. In: Proc. INTERSPEECH 2023, pp. 4424–4427. DOI: 10.21437/Interspeech.2023-1847

en

dc.relation.hasversion

Nina Markl and Catherine Lai (2021). “Context-sensitive evaluation of automatic speech recognition: considering user experience & language variation”. In: Proceedings of the First Workshop on Bridging Human–Computer Interaction and Natural Language Processing. Online: Association for Computational Linguistics, pp. 34–40. URL: https: //aclanthology.org/2021.hcinlp-1.6

en

dc.relation.hasversion

Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Klejch Ondřej, and Peter Bell (2023). “The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR”. in: ICASSP 2023, pp. 1–5. DOI: 10.1109/ ICASSP49357.2023.10095057

en

dc.relation.hasversion

Hall-Lew, Lauren, Claire Cowie, Catherine Lai, Nina Markl, Stephen Joseph McNulty, Shan-Jan Sarah Liu, Clare Llewellyn, Beatrice Alex, Zuzana Elliott, and Anita Klingler (2022). “The Lothian Diary Project: sociolinguistic methods during the COVID-19 lockdown”. In: Linguistics Vanguard 8.s3, pp. 321–330. DOI: doi:10.1515/lingvan-2021-0053

en

dc.relation.hasversion

Reitmaier, Thomas, Electra Wallington, Ondřej Klejch, Nina Markl, Lea-Marie Lam-Yee-Mui, Jennifer Pearson, Matt Jones, Peter Bell, and Simon Robinson (2023). “Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers”. In: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. CHI ’23. New York, NY, USA: Association for Computing Machinery. DOI: 10.1145/ 3544548.3581385.

en

dc.subject

automatic speech recognition

en

dc.subject

linguistic background

en

dc.subject

voice technology development

en

dc.subject

standard language

en

dc.subject

marginalised groups

en

dc.subject

representative speech datasets

en

dc.subject

inclusive design

en

dc.title

Language variation, automatic speech recognition and algorithmic bias

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

Doctoral

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Markl2023.pdf
Size:: 2.16 MB
Format:: Adobe Portable Document Format
Description:

Download

This item appears in the following Collection(s)

Informatics thesis and dissertation collection