Aditya Yadavalli

Ph.D. Student at UC San Diego

IMG_6620.JPG

I am a first-year Ph.D. student in the LeM🍋N Lab at UC San Diego, where I am advised by Prof. Alex Warstadt. Here, I work on using spoken language models (SLMs) as cognitive models to study to study how information is conveyed through lexical (textual) and non-lexical (speech) streams of human communication.

Previously, I was a Speech/NLP Engineer at Karya. There, I worked on a range of topics: human-LLM agreement across various tasks, building indigenous/endangered language resources, quality estimation of crowdsourced datasets, and assistive tools for crowdsource workers.

I used to volunteer at Masakhane to explore how current SLMs do not generalise well, especially when we encounter African languages and accents, and how we can make them better.

From 2021-2023, I was also a visiting researcher at Case Western Reserve University (CWRU). There, I collaborated with Prof. Vera Tobin to evaluate NLP models trained on Child Directed Speech (CDS) to establish common mistakes that humans and NLP models make when acquiring a new (second) language.

I completed my B.Tech. (Hons) in Computer Science & M.S. by Research in Computational Linguistics at IIIT Hyderabad. For my M.S. thesis, I explored how closely related languages can be used to improve the performance of low-resource languages at Speech Processing Lab (SPL) with Prof. Anil Kumar Vuppala.

When not at work, you can find me discussing cricket or picking up obscure trivia that no one cares about.

news

Nov 10, 2025 Paper titled ELR-1000: A Community-Generated Dataset for Endangered Indic Indigenous Languages accepted at IJCNLP-AACL 2025!
Oct 17, 2024 Paper titled PARIKSHA: Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data accepted at EMNLP 2024!
Mar 28, 2024 Paper titled Speaking in Terms of Money: Financial Knowledge Acquisition through Speech Data Generation accepted at COMPASS 2024!
Mar 25, 2024 Paper titled “Akal Badi ya Bias”: An Exploration Study of Gender Bias in Hindi accepted at FAccT 2024!
Jan 27, 2024 Paper titled MunTTS : A Text-to-Speech System for Mundari accepted at ComputEL-7 workshop
Jan 27, 2024 Paper titled AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents accepted at EACL 2024 Findings
Oct 4, 2023 Paper titled AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR accepted in TACL & will be presented at EMNLP 2023!
May 4, 2023 Paper titled SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT accepted at ACL 2023 :sparkles:
May 4, 2023 Paper titled X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents accepted in ACL Findings!
Nov 11, 2022 Defended my MS thesis! Thanks to my panel – Dr. Kishore Prahallad and Prof. Chiranjeevi Yarra :tada:
Oct 3, 2022 Paper on “How Do Phonological Properties Affect Bilingual Automatic Speech Recognition?” accepted at IEEE SLT 2022 :tada:
Sep 10, 2022 I will visiting Incheon to attend Interspeech! Happy to meet you if you’ll be attending the same.
Jul 1, 2022 I will be attending NAACL in person. If you are attending too, let’s catch up!
Jun 18, 2022 Submitted my MS Thesis for review :smile:
Jun 14, 2022 Paper on “Multi-Task End-to-End Model for Telugu Dialect and Speech Recognition” accepted in Interspeech 2022 :sparkles:
Jun 1, 2022 Started working at Karya under the mentorship of Dr. Vivek Seshadri as a Speech/NLP Engineer.

selected publications

  1. EMNLP
    PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data
    Watts, Ishaan, Gumma, Varun,  Yadavalli, Aditya, Seshadri, Vivek, Swaminathan, Manohar, and Sitaram, Sunayana
    In Proc. EMNLP 2024
  2. EACL Findings
    AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents
    Owodunni, Abraham*,  Yadavalli, Aditya*, Emezue, Chris*, Olatunji, Tobi*, and Mbataku, Clinton
    In Proc. EACL Findings 2024
  3. ACL
    SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT
    Yadavalli, Aditya*, Yadavalli, Alekhya*, and Tobin, Vera
    In Proc. ACL 2023
  4. Interspeech
    Multi-Task End-to-End Model for Telugu Dialect and Speech Recognition
    Yadavalli, Aditya, Mirishkar, Ganesh, and Vuppala, Anil Kumar
    In Proc. Interspeech 2022