publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. NAACL
    AMPS: ASR with Multimodal Paraphrase Supervision
    *Abhishek Gupta, *Amruta Parulekar, Sameep Chattopadhyay, and 1 more author
    In Proceedings of NAACL, 2025
  2. COLING
    CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving
    Bhavani Shankar P S V N, Preethi Jyothi, and Pushpak Bhattacharyya
    In Proceedings of COLING, 2025

2024

  1. NeurIPS
    WikiDO: A New Benchmark Evaluating Cross-Modal Retrieval for Vision-Language Models
    *Pavan Kalyan Tankala, *Piyush Pasi, Sahil Dharod, and 4 more authors
    In Proceedings of NeurIPS (Datasets and Benchmarks Track), 2024
  2. Interspeech
    SALSA: Speedy ASR-LLM Synchronous Aggregation
    Ashish Mittal, Darshan Prabhu, Sunita Sarawagi, and 1 more author
    In Proceedings of Interspeech
    This work was nominated for a Best Student Paper Award , 2024
  3. Interspeech
    Emotion arithmetic: Emotional speech synthesis via weight space interpolation
    Pavan Kalyan, Preeti Rao, Preethi Jyothi, and 1 more author
    In Proc. Interspeech 2024, 2024
  4. Interspeech
    Multi-Convformer: Extending Conformer with Multiple Convolution Kernels
    Darshan Prabhu, Yifan Peng, Preethi Jyothi, and 1 more author
    In Proceedings of Interspeech 2024, 2024
  5. Interspeech
    Improving Self-supervised Pre-training using Accent-Specific Codebooks
    Darshan Prabhu, Abhishek Gupta, Omkar Nitsure, and 2 more authors
    In Proc. Interspeech 2024, 2024
  6. ACL
    In-context mixing (ICM): Code-mixed prompts for multilingual LLMs
    Bhavani Shankar, Preethi Jyothi, and Pushpak Bhattacharyya
    In Proceedings of ACL, 2024
  7. ACL
    Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection
    *Barah Fazili, *Ashish Agrawal, and Preethi Jyothi
    In Proceedings of ACL (Findings), 2024
  8. ACL
    Part-of-speech Tagging for Extremely Low-resource Indian Languages
    Sanjeev Kumar, Preethi Jyothi, and Pushpak Bhattacharyya
    In Proceedings of ACL (Findings), 2024
  9. ACL
    DIMSIM: Distilled Multilingual Critics for Indic Text Simplification
    Sneha Mondal, Ritika Ritika, Ashish Agrawal, and 2 more authors
    In Proceedings of ACL (Findings), 2024
  10. EACL
    Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning
    Ashish* Agrawal, Barah* Fazili, and Preethi Jyothi
    In Proceedings of EACL, 2024
  11. EACL
    STORiCo: Storytelling TTS for Hindi with Character Voice Modulation
    Pavan Tankala, Preethi Jyothi, Preeti Rao, and 1 more author
    In Proceedings of EACL, 2024

2023

  1. ICLR
    In-situ text-only adaptation of speech models with low-overhead speech imputations
    Ashish Mittal, Sunita Sarawagi, and Preethi Jyothi
    In Proceedings of ICLR, 2023
  2. ICASSP
    Towards zero-shot code-switched speech recognition
    Brian Yan, Matthew Wiesner, Ondřej Klejch, and 2 more authors
    In Proceedings of ICASSP, 2023
  3. IJCAI
    Temporally aligning long audio interviews with questions: a case study in multimodal data integration
    Piyush Singh Pasi, Karthikeya Battepati, Preethi Jyothi, and 3 more authors
    In Proceedings of IJCAI, 2023
  4. ACL
    Improving pretraining techniques for code-switched NLP
    *Richeek Das, *Sahasra Ranjan, Shreya Pathak, and 1 more author
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
    This work received an Outstanding Paper Award , 2023
  5. ACL
    Zero-shot cross-lingual transfer with learned projections using unlabeled target-language data
    Ujan Deb, Ridayesh Parab, and Preethi Jyothi
    In Proceedings of ACL, 2023
  6. ACL
    DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent Adaptation
    *Suraj Kothawade, *Anmol Mekala, D Chandra Sekhara Hetha Havya, and 4 more authors
    In Proceedings of ACL, 2023
  7. Interspeech
    Narrator or Character: Voice Modulation in an Expressive Multi-speaker TTS
    Tankala Pavan Kalyan, Preeti Rao, Preethi Jyothi, and 1 more author
    In Proceedings of Interspeech, 2023
  8. Interspeech
    Improving RNN-Transducers with Acoustic LookAhead
    Vinit S Unni, Ashish Mittal, Preethi Jyothi, and 1 more author
    In Proceedings of Interspeech, 2023
  9. Interspeech
    Unsupervised Code-switched Text Generation from Parallel Text
    Jie Chi, Brian Lu, Jason Eisner, and 3 more authors
    In Proc. Interspeech 2023, 2023
  10. EMNLP
    Accented Speech Recognition With Accent-specific Codebooks
    Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, and 1 more author
    In Proceedings of EMNLP, 2023
  11. EMNLP
    Speech-enriched memory for inference-time adaptation of asr models to word dictionaries
    Ashish Mittal, Sunita Sarawagi, Preethi Jyothi, and 2 more authors
    In Proceedings of EMNLP, 2023
  12. EMNLP
    DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages
    Vineet Bhat, Preethi Jyothi, and Pushpak Bhattacharyya
    In Proceedings of EMNLP (Findings), 2023
  13. ICLR (Workshop)
    Surprisingly Simple Adapter Ensembling for Zero-shot Cross-lingual Sequence Tagging
    Rohan Shah and Preethi Jyothi
    2023

2022

  1. ACL
    Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding
    Soumya Chatterjee, Sunita Sarawagi, and Preethi Jyothi
    In Proceedings of ACL, 2022
  2. COLING
    Aligning multilingual embeddings for improved code-switched natural language understanding
    Barah Fazili and Preethi Jyothi
    In Proceedings of COLING, 2022
  3. COLING
    Zero-shot disfluency detection for Indian languages
    Rohit Kundu, Preethi Jyothi, and Pushpak Bhattacharyya
    In Proceedings of COLING, 2022
  4. EMNLP
    CoCoa: An Encoder-Decoder Model for Controllable Code-switched Generation
    Sneha Mondal, Shreya Pathak, Preethi Jyothi, and 2 more authors
    In Proceedings of EMNLP, 2022
  5. EMNLP
    Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training
    Ashish Mittal, Durga Sivasubramanian, Rishabh Iyer, and 2 more authors
    In Proceedings of EMNLP (Findings), 2022
  6. ICASSP
    Adaptive discounting of implicit language models in rnn-transducers
    Vinit Unni, Shreya Khare, Ashish Mittal, and 3 more authors
    In Proceedings of ICASSP, 2022
  7. Interspeech
    SPLICEOUT: A Simple and Efficient Audio Augmentation Method
    Arjit Jain, Pranay Reddy Samala, Deepak Mittal, and 2 more authors
    In Proceedings of Interspeech
    Pseudocode in the arxiv version , 2022
  8. Interspeech
    Linguistically Informed Post-processing for ASR Error correction in Sanskrit.
    Rishabh Kumar, Devaraja Adiga, Rishav Ranjan, and 4 more authors
    In Proceedings of Interspeech, 2022

2021

  1. EMNLP (Workshop)
    The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding
    Archiki Prasad, Mohammad Ali Rehan, Shreya Pathak, and 1 more author
    In Proceedings of the 1st Workshop on Multilingual Representation Learning (MRL)
    This work received an Honorable Mention Award , 2021
  2. Interspeech
    Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
    Anuj Diwan and Preethi Jyothi
    In Proceedings of Interspeech
    This work was nominated for a Best Student Paper Award , 2021
  3. Interspeech
    Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.
    Shreya Khare, Ashish R Mittal, Anuj Diwan, and 3 more authors
    In Proceedings of Interspeech, 2021
  4. Interspeech
    Cross-Modal Learning for Audio-Visual Video Parsing
    Jatin Lamba, Jayaprakash Akula, Rishabh Dabral, and 3 more authors
    In Proceedings of Interspeech, 2021
  5. Interspeech
    MUCS 2021: Multilingual and code-switching ASR challenges for low resource Indian languages
    Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, and 5 more authors
    In
    Datasets are at link1 and link2 , 2021
  6. ACL
    From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text
    Ishan Tarunesh, Syamantak Kumar, and Preethi Jyothi
    In Proceedings of ACL, 2021
  7. ACL
    Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights
    Devaraja Adiga, Rishabh Kumar, Amrith Krishna, and 3 more authors
    In Proceedings of ACL (Findings), 2021
  8. IJCAI
    Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image Captioning.
    Arjit Jain, Pranay Reddy Samala, Preethi Jyothi, and 1 more author
    In Proceedings of IJCAI, 2021
  9. NAACL Workskop
    The effect of pretraining on extractive summarization for scientific documents
    Yash Gupta, Pawan Sasanka Ammanamanchi, Shikha Bordia, and 7 more authors
    In Proceedings of the Second Workshop on Scholarly Document Processing, 2021
  10. SIGIR
    Select, substitute, search: A new benchmark for knowledge-augmented visual question answering
    Aman Jain, Mayank Kothyari, Vishwajeet Kumar, and 3 more authors
    In Proceedings of SIGIR, 2021
  11. ICASSP
    An investigation of end-to-end models for robust speech recognition
    Archiki Prasad, Preethi Jyothi, and Rajbabu Velmurugan
    In Proceedings of ICASSP, 2021
  12. ICASSP
    Error-driven fixed-budget asr personalization for accented speakers
    Abhijeet Awasthi, Aman Kansal, Sunita Sarawagi, and 1 more author
    In Proceedings of ICASSP, 2021
  13. ICASSP
    Collaborative learning to generate audio-video jointly
    Vinod K Kurmi, Vipul Bajaj, Badri N Patro, and 3 more authors
    In Proceedings of ICASSP, 2021
  14. EACL
    Disfluency correction using unsupervised and semi-supervised learning
    Nikhil Saini, Drumil Trivedi, Shreya Khare, and 4 more authors
    In Proceedings of EACL, 2021
  15. EACL
    Meta-Learning for Effective Multi-task and Multilingual Modelling
    Ishan Tarunesh, Sushil Khyalia, Vishwajeet Kumar, and 2 more authors
    In Proceedings of EACL, 2021

2020

  1. Interspeech
    Black-Box Adaptation of ASR for Accented Speech
    Kartik Khandelwal, Preethi Jyothi, Abhijeet Awasthi, and 1 more author
    In Proceedings of Interspeech, 2020
  2. Interspeech
    Improving Low Resource Code-Switched ASR Using Augmented Code-Switched TTS
    Yash Sharma, Basil Abraham, Karan Taneja, and 1 more author
    In Proceedings of Interspeech, 2020
  3. Interspeech
    Caption alignment for low resource audio-visual data
    Vighnesh Reddy Konda, Mayur Warialani, Rakesh Prasanth Achari, and 6 more authors
    In Proceedings of Interspeech, 2020
  4. ACL
    How accents confound: Probing for accent information in end-to-end speech recognition systems
    Archiki Prasad and Preethi Jyothi
    In Proceedings of ACL, 2020
  5. ICASSP
    Coupled training of sequence-to-sequence models for accented speech recognition
    Vinit Unni, Nitish Joshi, and Preethi Jyothi
    In Proceedings of ICASSP, 2020
  6. LREC
    Crowdsourcing speech data for low-resource languages from low-income workers
    Basil Abraham, Danish Goel, Divya Siddarth, and 7 more authors
    In Proceedings of LREC, 2020
  7. IWSLT
    Generating fluent translations from disfluent text without access to fluent references: IIT Bombay@ IWSLT2020
    Nikhil Saini, Jyotsana Khatri, Preethi Jyothi, and 1 more author
    In Proceedings of IWSLT, 2020

2019

  1. ACL
    Cross-Lingual Training for Automatic Question Generation
    Vishwajeet Kumar, Nitish Joshi, Arijit Mukherjee, and 2 more authors
    In Proceedings of ACL, 2019
  2. Interspeech
    Exploiting Monolingual Speech Corpora for Code-Mixed Speech Recognition.
    Karan Taneja, Satarupa Guha, Preethi Jyothi, and 1 more author
    In Proceedings of Interspeech, 2019

2018

  1. EMNLP
    Revisiting the Importance of Encoding Logic Rules in Sentiment Classification
    Kalpesh Krishna, Preethi Jyothi, and Mohit Iyyer
    In Proceedings of EMNLP, 2018
  2. EMNLP
    Code-switched Language Models Using Dual RNNs and Same-Source Pretraining
    Saurabh Garg, Tanmay Parekh, and Preethi Jyothi
    In Proceedings of EMNLP, 2018
  3. Interspeech
    Improved Accented Speech Recognition Using Accent Embeddings and Multi-task Learning.
    Abhinav Jain, Minali Upreti, and Preethi Jyothi
    In Proceedings of Interspeech, 2018
  4. Interspeech
    Dual Language Models for Code Switched Speech Recognition
    Saurabh Garg, Tanmay Parekh, and Preethi Jyothi
    In Proceedings of Interspeech, 2018
  5. Interspeech
    Time Aggregation Operators for Multi-label Audio Event Detection.
    Pankaj Joshi, Digvijaysingh Gautam, Ganesh Ramakrishnan, and 1 more author
    In Proceedings of Interspeech, 2018
  6. ICLR
    Generalizing Across Domains via Cross-Gradient Training
    Shiv Shankar, Vihari Piratla, Soumen Chakrabarti, and 3 more authors
    In Proceedings of ICLR, 2018

2017

  1. ASRU
    Leveraging native language speech for accent identification using deep siamese networks
    Aditya Siddhant, Preethi Jyothi, and Sriram Ganapathy
    In Proceedings of ASRU, 2017
  2. Asilomar
    Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions
    Mark Hasegawa-Johnson, Preethi Jyothi, Wenda Chen, and 1 more author
    In Proceedings of Asilomar, 2017
  3. ICASSP
    Low-resource grapheme-to-phoneme conversion using recurrent neural networks
    Preethi Jyothi and Mark Hasegawa-Johnson
    In Proceedings of ICASSP, 2017

2016

  1. TASL
    ASR for under-resourced languages from probabilistic transcription
    Mark A Hasegawa-Johnson, Preethi Jyothi, Daniel McCloy, and 8 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2016
  2. COLING Workshop
    Clustering-based phonetic projection in mismatched crowdsourcing channels for low-resourced ASR
    Wenda Chen, Mark Hasegawa-Johnson, Nancy Chen, and 2 more authors
    In Proceedings of the Workshop on South and Southeast Asian Natural Language Processing (WSSANLP2016), COLING, 2016
  3. Interspeech
    Automatic Speech Recognition Using Probabilistic Transcriptions in Swahili, Amharic, and Dinka.
    Amit Das, Preethi Jyothi, and Mark Hasegawa-Johnson
    In Proceedings of Interspeech, 2016
  4. SLTU
    Performance improvement of probabilistic transcriptions with language-specific constraints
    Xiang Kong, Preethi Jyothi, and Mark Hasegawa-Johnson
    Proceedings of SLTU Workshop, 2016
  5. CSL
    Articulatory feature-based pronunciation modeling
    Karen Livescu, Preethi Jyothi, and Eric Fosler-Lussier
    Computer Speech & Language, 2016
  6. ICASSP
    Adapting ASR for under-resourced languages using mismatched transcriptions
    *Chunxi Liu, *Preethi Jyothi, Hao Tang, and 5 more authors
    In Proceedings of ICASSP
    This work received an Speech and Language Processing Student Paper Award , 2016
  7. ITA
    Language coverage for mismatched crowdsourcing
    Lav R Varshney, Preethi Jyothi, and Mark Hasegawa-Johnson
    In 2016 Information Theory and Applications Workshop (ITA), 2016

2015

  1. Interspeech
    Transcribing continuous speech using mismatched crowdsourcing.
    Preethi Jyothi and Mark Hasegawa-Johnson
    In Proceedings of Interspeech, 2015
  2. Interspeech
    Improved hindi broadcast ASR by adapting the language model and pronunciation model using a priori syntactic and morphophonemic knowledge.
    Preethi Jyothi and Mark Hasegawa-Johnson
    In Proceedings of Interspeech, 2015
  3. AAAI
    Acquiring speech transcriptions using mismatched crowdsourcing
    Preethi Jyothi and Mark Hasegawa-Johnson
    In Proceedings of AAAI, 2015
  4. LabPhon
    Models of dataset size, question design, and cross-language speech perception for speech crowdsourcing applications
    Mark Hasegawa-Johnson, Jennifer Cole, Preethi Jyothi, and 1 more author
    Laboratory Phonology, 2015
  5. ICPhS
    Prosodic and structural correlates of perceived prominence in Russian and Hindi.
    Tatiana Luchkina, Jennifer S Cole, Preethi Jyothi, and 1 more author
    In Proceedings of ICPhS, 2015

2014

  1. SIGMORPHON
    Revisiting word neighborhoods for speech recognition
    Preethi Jyothi and Karen Livescu
    In Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, 2014
  2. SpeechProsody
    An investigation of prosody in Hindi narrative speech
    Preethi Jyothi, Jennifer Cole, Mark Hasegawa-Johnson, and 1 more author
    In Proceedings of Speech Prosody, 2014

2013

  1. Interspeech
    Discriminative training of WFST factors with application to pronunciation modeling.
    Preethi Jyothi, Eric Fosler-Lussier, and Karen Livescu
    In Proceedings of Interspeech, 2013
  2. IEEE
    Conditional random fields in speech, audio, and language processing
    Eric Fosler-Lussier, Yanzhang He, Preethi Jyothi, and 1 more author
    Proceedings of the IEEE, 2013

2012

  1. Interspeech
    Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks.
    Preethi Jyothi, Eric Fosler-Lussier, and Karen Livescu
    In Proceedings of Interspeech
    This work received a Best Student Paper Award , 2012
  2. NAACL Workshop
    Large-scale discriminative language model reranking for voice-search
    Preethi Jyothi, Leif Johnson, Ciprian Chelba, and 1 more author
    In Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model?, 2012
  3. ICASSP
    Distributed discriminative language models for Google voice-search
    Preethi Jyothi, Leif Johnson, Ciprian Chelba, and 1 more author
    In Proceedings of ICASSP, 2012

2011

  1. ICASSP
    Lexical access experiments with context-dependent articulatory feature-based models
    Preethi Jyothi, Karen Livescu, and Eric Fosler-Lussier
    In Proceedings of ICASSP, 2011

2010

  1. Interspeech
    Discriminative language modeling using simulated ASR errors.
    Preethi Jyothi and Eric Fosler-Lussier
    In Proceedings of Interspeech, 2010
  2. NAACL
    Investigations into the Crandem approach to word recognition
    Rohit Prabhavalkar, Preethi Jyothi, William Hartmann, and 2 more authors
    In Proceedings of NAACL, 2010

2009

  1. Interspeech
    A comparison of audio-free speech recognition error prediction methods.
    Preethi Jyothi and Eric Fosler-Lussier
    In Proceedings of Interspeech, 2009