Authors

Venue

Date

Share

In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

Pratik Rakesh Singh

Kritarth Prasad

Mohammadi Zaki

Pankaj Wasnik

ACL-25

2025

Abstract

Neural Machine Translation (NMT) systems face significant challenges when working with low-resource languages, particularly in domain adaptation tasks. These difficulties arise due to limited training data and suboptimal model generalization, As a result, selecting an opti- mal model for translation is crucial for achiev- ing strong performance on in-domain data, par- ticularly in scenarios where fine-tuning is not feasible or practical. In this paper, we investi- gate strategies for selecting the most suitable NMT model for a given domain using bandit- based algorithms, including Upper Confidence Bound, Linear UCB, Neural Linear Bandit, and Thompson Sampling. Our method effectively addresses the resource constraints by facilitat- ing optimal model selection with high confi- dence. We evaluate the approach across three African languages and domains, demonstrating its robustness and effectiveness in both scenar- ios where target data is available and where it is absent.

Related Publications

REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion

Interspeech, 2026
Ishan Biyani, Nirmesh Shah*, Ashishkumar Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah

Speech time reversal refers to the process of reversing the entire speech signal in time, causing it to play backward. Such signals are completely unintelligible since the fundamental structures of phonemes and syllables are destroyed. However, they still retain tonal patter…

Listen Like a Teacher: Mitigating Whisper Hallucinations using Adaptive Layer Attention and Knowledge Distillation

AAAI, 2025
Kumud Tripathi, Aditya Srinivas Menon, Aman Gupta, Raj Prakash Gohil, Pankaj Wasnik

The Whisper model, an open-source automatic speech recognition system, is widely adopted for its strong performance across multilingual and zero-shot settings. However, it frequently suffers from hallucination errors, especially under noisy acoustic conditions. Previous work…

Graph-Assisted Culturally Adaptable Idiomatic Translation for Indic languages

ACL, 2025
Pratik Rakesh Singh, Kritarth Prasad, Mohammadi Zaki, Pankaj Wasnik

Translating multi-word expressions (MWEs) and idioms requires a deep understanding of the cultural nuances of both the source and target languages. This challenge is further amplified by the one-to-many nature of idiomatic translations, where a single source idiom can have m…

  • HOME
  • Publications
  • In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.