Authors

Venue

Date

Share

Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs

Mohammadi Zaki

Pankaj Wasnik

Pratik Rakesh Singh

AAAI-25

2025

Abstract

We address the challenging task of neural machine translation (NMT) in the entertainment domain, where the objective is to automatically translate a given dialogue from a source language content to a target language. This task has various applications, particularly in automatic dubbing, subtitling, and other content localization tasks, enabling source content to reach a wider audience. Traditional NMT systems typically translate individual sentences in isolation, without facilitating knowledge transfer of crucial elements such as the context and style from previously encountered sentences. In this work, we emphasize the significance of these fundamental aspects in producing pertinent and captivating translations. We demonstrate their significance through several examples and propose a novel framework for entertainment translation, which, to our knowledge, is the first of its kind. Furthermore, we introduce an algorithm to estimate the context and style of the current session and use these estimations to generate a prompt that guides a Large Language Model (LLM) to generate high-quality translations. Our method is both language and LLM-agnostic, making it a general-purpose tool. We demonstrate the effectiveness of our algorithm through various numerical studies and observe significant improvement in the COMET scores over various state-of-the-art LLMs. Moreover, our proposed method consistently outperforms baseline LLMs in terms of win-ratio.

Related Publications

Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

NAACL, 2025
Kritarth Prasad, Mohammadi Zaki, Pratik Singh, Pankaj Wasnik

Ensembling neural machine translation (NMT) models to produce higher-quality translations than the $L$ individual models has been extensively studied. Recent methods typically employ a candidate selection block (CSB) and an encoder-decoder fusion block (FB), requiring infere…

Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection

CVPRW, 2025
Ayush Ghadiya, Purbayan Kar, Vishal Chudasama, Pankaj Wasnik

Recently, weakly supervised video anomaly detection (WS-VAD) has emerged as a contemporary research direction to identify anomaly events like violence and nudity in videos using only video-level labels. However, this task has substantial challenges, including addressing imba…

Open-Set Object Detection By Aligning Known Class Representations

WACV, 2025
Vishal Chudasama, Naoyuki Onoe*, Pankaj Wasnik, Hiran Sarkar, Vineeth N Balasubramanian

Open-Set Object Detection (OSOD) has emerged as a contemporary research direction to address the detection of unknown objects. Recently, few works have achieved remarkable performance in the OSOD task by employing contrastive clustering to separate unknown classes. In contra…

  • HOME
  • Publications
  • Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.