Proceedings of the First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages

PROGRAM

 8.45–9.00 Opening Remark
 9.00–10.00 Keynote Speech
 Theme: Language Processing and Evaluation
10.00–10.15Crossing Language Boundaries: Evaluation of Large Language Models on Urdu-English Question Answering
Samreen kazi, Maria Rahim and Shakeel Ahmed Khoja
10.15–10.30Hindi Reading Comprehension: Do Large Language Models Exhibit Semantic Understanding?
Daisy Monika Lal, Paul Rayson and Mo El-Haj
 Coffee Break
11.00–11.15Machine Translation and Transliteration for Indo-Aryan Languages: A Systematic Review
Sandun Sameera Perera and Deshan Koshala Sumanathilaka
11.15–11.30Investigating the Effect of Backtranslation for Indic Languages
Sudhansu Bala Das, Samujjal Choudhury, Dr Tapas Kumar Mishra and Dr Bidyut Kr Patra
11.30–11.45BERTopic for Topic Modeling of Hindi Short Texts: A Comparative Study
Atharva Mutsaddi, Anvi Jamkhande, Aryan Shirish Thakre and Yashodhara Haribhakta
11.45–12.00Evaluating Structural and Linguistic Quality in Urdu DRS Parsing and Generation through Bidirectional Evaluation
Muhammad Saad Amin, Luca Anselma and Alessandro Mazzei
12.00–12.15Studying the Effect of Hindi Tokenizer Performance on Downstream Tasks
Rashi Goel and Fatiha Sadat
12.15–12.30Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus: A Case Study for Hindi LLMs
Raviraj Joshi, Kanishk Singla, Anusha Kamath, Raunak Kalani, Rakesh Paul, Utkarsh Vaidya, Sanjay Singh Chauhan, Niranjan Wartikar and Eileen Long
12.30–12.45OVQA: A Dataset for Visual Question Answering and Multimodal Research in Odia Language
Shantipriya Parida, Shashikanta Sahoo, Sambit Sekhar, Kalyanamalini Sahoo, Ketan Kotwal, Sonal Khosla, Satya Ranjan Dash, Aneesh Bose, Guneet Singh Kohli, Smruti Smita Lenka and Ondřej Bojar
12.45–13.00Advancing Multilingual Speaker Identification and Verification for Indo-Aryan and Dravidian Languages
Braveenan Sritharan and Uthayasanker Thayasivam
 13.00–14.00 Lunch Break
 Theme: Applications and Societal Impact: Applying NLP to Real-World Problems and Societal Challenges
14.00–14.15Sentiment Analysis of Sinhala News Comments Using Transformers
Isuru Bandaranayake and Hakim Usoof
14.15–14.30ExMute: A Context-Enriched Multimodal Dataset for Hateful Memes
Riddhiman Swanan Debnath, Nahian Beente Firuj, Abdul Wadud Shakib, Sadia Sultana and Md Saiful Islam
14.30–14.45Studying the capabilities of Large Language Models in solving Combinatorics Problems posed in Hindi
Yash Kumar and Subhajit Roy
14.45–15.00From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs
Hrithik Majumdar Shibu, Shrestha Datta, Md. Sumon Miah, Nasrullah Sami, Mahruba Sharmin Chowdhury and Md Saiful Islam
15.00–15.15Enhancing Participatory Development Research in South Asia through LLM Agents System: An Empirically-Grounded Methodological Initiative from Field Evidence in Sri Lankan
Xinjie Zhao, Hao Wang, Shyaman Maduranga Sriwarnasinghe, Jiacheng Tang, Shiyun Wang, Sayaka Sugiyama and So Morikawa
15.15–15.30Identifying Aggression and Offensive Language in Code-Mixed Tweets: A Multi-Task Transfer Learning Approach
Bharath Kancharla, Prabhjot Singh, Lohith Bhagavan Kancharla, Yashita Chama and Raksha Sharma
 15.30–16.00 Coffee Break
 Shared Task Discussion
 Team IndiDataMiner at IndoNLP 2025: Hindi Back Transliteration - Roman to Devanagari using LLaMa
Saurabh Kumar, Dhruvkumar Babubhai Kakadiya and Sanasam Ranbir Singh
 IndoNLP 2025 Shared Task: Romanized Sinhala to Sinhala Reverse Transliteration Using BERT
Sandun Sameera Perera, Lahiru Prabhath Jayakodi, Deshan Koshala Sumanathilaka and Isuri Anuradha
 Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
Widanalage Mario Yomal De Mel, Kasun Imesha Wickramasinghe, Nisansa de Silva and Surangika Dayani Ranathunga
 Romanized to Native Malayalam Script Transliteration Using an Encoder-Decoder Framework
Bajiyo Baiju, Kavya Manohar, Leena G. Pillai and Elizabeth Sherly
 Final Remark