Workshop on Challenges in Processing South Asian Languages

PROGRAM

Sunday, January 19, 2025

 8:30–10:30 Morning Oral Presentations
8:30–8:50A Brief Overview of the First Workshop on Challenges in Processing South Asian Languages (CHiPSAL)
Kengatharaiyer Sarveswaran, Surendrabikram Thapa, Sana Shams, Ashwini Vaidya and Bal Krishna Bal
8:50–9:10Development of Pre-Trained Transformer-based Models for the Nepali Language
Prajwal Thapa, Jinu Nyachhyon, Mridul Sharma and Bal Krishna Bal
9:10–9:30Benchmarking the Performance of Pre-trained LLMs across Urdu NLP Tasks
Munief Hassan Tahir, Sana Shams, Layba Fiaz, Farah Adeeba and Sarmad Hussain
9:30–9:50Bengali ChartSumm: A Benchmark Dataset and study on feasibility of Large Language Models on Bengali Chart to Text Summarization
Nahida Akter Tanjila, Afrin Sultana Poushi, Sazid Abdullah Farhan, Abu Raihan Mostofa Kamal, Md. Azam Hossain and Md. Hamjajul Ashmafee
9:50–10:10DweshVaani: An LLM for Detecting Religious Hate Speech in Code-Mixed Hindi-English
Varad Srivastava
10:10–10:30Improving Accuracy of Low-resource ASR using Rule-Based Character Constituency Loss (RBCCL)
Rupak Raj Ghimire, Prakash Poudyal and Bal Krishna Bal
 10:30–12:00 Break
 12:00–12:30 Poster spotlights
 12:30–13:30 Poster Presentations
 13:30–15:50 Afternoon Oral Presentations
14:30–14:50Natural Language Understanding of Devanagari Script Languages: Language Identification, Hate Speech and its Target Detection
Surendrabikram Thapa, Kritesh Rauniyar, Farhan Ahmad Jafri, Surabhi Adhikari, Kengatharaiyer Sarveswaran, Bal Krishna Bal, Hariram Veeramani and Usman Naseem
14:50–15:10SiTa - Sinhala and Tamil Speaker Diarization Dataset in the Wild
Uthayasanker Thayasivam, Thulasithan Gnanenthiram, Shamila Jeewantha and Upeksha Jayawickrama
15:10–15:30Sandhi Splitting in Tamil and Telugu: A Sequence-to-Sequence Approach Leveraging Transformer Models
Priyanka Dasari, Mupparapu Sohan Gupta, Nagaraju Vuppala, Pruthwik Mishra and Parameswari Krishnamurthy
15:30–15:50Bridge the GAP: Multi-lingual Models For Ambiguous Pronominal Coreference Resolution in South Asian Languages
Rahothvarman P, Adith John Rajeev, Kaveri Anuranjana and Radhika Mamidi
 Poster papers
 A Dual Contrastive Learning Framework for Enhanced Hate Speech Detection in Low-Resource Languages
Krishan Chavinda and Uthayasanker Thayasivam
 Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers
Prakash Dhakal and Daya Sagar Baral
 Structured Information Extraction from Nepali Scanned Documents using Layout Transformer and LLMs
Aayush Neupane, Aayush Lamichhane, Ankit Paudel and Aman Shakya
 Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali
Sharad Duwal, Suraj Prasai and Suresh Manandhar
 POS-Aware Neural Approaches for Word Alignment in Dravidian Languages
Antony Alexander James and Parameswari Krishnamurthy
 neDIOM: Dataset and Analysis of Nepali Idioms
Rhitabrat Pokharel and Ameeta Agrawal
 Bridging the Bandwidth Gap: A Mixed Band Telephonic Urdu ASR Approach with Domain Adaptation for Banking Applications
Ayesha Khalid, Farah Adeeba, Najm Ul Sehar and Sarmad Hussain
 Impacts of Vocoder Selection on Tacotron-based Nepali Text-To-Speech Synthesis
Ganesh Dhakal Chhetri, Kiran Chandra Dahal and Prakash Poudyal
 EmoTa: A Tamil Emotional Speech Dataset
Jubeerathan Thevakumar, Luxshan Thavarasa, Thanikan Sivatheepan, Sajeev Kugarajah and Uthayasanker Thayasivam
 Benchmarking Whisper for Low-Resource Speech Recognition: An N-Shot Evaluation on Pashto, Punjabi, and Urdu
Najm Ul Sehar, Ayesha Khalid, Farah Adeeba and Sarmad Hussain
 Leveraging Machine-Generated Data for Joint Intent Detection and Slot Filling in Bangla: A Resource-Efficient Approach
A H M Rezaul Karim and Özlem Uzuner
 Challenges in Adapting Multilingual LLMs to Low-Resource Languages using LoRA PEFT Tuning
Omkar Khade, Shruti Jagdale, Abhishek Phaltankar, Gauri Takalikar and Raviraj Joshi
 Shared-task papers
 1-800-SHARED-TASKS@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech, and Targets using LLMs
Jebish Purbey, Siddartha Pullakhandam, Kanwal Mehreen, Muhammad Arham, Drishti Sharma, Ashay Srivastava and Ram Mohan Rao Kadiyala
 AniSan@NLU of Devanagari Script Languages 2025: Optimizing Language Identification with Ensemble Learning
Anik Mahmud Shanto, Mst. Sanjida Jamal Priya and Mohammad Shamsul Arefin
 byteSizedLLM@NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification Using Customized Attention BiLSTM and XLM-RoBERTa Base Embeddings
Rohith Gowtham Kodali, Durga Prasad Manukonda and Daniel Iglesias
 byteSizedLLM@NLU of Devanagari Script Languages 2025: Language Identification Using Customized Attention BiLSTM and XLM-RoBERTa base Embeddings
Durga Prasad Manukonda and Rohith Gowtham Kodali
 CUET_Big_O@NLU of Devanagari Script Languages 2025: Identifying Script Language and Detecting Hate Speech Using Deep Learning and Transformer Model
Md. Refaj Hossan, Nazmus Sakib, Md. Alam Miah, Jawad Hossain and Mohammed Moshiul Hoque
 CUET_HateShield@NLU of Devanagari Script Languages 2025: Transformer-Based Hate Speech Detection in Devanagari Script Languages
Sumaiya Rahman Aodhora, Shawly Ahsan and Mohammed Moshiul Hoque
 CUET_INSights@NLU of Devanagari Script Languages 2025: Leveraging Transformer-based Models for Target Identification in Hate Speech
Farjana Alam Tofa, Lorin Tasnim Zeba, Md Osama and Ashim Dey
 CUFE@NLU of Devanagari Script Languages 2025: Language Identification using fastText
Michael Ibrahim
 Dll5143A@NLU of Devanagari Script Languages 2025: Detection of Hate Speech and Targets Using Hierarchical Attention Network
Ashok Yadav and Vrijendra Singh
 DSLNLP@NLU of Devanagari Script Languages 2025: Leveraging BERT-based Architectures for Language Identification, Hate Speech Detection and Target Classification
Shraddha Chauhan and Abhinav Kumar
 IITR-CIOL@NLU of Devanagari Script Languages 2025: Multilingual Hate Speech Detection and Target Identification in Devanagari-Scripted Languages
Siddhant Gupta, Siddh Singhal and Azmine Toushik Wasi
 LLMsAgainstHate@NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs
Rushendra Sidibomma, Pransh Patwa, Parth Patwa, Aman Chadha, Vinija Jain and Amitava Das
 MDSBots@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech, and Targets using MURTweet
Prabhat Ale, Anish Thapaliya and Suman Paudel
 Nepali Transformers@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech and Targets
Pilot Khadka, Ankit BK, Ashish Acharya, Bikram K.C., Sandesh Shrestha and Rabin Thapa
 NLPineers@ NLU of Devanagari Script Languages 2025: Hate Speech Detection using Ensembling of BERT-based models
Nadika Poudel, Anmol Guragain, Rajesh Piryani and Bishesh Khanal
 One_by_zero@ NLU of Devanagari Script Languages 2025: Target Identification for Hate Speech Leveraging Transformer-based Approach
Dola Chakraborty, Jawad Hossain and Mohammed Moshiul Hoque
 Paramananda@NLU of Devanagari Script Languages 2025: Detection of Language, Hate Speech and Targets using FastText and BERT
Darwin Acharya, Sundeep Dawadi, Shivram Saud and Sunil Regmi
 SKPD Emergency @ NLU of Devanagari Script Languages 2025: Devanagari Script Classification using CBOW Embeddings with Attention-Enhanced BiLSTM
Shubham Shakya, Saral Sainju, Subham Krishna Shrestha, Prekshya Dawadi and Shreya Khatiwada