The 4th Workshop on Arabic Corpus Linguistics

PROGRAM

Monday, January 20, 2025

9:00–9:10Welcome and Opening Remarks
9:10–9:509:10
9:10
 Session 1
9:50–10:10ArabicSense: A Benchmark for Evaluating Commonsense Reasoning in Arabic with Large Language Models
salima lamsiyah, Kamyar Zeinalipour, Samir El amrany, Matthias Brust, Marco Maggini, Pascal Bouvry and Christoph Schommer
10:10–10:30Lahjawi: Arabic Cross-Dialect Translator
Mohamed Motasim Hamed, Muhammad Hreden, Khalil Hennara, Zeina Aldallal, Sara Chrouf and Safwan AlModhayan
10:30–11:00Coffee Break
 Session 2
11:00–11:20Lost in Variation: An Unsupervised Methodology for Mining Lexico-syntactic Patterns in Middle Arabic Texts
Julien JB Bezançon, Rimane Karam and Gaël Lejeune
11:20–11:40SADSLyC: A Corpus for Saudi Arabian Multi-dialect Identification through Song Lyrics
Salwa Saad Alahmari
11:40–12:00Enhancing Dialectal Arabic Intent Detection through Cross-Dialect Multilingual Input Augmentation
Shehenaz Hossain, Fouad Shammary, Bahaulddin Shammary and Haithem Afli
12:00–12:20Dial2MSA-Verified: A Multi-Dialect Arabic Social Media Dataset for Neural Machine Translation to Modern Standard Arabic
Abdullah Salem Khered, Youcef Benkhedda and Riza Batista-Navarro
12:20–13:20Lunch Break
 Session 3
13:20–13:40Web-Based Corpus Compilation of the Emirati Arabic Dialect
Yousra A. El-Ghawi
13:40–14:00Evaluating Calibration of Arabic Pre-trained Language Models on Dialectal Text
Ali Al-Laith and RACHIDA KEBDANI
14:00–14:20Empirical Evaluation of Pre-trained Language Models for Summarizing Moroccan Darija News Articles
Azzedine Aftiss, Salima Lamsiyah, Christoph Schommer and Said Ouatik El Alaoui
14:20–14:40Dialect2SQL: A Novel Text-to-SQL Dataset for Arabic Dialects with a Focus on Moroccan Darija
salmane chafik, Saad Ezzini and Ismail Berrada
14:40–15:00AraSim: Optimizing Arabic Dialect Translation in Children’s Literature with LLMs and Similarity Scores
Alaa Hassan Bouomar and Noorhan Abbas
15:00–15:20Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection
Ahmed Haj Ahmed, Rui-Jie Yew, Xerxes Minocher and Suresh Venkatasubramanian
15:20–16:00Coffee Break
16:00–16:30Best Paper Award, Closing Remarks, and Wrap-Up by Dr Saad Ezzini