| PreAct: Prediction Enhances Agent’s Planning Ability
Dayuan Fu, Jianzhao Huang, Siyuan Lu, Guanting Dong, Yejie Wang, Keqing He and Weiran Xu |
| The PRECOM-SM Corpus: Gambling in Spanish Social Media
Pablo Álvarez-Ojeda, María Victoria Cantero-Romero, Anastasia Semikozova and Arturo Montejo-Raez |
| How Well Can a Long Sequence Model Model Long Sequences? Comparing Architectural Inductive Biases on Long-Context Abilities
Jerry Huang |
| Sequential Fusion of Text-close and Text-far Representations for Multimodal Sentiment Analysis
Kaiwei Sun and Mi Tian |
| PoemBERT: A Dynamic Masking Content and Ratio Based Semantic Language Model For Chinese Poem Generation
Chihan Huang and Xiaobo Shen |
| CDA^2: Counterfactual Diffusion Augmentation for Cross-Domain Adaptation in Low-Resource Sentiment Analysis
Dancheng Xin, Kaiqi Zhao, Jingyun Sun and Yang Li |
| CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?
Yuwei Zhao, Ziyang Luo, Yuchen Tian, Hongzhan Lin, Weixiang Yan, Annan Li and Jing Ma |
| Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching
Tianshu Wang, Xiaoyang Chen, Hongyu Lin, Xuanang Chen, Xianpei Han, Le Sun, Hao Wang and Zhenyu Zeng |
| InstructGEC: Enhancing Unsupervised Grammatical Error Correction with Instruction Tuning
Jiayi Deng, Chen Chen, Chunyan Hou and Xiaojie Yuan |
| Sibyl: Empowering Empathetic Dialogue Generation in Large Language Models via Sensible and Visionary Commonsense Inference
Lanrui Wang, Jiangnan Li, Chenxu Yang, Zheng Lin, Hongyin Tang, Huan Liu, Yanan Cao, Jingang Wang and Weiping Wang |
| Noise-powered Multi-modal Knowledge Graph Representation Framework
Zhuo Chen, Yin Fang, Yichi Zhang, Lingbing Guo, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen and Wen Zhang |
| ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
Junjie Ye, Guanyu Li, SongYang Gao, Caishuang Huang, Yilong Wu, Sixian Li, Xiaoran Fan, Shihan Dou, Tao Ji, Qi Zhang, Tao Gui and Xuanjing Huang |
| Federated Incremental Named Entity Recognition
Zesheng Liu, Qiannan Zhu, Cuiping Li and Hong Chen |
| Large Language Models are Good Annotators for Type-aware Data Augmentation in Grammatical Error Correction
Xinyuan Li and Yunshi Lan |
| Looks can be Deceptive: Distinguishing Repetition Disfluency from Reduplication
Arif A. Ahmad, Khyathi Gayathri Mothika and Pushpak Bhattacharyya |
| Learning to Verify Summary Facts with Fine-Grained LLM Feedback
Jihwan Oh, Jeonghwan Choi, Nicole Hee-Yoen Kim, Taewon Yun and Hwanjun Song |
| FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models
Tao Fan, Guoqiang Ma, Yan Kang, Hanlin Gu, Yuanfeng Song, Lixin Fan, Kai Chen and Qiang Yang |
| Dynamic Graph Neural ODE Network for Multi-modal Emotion Recognition in Conversation
Yuntao Shou, tao meng, wei ai and KEQIN LI |
| HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
Peng Xia, Xingtong Yu, Ming Hu, Lie Ju, Zhiyong Wang, Peibo Duan and Zongyuan Ge |
| Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement
Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi Fung, Hou Pong Chan, Kevin Small, ChengXiang Zhai and Heng Ji |
| Style Over Substance: Evaluation Biases for Large Language Models
Minghao Wu and Alham Fikri Aji |
| Multimodal Aspect-Based Sentiment Analysis under Conditional Relation
Xinjing Liu, Ruifan Li, Shuqin Ye, guangwei zhang and Xiaojie WANG |
| Semantic Role Labeling of NomBank Partitives
Adam Meyers, Advait Pravin Savant and John E. Ortega |
| MCS-SQL: Leveraging Multiple Prompts and Multiple-Choice Selection For Text-to-SQL Generation
Dongjun Lee, Choongwon Park, Jaehyuk Kim and Heesoo Park |
| InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery
He Cao, Zijing Liu, Xingyu Lu, Yuan Yao and Yu Li |
| Ambiguity-aware Multi-level Incongruity Fusion Network for Multi-Modal Sarcasm Detection
Kuntao Li, Yifan Chen, Qiaofeng Wu, Weixing Mai, Fenghuan Li and Yun Xue |
| AdminSet and AdminBERT: a Dataset and a Pre-trained Language Model to Explore the Unstructured Maze of French Administrative Documents
Thomas Sebbag, Solen Quiniou, Nicolas Stucky and Emmanuel Morin |
| ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models
Thibaut Thonet, Laurent Besacier and Jos Rozen |
| Positive Text Reframing under Multi-strategy Optimization
Shutong Jia, Biwei Cao, Qingqing Gao, Jiuxin Cao and Bo Liu |
| RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert Collaboration
Haoyu Huang, Tong Niu, Rui Yang and Luping Shi |
| SURE: Mutually Visible Objects and Self-generated Candidate Labels For Relation Extraction
Yuxuan Feng, Qian Chen, Qianyou Wu, Xin GUO and Suge Wang |
| TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data
Yihong Liu, Chunlan Ma, Haotian Ye and Hinrich Schütze |
| Two-stage Incomplete Utterance Rewriting on Editing Operation
Zhiyu Cao, Peifeng Li, Qiaoming Zhu and Yaxin Fan |
| QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
Jingyao Li, Han Shi, Sitong Wu, Chuanyang Zheng, Zhenguo Li, Xin Jiang, Hong Xu and Jiaya Jia |
| SVD-GCL: A Noise-Augmented Hybrid Graph Contrastive Learning Framework for Recommendation
Liping Wang, Shichao Li, Hui Wang, Yuyan Gao and Mingyao Wei |
| MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL
Bing Wang, Changyu Ren, Jian Yang, Xinnian Liang, Jiaqi Bai, LinZheng Chai, Zhao Yan, Qian-Wen Zhang, di yin, Xing Sun and Zhoujun Li |
| Exploring Concept Depth: How Large Language Models Acquire Knowledge and Concept at Different Layers?
Mingyu Jin, Qinkai Yu, Jingyuan Huang, Qingcheng Zeng, Zhenting Wang, Wenyue Hua, Haiyan Zhao, Kai Mei, Yanda Meng, Kaize Ding, Fan Yang, Mengnan Du and Yongfeng Zhang |
| Knowledge Graph Entity Typing with Curriculum Contrastive Learning
hao wang, Minghua Nuo and shan jiang |
| The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models
Zihui Wu, Haichang Gao, Jianping He and Ping Wang |
| Adapters Selector: Cross-domains and Multi-tasks LoRA Modules Integration Usage Method
Yimin Tian, Bolin Zhang, Zhiying Tu and Dianhui Chu |
| XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser
Xianfu Cheng, hang zhang, Jian Yang, Xiang Li, Weixiao Zhou, Fei Liu, Kui Wu, Xiangyuan Guan, Tao Sun, Xianjie Wu, Tongliang Li and Zhoujun Li |
| Debiasing by obfuscating with 007-classifiers promotes fairness in multi-community settings
Ingroj Shrestha and Padmini Srinivasan |
| Graph Representation Learning in Hyperbolic Space via Dual-Masked
rui gong, zuyun jiang and daren zha |
| Perturbation-driven Dual Auxiliary Contrastive Learning for Collaborative Filtering Recommendation
Caihong Mu, Keyang Zhang, Jialiang Zhou and Yi Liu |
| Enhancing Reranking for Recommendation with LLMs through User Preference Retrieval
Haobo Zhang, Qiannan Zhu and Zhicheng Dou |
| SyntheT2C: Generating Synthetic Data for Fine-Tuning Large Language Models on the Text2Cypher Task
Zijie Zhong, Linqing Zhong, Zhaoze Sun, Qingyun Jin, Zengchang Qin and Xiaofan Zhang |
| Language Models Encode the Value of Numbers Linearly
Fangwei Zhu, Damai Dai and Zhifang Sui |
| FinDABench: Benchmarking Financial Data Analysis Ability of Large Language Models
Shu Liu, Shangqing Zhao, Chenghao Jia, Xinlin Zhuang, Zhaoguang Long, Jie Zhou, Aimin Zhou, Man Lan and yang chong |
| Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Nguyen Binh Nguyen and Yang He |
| SLARD: A Chinese Superior Legal Article Retrieval Dataset
Zhe Chen, Pengjie Ren, Fuhui Sun, Xiaoyan Wang, Yujun Li, Siwen Zhao and Tengyi Yang |
| Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations
Nuo Chen, Hongguang Li, Jianhui Chang, juhua huang, Baoyuan Wang and Jia Li |
| Refined Evaluation for End-to-End Grammatical Error Correction Using an Alignment-Based Approach
Junrui Wang, Mengyang Qiu, Yang Gu, Zihao Huang and Jungyeul Park |
| LLMs on interactive feature collections with implicit dynamic decision strategy
Juyeon Heo, Vihari Piratla, Kyunghyun Lee, Hyonkeun Joh and Adrian Weller |
| Pre-trained Semantic Interaction based Inductive Graph Neural Networks for Text Classification
Shiyu Wang, Gang Zhou, Jicang Lu, Jing Chen and Ningbo Huang |
| From Superficial to Deep: Integrating External Knowledge for Follow-up Question Generation Using Knowledge Graph and LLM
Jianyu Liu, Yi Huang, Sheng Bi, Junlan Feng and Guilin Qi |
| AGCL: Aspect Graph Construction and Learning for Aspect-level Sentiment Classification
Zhongquan Jian, Daihang Wu, Shaopan Wang, Yancheng Wang, Junfeng Yao, Meihong Wang and Qingqiang Wu |
| TaCIE: Enhancing Instruction Comprehension in Large Language Models through Task-Centred Instruction Evolution
Jiuding Yang, Shengyao Lu, Weidong Guo, Xiangyang Li, Kaitong Yang, Yu Xu and Di Niu |
| LLaMA-E: Empowering E-commerce Authoring with Object-Interleaved Instruction Following
Kaize Shi, Xueyao Sun, Dingxian Wang, Yinlin Fu, Guandong Xu and Qing Li |
| LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start Recommendations
Wenlin Zhang, Chuhan Wu, Xiangyang Li, Yuhao Wang, Kuicai Dong, Yichao Wang, Xinyi Dai, Xiangyu Zhao, Huifeng Guo and Ruiming Tang |
| Collaborative Document Simplification Using Multi-Agent Systems
Dengzhao Fang, Jipeng Qiang, Xiaoye Ouyang, Yi Zhu, Yunhao Yuan and Yun Li |
| Distilling Rule-based Knowledge into Large Language Models
Wenkai Yang, Yankai Lin, Jie Zhou and Ji-Rong Wen |
| Exploring Backdoor Vulnerabilities of Chat Models
Wenkai Yang, Yunzhuo Hao and Yankai Lin |
| Towards the Machine Translation of Scientific Neologisms
Paul Lerner and François Yvon |
| HyperIDP: Customizing Temporal Hypergraph Neural Networks for Multi-Scale Information Diffusion Prediction
Haowei Xu, Chao Gao, Xianghua Li and Zhen Wang |
| Enhancing multi-modal Relation Extraction with Reinforcement Learning Guided Graph Diffusion Framework
Rui Yang and Rajiv Gupta |
| Non-Emotion-Centric Empathetic Dialogue Generation
Yuanxiang Huangfu, Peifeng Li, Yaxin Fan and Qiaoming Zhu |
| Aligning Retrieval with Reader Needs: Reader-Centered Passage Selection for Open-Domain Question Answering
Chunlei Xin, Shuheng Zhou, Xuanang Chen, Yaojie Lu, Huijia Zhu, weiqiang wang, Zhongyi Liu, Xianpei Han and Le Sun |
| Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding
Cheng Wang, Yiwei Wang, Bryan Hooi, Yujun Cai, Nanyun Peng and Kai-Wei Chang |
| Citation Amnesia: On The Recency Bias of NLP and Other Academic Fields
Jan Philip Wahle, Terry Lima Ruas, Mohamed Abdalla, Bela Gipp and Saif M. Mohammad |
| Low-Resource Fast Text Classification Based on Intra-Class and Inter-Class Distance Calculation
Yanxu Mao, Peipei Liu, Tiehan Cui, Congying Liu and Datao You |
| Monte Carlo Tree Search Based Prompt Autogeneration for Jailbreak Attacks against LLMs
Suhuang WU, Huimin Wang, Yutian Zhao, Xian Wu, yefeng zheng, Wei Li, Hui Li and rongrong ji |
| LogiGraph: Logical Reasoning with Contrastive Learning and Lightweight Graph Networks
Xiang Li, Chen Shi, Yong Xu and jun huang |
| Explaining Relationships Among Research Papers
Xiangci Li and Jessica Ouyang |
| From Generalist to Specialist: A Survey of Large Language Models for Chemistry
Yang Han, Ziping Wan, Lu Chen, Kai Yu and Xin Chen |
| Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution
Milad Alshomary, Narutatsu Ri, Marianna Apidianaki, Ajay Patel, Smaranda Muresan and Kathleen McKeown |
| Read Before Grounding: Scene Knowledge Visual Grounding via Multi-step Parsing
HaiXiang Zhu, Lixian Su, ShuangMing Mao and Jing Ye |
| Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem
Qianli Wang, Tatiana Anikina, Nils Feldhus, Simon Ostermann, Sebastian Möller and Vera Schmitt |
| BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation
Minchong Li, Feng Zhou and Xiaohui Song |
| Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs
Tamzeed Mahfuz, Satak Kumar Dey, Ruwad Naswan, Hasnaen Adil, Khondker Salman Sayeed and Haz Sameen Shahgir |
| Do language models practice what they preach? Examining language ideologies about gendered language reform encoded in LLMs
Julia Watson, Sophia S. Lee, Barend Beekhuizen and Suzanne Stevenson |
| T-MES: Trait-Aware Mix-of-Experts Representation Learning for Multi-trait Essay Scoring
Jiong Wang and Jie Liu |
| A Graph Interaction Framework on Relevance for Multimodal Named Entity Recognition with Multiple Images
Jiachen Zhao, Shizhou Huang and xin Lin |
| Mining Word Boundaries from Speech-Text Parallel Data for Cross-domain Chinese Word Segmentation
Xuebin Wang, Lei Zhang, Zhenghua Li, Shilin Zhou, Chen Gong and Yang Hou |
| RoBGuard: Enhancing LLMs to Assess Risk of Bias in Clinical Trial Documents
Changkai Ji, Bowen Zhao, Zhuoyao Wang, Yingwen Wang, Yuejie Zhang, Ying Cheng, Rui Feng and Xiaobo Zhang |
| A Compressive Memory-based Retrieval Approach for Event Argument Extraction
Wanlong Liu, Enqi Zhang, shaohuan cheng, Dingyi Zeng, Li Zhou, Chen Zhang, Malu Zhang and Wenyu Chen |
| FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics
Yupei Du, Albert Gatt and Dong Nguyen |
| PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation
Hour Kaing, Raj Dabre, Haiyue Song, Van-Hien Tran, Hideki Tanaka and Masao Utiyama |
| Relation Logical Reasoning and Relation-aware Entity Encoding for Temporal Knowledge Graph Reasoning
Longzhou Liu, Chenglong Xiao, Shanshan Wang and Tingwen Liu |
| Awakening Augmented Generation: Learning to Awaken Internal Knowledge of Large Language Models for Question Answering
Huanxuan Liao, Shizhu He, Yao Xu, Yuanzhe Zhang, Shengping Liu, Kang Liu and Jun Zhao |
| Dying or Departing? Euphemism Detection for Death Discourse in Historical Texts
Ali Al-Laith, Alexander Conroy, Jens Bjerring-Hansen, Bolette Pedersen, Carsten Levisen and Daniel Hershcovich |
| ITERATE: Image-Text Enhancement, Retrieval, and Alignment for Transmodal Evolution with LLMs
Chenhan Fu, Guoming Wang, Juncheng Li, Wenqiao Zhang, Rongxing Lu and Siliang Tang |
| Multi-Graph Co-Training for Capturing User Intent in Session-based Recommendation
zhe yang and Tiantian Liang |
| CAST: Cross-modal Alignment Similarity Test for Vision Language Models
Gautier Dagan, Olga Loginova and Anil Batra |
| Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models
Chengkai Huang, Yu Xia, Rui Wang, Kaige Xie, Tong Yu, Julian McAuley and Lina Yao |
| Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes
Taichi Aida and Danushka Bollegala |
| Uncertainty Modelling in Under-Represented Languages with Bayesian Deep Gaussian Processes
Ubaid Azam, Imran Razzak, Shelly Vishwakarma and Shoaib Jameel |
| Cross-lingual Text Classification Transfer: The Case of Ukrainian
Daryna Dementieva, Valeriia Khylenko and Georg Groh |
| LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
Dongge Han, Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Peter Bell and Amos Storkey |
| CEHA: A Dataset of Conflict Events in the Horn of Africa
Rui Bai, Di Lu, Shihao Ran, Elizabeth M. Olson, Hemank Lamba, Aoife Cahill, Joel Tetreault and Alejandro Jaimes |
| QABISAR: Query-Article Bipartite Interactions for Statutory Article Retrieval
Santosh T.Y.S.S, Hassan Sarwat and Matthias Grabmair |
| Partial Order-centered Hyperbolic Representation Learning for Few-shot Relation Extraction
Biao Hu, Zhen Huang, Minghao Hu, Pinglv Yang, Peng Qiao, Yong Dou and Zhilin Wang |
| Taxonomy-Guided Zero-Shot Recommendations with LLMs
Yueqing Liang, Liangwei Yang, Chen Wang, Xiongxiao Xu, Philip S. Yu and Kai Shu |
| Enhancing Multi-party Dialogue Discourse Parsing with Explanation Generation
Shannan Liu, Peifeng Li, Yaxin Fan and Qiaoming Zhu |
| MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
Shuo Xie, Fangzhi Zhu, Jiahui Wang, Lulu Wen, Wei Dai, Xiaowei Chen, Junxiong Zhu, Kai Zhou and Bo Zheng |
| Polysemy Interpretation and Transformer Language Models: A Case of Korean Adverbial Postposition -(u)lo
Seongmin Mun and Gyu-Ho Shin |
| A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation
Ekai Hashimoto, Mikio Nakano, Takayoshi Sakurai, Shun Shiramatsu, Toshitake Komazaki and Shiho Tsuchiya |
| A Simple-Yet-Efficient Instruction Augmentation Method for Zero-Shot Sentiment Classification
Yang Zhao, Masayasu Muraoka, Issei Yoshida, Bishwaranjan Bhattacharjee and Hiroshi Kanayama |
| Improving Explainable Fact-Checking with Claim-Evidence Correlations
Xin Tan, Bowei Zou and Ai Ti Aw |
| Analyzing Continuous Semantic Shifts with Diachronic Word Similarity Matrices
Hajime Kiyama, Taichi Aida, Mamoru Komachi, Toshinobu Ogiso, Hiroya Takamura and Daichi Mochihashi |
| A Testset for Context-Aware LLM Translation in Korean-to-English Discourse Level Translation
Minjae Lee, Youngbin Noh and Seung Jin Lee |
| MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning
Lulu Zhao, Weihao Zeng, shi xiaofeng and Hua Zhou |
| A Combinatorial Approach to Neural Emergent Communication
Zheyuan Zhang |
| Multi-perspective Preference Alignment of LLMs for Programming-Community Question Answering
Hongyu Yang, Jiahui Hou, Liyang He and rui li |
| Learning to Refuse: Towards Mitigating Privacy Risks in LLMs
Zhenhua Liu, Tong Zhu, Chuanyuan Tan and Wenliang Chen |
| Exploring Unified Training Framework for Multimodal User Profiling
Minjie Qiang, Zhongqing Wang, Shoushan Li and Guodong Zhou |
| Acquiring Bidirectionality via Large and Small Language Models
Takumi Goto, Hiroyoshi Nagao and Yuta Koreeda |
| Enhancing One-Shot Pruned Pre-trained Language Models through Sparse-Dense-Sparse Mechanism
Guanchen Li, Xiandong Zhao, Lian Liu, Zeping Li, Yixing Xu, Dong Li, Lu Tian, Jie He, Ashish Sirasao and Emad Barsoum |
| Language Models over Large-Scale Knowledge Base: on Capacity, Flexibility and Reasoning for New Facts
Qiyuan He, Yizhong Wang, Jianfei Yu and Wenya Wang |
| Multi-View Incongruity Learning for Multimodal Sarcasm Detection
Diandian Guo, Cong Cao, Fangfang Yuan, Yanbing Liu, Guangjie Zeng, Xiaoyan Yu, Hao Peng and Philip S. Yu |
| Cognitive Biases, Task Complexity, and Result Intepretability in Large Language Models
Mario Mina, Valle Ruiz-Fernández, Júlia Falcão, Luis Vasquez-Reina and Aitor Gonzalez-Agirre |
| Robustness Evaluation of the German Extractive Question Answering Task
Shalaka Satheesh, Katharina Beckh, Katrin Klug, Héctor Allende-Cid, Sebastian Houben and Teena Hassan |
| Enhancing Multimodal Named Entity Recognition through Adaptive Mixup Image Augmentation
Bo Xu, Haiqi Jiang, Jie Wei, Hongyu Jing, Ming Du, Hui Song, Hongya Wang and Yanghua Xiao |
| Bridging Modality Gap for Effective Multimodal Sentiment Analysis in Fashion-related Social Media
Zheyu Zhao, Zhongqing Wang, Shichen Li, Hongling Wang and Guodong Zhou |
| Quality Beyond A Glance: Revealing Large Quality Differences Between Web-Crawled Parallel Corpora
Rik van Noord, Miquel Esplà-Gomis, Malina Chichirau, Gema Ramírez-Sánchez and Antonio Toral |
| MLLM-I2W: Harnessing Multimodal Large Language Model for Zero-Shot Composed Image Retrieval
Tong Bao, Che Liu, Derong Xu, Zhi Zheng and Tong Xu |
| Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech
Jonathan Heitz, Gerold Schneider and Nicolas Langer |
| Does Vision Accelerate Hierarchical Generalization in Neural Language Learners?
Tatsuki Kuribayashi and Timothy Baldwin |
| Efficient Solutions For An Intriguing Failure of LLMs: Long Context Window Does Not Mean LLMs Can Analyze Long Sequences Flawlessly
Peyman Hosseini, Ignacio Castro, Iacopo Ghinassi and Matthew Purver |
| MLD-EA: Check and Complete Narrative Coherence by Introducing Emotions and Actions
Jinming Zhang and Yunfei Long |
| SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization
Kohei Tsuji, Tatsuya Hiraoka, yuchang cheng and Tomoya Iwakura |
| Rethinking Long Context Generation from the Continual Learning Perspective
Zeyuan Yang, Fangzhou Xiong, Peng Li and Yang Liu |
| LTRS: Improving Word Sense Disambiguation via Learning to Rank Senses
Hansi Wang, Yue Wang, Qiliang Liang and Yang Liu |
| Are Your Keywords Like My Queries? A Corpus-Wide Evaluation of Keyword Extractors with Real Searches
Martina Galletti, Giulio Prevedello, Emanuele Brugnoli, Donald Ruggiero Lo Sardo and Pietro Gravino |
| NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers
Angel Yahir Loredo Lopez, Tyler McDonald and Ali Emami |
| How Well Can Large Language Models Reflect? A Human Evaluation of LLM-generated Reflections for Motivational Interviewing Dialogues
Erkan Basar, Xin Sun, Iris Hendrickx, Jan de Wit, Tibor Bosse, Gert-Jan De Bruijn, Jos A. Bosch and Emiel Krahmer |
| Rethinking the Alignment of Psychotherapy Dialogue Generation with Motivational Interviewing Strategies
Xin Sun, Xiao Tang, Abdallah El Ali, Zhuying Li, Pengjie Ren, Jan de Wit, Jiahuan Pei and Jos A.Bosch |
| Enhancing Zero-shot Chain of Thought Prompting via Uncertainty-Guided Strategy Selection
Shanu Kumar, Saish Mendke, Karody Lubna Abdul Rahman, Santosh Kurasa, Parag Agrawal and Sandipan Dandapat |
| Word-level Cross-lingual Structure in Large Language Models
Zihao Feng, Hailong Cao, Wang Xu and Tiejun Zhao |
| Trucidator: Document-level Event Factuality Identification via Hallucination Enhancement and Cross-Document Inference
Zihao Zhang, Zhong Qian, Xiaoxu Zhu, Peifeng Li and Qiaoming Zhu |
| RoLargeSum: A Large Dialect-Aware Romanian News Dataset for Summary, Headline, and Keyword Generation
Andrei-Marius Avram, Mircea Timpuriu, Andreea Iuga, Vlad-Cristian Matei, Iulian-Marius Taiatu, Tudor Găină, Dumitru-Clementin Cercel, Mihaela-Claudia Cercel and Florin Pop |
| From Detection to Explanation: Effective Learning Strategies for LLMs in Online Abusive Language Research
Chiara Di Bonaventura, Lucia Siciliani, Pierpaolo Basile, Albert Merono Penuela and Barbara McGillivray |
| TEEMIL : Towards Educational MCQ Difficulty Estimation in Indic Languages
Manikandan Ravikiran, Siddharth Vohra, Rajat Verma, Rohit Saluja and Arnav Bhavsar |
| What’s Wrong? Refining Meeting Summaries with LLM Feedback
Frederic Thomas Kirstein, Terry Lima Ruas and Bela Gipp |
| Scene Graph and Dependency Grammar Enhanced Remote Sensing Change Caption Network (SGD-RSCCN)
Qiaoli Sun, Yan Wang and Xiaoyu Song |
| Looking at the Unseen: Effective Sampling of Non-Related Propositions for Argument Mining
Ramon Ruiz-Dolz, Debela Gemechu, Zlata Kikteva and Chris Reed |
| "Not Aligned” is Not "Malicious”: Being Careful about Hallucinations of Large Language Models’ Jailbreak
Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Jiayi Mao and Xueqi Cheng |
| From Form to Meaning: The Case of Particles within the Prague Dependency Treebank Annotation Scheme
Marie Mikulova, Barbora Štěpánková and Jan Štěpánek |
| Enhancing Long-range Dependency with State Space Model and Kolmogorov-Arnold Networks for Aspect-based Sentiment Analysis
Adamu Lawan, Juhua Pu, Haruna Yunusa, Aliyu Umar and Muhammad Lawan |
| ROUGE-SciQFS: A ROUGE-based Method to Automatically Create Datasets for Scientific Query-Focused Summarization
Juan Ramirez-Orta, Ana Maguitman, Axel J. Soto and Evangelos Milios |
| Commonsense Subgraph for Inductive Relation Reasoning with Meta-learning
Feng Zhao, Zhilu Zhang, Cheng Yan and Xianggan Liu |
| Clear Up Confusion: Iterative Differential Generation for Fine-grained Intent Detection with Contrastive Feedback
Feng Zhang, Wei Chen, Meng Gao, Fei Ding, Tengjiao Wang, jiahui Yao and jiabin zheng |
| Leveraging Explicit Reasoning for Inference Integration in Commonsense-Augmented Dialogue Models
Sarah E. Finch and Jinho D. Choi |
| Integrating Group-based Preferences from Coarse to Fine for Cold-start Users Recommendation
Siyu Wang, Jianhui Jiang, Jiangtao Qiu and Shengran Dai |
| Automatic Multiple-Choice Question Generation and Evaluation Systems Based on LLM: A Study Case With University Resolutions
Sérgio Silva Mucciaccia, Thiago Meireles Paixão, Filipe Wall Mutz, Claudine Santos Badue, Alberto Ferreira de Souza and Thiago Oliveira-Santos |
| Generating Commonsense Reasoning Questions with Controllable Complexity through Multi-step Structural Composition
Jianxing Yu, Shiqi Wang, Hanjiang Lai, Wenqing Chen, Yanghui Rao, Qinliang Su and Jian Yin |
| DnA-Eval: Enhancing Large Language Model Evaluation through Decomposition and Aggregation
Minzhi Li, Zhengyuan Liu, Shumin Deng, Shafiq Joty, Nancy Chen and Min-Yen Kan |
| Towards Faithful Multi-step Reasoning through Fine-Grained Causal-aware Attribution Reasoning Distillation
Zheng Chu, Jingchang Chen, Zhongjie Wang, Guo Tang, Qianglong Chen, ming liu and Bing Qin |
| AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations
Qian Tao, Wenyuan Yu and Jingren Zhou |
| E-Bench: Towards Evaluating the Ease-of-Use of Large Language Models
Zhenyu Zhang, Bingguang Hao, Jinpeng Li, Zekai Zhang and Dongyan Zhao |
| Enhancing Online Grooming Detection via Backtranslation Augmentation
Hamed Waezi and Hossein Fani |
| CausalScore: An Automatic Reference-Free Metric for Assessing Response Relevance in Open-Domain Dialogue Systems
Tao Feng, Lizhen Qu, Xiaoxi Kang and Gholamreza Haffari |
| Exploring the Impact of Language Switching on Personality Traits in LLMs
Jacopo Amidei, Jose Gregorio Ferreira De Sá, Rubén Nieto Luna and Andreas Kaltenbrunner |
| LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation
Keheng Wang, Feiyu Duan, Peiguang Li, Sirui Wang and Xunliang Cai |
| Chain-of-Specificity: Enhancing Task-Specific Constraint Adherence in Large Language Models
Kaiwen Wei, Jiang Zhong, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, li jin, Yue Yu and Jingyuan Zhang |
| How Transliterations Improve Crosslingual Alignment
Yihong Liu, Mingyang Wang, Amir Hossein Kargaran, Ayyoob ImaniGooghari, Orgest Xhelili, Haotian Ye, Chunlan Ma, François Yvon and Hinrich Schütze |
| GL-GAN: Perceiving and Integrating Global and Local Styles for Handwritten Text Generation with Mamba
Yiming Wang, Hongxi Wei, Heng Wang, Shiwen Sun and Chao He |
| Discrete Subgraph Sampling for Interpretable Graph based Visual Question Answering
Pascal Tilli and Ngoc Thang Vu |
| From Multiple-Choice to Extractive QA: A Case Study for English and Arabic
Teresa Lynn, Malik H. Altakrori, Samar M. Magdy, Rocktim Jyoti Das, Chenyang Lyu, Mohamed Nasr, Younes Samih, Kirill Chirkunov, Alham Fikri Aji, Preslav Nakov, Shantanu Godbole, Salim Roukos, Radu Florian and Nizar Habash |
| Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment
Tianyu Peng and Jiajun Zhang |
| DialogueMMT: Dialogue Scenes Understanding Enhanced Multi-modal Multi-task Tuning for Emotion Recognition in Conversations
ChenYuan He, Senbin Zhu, Hongde Liu, Fei Gao, Yuxiang Jia, Hongying Zan and Min Peng |
| Learning Transition Patterns by Large Language Models for Sequential Recommendation
Jianyang Zhai, Zi-Feng Mai, Dongyi Zheng, Chang-Dong Wang, Xiawu Zheng, Hui Li, Feidiao Yang and Yonghong Tian |
| Aligning Large Language Models with Human Opinions through Persona Selection and Value–Belief–Norm Reasoning
Xuan Long Do, Kenji Kawaguchi, Min-Yen Kan and Nancy Chen |
| MiMoTable: A Multi-scale Spreadsheet Benchmark with Meta Operations for Table Reasoning
Zheng Li, Yang Du, Mao Zheng and Mingyang Song |
| Implicit Discourse Relation Classification For Nigerian Pidgin
Muhammed Yahia Gaffar Saeed Saeed, Peter Bourgonje and Vera Demberg |
| How Many Languages Make Good Multilingual Instruction Tuning? A Case Study on BLOOM
Shaoxiong Ji and Pinzhen Chen |
| Gradient Inversion Attack in Federated Learning: Exposing Text Data through Discrete Optimization
Ying Gao, Yuxin Xie, Huanghao Deng and Zukun Zhu |
| Simulating Dual-Process Thinking in Dialogue Topic Shift Detection
huiyao wang, Peifeng Li, Yaxin Fan and Qiaoming Zhu |
| A Compliance Checking Framework Based on Retrieval Augmented Generation
Jingyun Sun, Zhongze Luo and Yang Li |
| MIDLM: Multi-Intent Detection with Bidirectional Large Language Models
Shangjian Yin, Peijie Huang and Yuhong Xu |
| ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
Chenyang Song, Xu Han, Zhengyan Zhang, Shengding Hu, Xiyu Shi, Kuai Li, Chen Chen, Zhiyuan Liu, Guangli Li, Tao Yang and Maosong Sun |
| Reasoning-Oriented and Analogy-Based Methods for Locating and Editing in Zero-Shot Event-Relational Reasoning
Jingyao Tang, Lishuang Li, Liteng Mi, Haiming Wu and Hongbin Lu |
| Leveraging Language Models for Summarizing Mental State Examinations: A Comprehensive Evaluation and Dataset Release
Nilesh Kumar Sahu, Manjeet Yadav, Mudita Chaturvedi, Snehil Gupta and Haroon R. Lone |
| Oddballness: universal anomaly detection with language models
Filip Gralinski, Ryszard Staruch and Krzysztof Jurkiewicz |
| CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models
Zhongzhi Li, Ming-Liang Zhang, Pei-Jie Wang, Jian Xu, Rui-Song Zhang, Yin Fei, Zhi-Long Ji, Jin-Feng Bai, Zhen-Ru Pan, Jiaxin Zhang and Cheng-Lin Liu |
| Efficient Tool Use with Chain-of-Abstraction Reasoning
Silin Gao, Jane Dwivedi-Yu, Ping Yu, Xiaoqing Ellen Tan, Ramakanth Pasunuru, Olga Golovneva, Koustuv Sinha, Asli Celikyilmaz, Antoine Bosselut and Tianlu Wang |
| Enhancing Arabic NLP Tasks through Character-Level Models and Data Augmentation
Mohanad Mohamed and Sadam Al-Azani |
| The Gaps between Fine Tuning and In-context Learning in Bias Evaluation and Debiasing
Masahiro Kaneko, Danushka Bollegala and Timothy Baldwin |
| LLM Sensitivity Challenges in Abusive Language Detection: Instruction-Tuned vs. Human Feedback
Yaqi Zhang, Viktor Hangya and Alexander Fraser |
| Improving Automatic Grammatical Error Annotation for Chinese Through Linguistically-Informed Error Typology
Yang Gu, Zihao Huang, Min Zeng, Mengyang Qiu and Jungyeul Park |
| Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach
Daiki Shirafuji, Makoto Takenaka and Shinya Taguchi |
| Topology-of-Question-Decomposition: Enhancing Large Language Models with Information Retrieval for Knowledge-Intensive Tasks
Weijie Li, Jin Wang, Liang-Chih Yu and Xuejie Zhang |
| t-HNE: A Text-guided Hierarchical Noise Eliminator for Multimodal Sentiment Analysis
Zuocheng Li and Lishuang Li |
| ALYMPICS: LLM Agents Meet Game Theory
Shaoguang Mao, Yuzhe Cai, Yan Xia, Wenshan Wu, Xun Wang, Fengyi Wang, Qiang Guan, Tao Ge and Furu Wei |
| Towards Adaptive Mechanism Activation in Language Agent
Ziyang Huang, Jun Zhao and Kang Liu |
| Scaffolding Coordinates to Promote Vision-Language Coordination in Large Multi-Modal Models
Xuanyu Lei, Zonghan Yang, Xinrui Chen, Peng Li and Yang Liu |
| Retrieval Augmented Instruction Tuning for Open NER with Large Language Models
Tingyu Xie, Jian Zhang, Yan Zhang, Yuanyuan Liang, Qi Li and Hongwei Wang |
| Rethinking Vocabulary Augmentation: Addressing the Challenges of Low-Resource Languages in Multilingual Models
Nankai Lin, Peijian Zeng, Weixiong Zheng, Shengyi JIANG, Dong Zhou and Aimin Yang |
| Hawkes based Representation Learning for Reasoning over Scale-free Community-structured Temporal Knowledge Graphs
Yuwei Du, Xinyue Liu, Wenxin Liang, Linlin Zong and Xianchao Zhang |
| Intention Analysis Makes LLMs A Good Jailbreak Defender
Yuqi Zhang, Liang Ding, Lefei Zhang and Dacheng Tao |
| Towards Understanding Multi-Task Learning (Generalization) of LLMs via Detecting and Exploring Task-Specific Neurons
Yongqi Leng and Deyi Xiong |
| Do Large Language Models Mirror Cognitive Language Processing?
Yuqi Ren, Renren Jin, Tongxuan Zhang and Deyi Xiong |
| SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models with Customisable Fairness Calibration
Xin Guan, Nate Demchak, Saloni Gupta, Ze Wang, Ediz Ertekin Jr., Adriano Koshiyama, Emre Kazim and Zekun Wu |
| Learning to Reason via Self-Iterative Process Feedback for Small Language Models
Kaiyuan Chen, Jin Wang and Xuejie Zhang |
| Rethinking-based Code Summarization with Chain of Comments
Liuwen Cao, Hongkui He, Hailin Huang, Jiexin Wang and Yi Cai |
| RGR-KBQA: Generating Logical Forms for Question Answering Using Knowledge-Graph-Enhanced Large Language Model
Tengfei Feng and Liang He |
| To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation
Abdul Hameed Azeemi, Ihsan Ayyub Qazi and Agha Ali Raza |
| LLM Sensitivity Evaluation Framework for Clinical Diagnosis
Chenwei Yan, Xiangling Fu, Yuxuan Xiong, Tianyi Wang, Siu Cheung Hui, Ji Wu and Xien Liu |
| Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Zijun Chen, Wenbo Hu, Guande He, Zhijie Deng, ZHeng ZHang and Richang Hong |
| Unifying Dual-Space Embedding for Entity Alignment via Contrastive Learning
Cunda Wang, Weihua Wang, Qiuyu Liang, Feilong Bao and Guanglai Gao |
| Aspect-Based Sentiment Analysis with Syntax-Opinion-Sentiment Reasoning Chain
Rui Fan, Shu Li, Tingting He and Yu Liu |
| Reasoning with Trees: Faithful Question Answering over Knowledge Graph
Tiesunlong Shen, Jin Wang, Xuejie Zhang and Erik Cambria |
| Revisiting Jailbreaking for Large Language Models: A Representation Engineering Perspective
Tianlong Li, Zhenghua Wang, Wenhao Liu, Muling Wu, Shihan Dou, Changze Lv, Xiaohua Wang, Xiaoqing Zheng and Xuanjing Huang |
| Lexicography Saves Lives (LSL): Automatically Translating Suicide-Related Language
Annika Marie Schoene, John E. Ortega, Rodolfo Joel Zevallos and Laura Haaber Ihle |
| Enhancing Emotional Support Conversations: A Framework for Dynamic Knowledge Filtering and Persona Extraction
Jiawang Hao and Fang Kong |
| SKIntern: Internalizing Symbolic Knowledge for Distilling Better CoT Capabilities into Small Language Models
Huanxuan Liao, Shizhu He, Yupu Hao, Xiang Li, Yuanzhe Zhang, Jun Zhao and Kang Liu |
| TermDiffuSum: A Term-guided Diffusion Model for Extractive Summarization of Legal Documents
Xiangyun Dong, Wei Li, Yuquan Le, Zhangyue Jiang, Junxi Zhong and Zhong Wang |
| COF: Adaptive Chain of Feedback for Comparative Opinion Quintuple Extraction
Qingting Xu, Kaisong Song, Chaoqun Liu, Yangyang Kang, Xiabing Zhou, Jun Lin and Yu Hong |
| MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
Xiaqiang Tang, Qiang Gao, Jian Li, Nan Du, Qi Li and Sihong Xie |
| Improvement in Sign Language Translation Using Text CTC Alignment
Sihan Tan, Taro Miyazaki, Nabeela Khan and Kazuhiro Nakadai |
| Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Zongru Wu, Pengzhou Cheng, Lingyong Fang, Zhuosheng Zhang and Gongshen Liu |
| MQM-Chat: Multidimensional Quality Metrics for Chat Translation
Yunmeng Li, Jun Suzuki, Makoto Morishita, Kaori Abe and Kentaro Inui |
| Intent Contrastive Learning Based on Multi-view Augmentation for Sequential Recommendation
Bo Pei, Yingzheng Zhu, Guangjin Wang, Huajuan Duan, Wenya Wu, Fuyong Xu, Yizhao Zhu, Peiyu Liu and Ran Lu |
| Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation
Siyuan Wang, Zhuohan Long, Zhihao Fan, Xuanjing Huang and Zhongyu Wei |
| Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection
Dmitri Roussinov, Serge Sharoff and Nadezhda Puchnina |
| Finetuning LLMs for Comparative Assessment Tasks
Vatsal Raina, Adian Liusie and Mark Gales |
| Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea
Eunjung Cho, Won Ik Cho and Soomin Seo |
| CycleOIE: A Low-Resource Training Framework For Open Information Extraction
Zhihong Jin, Chunhong Zhang, Zheng Hu, Jibin Yu, Ruiqi Ma, Qingyun Chen, Xiaohao Liao and Yanxing Zhang |
| AHVE-CNER: Aligned Hanzi Visual Encoding Enhance Chinese Named Entity Recognition with Multi-Information
Xuhui Zheng, Zhiyuan Min, Bin Shi and Hao Wang |
| Edit-Wise Preference Optimization for Grammatical Error Correction
Jiehao Liang, Haihui Yang, Shiping Gao and Xiaojun Quan |
| You Only Query Twice: Multimodal Rumor Detection via Evidential Evaluation from Dual Perspectives
Junyi Chen, Leyuan Liu, Tian Lan, Fan Zhou and Xiaosong Zhang |
| On Evaluation Protocols for Data Augmentation in a Limited Data Scenario
Frédéric Piedboeuf and Philippe Langlais |
| Context-Informed Machine Translation of Manga using Multimodal Large Language Models
Philip Lippmann, Konrad Skublicki, Joshua Tanner, Shonosuke Ishiwatari and Jie Yang |
| Large Language Model as a Teacher for Zero-shot Tagging at Extreme Scales
Jinbin Zhang, Nasib Ullah and Rohit Babbar |
| NovAScore: A New Automated Metric for Evaluating Document Level Novelty
Lin Ai, Ziwei Gong, Harshsaiprasad Deshpande, Alexander Johnson, Emmy Phung, Ahmad Emami and Julia Hirschberg |
| HLU: Human Vs LLM Generated Text Detection Dataset for Urdu at Multiple Granularities
Iqra Ali, Jesse Atuhurra, Hidetaka Kamigaito and Taro Watanabe |
| Embedding Style Beyond Topics: Analyzing Dispersion Effects Across Different Language Models
Benjamin Icard, Evangelia Zve, Lila Sainero, Alice Breton and Jean-Gabriel Ganascia |
| Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
Tadesse Destaw Belay, Israel Abebe Azime, Abinew Ali Ayele, Grigori Sidorov, Dietrich Klakow, Philip Slusallek, Olga Kolesnikova and Seid Muhie Yimam |
| Knowledge Graph Unlearning with Schema
Yang Xiao, Ruimeng Ye and Bo Hui |
| Assessing the Human Likeness of AI-Generated Counterspeech
Xiaoying Song, Sujana Mamidisetty, Eduardo Blanco and Lingzi Hong |
| Discarding the Crutches: Adaptive Parameter-Efficient Expert Meta-Learning for Continual Semantic Parsing
Ruiheng Liu, Jinyu Zhang, Yanqi Song, Yu Zhang and Bailong Yang |
| Improving Multilingual Sign Language Translation with Automatically Clustered Language Family Information
Ruiquan Zhang, Cong Hu, Pei Yu and Yidong Chen |
| Is Peer-Reviewing Worth the Effort?
Kenneth Ward Church, Raman Chandrasekar, John E. Ortega and Ibrahim Said Ahmad |
| OptiPrune: Effective Pruning Approach for Every Target Sparsity
Khang Nguyen Le, Ryo Sato, Dai Nakashima, Takeshi Suzuki and Minh Le Nguyen |
| ChatCite: LLM Agent with Human Workflow Guidance for Comparative Literature Summary
Yutong Li, Lu Chen, Aiwei Liu, Kai Yu and Lijie Wen |
| Paraphrase Makes Perfect: Leveraging Expression Paraphrase to Improve Implicit Sentiment Learning
Xia Li, Junlang Wang, Yongqiang Zheng, Yuan Chen and Yangjia Zheng |
| Not Every Metric is Equal: Cognitive Models for Predicting N400 and P600 Components During Reading Comprehension
Lavinia Salicchi and Yu-Yin Hsu |
| Multilingual Supervision Improves Semantic Disambiguation of Adpositions
Wesley Scivetti, Lauren Levine and Nathan Schneider |
| Empirical Study of Zero-shot Keyphrase Extraction with Large Language Models
Byungha Kang and Youhyun Shin |
| Investigating the Impact of Incremental Processing and Voice Activity Projection on Spoken Dialogue Systems
Yuya Chiba and Ryuichiro Higashinaka |
| Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
Ruiyang Ren, Yuhao Wang, Yingqi Qu, Wayne Xin Zhao, Jing Liu, Hua Wu, Ji-Rong Wen and Haifeng Wang |
| Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold Labels
Chaoqun Liu, Qin Chao, Wenxuan Zhang, Xiaobao Wu, Boyang Li, Anh Tuan Luu and Lidong Bing |
| Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models
Anmol Reddy Mekala, Vineeth Dorna, Shreya Dubey, Abhishek Lalwani, David Koleczek, Mukund Rungta, Sadid A. Hasan and Elita A.A Lobo |
| Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models
Mingyang Song, Mao Zheng and Xuan Luo |
| Personalized Large Language Model Assistant with Evolving Conditional Memory
Ruifeng Yuan, Shichao Sun, Yongqi Li, zili Wang, Ziqiang Cao and Wenjie Li |
| ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training
Zhouqiang Jiang, Bowen Wang, Junhao Chen and Yuta Nakashima |
| Gen-SQL: Efficient Text-to-SQL By Bridging Natural Language Question And Database Schema With Pseudo-Schema
Jie Shi, Bo Xu, Jiaqing Liang, Yanghua Xiao, Jia Chen, Chenhao Xie, Peng Wang and Wei Wang |
| Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive Ziji
Xiulin Yang |
| HyperHatePrompt: A Hypergraph-based Prompting Fusion Model for Multimodal Hate Detection
Bo Xu, Erchen Yu, Jiahui Zhou, Hongfei LIN and Linlin Zong |
| GenWebNovel: A Genre-oriented Corpus of Entities in Chinese Web Novels
Hanjie Zhao, Yuchen Yan, Senbin Zhu, Hongde Liu, Yuxiang Jia, Hongying Zan and Min Peng |
| Automated Progressive Red Teaming
Bojian Jiang, Yi Jing, Tong Wu, Tianhao Shen, Deyi Xiong and Qing Yang |
| Rumor Detection on Social Media with Temporal Propagation Structure Optimization
Xingyu Peng, Junran Wu, Ruomei Liu and Ke Xu |
| Revisiting Implicitly Abusive Language Detection: Evaluating LLMs in Zero-Shot and Few-Shot Settings
Julia Jaremko, Dagmar Gromann and Michael Wiegand |
| Grading Massive Open Online Courses Using Large Language Models
Shahriar Golchin, Nikhil Garuda, Christopher Impey and Matthew Wenger |
| Decoding Echo Chambers: LLM-Powered Simulations Revealing Polarization in Social Networks
Chenxi Wang, Zongfang Liu, Dequan Yang and Xiuying Chen |
| Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace
Jia-Chen Zhang, Yu-Jie Xiong, Chun-Ming Xia, Dong-Hai Zhu and Xi-He Qiu |
| StoryLLaVA: Enhancing Visual Storytelling with Multi-Modal Large Language Models
Li Yang, Zhiding Xiao, Wenxin Huang and Xian Zhong |
| Aligning Complex Knowledge Graph Question Answering as Knowledge-Aware Constrained Code Generation
Prerna Agarwal, Nishant Kumar and Srikanta Bedathur Jagannath |
| KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting
Thilini Wijesiriwardene, Ruwan Wickramarachchi, Sreeram Reddy Vennam, Vinija Jain, Aman Chadha, Amitava Das, Ponnurangam Kumaraguru and Amit Sheth |
| Unified Grid Tagging Scheme for Aspect Sentiment Quad Prediction
Guixin Su, Yongcheng Zhang, Tongguan Wang, Mingmin Wu and Ying Sha |
| Claim veracity assessment for explainable fake news detection
Bassamtiano Renaufalgi Irnawan, Sheng Xu, Noriko Tomuro, Fumiyo Fukumoto and Yoshimi Suzuki |
| ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models
Xiechi Zhang, Shunfan Zheng, Linlin Wang, Gerard de Melo, Zhu Cao, xiaoling Wang and Liang He |
| A Dual Contrastive Learning Framework for Enhanced Multimodal Conversational Emotion Recognition
Yunhe XIE, Chengjie Sun, Ziyi Cao, Bingquan Liu, zhenzhou Ji, Yuanchao Liu and Lili Shan |
| Can LLMs Clarify? Investigation and Enhancement of Large Language Models on Argument Claim Optimization
Yiran Wang, Ben He, Xuanang Chen and Le Sun |
| Generation-Augmented and Embedding Fusion in Document-Level Event Argument Extraction
Xingjian Lin, Shengfei Lyu, Xin Wang, Qiuju Chen and Huanhuan Chen |
| C3LRSO: A Chinese Corpus for Complex Logical Reasoning in Sentence Ordering
Xiaotao Guo, Jiang Li, Xiangdong Su and Fujun Zhang |
| KIA: Knowledge-Guided Implicit Vision-Language Alignment for Chest X-Ray Report Generation
Heng Yin, Shanlin Zhou, Pandong Wang, Zirui Wu and Yongtao Hao |
| On the Human-level Performance of Visual Question Answering
Chenlian Zhou, Guanyi Chen, Xin Bai and Ming Dong |
| Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
Dahyun Kim, Sukyung Lee, Yungi Kim, Attapol Rutherford and Chanjun Park |
| CONTRANS: Weak-to-Strong Alignment Engineering via Concept Transplantation
Weilong Dong, Xinwei Wu, Renren Jin, Shaoyang Xu and Deyi Xiong |
| Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen, Xiang Li, Xiaojun Ye, Chao Li, Zhaoxin Fan and Hao Zhao |
| Learning from Impairment: Leveraging Insights from Clinical Linguistics in Language Modelling Research
Dominique Brunato |
| Efficient Cross-modal Prompt Learning with Semantic Enhancement for Domain-robust Fake News Detection
Fei Wu, Hao Jin, Changhui Hu, Yimu Ji, Xiao-Yuan Jing and Guo-Ping Jiang |
| AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
Basel Mousi, Nadir Durrani, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain, Tameem Kabbani, Fahim Dalvi, Shammur Absar Chowdhury and Firoj Alam |
| Distance-Adaptive Quaternion Knowledge Graph Embedding with Bidirectional Rotation
Weihua Wang, Qiuyu Liang, Feilong Bao and Guanglai Gao |
| How Credible Is an Answer From Retrieval-Augmented LLMs? Investigation and Evaluation With Multi-Hop QA
Yujia Zhou, Zheng Liu and Zhicheng Dou |
| Is Parameter Collision Hindering Continual Learning in LLMs?
Shuo Yang, Kun-Peng Ning, Yu-Yang Liu, Jia-Yu Yao, Yong-Hong Tian, Yi-Bing Song and Li Yuan |
| Jump To Hyperspace: Comparing Euclidean and Hyperbolic Loss Functions for Hierarchical Multi-Label Text Classification
Jens Van Nooten and Walter Daelemans |
| Exploring the Limitations of Detecting Machine-Generated Text
Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady Shehata, Preslav Nakov and Zeerak Talat |
| Boosting Text-to-SQL through Multi-grained Error Identification
Bo Xu, Shufei Li, Hongyu Jing, Ming Du, Hui Song, Hongya Wang and Yanghua Xiao |
| Know When to Fuse: Investigating Non-English Hybrid Retrieval in the Legal Domain
Antoine Louis, Gijs van Dijck and Gerasimos Spanakis |
| MPID: A Modality-Preserving and Interaction-Driven Fusion Network for Multimodal Sentiment Analysis
Tianyi Li and Daming Liu |
| Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language Models
Patrick Amadeus Irawan, Genta Indra Winata, Samuel Cahyawijaya and Ayu Purwarianti |
| DefVerify: Do Hate Speech Models Reflect Their Dataset’s Definition?
Urja Khurana, Eric Nalisnick and Antske Fokkens |
| Fusion meets Function: The Adaptive Selection-Generation Approach in Event Argument Extraction
Guoxuan Ding, Xiaobo Guo, Xin Wang, Lei Wang, Tianshu Fu, Nan Mu and Daren Zha |
| ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information Retrieval
Antoine Louis, Vageesh Kumar Saxena, Gijs van Dijck and Gerasimos Spanakis |
| TEXT-CAKE: Challenging Language Models on Local Text Coherence
Luca Dini, Dominique Brunato, Felice Dell’Orletta and Tommaso Caselli |
| KVFKT: A New Horizon in Knowledge Tracing with Attention-Based Embedding and Forgetting Curve Integration
Quanlong Guan, Xiuliang Duan, Kaiquan Bian, Guanliang Chen, Jianbo Huang, Zhiguo Gong and Liangda Fang |
| Fine-tuning Large Language Models for Improving Factuality in Legal Question Answering
Yinghao Hu, Leilei Gan, Wenyi Xiao, Kun Kuang and Fei Wu |
| Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
Xiaoye Qu, Jiashuo Sun, Wei Wei, Daizong Liu, Jianfeng Dong and Yu Cheng |
| Large Language Models are good multi-lingual learners : When LLMs meet cross-lingual prompts
Teng Wang, Zhenqi He, Wing-Yin Yu, Xiaojin Fu and Xiongwei Han |
| MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models
Zihao Wei, Jingcheng Deng, Liang Pang, Hanxing Ding, Huawei Shen and Xueqi Cheng |
| Factual Dialogue Summarization via Learning from Large Language Models
Rongxin Zhu, Jey Han Lau and Jianzhong Qi |
| QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Mohammad Aflah Khan, Neemesh Yadav, Sarah Masud and Md. Shad Akhtar |
| GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering
Sacha Muller, Antonio Loison, Bilel Omrani and Gautier Viaud |
| Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models
Jiahui Li, Yongchang Hao, Haoyu Xu, Xing Wang and Yu Hong |
| Conditional Semantic Textual Similarity via Conditional Contrastive Learning
Xinyue Liu, Zeyang Qin, Zeyu Wang, Wenxin Liang, Linlin Zong and Bo Xu |
| A Survey of Code-switched Arabic NLP: Progress, Challenges, and Future Directions
Injy Hamed, Caroline Sabty, Slim Abdennadher, Ngoc Thang Vu, Thamar Solorio and Nizar Habash |
| Towards Database-Free Text-to-SQL Evaluation: A Graph-Based Metric for Functional Correctness
Yi Zhan, Longjie Cui, Han Weng, Guifeng Wang, Yu Tian, Boyi Liu, Yingxiang Yang, Xiaoming Yin, Jiajun Xie and Yang Sun |
| Modal Feature Optimization Network with Prompt for Multimodal Sentiment Analysis
xiangmin zhang, Wei Wei and Shihao Zou |
| Multimodal Fact-Checking with Vision Language Models: A Probing Classifier based Solution with Embedding Strategies
Recep Firat Cekinel, Pinar Karagoz and Çağrı Çöltekin |
| Faithful Inference Chains Extraction for Fact Verification over Multi-view Heterogeneous Graph with Causal Intervention
Daoqi Chen, Yaxin Li, Zizhong Zhu, Xiaowang Zhang and Zhiyong Feng |
| SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent
Jing Ye, Lu Xiang, Yaping Zhang and Chengqing Zong |
| ELAINE-medLLM: Lightweight English Japanese Chinese Trilingual Large Language Model for Bio-medical Domain
Ken Yano, Zheheng Luo, Jimin Huang, Qianqian Xie, Masaki Asada, Chenhan Yuan, Kailai Yang, Makoto Miwa, Sophia Ananiadou and Jun’ichi Tsujii |
| Debate-to-Write: A Persona-Driven Multi-Agent Framework for Diverse Argument Generation
Zhe Hu, Hou Pong Chan, Jing Li and Yu Yin |
| Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy
Min Zeng, Caiquan Liu, Shiqi Zhang, Li Xie, Chen Sang and Xiaoxin Chen |
| Slender-Mamba: Fully Quantized Mamba in 1.58 Bits From Head to Toe
Zhenxuan Yu, Takeshi Kojima, Yutaka Matsuo and Yusuke Iwasawa |
| What’s the most important value? INVP: INvestigating the Value Priorities of LLMs through Decision-making in Social Scenarios
Xuelin Liu, pengyuan liu and Dong Yu |
| BasqBBQ: A QA Benchmark for Assessing Social Biases in LLMs for Basque, a Low-Resource Language
Xabier Saralegi and Muitze Zulaika |
| DynRank: Improve Passage Retrieval with Dynamic Zero-Shot Prompting Based on Question Classification
Abdelrahman Elsayed Mahmoud Abdallah, Jamshid Mozafari, Bhawna Piryani, Mohammed M.Abdelgwad and Adam Jatowt |
| Why should only High-Resource-Languages have all the fun? Pivot Based Evaluation in Low Resource Setting
Ananya Mukherjee, Saumitra Yadav and Manish Shrivastava |
| The Shift from Logic to Dialectic in Argumentation Theory: Implications for Computational Argument Quality Assessment
Rositsa V Ivanova and Reto Gubelmann |
| Task-Oriented Dialog Systems for the Senegalese Wolof Language
Derguene Mbaye and Moussa Diallo |
| Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment
Jianfei Zhang, Jun Bai, Bei Li, Yanmeng Wang, Rumei Li, Chenghua Lin and Wenge Rong |
| A Survey of Generative Information Extraction
Zikang Zhang, Wangjie You, Tianci Wu, Xinrui Wang, Juntao Li and Min Zhang |
| Interactive Evaluation for Medical LLMs via Task-oriented Dialogue System
Ruoyu Liu, Kui Xue, Xiaofan Zhang and Shaoting Zhang |
| Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models
Haoran Lian, Junmin Chen, Wei Huang, Yizhe Xiong, Wenping Hu, Guiguang Ding, Hui Chen, Jianwei Niu, Zijia Lin, Fuzheng Zhang and Di Zhang |
| ACL-rlg: A Dataset for Reading List Generation
Julien Aubert-Béduchaud, Florian Boudin, Béatrice Daille and Richard Dufour |
| SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding
Zhenglin Wang, Jialong Wu, Yilong Lai, Congzhi Zhang and Deyu Zhou |
| Extracting structure from an LLM - how to improve on surprisal-based models of Human Language Processing
Daphne P. Wang, Mehrnoosh Sadrzadeh, Miloš Stanojević, Wing-Yee Chow and Richard Breheny |
| Evaluating Generalization Capability of Language Models across Abductive, Deductive and Inductive Logical Reasoning
Yu Sheng, Wanting Wen, Linjing Li and Daniel Zeng |
| Measuring the Robustness of Reference-Free Dialogue Evaluation Systems
Justin Vasselli, Adam Nohejl and Taro Watanabe |
| Towards Robust Comparisons of NLP Models: A Case Study
Vicente Ivan Sanchez Carmona, Shanshan Jiang and Bin Dong |
| SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis
Senbin Zhu, ChenYuan He, Hongde Liu, Pengcheng Dong, Hanjie Zhao, Yuchen Yan, Yuxiang Jia, Hongying Zan and Min Peng |
| Enhancing Criminal Investigation Analysis with Summarization and Memory-based Retrieval-Augmented Generation: A Comprehensive Evaluation of Real Case Data
Mads Skipanes, Tollef Emil JÃ,rgensen, Kyle Porter, Gianluca Demartini and Sule Yildirim Yayilgan |
| Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction
Erwin Daniel Lopez Zapata, Cheng Tang and Atsushi Shimada |
| Evaluating Open-Source ASR Systems: Performance Across Diverse Audio Conditions and Error Correction Methods
Saki Imai, Tahiya Chowdhury and Amanda J. Stent |
| Large Language Models as an Indirect Reasoner: Contrapositive and Contradiction for Automated Reasoning
Yanfang Zhang, Yiliu Sun, Yibing Zhan, Dapeng Tao, Dacheng Tao and Chen Gong |
| Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges
Vinay Samuel, Yue Zhou and Henry Peng Zou |
| Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
Bohan Li, Jiannan Guan, Longxu Dou, Yunlong Feng, Dingzirui Wang, Yang Xu, Enbo Wang, Qiguang Chen, Bichen Wang, Xiao Xu, Yimeng Zhang, Libo Qin, Yanyan Zhao, Qingfu Zhu and Wanxiang Che |
| TMATH A Dataset for Evaluating Large Language Models in Generating Educational Hints for Math Word Problems
Changyong Qi, Yuang Wei, Haoxin Xu, Longwei Zheng, Peiji Chen and Xiaoqing Gu |
| A Benchmark of French ASR Systems Based on Error Severity
Antoine Tholly, Jane Wottawa, Mickael Rouvier and Richard Dufour |
| What Makes Cryptic Crosswords Challenging for LLMs?
Abdelrahman Sadallah, Daria Kotova and Ekaterina Kochmar |
| Improving the Efficiency of Visually Augmented Language Models
Paula Ontalvilla, Aitor Ormazabal and Gorka Azkune |
| Refer to the Reference: Reference-focused Synthetic Automatic Post-Editing Data Generation
Sourabh Dattatray Deoghare, Diptesh Kanojia and Pushpak Bhattacharyya |
| EvoPrompt: Evolving Prompts for Enhanced Zero-Shot Named Entity Recognition with Large Language Models
Zeliang Tong, Zhuojun Ding and Wei Wei |
| MIT-10M: A Large Scale Parallel Corpus of Multilingual Image Translation
Bo Li, Shaolin Zhu and Lijie Wen |
| Synthetic Paths to Integral Truth: Mitigating Hallucinations Caused by Confirmation Bias with Synthetic Data
Changwon Ok, Eunkyeong Lee and Dongsuk Oh |
| Unlike "Likely", "Unlike" is Unlikely: BPE-based Segmentation hurts Morphological Derivations in LLMs
Paul Lerner and François Yvon |
| WIKIGENBENCH:Exploring Full-length Wikipedia Generation under Real-World Scenario
Jiebin Zhang, Eugene J. Yu, Qinyu Chen, Chenhao Xiong, Dawei Zhu, Han Qian, Mingbo Song, Weimin Xiong, Xiaoguang Li, Qun Liu and Sujian Li |
| LLMs meet Bloom’s Taxonomy: A Cognitive View on Large Language Model Evaluations
Thomas Huber and Christina Niklaus |
| Exploring Fine-Grained Human Motion Video Captioning
Bingchan Zhao, Xinyi Liu, Zhuocheng Yu, Tongchen Yang, Yifan Song, Mingyu Jin, Sujian Li and Yizhou Wang |
| DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles
Jiaxuan Liu, Zhaoci Liu, Yajun Hu, Yingying Gao, Shilei Zhang and Zhenhua Ling |
| OpenForecast: A Large-Scale Open-Ended Event Forecasting Dataset
Zhen Wang, Xi Zhou, Yating Yang, Bo Ma, Lei Wang, Rui Dong and Azmat Anwar |
| A Knowledge Graph Reasoning-Based Model for Computerized Adaptive Testing
Xinyi Qiu and Zhiyun Chen |
| TOOL-ED: Enhancing Empathetic Response Generation with the Tool Calling Capability of LLM
Huiying Cao, Yiqun Zhang, Shi Feng, Xiaocui Yang, Daling Wang and Yifei Zhang |
| Annotating the French Wiktionary with supersenses for large scale lexical analysis: a use case to assess form-meaning relationships within the nominal lexicon
Nicolas Angleraud, Lucie Barque and Marie Candito |
| When Evolution Strategy Meets Language Models Tuning
Bo Huang, Yuxin Jiang, Mingyang Chen, Yi Wang, Hongyang Chen and Wei Wang |
| Unveiling Entity-Level Unlearning for Large Language Models: A Comprehensive Analysis
weitao ma, Xiaocheng Feng, weihong zhong, Lei Huang, yangfan ye, Xiachong Feng and bing qin |
| Knowledge Graph Pooling and Unpooling for Concept Abstraction
Juan Li, Wen Zhang, Zhiqiang Liu, Mingchen Tu, Mingyang Chen, Ningyu Zhang and Shijian Li |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation
Jia Gu, Liang Pang, Huawei Shen and Xueqi Cheng |
| Pseudo-label Data Construction Method and Syntax-enhanced Model for Chinese Semantic Error Recognition
Hongyan Wu, Nankai Lin, Shengyi JIANG, Lianxi Wang and Aimin Yang |
| An Active Learning Framework for Inclusive Generation by Large Language Models
Sabit Hassan, Anthony B. Sicilia and Malihe Alikhani |
| Multimodal Extraction and Recognition of Arabic Implicit Discourse Relations
Ahmed Ruby, Christian Hardmeier and Sara Stymne |
| Post-Hoc Watermarking for Robust Detection in Text Generated by Large Language Models
Jifei Hao, Jipeng Qiang, Yi Zhu, Yun Li, Yunhao Yuan and Xiaoye Ouyang |
| RA-MTR: A Retrieval Augmented Multi-Task Reader based Approach for Inspirational Quote Extraction from Long Documents
Sayantan Adak and Animesh Mukherjee |
| VeritasQA: A Truthfulness Benchmark Aimed at Multilingual Transferability
Javier Aula-Blasco, Júlia Falcão, Susana Sotelo, Silvia Paniagua, Aitor Gonzalez-Agirre and Marta Villegas |
| ECC: Synergizing Emotion, Cause and Commonsense for Empathetic Dialogue Generation
Xu Wang, Bo Wang, Yihong Tang, Dongming Zhao, jing liu, Ruifang He and Yuexian Hou |
| GraphOTTER: Evolving LLM-based Graph Reasoning for Complex Table Question Answering
Qianlong Li, Chen Huang, Shuai Li, Yuanxin Xiang, Deng Xiong and Wenqiang Lei |
| Persona-Consistent Dialogue Generation via Pseudo Preference Tuning
Junya Takayama, Masaya Ohagi, Tomoya Mizumoto and Katsumasa Yoshikawa |
| Montague semantics and modifier consistency measurement in neural language models
Danilo Silva de Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro and André Freitas |
| LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluation
Hongyun Zhou, Xiangyu Lu, Wang Xu, Conghui Zhu, Tiejun Zhao and Muyun Yang |
| Leveraging Language-based Representations for Better Solving Symbol-related Problems with Large Language Models
Yile Wang, Sijie Cheng, Zixin Sun, Peng Li and Yang Liu |
| Towards Cross-Lingual Audio Abuse Detection in Low-Resource Settings with Few-Shot Learning
Aditya Narayan Sankaran, Reza Farahbakhsh and Noel Crespi |
| MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators
Qingyu Lu, Liang Ding, Kanjian Zhang, Jinxia Zhang and Dacheng Tao |
| MOPO: Multi-Objective Prompt Optimization for Affective Text Generation
Yarik Menchaca Resendiz and Roman Klinger |
| PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent
Jiateng Liu, Lin Ai, Zizhou Liu, Payam Karisani, zheng hui, Yi Fung, Preslav Nakov, Julia Hirschberg and Heng Ji |
| MQA-KEAL: Multi-hop Question Answering under Knowledge Editing for Arabic Language
Muhammad Asif Ali, Nawal Daftardar, Mutayyba Waheed, Jianbin Qin and Di Wang |
| A Novel Negative Sample Generation Method for Contrastive Learning in Hierarchical Text Classification
Juncheng Zhou, Lijuan Zhang, Yachen He, Rongli Fan, Lei Zhang and Jian Wan |
| Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs
Taiqiang Wu, Zhe Zhao, Jiahao Wang, Xingyu Bai, Lei Wang, Ngai Wong and Yujiu Yang |
| A Context-Aware Approach for Enhancing Data Imputation with Pre-trained Language Models
Ahatsham Hayat and Mohammad R. Hasan |
| Using Game Play to Investigate Multimodal and Conversational Grounding in Large Multimodal Models
Sherzod Hakimov, Yerkezhan Abdullayeva, Kushal Koshti, Antonia Schmidt, Yan Weiser, Anne Beyer and David Schlangen |
| PADO: Personality-induced multi-Agents for Detecting OCEAN in human-generated texts
Haein Yeo, Taehyeong Noh, Seungwan Jin and Kyungsik Han |
| Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models
Taiqiang Wu, Chaofan Tao, Jiahao Wang, Runming Yang, Zhe Zhao and Ngai Wong |
| Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation
Zijie Zhong, Hanwen Liu, Xiaoya Cui, Xiaofan Zhang and Zengchang Qin |
| Multilingual Knowledge Editing with Language-Agnostic Factual Neurons
Xue Zhang, Yunlong Liang, Fandong Meng, Songming Zhang, Yufeng Chen, Jinan Xu and Jie Zhou |
| MURRE: Multi-Hop Table Retrieval with Removal for Open-Domain Text-to-SQL
xuanliang zhang, Dingzirui Wang, Longxu Dou, Qingfu Zhu and Wanxiang Che |
| Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election
Roberto Mondini, Neema Kotonya, Robert L Logan IV, Elizabeth M. Olson, Angela Oduor Lungati, Daniel Odongo, Tim Ombasa, Hemank Lamba, Aoife Cahill, Joel Tetreault and Alejandro Jaimes |
| On Evaluating LLMs’ Capabilities as Functional Approximators: A Bayesian Evaluation Framework
Shoaib Ahmed Siddiqui, Yanzhi Chen, Juyeon Heo, Menglin Xia and Adrian Weller |
| Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference
Grace Proebsting and Adam Poliak |
| LLMs May Perform MCQA by Selecting the Least Incorrect Option
Haochun Wang, Sendong Zhao, Zewen Qiang, Nuwa Xi, Bing Qin and Ting Liu |
| Benchmark Creation for Aspect-Based Sentiment Analysis in Low-Resource Odia Language and Evaluation through Fine-Tuning of Multilingual Models
Lipika Dewangan, Zoyah Afsheen Sayeed and Chandresh Maurya |
| ADAPTIVE IE: Investigating the Complementarity of Human-AI Collaboration to Adaptively Extract Information on-the-fly
Ishani Mondal, Michelle Yuan, Anandhavelu N, Aparna Garimella, Francis Ferraro, Andrew Blair-Stanek, Benjamin Van Durme and Jordan Boyd-Graber |
| DAEA: Enhancing Entity Alignment in Real-World Knowledge Graphs Through Multi-Source Domain Adaptation
Linyan Yang, Shiqiao Zhou, Jingwei Cheng, Fu Zhang, Jizheng Wan, Shuo Wang and Mark Lee |
| CoPrUS: Consistency Preserving Utterance Synthesis towards more realistic benchmark dialogues
Sebastian Steindl, Ulrich Schäfer and Bernd Ludwig |
| JMedBench: A Benchmark for Evaluating Japanese Biomedical Large Language Models
Junfeng Jiang, Jiahao Huang and Akiko Aizawa |
| Automated Detection of Tropes In Short Texts
Alessandra Flaccavento, Youri Peskine, Paolo Papotti, Riccardo Torlone and Raphael Troncy |
| WER We Stand: Benchmarking Urdu ASR Models
Samee Arif, Aamina Jamal Khan, Mustafa Abbas, Agha Ali Raza and Awais Athar |
| CHIFRAUD: A Long-term Web Text Dataset for Chinese Fraud Detection
Min Tang, Lixin Zou, Zhe Jin, ShuJie Cui, Shiuan Ni Liang and Weiqing Wang |
| CateEA: Enhancing Entity Alignment via Implicit Category Supervision
Guan Dong Feng, Tao Ren, Jun Hu and Dan dan Wang |
| Egalitarian Language Representation in Language Models: It All Begins with Tokenizers
Menan Velayuthan and Kengatharaiyer Sarveswaran |
| PIRsuader: A Persuasive Chatbot for Mitigating Psychological Insulin Resistance in Type-2 Diabetic Patients
Sujatha Das Gollapalli and See-Kiong Ng |
| Continual Learning Using Only Large Language Model Prompting
Jiabao Qiu, Zixuan Ke and Bing Liu |
| Empirical Study on Data Attributes Insufficiency of Evaluation Benchmarks for LLMs
Chuang Liu, Renren Jin, Zheng Yao, Tianyi Li, Liang Cheng, Mark Steedman and Deyi Xiong |
| Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas
Bastian Bunzeck, Daniel Duran, Leonie Schade and Sina Zarrieß |
| Evaluating Readability Metrics for German Medical Text Simplification
Karen Scholz and Markus Wenzel |
| Hi-GEC: Hindi Grammar Error Correction in Low Resource Scenario
Ujjwal Sharma and Pushpak Bhattacharyya |
| MuPe Life Stories Dataset: Spontaneous Speech in Brazilian Portuguese with a Case Study Evaluation on ASR Bias against Speakers Groups and Topic Modeling
Sidney Evaldo Leal, Arnaldo Candido Junior, Ricardo Marcacini, Edresson Casanova, Odilon Gonçalves, Anderson Silva Soares, Rodrigo Freitas Lima, Lucas Rafael Stefanel Gris and Sandra Aluísio |
| Multi-Layered Evaluation Using a Fusion of Metrics and LLMs as Judges in Open-Domain Question Answering
Rashin Rahnamoun and Mehrnoush Shamsfard |
| BERT-based Classical Arabic Poetry Authorship Attribution
Lama Alqurashi, Serge Sharoff, Janet Watson and Jacob Blakesley |
| It’s What You Say and How You Say It: Investigating the Effect of Linguistic vs. Behavioral Adaptation in Task-Oriented Chatbots
Lindsey Vanderlyn and Ngoc Thang Vu |
| VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation
Hyeonseok Lim, Dongjae Shin, Seohyun Song, Inho Won, Minjun Kim, Junghun Yuk, Haneol Jang and KyungTae Lim |
| LASS: A Novel and Economical Data Augmentation Framework Based on Language Models for Debiasing Opinion Summarization
Yanyue Zhang, Pengfei Li, Yilong Lai, Yulan He and Deyu Zhou |
| Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal Contamination
Eva Sánchez Salido, Roser Morante, Julio Gonzalo, Guillermo Marco, Jorge Carrillo-de-Albornoz, Laura Plaza, Enrique Amigo, Andrés Fernandez García, Alejandro Benito-Santos, Adrián Ghajari Espinosa and Victor Fresno |
| Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slips
Yingfa Chen, Chenlong Hu, Cong Feng, Chenyang Song, Shi Yu, Xu Han, Zhiyuan Liu and Maosong Sun |
| DROWN: Towards Tighter LiRPA-based Robustness Certification
Yunruo Zhang, Tianyu Du, Shouling Ji and Shanqing Guo |
| Large Language Models with Reinforcement Learning from Human Feedback Approach for Enhancing Explainable Sexism Detection
Ali Riahi Samani, Tianhao Wang, Kangshuo Li and Feng Chen |
| Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification
Shijing Chen, Mohamed Reda Bouadjenek, Usman Naseem, Basem Suleiman, Shoaib Jameel, Flora Salim, Hakim Hacid and Imran Razzak |
| Representation Purification for End-to-End Speech Translation
Chengwei Zhang, Yue Zhou, Rui Zhao, Yidong Chen and xiaodong shi |
| Semi-Automated Construction of Sense-Annotated Datasets for Practically Any Language
Jai Riley, Bradley M. Hauer, Nafisa Sadaf Hriti, Guoqing Luo, Amir Reza Mirzaei, Ali Rafiei, Hadi Sheikhi, Mahvash Siavashpour, Mohammad Tavakoli, Ning Shi and Grzegorz Kondrak |
| HYDEN: Hyperbolic Density Representations for Medical Images and Reports
Zhi Qiao, linbin han, Xiantong Zhen, Jiahong Gao and Zhen Qian |
| Towards Human Understanding of Paraphrase Types in Large Language Models
Dominik Meier, Jan Philip Wahle, Terry Lima Ruas and Bela Gipp |
| Just Read the Codebook! Make Use of Quality Codebooks in Zero-Shot Classification of Multilabel Frame Datasets
Mattes Ruckdeschel |
| NLP for preserving Torlak, a vulnerable low-resource Slavic language
Li Tang and Teodora Vuković |
| Analyzing the Attention Heads for Pronoun Disambiguation in Context-aware Machine Translation Models
Paweł Mąka, Yusuf Can Semerci, Jan Scholtes and Gerasimos Spanakis |
| ModaFact: Multi-paradigm Evaluation for Joint Event Modality and Factuality Detection
Marco Rovera, Serena Cristoforetti and Sara Tonelli |
| Why Does ChatGPT "Delve" So Much? Exploring the Sources of Lexical Overrepresentation in Large Language Models
Tom S Juzek and Zina B. Ward |
| Evaluating Pixel Language Models on Non-Standardized Languages
Alberto Muñoz-Ortiz, Verena Blaschke and Barbara Plank |
| LOLA – An Open-Source Massively Multilingual Large Language Model
Nikit Srivastava, Denis Kuchelev, Tatiana Moteu Ngoli, Kshitij Shetty, Michael Roeder, Hamada Zahera, Diego Moussallem and Axel-Cyrille Ngonga Ngomo |
| Cross-Lingual Sentence Compression for Length-Constrained Subtitles in Low-Resource Settings
Tollef Emil JÃ,rgensen and Ole Jakob Mengshoel |
| SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages
Gayane Ghazaryan, Erik Arakelyan, Isabelle Augenstein and Pasquale Minervini |
| Part-Of-Speech Sensitivity of Routers in Mixture of Experts Models
Elie Antoine, Frederic Bechet and Phillippe Langlais |
| Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
Yang Wang and Chenghua Lin |
| Acquired TASTE: Multimodal Stance Detection with Textual and Structural Embeddings
Guy Barel, Oren Tsur and Dan Vilenchik |
| IRUEX: A Study on Large Language Models Problem-Solving Skills in Iran’s University Entrance Exam
Hamed Khademi Khaledi and Heshaam Faili |
| data2lang2vec: Data Driven Typological Features Completion
Hamidreza Amirzadeh, sadegh jafari, Anika Harju and Rob van der Goot |
| Explanation Regularisation through the Lens of Attributions
Pedro Ferreira, Ivan Titov and Wilker Aziz |
| Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs
Guillermo Marco, Luz Rello and Julio Gonzalo |
| Generics are puzzling. Can language models find the missing piece?
Gustavo Cilleruelo, Emily Allaway, Barry Haddow and Alexandra Birch |
| Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models
Souvik Das, Lifeng Jin, Linfeng Song, Haitao Mi, Baolin Peng and Dong Yu |
| Iterative Structured Knowledge Distillation: Optimizing Language Models Through Layer-by-Layer Distillation
Malthe Have Musaeus and Rob van der Goot |
| Why do language models perform worse for morphologically complex languages?
Catherine Arnett and Benjamin Bergen |
| Argument Mining with Fine-Tuned Large Language Models
Jérémie Cabessa, Hugo Hernault and Umer Mushtaq |
| Beyond Surprisal: A Dual Metric Framework for Lexical Skill Acquisition in LLMs
Nazanin Shafiabadi and Guillaume Wisniewski |
| RUAccent: Advanced System for Stress Placement in Russian with Homograph Resolution
Denis Andreevich Petrov |
| On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning
Mauricio Gruppi, Soham Dan, Keerthiram Murugesan and Subhajit Chaudhury |
| HateBRXplain: A Benchmark Dataset with Human-Annotated Rationales for Explainable Hate Speech Detection in Brazilian Portuguese
Isadora Salles, Francielle Vargas and Fabrício Benevenuto |
| LLM4RE: A Data-centric Feasibility Study for Relation Extraction
Anushka Swarup, Tianyu Pan, Ronald Wilson, Avanti Bhandarkar and Damon Woodard |
| Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation
Joanne Boisson, Zara Siddique, Hsuvas Borkakoty, Dimosthenis Antypas, Luis Espinosa Anke and Jose Camacho-Collados |
| Enhancing Retrieval-Augmented Generation: A Study of Best Practices
Siran Li, Linus Stenzel, Carsten Eickhoff and Seyed Ali Bahrainian |
| From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings
Aishik Rakshit, Smriti Singh, Shuvam Keshari, Arijit Ghosh Chowdhury, Vinija Jain and Aman Chadha |
| LaERC-S: Improving LLM-based Emotion Recognition in Conversation with Speaker Characteristics
Yumeng Fu, Junjie Wu, Zhongjie Wang, Meishan Zhang, Lili Shan, Yulin Wu and Bingquan Liu |
| Analysing Zero-Shot Readability-Controlled Sentence Simplification
Abdullah Barayan, Jose Camacho-Collados and Fernando Alva-Manchego |
| The Invalsi Benchmarks: measuring the Linguistic and Mathematical understanding of Large Language Models in Italian
Giovanni Puccetti, Maria Cassese and Andrea Esuli |
| RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human Feedback
Guoqing Chen, Fu Zhang, Jinghao Lin, Chenglong Lu and Jingwei Cheng |
| Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection
Beomseok Lee, Marco Gaido, Ioan Calapodescu, Laurent Besacier and Matteo Negri |
| Improving Accessibility of SCOTUS Opinions: A Benchmark Study and a New Dataset for Generic Heading Prediction and Specific Heading Generation
Malek Yaich and Nicolas Hernandez |
| SelfPrompt: Autonomously Evaluating LLM Robustness via Domain-Constrained Knowledge Guidelines and Refined Adversarial Prompts
Aihua Pei, Zehua Yang, Shunan Zhu, Ruoxi Cheng and Ju Jia |
| GLoCIM: Global-view Long Chain Interest Modeling for news recommendation
Zhen Yang, Wenhui Wang, Tao Qi, Peng Zhang, TianYun Zhang, Ru Zhang, Jianyi Liu and Yongfeng Huang |
| Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models
Xinyu Zhou, Delong Chen, Samuel Cahyawijaya, Xufeng Duan and Zhenguang Cai |
| MMD-ERE: Multi-Agent Multi-Sided Debate for Event Relation Extraction
Yong Guan, Hao Peng, Lei Hou and Juanzi Li |
| Cross Domain Classification of Education Talk Turns
Achyutarama R. Ganti, Steven R. Wilson and Geoffrey Louie Wing-Yue |
| Automated Molecular Concept Generation and Labeling with Large Language Models
Zimin Zhang, Qianli Wu, Botao Xia, Fang Sun, Ziniu Hu, Yizhou Sun and Shichang Zhang |
| URIEL+: Enhancing Linguistic Inclusion and Usability in a Typological and Multilingual Knowledge Base
Aditya Armaan Khan, Mason Stephen Shipton, David Anugraha, Kaiyao Duan, Phuong H. Hoang, Eric Khiu, A. Seza Doğruöz and Annie Lee |
| A Framework for Effective Invocation Methods of Various LLM Services
Can Wang, Dianbo Sui, Bolin Zhang, Xiaoyu Liu, Jiabao Kang, Zhidong Qiao and Zhiying Tu |
| DP-FROST: Differentially Private Fine-tuning of Pre-trained Models with Freezing Model Parameters
Daeyoung Hong, Woohwan Jung and Kyuseok Shim |
| Evaluating LLMs’ Capability to Identify Lexical Semantic Equivalence: Probing with the Word-in-Context Task
Yoshihiko Hayashi |
| Close or Cloze? Assessing the Robustness of Large Language Models to Adversarial Perturbations via Word Recovery
Luke Moffett and Bhuwan Dhingra |
| NüshuRescue: Reviving the Endangered Nüshu Language with AI
Ivory Yang, Weicheng Ma and Soroush Vosoughi |
| TOP-Training: Target-Oriented Pretraining for Medical Extractive Question Answering
Saptarshi Sengupta, Connor Heaton, Shreya Ghosh, Wenpeng Yin, Preslav Nakov and Suhang Wang |
| Beyond Discrete Personas: Personality Modeling Through Journal Intensive Conversations
Sayantan Pal, Souvik Das and Rohini K. Srihari |
| Can We Afford The Perfect Prompt? Balancing Cost and Accuracy with the Economical Prompting Index
Tyler McDonald, Anthony Colosimo, Yifeng Li and Ali Emami |
| From Priest to Doctor: Domain Adaptation for Low-Resource Neural Machine Translation
Ali Marashian, Enora Rice, Luke Gessler, Alexis Palmer and Katharina von der Wense |
| Improving Relation Extraction by Sequence-to-sequence-based Dependency Parsing Pre-training
Masaki Asada and Makoto Miwa |
| Exploring Language Model Generalization in Low-Resource Extractive QA
Saptarshi Sengupta, Wenpeng Yin, Preslav Nakov, Shreya Ghosh and Suhang Wang |
| Explain-Analyze-Generate: A Sequential Multi-Agent Collaboration Method for Complex Reasoning
WenYuan Gu, JiaLe Han, HaoWen Wang, Xiang Li and Bo Cheng |
| Towards Real-World Rumor Detection: Anomaly Detection Framework with Graph Supervised Contrastive Learning
Chaoqun Cui and caiyan jia |
| Addressing the Training-Inference Discrepancy in Discrete Diffusion for Text Generation
Masaki Asada and Makoto Miwa |
| Enhancing Rumor Detection Methods with Propagation Structure Infused Language Model
Chaoqun Cui, Siyuan Li, Kunkun Ma and caiyan jia |
| EffiQA: Efficient Question-Answering with Strategic Multi-Model Collaboration on Knowledge Graphs
Zixuan Dong, Baoyun Peng, Yufei Wang, Jia Fu, Xiaodong Wang, Xin Zhou, Yongxue Shan, Kangchen Zhu and Weiguo Chen |
| Language Adaptation of Large Language Models: An Empirical Study on LLaMA2
Shumin Wang, Yuexiang Xie, Bolin Ding, Jinyang Gao and Yanyong Zhang |
| Dialectal and Low Resource Machine Translation for Aromanian
Alexandru-Iulius Jerpelea, Alina Radoi and Sergiu Nisioi |
| Fine-Grained Features-based Code Search for Precise Query-Code Matching
Xinting Zhang, Mengqiu Cheng, Mengzhen Wang, Songwen Gong, Jiayuan Xie, Yi Cai and Qing Li |
| VideoQA-TA: Temporal-Aware Multi-Modal Video Question Answering
Zhixuan Wu, Bo Cheng, Jiale Han, Jiabao Ma, Shuhao Zhang, Yuli Chen and Changbo Li |
| Cross-lingual Social Misinformation Detector based on Hierarchical Mixture-of-Experts Adapter
Haofang Fan, Xiran Hu and Geng Zhao |
| Unveiling Performance Challenges of Large Language Models in Low-Resource Healthcare: A Demographic Fairness Perspective
Yue Zhou, Barbara Di Eugenio and Lu Cheng |
| A Text Embedding Model with Contrastive Example Mining for Point-of-Interest Geocoding
Hibiki Nakatani, Hiroki Teranishi, Shohei Higashiyama, Yuya Sawada, Hiroki Ouchi and Taro Watanabe |
| In-context Continual Learning Assisted by an External Continual Learner
Saleh Momeni, Sahisnu Mazumder, Zixuan Ke and Bing Liu |
| VaeDiff-DocRE: End-to-end Data Augmentation Framework for Document-level Relation Extraction
Khai Phan Tran, Wen Hua and Xue Li |
| Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
Jinfa Huang, Jinsheng Pan, Zhongwei Wan, Hanjia Lyu and Jiebo Luo |
| An Efficient Dialogue Policy Agent with Model-Based Causal Reinforcement Learning
Kai Xu, Zhenyu Wang, Yangyang Zhao and Bopeng Fang |
| Re-Cent: A Relation-Centric Framework for Joint Zero-Shot Relation Triplet Extraction
Zehan Li, Fu Zhang, Kailun Lyu, Jingwei Cheng and Tianyue Peng |
| CoMIF: Modeling of Complex Multiple Interaction Factors for Conversation Generation
yuxuan chen, Wei Wei, Shixuan Fan, kaihe xu and Dangyang Chen |
| Courtroom-LLM: A Legal-Inspired Multi-LLM Framework for Resolving Ambiguous Text Classifications
Sangkeun Jung and Jeesu Jung |
| RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems
Yihong Tang, Bo Wang, Xu Wang, Dongming Zhao, jing liu, Ruifang He and Yuexian Hou |
| Enhancing Event Causality Identification with LLM Knowledge and Concept-Level Event Relations
Ya Su, Hu Zhang, Guangjun Zhang, Yujie Wang, Yue Fan, Ru Li and Yuanlong Wang |
| Cognate Detection for Historical Language Reconstruction of Proto-Sabean Languages: the Case of Ge’ez, Tigrinya, and Amharic
Elleni Sisay Temesgen, Hellina Hailu Nigatu and Fitsum Assamnew Andargie |
| Revisiting Cosine Similarity via Normalized ICA-transformed Embeddings
Hiroaki Yamagiwa, Momose Oyama and Hidetoshi Shimodaira |
| Piecing It All Together: Verifying Multi-Hop Multimodal Claims
Haoran Wang, Aman Rangapur, Xiongxiao Xu, Yueqing Liang, Haroon Gharwi, Carl Yang and Kai Shu |
| Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation
Bhargav Shandilya and Alexis Palmer |
| Large Language Model-Based Event Relation Extraction with Rationales
Zhilei Hu, Zixuan Li, Xiaolong Jin, Long Bai, Jiafeng Guo and Xueqi Cheng |
| Charting the Future: Using Chart Question-Answering for Scalable Evaluation of LLM-Driven Data Visualizations
James Ford, Xingmeng Zhao, Dan Schumacher and Anthony Rios |
| Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study
Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng and Kai Chen |
| Making Large Language Models into World Models with Precondition and Effect Knowledge
Kaige Xie, Ian Yang, John Gunerli and Mark Riedl |
| DORA: Dynamic Optimization Prompt for Continuous Reflection of LLM-based Agent
Kun Li, Tingzhang Zhao, Wei Zhou and Songlin Hu |
| Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning
Yanda Chen, Chandan Singh, Xiaodong Liu, Simiao Zuo, Bin Yu, He He and Jianfeng Gao |
| Propulsion: Steering LLM with Tiny Fine-Tuning
Md Kowsher, Nusrat Jahan Prottasha and Prakash Bhat |
| DEGAP: Dual Event-Guided Adaptive Prefixes for Templated-Based Event Argument Extraction with Slot Querying
Guanghui Wang, Dexi Liu, Jian-Yun Nie, Qizhi Wan, Rong Hu, Xiping Liu, Wanlong Liu and Jiaming Liu |
| Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs
Dingjie Song, Wenjun Wang, Shunian Chen, Xidong Wang, Michael X. Guan and Benyou Wang |
| Leveraging Large Pre-trained Multilingual Models for High-Quality Speech-to-Text Translation on Industry Scenarios
Marko Avila and Josep Crego |
| SA-DETR:Span Aware Detection Transformer for Moment Retrieval
Tianheng Xiong, Wei Wei, kaihe xu and Dangyang Chen |
| Aligning LLMs with Individual Preferences via Interaction
Shujin Wu, Yi R. Fung, Cheng Qian, Jeonghwan Kim, Dilek Hakkani-Tur and Heng Ji |
| Automatic Evaluation of Language Generation Technology Based on Structure Alignment
Katsuki Chousa and Tsutomu Hirao |
| Enhancing Talk Moves Analysis in Mathematics Tutoring through Classroom Teaching Discourse
Jie Cao, Abhijit Suresh, Jennifer Jacobs, Charis Clevenger, Amanda Howard, Chelsea Brown, Brent Milne, Tom Fischaber, Tamara Sumner and James H. Martin |
| How to Leverage Digit Embeddings to Represent Numbers?
Jasivan Alex Sivakumar and Nafise Sadat Moosavi |
| AdaCQR: Enhancing Query Reformulation for Conversational Search via Sparse and Dense Retrieval Alignment
Yilong Lai, Jialong Wu, Congzhi Zhang, Haowen Sun and Deyu Zhou |
| EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
Zheng Li, Sujian Li, Dawei Zhu, Qilong Ma and Weimin Xiong |
| Linear Recency Bias During Training Improves Transformers’ Fit to Reading Times
Christian Clark, Byung-Doh Oh and William Schuler |
| ProsodyFlow: High-fidelity Text-to-Speech through Conditional Flow Matching and Prosody Modeling with Large Speech Language Models
Haoyu Wang, Sizhe Shan, Yinlin Guo and Yuehai Wang |
| Mitigating Out-of-Entity Errors in Named Entity Recognition: A Sentence-Level Strategy
Guochao Jiang, Ziqin Luo, Chengwei Hu, Zepeng Ding and Deqing Yang |
| Cross-lingual Evaluation of Multilingual Text Generation
Shamil Chollampatt, Minh Quang Pham, Sathish Reddy Indurthi and Marco Turchi |
| Norm of Mean Contextualized Embeddings Determines their Variance
Hiroaki Yamagiwa and Hidetoshi Shimodaira |
| Exploring the Impacts of Feature Fusion Strategy in Multi-modal Entity Alignment
Chenxiao Li, Jingwei Cheng, Qiang Tong and Fu Zhang |
| Extrapolating to Unknown Opinions Using LLMs
Kexun Zhang, Jane Dwivedi-Yu, Zhaojiang Lin, Yuning Mao, William Yang Wang, Lei Li and Yi-Chia Wang |
| How Likely Do LLMs with CoT Mimic Human Reasoning?
Guangsheng Bao, Hongbo Zhang, Cunxiang Wang, Linyi Yang and Yue Zhang |
| SGMEA: Structure-Guided Multimodal Entity Alignment
Jingwei Cheng, Mingxiao Guo and Fu Zhang |
| Unveiling Fake News with Adversarial Arguments Generated by Multimodal Large Language Models
Xiaofan Zheng, Minnan Luo and Xinghao Wang |
| Incorporating Review-missing Interactions for Generative Explainable Recommendation
Xi Li, Xiaohe Bo, Chen Ma and Xu Chen |
| Transformer-based Speech Model Learns Well as Infants and Encodes Abstractions through Exemplars in the Poverty of the Stimulus Environment
Yi Yang, Yiming Wang and Jiahong Yuan |
| Hire Me or Not? Examining Language Model’s Behavior with Occupation Attributes
Damin Zhang, Yi Zhang, Geetanjali Bihani and Julia Rayz |
| Enhancing Factual Consistency in Text Summarization via Counterfactual Debiasing
Zhenqing Ling, Yuexiang Xie, Chenhe Dong and Ying Shen |
| GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models
zike yuan, ming liu, Hui Wang and Bing Qin |
| Exploring Content Predictability in Turn-Taking Through Different Computer-Mediated Communications
Wanqing He, Calen C. MacDonald, Yejoon Yoo, Marcos Eizayaga, Ryun Shim, Lev D. Katreczko and Susan R. Fussell |
| VEEF-Multi-LLM: Effective Vocabulary Expansion and Parameter Efficient Finetuning Towards Multilingual Large Language Models
jiu sha, Mengxiao Zhu, Chong Feng and Yuming Shang |
| PERC: Plan-As-Query Example Retrieval for Underrepresented Code Generation
Jaeseok Yoo, Hojae Han, Youngwon Lee, Jaejin Kim and Seung-won Hwang |
| Multilingual and Explainable Text Detoxification with Parallel Corpora
Daryna Dementieva, Nikolay Babakov, Amit Ronen, Abinew Ali Ayele, Naquee Rizwan, Florian Schneider, Xintong Wang, Seid Muhie Yimam, Daniil Alekhseevich Moskovskiy, Elisei Stakovskii, Eran Kaufman, Ashraf Elnagar, Animesh Mukherjee and Alexander Panchenko |
| Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text
Ali Al Lawati, Jason Lucas and Prasenjit Mitra |
| Factual Knowledge Assessment of Language Models Using Distractors
Hichem Ammar Khodja, Abderrahmane Ait gueni ssaid, Frederic Bechet, Quentin Brabant, Alexis Nasr and Gwénolé Lecorvé |
| Paraphrase Generation Evaluation Powered by an LLM: A Semantic Metric, Not a Lexical One
Quentin Lemesle, Jonathan Chevelu, Philippe Martin, Damien Lolive, Arnaud Delhay and Nelly Barbot |
| Summarization of Opinionated Political Documents with Varied Perspectives
Nicholas Deas and Kathleen McKeown |
| Measuring Contextual Informativeness in Child-Directed Text
Maria R. Valentini, Téa Y. Wright, Ali Marashian, Jennifer M. Ellis, Eliana Colunga and Katharina von der Wense |
| Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring
Hongjin Kim, Jeonghyun Kang and Harksoo Kim |
| Zero-Shot Entailment Learning for Ontology-Based Biomedical Annotation Without Explicit Mentions
Rumana Ferdous Munne, Noriki Nishida, Shanshan Liu, Narumi Tokunaga, Yuki Yamagata, Kouji Kozaki and Yuji Matsumoto |
| Mitigating Shortcut Learning via Smart Data Augmentation based on Large Language Model
Xinyi Sun, Hongye Tan, Yaxin Guo, pengpeng Qiang, Ru Li and Hu Zhang |
| DeTriever: Decoder-representation-based Retriever for Improving NL2SQL In-Context Learning
Raymond Li, Yuxi Feng, Zhenan Fan, Giuseppe Carenini, Weiwei Zhang, Mohammadreza Pourreza and Yong Zhang |
| Improving NMT Models by Retrofitting Quality Estimators into Trainable Energy Loss
Gahyun Yoo and Jay Yoon Lee |
| What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
Yifan Du, Hangyu Guo, Kun Zhou, Wayne Xin Zhao, Jinpeng Wang, Chuyuan Wang, Mingchen Cai, Ruihua Song and Ji-Rong Wen |
| TriFine: A Large-Scale Dataset of Vision-Audio-Subtitle for Tri-Modal Machine Translation and Benchmark with Fine-Grained Annotated Tags
Boyu Guan, Yining Zhang, Yang Zhao and Chengqing Zong |
| Can Many-Shot In-Context Learning Help LLMs as Evaluators? A Preliminary Empirical Study
Mingyang Song, Mao Zheng and Xuan Luo |
| GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary
Fatemah Yousef Almeman and Luis Espinosa Anke |
| Momentum Posterior Regularization for Multi-hop Dense Retrieval
Zehua Xia, Yuyang Wu, Yiyun Xia and Cam Tu Nguyen |
| CaDRL: Document-level Relation Extraction via Context-aware Differentiable Rule Learning
Kunli Zhang, Pengcheng Wu, Bohan Yu, Kejun Wu, Aoze Zheng, Xiyang Huang, Chenkang Zhu, Min Peng, Hongying Zan and Yu Song |
| TEF: Causality-Aware Taxonomy Expansion via Front-Door Criterion
Yuan Meng, Songlin Zhai, Yuxin Zhang, Zhongjian Hu and Guilin Qi |
| Inside-Outside Algorithm for Probabilistic Product-Free Lambek Categorial Grammar
Jinman Zhao and Gerald Penn |
| Perceive the Passage of Time: A Systematic Evaluation of Large Language Model in Temporal Relativity
Shuang chen, Yining Zheng, Shimin Li, Qinyuan Cheng and Xipeng Qiu |
| Hit the Sweet Spot! Span-Level Ensemble for Large Language Models
Yangyifan Xu, Jianghao Chen, Junhong Wu and Jiajun Zhang |
| PToco: Prefix-based Token-level Collaboration Enhances Reasoning for Multi-LLMs
Yuang Bian, Yupian Lin, Jingping Liu and Tong Ruan |
| MAGRET: Machine-generated Text Detection with Rewritten Texts
Yifei Huang, Jiuxin Cao, Hanyu Luo, Xin Guan and Bo Liu |
| Structured List-Grounded Question Answering
Mujeen Sung, Song Feng, James Gung, Raphael Shu, Yi Zhang and Saab Mansour |
| Low-Resource Language Expansion and Translation Capacity Enhancement for LLM: A Study on the Uyghur
Kaiwen Lu, Yating Yang, Fengyi Yang, Rui Dong, Bo Ma, Aihetamujiang Aihemaiti, Abibilla Atawulla, Lei Wang and Xi Zhou |
| Unraveling the Mystery: Defending Against Jailbreak Attacks Via Unearthing Real Intention
Yanhao Li, Hongshen Chen, Heng Zhang, Zhiwei Ge, Tianhao Li, Sulong Xu and Guibo Luo |
| A Flash in the Pan: Better Prompting Strategies to Deploy Out-of-the-Box LLMs as Conversational Recommendation Systems
Gustavo Adolpho Lucas de Carvalho, Simon Ben Igeri, Jennifer Healey, Victor Bursztyn, David Demeter and Lawrence A. Birnbaum |
| Rule-KBQA: Rule-Guided Reasoning for Complex Knowledge Base Question Answering with Large Language Models
Zhiqiang Zhang, Liqiang Wen and Wen Zhao |
| Mitigating Language Confusion through Inference-time Intervention
Xie Yunfan, Lixin Zou, Dan Luo, Min Tang, Chenliang Li, Xiangyang Luo and Liming Dong |
| Detecting deepfakes and false ads through analysis of text and social engineering techniques
Alicja Martinek and Ewelina Bartuzi-Trokielewicz |
| Indigenous Languages Spoken in Argentina: A Survey of NLP and Speech Resources
Belu Ticona, Fernando Martín Carranza and Viviana Cotik |
| The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction
Arthur Amalvy, Vincent Labatut and Richard Dufour |
| Cultural Alignment in Large Language Models: An Explanatory Analysis Based on Hofstede’s Cultural Dimensions
Reem Masoud, Ziquan Liu, Martin Ferianc, Philip C. Treleaven and Miguel Rodrigues Rodrigues |
| META-LORA: Memory-Efficient Sample Reweighting for Fine-Tuning Large Language Models
Weicheng Li, Lixin Zou, Min Tang, Qing Yu, Wanli Li and Chenliang Li |
| Can Large Language Models perform Relation-based Argument Mining?
Deniz Gorur, Antonio Rago and Francesca Toni |
| Contextual Augmentation for Entity Linking using Large Language Models
Daniel Vollmers, Hamada Zahera, Diego Moussallem and Axel-Cyrille Ngonga Ngomo |
| CmEAA: Cross-modal Enhancement and Alignment Adapter for Radiology Report Generation
Xiyang Huang, Yingjie Han, YX L, Runzhi Li, Pengcheng Wu and Kunli Zhang |
| Semantic Reshuffling with LLM and Heterogeneous Graph Auto-Encoder for Enhanced Rumor Detection
Guoyi Li, Die Hu, Zongzhen Liu, Xiaodan Zhang and Honglei Lyu |
| Extracting, Detecting, and Generating Research Questions for Scientific Articles
Sina Taslimi, Artemis Capari, Hosein Azarbonyad, Zi Long Zhu, Zubair Afzal, Evangelos Kanoulas and George Tsatsaronis |
| Confront Insider Threat: Precise Anomaly Detection in Behavior Logs Based on LLM Fine-Tuning
Shuang Song, Yifei Zhang and Neng Gao |
| Flashback: Memory Mechanism for Enhancing Memory Efficiency and Speed in Deep Sequential Models
Taiki Sekii |
| Engagement-driven Persona Prompting for Rewriting News Tweets
Reshmi Gopalakrishna Pillai, Antske Fokkens and Wouter van Atteveldt |
| A Chain-of-Task Framework for Instruction Tuning of LLMs Based on Chinese Grammatical Error Correction
Xinpeng Liu, Bing Xu, Muyun Yang, Hailong Cao, Conghui Zhu, Tiejun Zhao and Wenpeng Lu |
| Beyond Dataset Creation: Critical View of Annotation Variation and Bias Probing of a Dataset for Online Radical Content Detection
Arij Riabi, Virginie Mouilleron, Menel Mahamdi, Wissam Antoun and Djamé Seddah |
| AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic
Emad A. Alghamdi, Reem Masoud, Deema Alnuhait, Afnan Y. Alomairi, Ahmed Ashraf and Mohamed Zaytoon |
| Comparative Study of Multilingual Idioms and Similes in Large Language Models
Paria Khoshtab, Danial Namazifard, Mostafa Masoudi, Ali Akhgary, Samin Mahdizadeh Sani and Yadollah Yaghoobzadeh |
| FedCSR: A Federated Framework for Multi-Platform Cross-Domain Sequential Recommendation with Dual Contrastive Learning
Dongyi Zheng, Hongyu Zhang, Jianyang Zhai, Lin Zhong, Lingzhi Wang, Jiyuan Feng, Xiangke Liao, Yonghong Tian, Nong Xiao and Qing Liao |
| Multi-Modal Entities Matter: Benchmarking Multi-Modal Entity Alignment
GuanChen Xiao, WeiXin Zeng, ShiQi Zhang, MingRui Lao and Xiang Zhao |
| Enhancing Extractive Question Answering in Multiparty Dialogues with Logical Inference Memory Network
Shu Zhou, Rui Zhao, Zhengda Zhou, Haohan Yi, Xuhui Zheng and Hao Wang |
| Enhancing Discourse Parsing for Local Structures from Social Media with LLM-Generated Data
Martial Pastor, Nelleke Oostdijk, Patricia Martin-Rodilla and Javier Parapar |
| PARAPHRASUS: A Comprehensive Benchmark for Evaluating Paraphrase Detection Models
Andrianos Michail, Simon Clematide and Juri Opitz |
| Dynamic-prototype Contrastive Fine-tuning for Continual Few-shot Relation Extraction with Unseen Relation Detection
Si Miao Zhao, Zhen Tan, Ning Pang, Wei Dong Xiao and Xiang Zhao |
| Enhancing Rhetorical Figure Annotation: An Ontology-Based Web Application with RAG Integration
Ramona Kühn, Jelena Mitrović and Michael Granitzer |
| Quantifying the Influence of Evaluation Aspects on Long-Form Response Assessment
Go Kamoda, Akari Asai, Ana Brassard and Keisuke Sakaguchi |
| CharMoral: A Character Morality Dataset for Morally Dynamic Character Analysis in Long-Form Narratives
Suyoung Bae, Gunhee Cho, Yun-Gyung Cheong and Boyang Li |
| Incremental Transformer: Efficient Encoder for Incremented Text Over MRC and Conversation Tasks
Weisheng Li, Yuechen Wang, Jiaxin Shi, Wengang Zhou, Qi Tian and Houqiang Li |
| Enhancing Large Language Models for Document-Level Translation Post-Editing Using Monolingual Data
Zongyao Li, Zhiqiang Rao, Hengchao Shang, Jiaxin GUO, Shaojun Li, Daimeng Wei and Hao Yang |
| PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning
Qibin Wang, Xiaolin Hu, Weikai Xu, Wei Liu, Jian Luan and Bin Wang |
| Learn from Failure: Causality-guided Contrastive Learning for Generalizable Implicit Hate Speech Detection
Tianming Jiang |
| Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation
Samin Mahdizadeh Sani, Pouya Sadeghi, Thuy-Trang Vu, Yadollah Yaghoobzadeh and Gholamreza Haffari |
| Inductive Link Prediction in N-ary Knowledge Graphs
Jiyao Wei, Saiping Guan, Xiaolong Jin, Jiafeng Guo and Xueqi Cheng |
| ZigZagKV: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Meizhi Zhong, Xikai Liu, Chen Zhang, Yikun Lei, Yan Gao, Yao Hu, Kehai Chen and Min Zhang |
| Automatic Mathematic In-Context Example Generation for LLM Using Multi-Modal Consistency
jaeseong lee, Wei Yang, Gopal Gupta and Shiyi Wei |
| From Traits to Empathy: Personality-Aware Multimodal Empathetic Response Generation
jiaqiang wu, Xuandong Huang, Zhouan Zhu and Shangfei Wang |
| Integrating Visual Modalities with Large Language Models for Mental Health Support
Zhouan Zhu, Shangfei Wang, Yuxin Wang and jiaqiang wu |
| Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Meizhi Zhong, Chen Zhang, Yikun Lei, Xikai Liu, Yan Gao, Yao Hu, Kehai Chen and Min Zhang |
| Selected Languages are All You Need for Cross-lingual Truthfulness Transfer
Weihao Liu, Ning Wu, Wenbiao Ding, Shining Liang, Ming Gong and Dongmei Zhang |
| OVEL: Online Video Entity Linking
Haiquan Zhao, Xuwu Wang, Shisong Chen, Zhixu Li, Xin Zheng and Yanghua Xiao |
| The Only Way is Ethics: A Guide to Ethical Research with Large Language Models
Eddie L. Ungless, Nikolas Vitsakis, Zeerak Talat, James Garforth, Bjorn Ross, Arno Onken, Atoosa Kasirzadeh and Alexandra Birch |
| Should We Use a Fixed Embedding Size? Customized Dimension Sizes for Knowledge Graph Embedding
Zhanpeng Guan, Zhao Zhang, Yiqing Wu, Fuwei Zhang and Yongjun Xu |
| Chinese Automatic Readability Assessment Using Adaptive Pre-training and Linguistic Feature Fusion
Xusheng Yang, Jincai Yang and Xiao Li |
| Multitask-Bench: Unveiling and Mitigating Safety Gaps in LLMs Fine-tuning
Essa Jan, Nouar Aldahoul, Moiz Ali, Faizan Ahmad, Fareed Zaffar and Yasir Zaki |
| Unmasking the Imposters: How Censorship and Domain Adaptation Affect the Detection of Machine-Generated Tweets
Bryan E. Tuck and Rakesh Verma |
| Detecting Emotional Incongruity of Sarcasm by Commonsense Reasoning
Ziqi Qiu, Jianxing Yu, Yufeng Zhang, Hanjiang Lai, Yanghui Rao, Qinliang Su and Jian Yin |
| Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning
Jing Bi, Yuting Wu, Weiwei Xing and Zhenjie Wei |
| LOG: A Local-to-Global Optimization Approach for Retrieval-based Explainable Multi-Hop Question Answering
Hao Xu, Yunxiao Zhao, Jiayang Zhang, Zhiqiang Wang and Ru Li |
| KG-TRICK: Unifying Textual and Relational Information Completion of Knowledge for Multilingual Knowledge Graphs
Zelin Zhou, Simone Conia, Daniel Lee, Min Li, Shenglei Huang, Umar Farooq Minhas, Saloni Potdar, Henry Xiao and Yunyao Li |
| Impromptu Cybercrime Euphemism Detection
Xiang Li, Yucheng Zhou, Laiping Zhao, Jing Li and Fangming Liu |
| ALIS: Aligned LLM Instruction Security Strategy for Unsafe Input Prompt
Xinhao Song, Sufeng Duan and Gongshen Liu |
| ProTOD: Proactive Task-oriented Dialogue System Based on Large Language Model
Wenjie Dong, Sirong Chen and Yan Yang |
| Towards Multilingual spoken Visual Question Answering system using Cross-Attention
Amartya Roy Chowdhury, Tonmoy Rajkhowa and Sanjeev Sharma |
| Detecting Conversational Mental Manipulation with Intent-Aware Prompting
Jiayuan Ma, Hongbin Na, Zimu Wang, Yining Hua, Yue Liu, Wei Wang and Ling Chen |
| MIGRATE: Cross-Lingual Adaptation of Domain-Specific LLMs through Code-Switching and Embedding Transfer
Seongtae Hong, Seungyoon Lee, Hyeonseok Moon and Heuiseok Lim |
| CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving
Bhavani Shankar P S V N, Preethi Jyothi and Pushpak Bhattacharyya |
| Bridging the Language Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs
Somnath Kumar, Vaibhav Balloli, Mercy Ranjit, Kabir Ahuja, Sunayana Sitaram, Kalika Bali, Tanuja Ganu and Akshay Nambi |
| Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models
Sofia Jamil, Bollampalli Areen Reddy, Raghvendra Kumar, Sriparna Saha, Joseph K. J and Koustava Goswami |
| Argumentation and Domain Discourse in Scholarly Articles on the Theory of International Relations
Magdalena Wolska, Sassan Gholiagha, Mitja Sienknecht, Dora Kiesel, Irene Lopez Garcia, Patrick Riehmann, Matti Wiegmann, Bernd Froehlich, Katrin Girgensohn, Jürgen Neyer and Benno Stein |
| Semantic and Sentiment Dual-Enhanced Generative Model for Script Event Prediction
Feiyang Wu, Peixin Huang, Yanli Hu, Zhen Tan and Xiang Zhao |
| Generation-Based and Emotion-Reflected Memory Update: Creating the KEEM Dataset for Better Long-Term Conversation
Jeonghyun Kang, Hongjin Kim and Harksoo Kim |
| medIKAL: Integrating Knowledge Graphs as Assistants of LLMs for Enhanced Clinical Diagnosis on EMRs
Mingyi Jia, Junwen Duan, Yan Song and Jianxin Wang |
| AIDER: a Robust and Topic-Independent Framework for Detecting AI-Generated Text
Jiayi Gui, Baitong Cui, Xiaolian Guo, Ke Yu and Xiaofei Wu |
| CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information
Yuxin Wang, MingHua Ma, Zekun Wang, Jingchang Chen, Shan Liping, Qing Yang, Dongliang Xu, ming liu and Bing Qin |
| Do LLMs Know When to NOT Answer? Investigating Abstention Abilities of Large Language Models
Nishanth Madhusudhan, Sathwik Tejaswi Madhusudhan, Vikas Yadav and Masoud Hashemi |
| Dr.ECI: Infusing Large Language Models with Causal Knowledge for Decomposed Reasoning in Event Causality Identification
Ruichu Cai, Shengyin Yu, Jiahao Zhang, Wei Chen, Boyan Xu and Keli Zhang |
| InternLM-Law: An Open-Sourced Chinese Legal Large Language Model
Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Jidong Ge and Vincent Ng |
| Let’s Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model
Haoyun Xu, Runzhe Zhan, Yingpeng Ma, Derek F. Wong and Lidia S. Chao |
| Cross-Domain Fake News Detection based on Dual-Granularity Adversarial Training
Wenjie Wei, Yanyue Zhang, Jinyan Li, Panfei Liu and Deyu Zhou |
| Position Information Emerges in Causal Transformers Without Positional Encodings via Similarity of Nearby Embeddings
Chunsheng Zuo, Pavel Guerzhoy and Michael Guerzhoy |
| RISCORE: Enhancing In-Context Riddle Solving in Language Models through Context-Reconstructed Example Augmentation
Ioannis Panagiotopoulos, George Filandrianos, Maria Lymperaiou and Giorgos Stamou |
| Ranking Over Scoring: Towards Reliable and Robust Automated Evaluation of LLM-Generated Medical Explanatory Arguments
Iker De la Iglesia, Iakes Goenaga, Johanna Ramirez-Romero, Jose Maria Villa-Gonzalez, Josu Goikoetxea and Ander Barrena |
| CACA: Context-Aware Cross-Attention Network for Extractive Aspect Sentiment Quad Prediction
Bingfeng Chen, Haoran Xu, Yongqi Luo, Boyan Xu, Ruichu Cai and Zhifeng Hao |
| Improved Sparse Upcycling for Instruction Tuning
Wangyi Jiang, Yaojie Lu, Hongyu Lin, Xianpei Han and Le Sun |
| SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment
Yuchun Fan, Yongyu Mu, YiLin Wang, Lei Huang, Junhao Ruan, Bei Li, Tong Xiao, Shujian Huang, Xiaocheng Feng and Jingbo Zhu |
| ME2-BERT: Are Events and Emotions what you need for Moral Foundation Prediction?
Lorenzo Zangari, Candida M. Greco, Davide Picca and Andrea Tagarelli |
| SCCD: A Session-based Dataset for Chinese Cyberbullying Detection
Qingpo Yang, Yakai Chen, Zihui Xu, Yu-ming Shang, Sanchuan Guo and Xi Zhang |
| Hands-off Image Editing: Language-guided Editing without any Task-specific Labeling, Masking or even Training
Rodrigo Santos, António Branco, João Ricardo Silva and Joao Rodrigues |
| Beyond Film Subtitles: Is YouTube the Best Approximation of Spoken Vocabulary?
Adam Nohejl, Frederikus Hudi, Eunike Andriani Kardinata, Shintaro Ozaki, Maria Angelica Riera Machin, Hongyu Sun, Justin Vasselli and Taro Watanabe |
| RealSafe: Quantifying Safety Risks of Language Agents in Real-World
Yingning Ma |
| Voice synthesis in Polish and English - analyzing prediction differences in speaker verification systems
Joanna Gajewska, Alicja Martinek, Michał J. Ołowski and Ewelina Bartuzi-Trokielewicz |
| AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment
Umair Nawaz, Awais Muhammad, Hanan Gani, Muzammal Naseer, Fahad Shahbaz Khan, Salman Khan and Rao Anwer |
| RUIE: Retrieval-based Unified Information Extraction using Large Language Model
Xincheng Liao, Junwen Duan, Yixi Huang and Jianxin Wang |
| It is not a piece of cake for GPT: Explaining Textual Entailment Recognition in the presence of Figurative Language
Giuseppe Gallipoli and Luca Cagliero |
| MuKA: Multimodal Knowledge Augmented Visual Information-Seeking
Lianghao Deng, Yuchong Sun, Shizhe Chen, Ning Yang, Yunfeng Wang and Ruihua Song |
| MSG-LLM: A Multi-scale Interactive Framework for Graph-enhanced Large Language Models
Jiayu Ding, Zhangkai Zheng, Benshuo Lin, Yun Xue and YIPING SONG |
| MedEx: Enhancing Medical Question-Answering with First-Order Logic based Reasoning and Knowledge Injection
Aizan Zafar, Kshitij Mishra and Asif Ekbal |
| Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking
Dina Pisarevskaya and Arkaitz Zubiaga |
| Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning
Yukang Lin, Bingchen Zhong, Shuoran Jiang, Joanna Siebert and Qingcai Chen |
| A Review of Prominent Paradigms for LLM-Based Agents: Tool Use, Planning (Including RAG), and Feedback Learning
Xinzhe Li |
| Analyzing Offensive Language Dataset Insights from Training Dynamics and Human Agreement Level
DO KYUNG KIM, Hyeseon Ahn, Youngwook Kim and Yo-Sub Han |
| Solid-SQL: Enhanced Schema-linking based In-context Learning for Robust Text-to-SQL
Geling Liu, Yunzhi Tan, Ruichao Zhong, Yuanzhen Xie, Lingchen Zhao, Qian Wang, Bo Hu and Zang Li |
| Mitigating the Discrepancy Between Video and Text Temporal Sequences: A Time-Perception Enhanced Video Grounding method for LLM
Xuefen Li, bo wang, Ge Shi, Chong Feng and Jiahao Teng |
| CE-DA: Custom Embedding and Dynamic Aggregation for Zero-Shot Relation Extraction
Fu Zhang, He Liu, Zehan Li and Jingwei Cheng |
| NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models
Han Han, Tong Zhu, xiang zhang, MengSong Wu, Xiong Hao and Wenliang Chen |
| A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection
Simon Hachmeier and Robert Jäschke |
| Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study
Yulin Fei, Yuhui Gao, Xingyuan Xian, Xiaojin Zhang, Tao Wu and Wei Chen |
| Disentangle to Decay: Linear Attention with Trainable Decay Factor
Haibo Tong, Chenyang Zhang, Jiayi Lin, Bingxuan Hou, Qingqing Hong and Junli Wang |
| GAProtoNet: A Multi-head Graph Attention-based Prototypical Network for Interpretable Text Classification
Ximing Wen, Wenjuan Tan and Rosina Weber |
| Few-shot domain adaptation for named-entity recognition via joint constrained k-means and subspace selection
Ayoub Hammal, Benno Uthayasooriyar and Caio Corro |
| An Efficient Retrieval-Based Method for Tabular Prediction with LLM
Jie Wu and Mengshu Hou |
| AIGT: AI Generative Table Based on Prompt
Mingming Zhang, Zhiqing Xiao, Guoshan Lu, Sai Wu, Weiqiang Wang, Xing Fu, Can Yi and Junbo Zhao |
| IRR: Image Review Ranking Framework for Evaluating Vision-Language Models
Kazuki Hayashi, Kazuma Onishi, Toma Suzuki, Yusuke Ide, Seiji Gobara, Shigeki Saito, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe |
| Development of Numerical Error Detection Tasks to Analyze the Numerical Capabilities of Language Models
Taku Sakamoto, Saku Sugawara and Akiko Aizawa |
| Searching for Structure: Investigating Emergent Communication with Large Language Models
Tom Kouwenhoven, Max Peeperkorn and Tessa Verhoef |
| Decoding Decoded: Understanding Hyperparameter Effects in Open-Ended Text Generation
Esteban Garces Arias, Meimingwei Li, Christian Heumann and Matthias Assenmacher |
| Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems
Xuyang Wu, Shuowei Li, Hsin-Tai Wu, Zhiqiang Tao and Yi Fang |
| CUTE: A Multilingual Dataset for Enhancing Cross-Lingual Knowledge Transfer in Low-Resource Languages
Wenhao Zhuang and Yuan Sun |
| How Ambiguous Are the Rationales for Natural Language Reasoning? A Simple Approach to Handling Rationale Uncertainty
Hazel H. Kim |
| Planning with Multi-Constraints via Collaborative Language Agents
Cong Zhang, Xin Deik Goh, Dexun Li, Hao Zhang and Yong Liu |
| Enhancing Nursing and Elderly Care with Large Language Models: An AI-Driven Framework
Qiao Sun, Jiexin Xie, Nanyang Ye, Qinying Gu and Shijie Guo |
| A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation
Shijie Zhou, Ruiyi Zhang, Yufan Zhou and Changyou Chen |
| Cross-Lingual Knowledge Projection and Knowledge Enhancement for Zero-Shot Question Answering in Low-Resource Languages
Sello Ralethe and Jan Buys |
| FarExStance: Explainable Stance Detection for Farsi
Majid Zarharan, Maryam Hashemi, Malika Behroozrazegh, Sauleh Eetemadi, Mohammad Taher Pilehvar and Jennifer Foster |
| Unveiling Language Competence Neurons: A Psycholinguistic Approach to Model Interpretability
Xufeng Duan, Xinyu Zhou, Bei Xiao and Zhenguang Cai |
| Cross-Dialect Information Retrieval: Information Access in Low-Resource and High-Variance Languages
Robert Litschko, Oliver Kraus, Verena Blaschke and Barbara Plank |
| MoKA:Parameter Efficiency Fine-Tuning via Mixture of Kronecker Product Adaption
Beiming Yu, Zhenfei Yang and Xiushuang Yi |
| AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator
Zhihao Fan, Lai Wei, Jialong Tang, Wei Chen, Wang Siyuan, Zhongyu Wei and Fei Huang |
| Can LLMs Help Create Grammar?: Automating Grammar Creation for Endangered Languages with In-Context Learning
Piyapath T. Spencer and Nanthipat Kongborrirak |
| Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition
Sneheel Sarangi, Maha Elgarf and Hanan Salam |
| Bridging Context Gaps: Enhancing Comprehension in Long-Form Social Conversations Through Contextualized Excerpts
Shrestha Mohanty, Sarah Xuan, Jacob Jobraeel, Anurag Kumar, Deb Roy and Jad Kabbara |
| Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xuebo Liu, Lidia S. Chao and Min Zhang |
| VoxpopuliTTS: a large-scale multilingual TTS corpus for zero-shot speech generation
Wenrui Liu, Jionghao Bai, Xize Cheng, Jialong Zuo, Ziyue Jiang, Shengpeng Ji, Minghui Fang, Xiaoda Yang, Qian Yang and Zhou Zhao |
| Self-Evolution Knowledge Distillation for LLM-based Machine Translation
yuncheng song, Liang Ding, Changtong Zan and Shujian Huang |
| On Weaponization-Resistant Large Language Models with Prospect Theoretic Alignment
Zehua Cheng, Manying Zhang, Jiahao Sun and Wei Dai |
| Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP Tasks
Qintong Li, Leyang Cui, Lingpeng Kong and Wei Bi |
| Dynamics of Instruction Fine-Tuning for Chinese Large Language Models
Chiyu Song, Zhanchao Zhou, Jianhao Yan, Yuejiao Fei, Zhenzhong Lan and Yue Zhang |
| Evaluating Transformers for OCR Post-Correction in Early Modern Dutch Theatre
Florian Debaene, Aaron Maladry, Els Lefever and Veronique Hoste |
| BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition
quanjiang guo, yihong dong, ling tian, zhao kang, yu zhang and Sijie Wang |
| In-Context Reinforcement Learning with Retrieval-Augmented Generation for Text-to-SQL
Rishit Toteja, Arindam Sarkar and Prakash Mandayam Comar |
| ICLEval: Evaluating In-Context Learning Ability of Large Language Models
Wentong Chen, Yankai Lin, ZhenHao Zhou, HongYun Huang, YanTao Jia, Zhao Cao and Ji-Rong Wen |
| VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models
Haowen Hou, Peigen Zeng, Fei Ma and Fei Richard Yu |
| Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark
Zhikun Xu, Yinghui Li, Ruixue Ding, Xinyu Wang, Boli Chen, Yong Jiang, Haitao Zheng, Wenlian Lu, Pengjun Xie and Fei Huang |
| Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests
Amogh Mannekote, Jinseok Nam, Ziming Li, Kristy Elizabeth Boyer and Bonnie J. Dorr |
| Consistency Rating of Semantic Transparency: an Evaluation Method for Metaphor Competence in Idiom Understanding Tasks
Hui Gao, Jing Zhang, Peng Zhang and Chang Yang |
| KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions
Yanxu Zhu, jinlin xiao, Yuhang Wang and Jitao Sang |
| IberoBench: A Benchmark for LLM Evaluation in Iberian Languages
Irene Baucells, Javier Aula-Blasco, Iria de-Dios-Flores, Silvia Paniagua Suárez, Naiara Perez, Anna Salles, Susana Sotelo Docio, Júlia Falcão, Jose Javier Saiz, ROBIERT SEPULVEDA TORRES, Jeremy Barnes, Pablo Gamallo, Aitor Gonzalez-Agirre, German Rigau and Marta Villegas |
| Efficient Architectures for High Resolution Vision-Language Models
Miguel Carvalho and Bruno Martins |
| NCRE: A Benchmark for Document-level Nominal Compound Relation Extraction
jincheng cao, Bobo Li, Jiang Liu and Donghong Ji |
| Comet: Dialog Context Fusion Mechanism for End-to-End Task-Oriented Dialog with Multi-task Learning
Haipeng Sun, Junwei Bao, Youzheng Wu and Xiaodong He |
| Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
Yi Fang, Moxin Li, Wenjie Wang, Lin Hui and Fuli Feng |
| Extracting the Essence and Discarding the Dross: Enhancing Code Generation with Contrastive Execution Feedback
Xuanyu Zhang and Qing Yang |
| From Facts to Insights: A Study on the Generation and Evaluation of Analytical Reports for Deciphering Earnings Calls
Tomas Goldsack, Yang Wang, Chenghua Lin and Chung-Chi Chen |
| Leveraging LLM-Generated Schema Descriptions for Unanswerable Question Detection in Clinical Data
donghee han, Seungjae Lim, Daeyoung Roh, Sangryul Kim, Sehyun Kim and Mun Yong Yi |
| Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models
Hongchuan Zeng, Senyu Han, Lu Chen and Kai Yu |
| Understanding Token Probability Encoding in Output Embeddings
Hakaze Cho, Yoshihiro Sakai, Kenshiro Tanaka, Mariko Kato and Naoya Inoue |
| Investigating Bias in LLM-Based Bias Detection: Disparities between LLMs and Human Perception
Luyang Lin, Lingzhi Wang, Jinsong Guo and Kam-Fai Wong |
| Evaluating the Consistency of LLM Evaluators
Noah Lee, Jiwoo Hong and James Thorne |
| MDPO: Customized Direct Preference Optimization with a Metric-based Sampler for Question and Answer Generation
Yihang Wang, Bowen Tian, Yueyang Su, Yixing Fan and Jiafeng Guo |
| A Collaborative Reasoning Framework Powered by Reinforcement Learning and Large Language Models for Complex Questions Answering over Knowledge Graph
Zhiqiang Zhang and Wen Zhao |
| Scalability of Bayesian Network Structure Elicitation with Large Language Models: a Novel Methodology and Comparative Analysis
Nikolay Babakov, Ehud Reiter and Alberto Bugarín-Diz |
| An LLM-based Framework for Biomedical Terminology Normalization in Social Media via Multi-Agent Collaboration
Yongqi Fan, Kui Xue, Zelin Li, Xiaofan Zhang and Tong Ruan |
| Driving Chinese Spelling Correction from a Fine-Grained Perspective
Linfeng Liu, Hongqiu Wu and Hai Zhao |
| LAiW: A Chinese Legal Large Language Models Benchmark
Yongfu Dai, Duanyu Feng, Jimin Huang, Haochen Jia, Qianqian Xie, Yifang Zhang, Weiguang Han, Wei Tian and Hao Wang |
| Retrieval-Augmented Generation for Large Language Model based Few-shot Chinese Spell Checking
Ming Dong, Zhiwei Cheng, Changyin Luo and Tingting He |
| GADFA: Generator-Assisted Decision-Focused Approach for Opinion Expressing Timing Identification
Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao and Hsin-Hsi Chen |
| Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia, Rui Wang, Xu Liu, Mingyan Li, Tong Yu, Xiang Chen, Julian McAuley and Shuai Li |
| Interpreting Topic Models in Byte-Pair Encoding Space
Jia Peng Lim and Hady Lauw |
| SUMIE: A Synthetic Benchmark for Incremental Entity Summarization
Eunjeong Hwang, Yichao Zhou, Beliz Gunel, James Bradley Wendt and Sandeep Tata |
| Text-Attributed Graph Learning with Coupled Augmentations
Chuang Zhou, Jiahe Du, Huachi Zhou, Hao Chen, Feiran Huang and Xiao Huang |
| From Chaotic OCR Words to Coherent Document: A Fine-to-Coarse Zoom-Out Network for Complex-Layout Document Image Translation
Zhiyang Zhang, Yaping Zhang, Yupu Liang, Lu Xiang, Yang Zhao, Yu Zhou and Chengqing Zong |
| MESAQA: A Dataset for Multi-Span Contextual and Evidence-Grounded Question Answering
Jui-I Wang, Hen-Hsen Huang and Hsin-Hsi Chen |
| Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition
Yuming Yang, Wantong Zhao, Caishuang Huang, Junjie Ye, Xiao Wang, Huiyuan Zheng, Yang Nan, Yuran Wang, Xueying Xu, Kaixin Huang, Yunke Zhang, Tao Gui, Qi Zhang and Xuanjing Huang |
| Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization
Zhaohan Zhang, Ziquan Liu and Ioannis Patras |
| Re-Examine Distantly Supervised NER: A New Benchmark and a Simple Approach
Yuepei Li, Kang Zhou, Qiao Qiao, Qing Wang and Qi Li |
| BinarySelect to Improve Accessibility of Black-Box Attack Research
Shatarupa Ghosh and Jonathan Rusert |
| Interaction Matters: An Evaluation Framework for Interactive Dialogue Assessment on English Second Language Conversations
Rena Gao, Carsten Roever and Jey Han Lau |
| Imposter: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
Divya Kothandaraman, Kuldeep Kulkarni, Sumit Shekhar, Balaji Vasan Srinivasan and Dinesh Manocha |
| FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema
Junru Lu, Siyu An, Min Zhang, Yulan He, di yin and Xing Sun |
| Context Filtering with Reward Modeling in Question Answering
Sangryul Kim and James Thorne |
| Case2Code: Scalable Synthetic Data for Code Generation
Yunfan Shao, Linyang Li, Yichuan Ma, Peiji Li, Demin Song, Qinyuan Cheng, Shimin Li, Xiaonan Li, Pengyu Wang, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang and Dahua Lin |
| Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering
Mingxu Tao, Dongyan Zhao and Yansong Feng |
| RAIDEN Benchmark: Evaluating Role-playing Conversational Agents with Measurement-Driven Custom Dialogues
Bowen Wu, kaili sun, Ziwei Bai, Ying Li and Baoxun Wang |
| CryptOpiQA: A new Opinion and Question Answering dataset on Cryptocurrency
Sougata Sarkar, aditya badwal, Amartya Roy, Koustav Rudra and Kripabandhu Ghosh |
| No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement
Mateusz Klimaszewski, Piotr Andruszkiewicz and Alexandra Birch |
| NYAYAANUMANA and INLEGALLLAMA: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis
Shubham Kumar Nigam, Deepak Patnaik Balaramamahanthi, Shivam Mishra, Noel Shallum, Kripabandhu Ghosh and Arnab Bhattacharya |
| ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media
Kung-Hsiang Huang, Hou Pong Chan, Kathleen McKeown and Heng Ji |
| Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion
Ben Liu, Jihai Zhang, Fangquan Lin, Cheng Yang and Min Peng |
| FineRAG: Fine-grained Retrieval-Augmented Text-to-Image Generation
huaying yuan, ziliang zhao, Shuting Wang, shitao xiao, minheng ni, zheng liu and zhicheng dou |
| User Willingness-aware Sales Talk Dataset
Asahi Hentona, Jun Baba, Shiki Sato and Reina Akama |
| Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Dongryeol Lee, Minwoo Lee, Kyungmin Min, Joonsuk Park and Kyomin Jung |
| Data Augmentation for Cross-domain Parsing via Lightweight LLM Generation and Tree Hybridization
Ziyan Zhang, Yang Hou, Chen Gong and Zhenghua Li |
| CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations
Jiahao Zhao, Jingwei Zhu, Minghuan Tan, Min Yang, Renhao Li, Yang Di, Chenhao Zhang, Guancheng Ye, Chengming Li, Xiping Hu and Derek F. Wong |
| Optimizing Lifelong Fine-Tuning for Multiple Tasks via Dataless Distribution Replay
zhenxing wang |
| Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models
Xinyu Pang, Ruixin Hong, Zhanke Zhou, Fangrui Lv, Xinwei Yang, Zhilong Liang, Bo Han and Changshui Zhang |
| Efficient Data Labeling by Hierarchical Crowdsourcing with Large Language Models
Haodi Zhang, Junyu Yang, Jinyin Nie, Peirou Liang, Kaishun Wu, Defu Lian, Rui Mao and Yuanfeng Song |
| Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?
Leonidas Zotos, Hedderik van Rijn and Malvina Nissim |
| RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
Shuting Wang, Xin Yu, Mang Wang, Weipeng Chen, Yutao Zhu and Zhicheng Dou |
| LlmLink: Dual LLMs for Dynamic Entity Linking on Long Narratives with Collaborative Memorisation and Prompt Optimisation
Lixing Zhu, Jun Wang and Yulan He |
| PERSONA: A Reproducible Testbed for Pluralistic Alignment
Louis Castricato, Nathan Lile, Rafael Rafailov, Jan-Philipp Fränken and Chelsea Finn |
| LuxEmbedder: A Cross-Lingual Approach to Enhanced Luxembourgish Sentence Embeddings
Fred Philippy, Siwen Guo, Jacques Klein and TEGAWENDE BISSYANDE |
| Human Interest Framing across Cultures: A Case Study on Climate Change
Gisela Vallejo, Christine de Kock, Timothy Baldwin and Lea Frermann |
| OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
Yuxia Wang, Minghan Wang, Hasan Iqbal, Georgi N. Georgiev, Jiahui Geng, Iryna Gurevych and Preslav Nakov |
| A Dataset for Expert Reviewer Recommendation with Large Language Models as Zero-shot Rankers
Vanja M. Karan, Stephen McQuistin, Ryo Yanagida, Colin Perkins, Gareth Tyson, Ignacio Castro, Patrick G.T. Healey and Matthew Purver |
| Evaluating Model Alignment with Human Perception: A Study on Shitsukan in LLMs and LVLMs
Daiki Shiono, Ana Brassard, Yukiko Ishizuki and Jun Suzuki |