Awards

Best Paper

  • A Theory of Response Sampling in LLMs: Part Descriptive and Part Prescriptive
    Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi, Mario Fritz

  • Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs
    Angelina Wang, Michelle Phan, Daniel E. Ho, Sanmi Koyejo

  • Language Models Resist Alignment: Evidence From Data Compression
    Jiaming Ji, Kaile Wang, Tianyi Qiu, Boyuan Chen, Jiayi Zhou, Changye Li, Hantao Lou, Juntao Dai, Yunhuai Liu, Yaodong Yang

  • Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
    Jingyang Yuan, Huazuo Gao, Damai Dai, Junyu Luo, Liang Zhao, Zhengyan Zhang, Zhenda Xie, Y. X. Wei, Lean Wang, Zhiping Xiao, Yuqing Wang, Chong Ruan, Ming Zhang, Wenfeng Liang, Wangding Zeng

Best Social Impact Paper

  • AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
    Charles Nimo, Tobi Olatunji, Abraham Toluwase Owodunni, Tassallah Abdullahi, Emmanuel Ayodele, Mardhiyah Sanni, Ezinwanne C. Aka, Folafunmi Omofoye, Foutse Yuehgoh, Timothy Faniran, Bonaventure F. P. Dossou, Moshood O. Yekini, Jonas Kemp, Katherine A Heller, Jude Chidubem Omeke, Chidi Asuzu MD, Naome A Etori, Aïmérou Ndiaye, Ifeoma Okoh, Evans Doe Ocansey, Wendy Kinara, Michael Best, Irfan Essa, Stephen Edward Moore, Chris Fourie, Mercy Nyamewaa Asiedu

  • The AI Gap: How Socioeconomic Status Affects Language Technology Interactions
    Elisa Bassignana, Amanda Cercas Curry, Dirk Hovy

Best Resource Paper

  • Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral
    Shivani Kumar, David Jurgens

  • BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages
    Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, Jan Philip Wahle, Terry Ruas, Meriem Beloucif, Christine de Kock, Nirmal Surange, Daniela Teodorescu, Ibrahim Said Ahmad, David Ifeoluwa Adelani, Alham Fikri Aji, Felermino D. M. A. Ali, Ilseyar Alimova, Vladimir Araujo, Nikolay Babakov, Naomi Baes, Ana-Maria Bucur, Andiswa Bukula, Guanqun Cao, Rodrigo Tufiño, Rendi Chevi, Chiamaka Ijeoma Chukwuneke, Alexandra Ciobotaru, Daryna Dementieva, Murja Sani Gadanya, Robert Geislinger, Bela Gipp, Oumaima Hourrane, Oana Ignat, Falalu Ibrahim Lawan, Rooweither Mabuya, Rahmad Mahendra, Vukosi Marivate, Alexander Panchenko, Andrew Piper, Charles Henrique Porto Ferreira, Vitaly Protasov, Samuel Rutunda, Manish Shrivastava, Aura Cristina Udrea, Lilian Diana Awuor Wanzare, Sophie Wu, Florian Valentin Wunderlich, Hanif Muhammad Zhafran, Tianhui Zhang, Yi Zhou, Saif M. Mohammad

  • Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs
    Fakhraddin Alwajih, Abdellah EL MEKKI, Samar Mohamed Magdy, AbdelRahim A. Elmadany, OMER NACAR, El Moatez Billah Nagoudi, Reem Abdel-Salam, Hanin atwany, Youssef Nafea, Abdulfattah Mohammed Yahya, Rahaf Alhamouri, Hamzah A. Alsayadi, Hiba Zayed, Sara Shatnawi, Serry Sibaee, Yasir ECH-CHAMMAKHY, Walid Al-Dhabyani, Marwa Mohamed Ali, Imen JARRAYA, Ahmed Oumar El-Shangiti, Aisha Alraeesi, Mohammed Anwar AL-Ghrawi, Abdulrahman S. Al-Batati, Elgizouli Mohamed, Noha Taha Elgindi, Muhammed Saeed, Houdaifa Atou, Issam AIT YAHIA, Abdelhak Bouayad, Mohammed Machrouh, AMAL MAKOUAR, Dania Alkawi, Mukhtar Mohamed, Safaa Taher Abdelfadil, Amine Ziad Ounnoughene, Anfel ROUABHIA, Rwaa Assi, Ahmed Sorkatti, Mohamedou cheikh tourad, Anis Koubaa, Ismail Berrada, Mustafa Jarrar, Shady Shehata, Muhammad Abdul-Mageed

Best Theme Paper

  • MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
    Yixian Shen, Qi Bi, JIA-HONG HUANG, Hongyi Zhu, Andy D. Pimentel, Anuj Pathania

  • Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models
    Xinlin Zhuang, Jiahui Peng, Ren Ma, Yinfan Wang, Tianyi Bai, Xingjian Wei, Qiu Jiantao, Chi Zhang, Ying Qian, Conghui He

  • SubLIME: Subset Selection via Rank Correlation Prediction for Data-Efficient LLM Evaluation
    Gayathri Saranathan, Cong Xu, Mahammad Parwez Alam, Tarun Kumar, Martin Foltin, Soon Yee Wong, Suparna Bhattacharya

Outstanding Papers

  • A New Formulation of Zipf’s Meaning-Frequency Law through Contextual Diversity
    Ryo Nagata, Kumiko Tanaka-Ishii

  • All That Glitters is Not Novel: Plagiarism in AI Generated Research
    Tarun Gupta, Danish Pruthi

  • Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases
    Michael Y. Hu, Jackson Petty, Chuan Shi, William Merrill, Tal Linzen

  • Beyond N-Grams: Rethinking Evaluation Metrics and Strategies for Multilingual Abstractive Summarization
    Itai Mondshine, Tzuf Paz-Argaman, Reut Tsarfaty

  • Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
    Weixuan Wang, Minghao Wu, Barry Haddow, Alexandra Birch

  • Byte Latent Transformer: Patches Scale Better Than Tokens
    Artidoro Pagnoni, Ramakanth Pasunuru, Pedro Rodriguez, John Nguyen, Benjamin Muller, Margaret Li, Chunting Zhou, LILI YU, Jason E Weston, Luke Zettlemoyer, Gargi Ghosh, Mike Lewis, Ari Holtzman, Srini Iyer

  • Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law
    Qiming Ge, Shuhao Xing, Songyang Gao, Yunhua Zhou, Yicheng Zou, Songyang Zhang, Zhi Chen, Hang Yan, Qi Zhang, Qipeng Guo, Kai Chen

  • From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding
    Chiwei Zhu, Benfeng Xu, Xiaorui Wang, Zhendong Mao

  • HALOGEN: Fantastic LLM Hallucinations and Where to Find Them
    Abhilasha Ravichander, Shrusti Ghela, David Wadden, Yejin Choi

  • HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter
    Manuel Tonneau, Diyi Liu, Niyati Malhotra, Scott A. Hale, Samuel Fraiberger, Victor Orozco-Olvera, Paul Röttger

  • IOT: Embedding Standardization Method Towards Zero Modality Gap
    Na Min An, Eunki Kim, James Thorne, Hyunjung Shim

  • IndicSynth: A Large-Scale Multilingual Synthetic Speech Dataset for Low-Resource Indian Languages
    Divya V Sharma, Vijval Ekbote, Anubha Gupta

  • LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models
    Hugo Pitorro, Marcos Vinicius Treviso

  • Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
    Jingcheng Niu, Xingdi Yuan, Tong Wang, Hamidreza Saghir, Amir H. Abdi

  • LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts
    Qibing Ren, Hao Li, Dongrui Liu, Zhanxu Xie, Xiaoya Lu, Yu Qiao, Lei Sha, Junchi Yan, Lizhuang Ma, Jing Shao

  • Mapping 1,000+ Language Models via the Log-Likelihood Vector
    Momose Oyama, Hiroaki Yamagiwa, Yusuke Takase, Hidetoshi Shimodaira

  • MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models
    Zhongzhan Huang, Guoming Ling, Shanshan Zhong, Hefeng Wu, Liang Lin

  • PARME: Parallel Corpora for Low-Resourced Middle Eastern Languages
    Sina Ahmadi, Rico Sennrich, Erfan Karami, Ako Marani, Parviz Fekrazad, Gholamreza Akbarzadeh Baghban, Hanah Hadi, Semko Heidari, Mahîr Dogan, Pedram Asadi, Dashne Bashir, Mohammad Amin Ghodrati, Kourosh Amini, Zeynab Ashourinezhad, Mana Baladi, Farshid Ezzati, Alireza Ghasemifar, Daryoush Hosseinpour, Behrooz Abbaszadeh, Amin Hassanpour, Bahaddin jalal hamaamin, Saya Kamal Hama, Ardeshir Mousavi, Sarko Nazir Hussein, Isar Nejadgholi, Mehmet Ölmez, Horam Osmanpour, Rashid Roshan Ramezani, Aryan Sediq Aziz, Ali Salehi, Mohammadreza Yadegari, Kewyar Yadegari, Sedighe Zamani Roodsari

  • Past Meets Present: Creating Historical Analogy with Large Language Models
    Nianqi Li, Siyu Yuan, Jiangjie Chen, Jiaqing Liang, Feng Wei, Zujie Liang, Deqing Yang, Yanghua Xiao

  • Pre³: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation
    Junyi Chen, Shihao Bai, Zaijun Wang, Siyu Wu, Chuheng Du, Hailong Yang, Ruihao Gong, Shengzhong Liu, Fan Wu, Guihai Chen

  • Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory
    Yexiang Liu, Zekun Li, Zhi Fang, Nan Xu, Ran He, Tieniu Tan

  • Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability
    Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe

  • Toward Automatic Discovery of a Canine Phonetic Alphabet
    Theron S. Wang, Xingyuan Li, Hridayesh Lekhak, Tuan Minh Dang, Mengyue Wu, Kenny Q. Zhu

  • Towards the Law of Capacity Gap in Distilling Language Models
    Chen Zhang, Qiuchi Li, Dawei Song, Zheyu Ye, Yan Gao, Yao Hu

  • Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling
    Xianzhen Luo, Yixuan Wang, Qingfu Zhu, Zhiming Zhang, Xuanyu Zhang, Qing Yang, Dongliang Xu

  • Typology-Guided Adaptation for African NLP
    Ndapa Nakashole

TACL Best Paper

  • The Test of Time Award (TACL in 2013 or 2014)
    Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions
    Yoav Artzi, Luke Zettlemoyer

  • The Best Paper Award (TACL in 2024)
    Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers
    Melanie Subbiah, Sean Zhang, Lydia B. Chilton, Kathleen McKeown

Best Demo

  • OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens
    Jiacheng Liu, Taylor Blanton, Yanai Elazar, Sewon Min, Yen-Sung Chen, Arnavi Chheda-Kothary, Huy Tran, Byron Bischoff, Eric Marsh, Michael Schmitz, Cassidy Trier, Aaron Sarnat, Jenna James, Jon Borchardt, Bailey Kuehl, Evie Yu-Yen Cheng, Karen Farley, Taira Anderson, David Albright, Carissa Schoenick, Luca Soldaini, Dirk Groeneveld, Rock Yuren Pang, Pang Wei Koh, Noah A. Smith, Sophie Lebrecht, Yejin Choi, Hannaneh Hajishirzi, Ali Farhadi, Jesse Dodge

Best Student Research Workshop Paper

  • Towards Multi-Perspective NLP Systems: A Thesis Proposal
    Benedetta Muscato

SAC Highlights

  • A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens
    Zhijie Nie, Richong Zhang, Zhanyu Wu

  • A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns
    Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao

  • Adapt Once, Thrive with Updates: Transferable Parameter-Efficient Fine-Tuning on Evolving Base Models
    Naibin Gu, Peng Fu, Xiyu Liu, Ke Ma, Zheng Lin, Weiping Wang

  • AntiLeakBench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
    Xiaobao Wu, Liangming Pan, Yuxi Xie, Ruiwen Zhou, Shuai Zhao, Yubo Ma, Mingzhe Du, Rui Mao, Anh Tuan Luu, William Yang Wang

  • Balancing the Budget: Understanding Trade-offs Between Supervised and Preference-Based Finetuning
    Mohit Raghavendra, Junmo Kang, Alan Ritter

  • Beyond Position: the emergence of wavelet-like properties in Transformers
    Valeria Ruscio, Umberto Nanni, Fabrizio Silvestri

  • BOOKCOREF: Coreference Resolution at Book Scale
    Giuliano Martinelli, Tommaso Bonomo, Pere-Lluís Huguet Cabot, Roberto Navigli

  • Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
    Gregor Geigle, Florian Schneider, Carolin Holtermann, Chris Biemann, Radu Timofte, Anne Lauscher, Goran Glavaš

  • CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games
    Shuhang Xu, Fangwei Zhong

  • Completing A Systematic Review in Hours instead of Months with Interactive AI Agents
    Rui Qiu, Shijie Chen, Yu Su, Po-Yin Yen, Han Wei Shen

  • Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems
    Myra Cheng, Su Lin Blodgett, Alicia DeVrio, Lisa Egede, Alexandra Olteanu

  • Diversity-oriented Data Augmentation with Large Language Models
    Zaitian Wang, Jinghan Zhang, Xinhao Zhang, Kunpeng Liu, pengfei wang, Yuanchun Zhou

  • Do LLMs Understand Dialogues? A Case Study on Dialogue Acts
    Ayesha Qamar, Jonathan Tong, Ruihong Huang

  • Efficient Knowledge Editing via Minimal Precomputation
    Akshat Gupta, Maochuan Lu, Thomas Hartvigsen, Gopala Anumanchipalli

  • Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users
    Antonia Karamolegkou, Malvina Nikandrou, Georgios Pantazopoulos, Danae Sanchez Villegas, Phillip Rust, Ruchira Dhar, Daniel Hershcovich, Anders Søgaard

  • Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
    Fan Zhang, Shulin Tian, Ziqi Huang, Yu Qiao, Ziwei Liu

  • FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation
    Qinggang Zhang, Zhishang Xiang, Yilin Xiao, Le Wang, Junhui Li, Xinrun Wang, Jinsong Su

  • Finding Needles in Images: Can Multi-modal LLMs Locate Fine Details?
    Parth Thakkar, Ankush Agarwal, Prasad Kasu, Pulkit Bansal, Chaitanya Devaguptapu

  • FlashAudio: Rectified Flow for Fast and High-Fidelity Text-to-Audio Generation
    Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Heng Lu, Zhou Zhao, Wei Xue

  • FloorPlan-LLaMa: Aligning Architects’ Feedback and Domain Knowledge in Architectural Floor Plan Generation
    Jun Yin, Pengyu Zeng, Haoyuan Sun, Yuqin Dai, Han Zheng, Miao Zhang, Yachao Zhang, Shuai Lu

  • Geometric Signatures of Compositionality Across a Language Model’s Lifetime
    Jin Hwa Lee, Thomas Jiralerspong, Lei Yu, Yoshua Bengio, Emily Cheng

  • Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights
    Célia Nouri, Chloé Clavel, Jean-Philippe Cointet

  • Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of Progress
    Lorenzo Proietti, Stefano Perrella, Roberto Navigli

  • HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
    Yuhan Chen, Ang Lv, Jian Luan, Bin Wang, Wei Liu

  • Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization
    Lei Huang, Xiaocheng Feng, Weitao Ma, Yuchun Fan, Xiachong Feng, Yangfan Ye, Weihong Zhong, Yuxuan Gu, Baoxin Wang, Dayong Wu, Guoping Hu, Bing Qin

  • Incorporating Domain Knowledge into Materials Tokenization
    Yerim Oh, Jun-Hyung Park, Junho Kim, SungHo Kim, SangKeun Lee

  • Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles
    Yuxi Xia, Pedro Henrique Luz de Araujo, Klim Zaporojets, Benjamin Roth

  • Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization on Multi-party Conversation
    Luyao Cheng, Hui Wang, Chong Deng, Siqi Zheng, Yafeng Chen, Rongjie Huang, Qinglin Zhang, Qian Chen, Xihao Li, Wen Wang

  • Lost in Multilinguality: Dissecting Cross-lingual Factual Inconsistency in Transformer Language Models
    Mingyang Wang, Heike Adel, Lukas Lange, Yihong Liu, Ercong Nie, Jannik Strötgen, Hinrich Schuetze

  • Mamba Knockout for Unraveling Factual Information Flow
    Nir Endy, Idan Daniel Grosbard, Yuval Ran-Milo, Yonatan Slutzky, Itay Tshuva, Raja Giryes

  • Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System
    Ziyou Jiang, Mingyang Li, Guowei Yang, Junjie Wang, Yuekai Huang, Zhiyuan Chang, Qing Wang

  • Mixtures of In-Context Learners
    Giwon Hong, Emile van Krieken, Edoardo Ponti, Nikolay Malkin, Pasquale Minervini

  • Pattern Recognition or Medical Knowledge? The Problem with Multiple-Choice Questions in Medicine
    Maxime Griot, Jean Vanderdonckt, Demet YUKSEL, Coralie Hemptinne

  • PlanGenLLMs: A Modern Survey of LLM Planning Capabilities
    Hui Wei, Zihao Zhang, Shenghua He, Tian Xia, Shijia Pan, Fei Liu

  • ProcessBench: Identifying Process Errors in Mathematical Reasoning
    Chujie Zheng, Zhenru Zhang, Beichen Zhang, Runji Lin, Keming Lu, Bowen Yu, Dayiheng Liu, Jingren Zhou, Junyang Lin

  • Retrofitting Large Language Models with Dynamic Tokenization
    Darius Feher, Ivan Vulić, Benjamin Minixhofer

  • Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
    Yue Yang, Ajay Patel, Matt Deitke, Tanmay Gupta, Luca Weihs, Andrew Head, Mark Yatskar, Chris Callison-Burch, Ranjay Krishna, Aniruddha Kembhavi, Christopher Clark

  • Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch
    Prarabdh Shukla, Wei Yin Chong, Yash Patel, Brennan Schaffner, Danish Pruthi, Arjun Bhagoji

  • SkillVerse: Assessing and Enhancing LLMs with Tree Evaluation
    Yufei Tian, Jiao Sun, Nanyun Peng, Zizhao Zhang

  • SPECTRA: Faster Large Language Model Inference with Optimized Internal and External Speculation
    Nguyen-Khang Le, Truong Dinh Do, Le-Minh Nguyen

  • Steering off Course: Reliability Challenges in Steering Language Models
    Patrick Queiroz Da Silva, Hari Sethuraman, Dheeraj Rajagopal, Hannaneh Hajishirzi, Sachin Kumar

  • Teaching Vision-Language Models to Ask: Resolving Ambiguity in Visual Questions
    Pu Jian, Donglei Yu, Wen Yang, Shuo Ren, Jiajun Zhang

  • The Cross-linguistic Role of Animacy in Grammar Structures
    Nina Gregorio, Matteo Gay, Sharon Goldwater, Edoardo Ponti

  • Tokenisation is NP-Complete
    Philip Whittington, Gregor Bachmann, Tiago Pimentel

  • Towards Better Value Principles for Large Language Model Alignment: A Systematic Evaluation and Enhancement
    Bingbing Xu, Jing Yao, Xiaoyuan Yi, Aishan Maoliniyazi, Xing Xie, Xiaofeng Meng

  • Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
    Ethan Wilcox, Cui Ding, Giovanni Acampa, Tiago Pimentel, Alex Warstadt, Tamar I Regev

  • What is Stigma Attributed to? A Theory-Grounded, Expert-Annotated Interview Corpus for Demystifying Mental-Health Stigma
    Han Meng, Yancan Chen, Yunan Li, YITIAN YANG, Jungup Lee, Renwen Zhang, Yi-Chieh Lee

Industry Track Awards

Best Paper

Speed Without Sacrifice: Fine-Tuning Language Models with Medusa and Knowledge Distillation in Travel Applications
Daniel Zagyva, Emmanouil Stergiadis, Laurens van der Maas, Aleksandra Dokic, Eran Fainman, Ilya Gusev, Moran Beladev

Honourable Mentions

Boosting Decision-Making Ability of LLMs with Speculative Reward Model Cost-Effectively
Jiawei Gu, Shangsong Liang

NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model
Yen-Ting Lin, Zhehuai Chen, Piotr Zelasko, Zhen Wan, Xuesong Yang, Zih-Ching Chen, Krishna C Puvvada, Ke Hu, Szu-Wei Fu, Jun Wei Chiu, Jagadeesh Balam, Boris Ginsburg, Yu-Chiang Frank Wang, Chao-Han Huck Yang