Awards

SAC Highlights

  • A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens
    Zhijie Nie, Richong Zhang, Zhanyu Wu

  • A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns
    Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao

  • Adapt Once, Thrive with Updates: Transferable Parameter-Efficient Fine-Tuning on Evolving Base Models
    Naibin Gu, Peng Fu, Xiyu Liu, Ke Ma, Zheng Lin, Weiping Wang

  • AntiLeakBench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
    Xiaobao Wu, Liangming Pan, Yuxi Xie, Ruiwen Zhou, Shuai Zhao, Yubo Ma, Mingzhe Du, Rui Mao, Anh Tuan Luu, William Yang Wang

  • Balancing the Budget: Understanding Trade-offs Between Supervised and Preference-Based Finetuning
    Mohit Raghavendra, Junmo Kang, Alan Ritter

  • Beyond Position: the emergence of wavelet-like properties in Transformers
    Valeria Ruscio, Umberto Nanni, Fabrizio Silvestri

  • BOOKCOREF: Coreference Resolution at Book Scale
    Giuliano Martinelli, Tommaso Bonomo, Pere-Lluís Huguet Cabot, Roberto Navigli

  • Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
    Gregor Geigle, Florian Schneider, Carolin Holtermann, Chris Biemann, Radu Timofte, Anne Lauscher, Goran Glavaš

  • CoMet: Metaphor-Driven Covert Communication for Multi-Agent Language Games
    Shuhang Xu, Fangwei Zhong

  • Completing A Systematic Review in Hours instead of Months with Interactive AI Agents
    Rui Qiu, Shijie Chen, Yu Su, Po-Yin Yen, Han Wei Shen

  • Dehumanizing Machines: Mitigating Anthropomorphic Behaviors in Text Generation Systems
    Myra Cheng, Su Lin Blodgett, Alicia DeVrio, Lisa Egede, Alexandra Olteanu

  • Diversity-oriented Data Augmentation with Large Language Models
    Zaitian Wang, Jinghan Zhang, Xinhao Zhang, Kunpeng Liu, pengfei wang, Yuanchun Zhou

  • Do LLMs Understand Dialogues? A Case Study on Dialogue Acts
    Ayesha Qamar, Jonathan Tong, Ruihong Huang

  • Efficient Knowledge Editing via Minimal Precomputation
    Akshat Gupta, Maochuan Lu, Thomas Hartvigsen, Gopala Anumanchipalli

  • Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users
    Antonia Karamolegkou, Malvina Nikandrou, Georgios Pantazopoulos, Danae Sanchez Villegas, Phillip Rust, Ruchira Dhar, Daniel Hershcovich, Anders Søgaard

  • Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
    Fan Zhang, Shulin Tian, Ziqi Huang, Yu Qiao, Ziwei Liu

  • FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation
    Qinggang Zhang, Zhishang Xiang, Yilin Xiao, Le Wang, Junhui Li, Xinrun Wang, Jinsong Su

  • Finding Needles in Images: Can Multi-modal LLMs Locate Fine Details?
    Parth Thakkar, Ankush Agarwal, Prasad Kasu, Pulkit Bansal, Chaitanya Devaguptapu

  • FlashAudio: Rectified Flow for Fast and High-Fidelity Text-to-Audio Generation
    Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Heng Lu, Zhou Zhao, Wei Xue

  • FloorPlan-LLaMa: Aligning Architects’ Feedback and Domain Knowledge in Architectural Floor Plan Generation
    Jun Yin, Pengyu Zeng, Haoyuan Sun, Yuqin Dai, Han Zheng, Miao Zhang, Yachao Zhang, Shuai Lu

  • Geometric Signatures of Compositionality Across a Language Model’s Lifetime
    Jin Hwa Lee, Thomas Jiralerspong, Lei Yu, Yoshua Bengio, Emily Cheng

  • Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights
    Célia Nouri, Chloé Clavel, Jean-Philippe Cointet

  • Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of Progress
    Lorenzo Proietti, Stefano Perrella, Roberto Navigli

  • HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
    Yuhan Chen, Ang Lv, Jian Luan, Bin Wang, Wei Liu

  • Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization
    Lei Huang, Xiaocheng Feng, Weitao Ma, Yuchun Fan, Xiachong Feng, Yangfan Ye, Weihong Zhong, Yuxuan Gu, Baoxin Wang, Dayong Wu, Guoping Hu, Bing Qin

  • Incorporating Domain Knowledge into Materials Tokenization
    Yerim Oh, Jun-Hyung Park, Junho Kim, SungHo Kim, SangKeun Lee

  • Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles
    Yuxi Xia, Pedro Henrique Luz de Araujo, Klim Zaporojets, Benjamin Roth

  • Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization on Multi-party Conversation
    Luyao Cheng, Hui Wang, Chong Deng, Siqi Zheng, Yafeng Chen, Rongjie Huang, Qinglin Zhang, Qian Chen, Xihao Li, Wen Wang

  • Lost in Multilinguality: Dissecting Cross-lingual Factual Inconsistency in Transformer Language Models
    Mingyang Wang, Heike Adel, Lukas Lange, Yihong Liu, Ercong Nie, Jannik Strötgen, Hinrich Schuetze

  • Mamba Knockout for Unraveling Factual Information Flow
    Nir Endy, Idan Daniel Grosbard, Yuval Ran-Milo, Yonatan Slutzky, Itay Tshuva, Raja Giryes

  • Mimicking the Familiar: Dynamic Command Generation for Information Theft Attacks in LLM Tool-Learning System
    Ziyou Jiang, Mingyang Li, Guowei Yang, Junjie Wang, Yuekai Huang, Zhiyuan Chang, Qing Wang

  • Mixtures of In-Context Learners
    Giwon Hong, Emile van Krieken, Edoardo Ponti, Nikolay Malkin, Pasquale Minervini

  • Pattern Recognition or Medical Knowledge? The Problem with Multiple-Choice Questions in Medicine
    Maxime Griot, Jean Vanderdonckt, Demet YUKSEL, Coralie Hemptinne

  • PlanGenLLMs: A Modern Survey of LLM Planning Capabilities
    Hui Wei, Zihao Zhang, Shenghua He, Tian Xia, Shijia Pan, Fei Liu

  • ProcessBench: Identifying Process Errors in Mathematical Reasoning
    Chujie Zheng, Zhenru Zhang, Beichen Zhang, Runji Lin, Keming Lu, Bowen Yu, Dayiheng Liu, Jingren Zhou, Junyang Lin

  • Retrofitting Large Language Models with Dynamic Tokenization
    Darius Feher, Ivan Vulić, Benjamin Minixhofer

  • Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
    Yue Yang, Ajay Patel, Matt Deitke, Tanmay Gupta, Luca Weihs, Andrew Head, Mark Yatskar, Chris Callison-Burch, Ranjay Krishna, Aniruddha Kembhavi, Christopher Clark

  • Silencing Empowerment, Allowing Bigotry: Auditing the Moderation of Hate Speech on Twitch
    Prarabdh Shukla, Wei Yin Chong, Yash Patel, Brennan Schaffner, Danish Pruthi, Arjun Bhagoji

  • SkillVerse : Assessing and Enhancing LLMs with Tree Evaluation
    Yufei Tian, Jiao Sun, Nanyun Peng, Zizhao Zhang

  • SPECTRA: Faster Large Language Model Inference with Optimized Internal and External Speculation
    Nguyen-Khang Le, Truong Dinh Do, Le-Minh Nguyen

  • Steering off Course: Reliability Challenges in Steering Language Models
    Patrick Queiroz Da Silva, Hari Sethuraman, Dheeraj Rajagopal, Hannaneh Hajishirzi, Sachin Kumar

  • Teaching Vision-Language Models to Ask: Resolving Ambiguity in Visual Questions
    Pu Jian, Donglei Yu, Wen Yang, Shuo Ren, Jiajun Zhang

  • The Cross-linguistic Role of Animacy in Grammar Structures
    Nina Gregorio, Matteo Gay, Sharon Goldwater, Edoardo Ponti

  • Tokenisation is NP-Complete
    Philip Whittington, Gregor Bachmann, Tiago Pimentel

  • Towards Better Value Principles for Large Language Model Alignment: A Systematic Evaluation and Enhancement
    Bingbing Xu, Jing Yao, Xiaoyuan Yi, Aishan Maoliniyazi, Xing Xie, Xiaofeng Meng

  • Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
    Ethan Wilcox, Cui Ding, Giovanni Acampa, Tiago Pimentel, Alex Warstadt, Tamar I Regev

  • What is Stigma Attributed to? A Theory-Grounded, Expert-Annotated Interview Corpus for Demystifying Mental-Health Stigma
    Han Meng, Yancan Chen, Yunan Li, YITIAN YANG, Jungup Lee, Renwen Zhang, Yi-Chieh Lee

Industry Track Awards

Best Paper

Speed Without Sacrifice: Fine-Tuning Language Models with Medusa and Knowledge Distillation in Travel Applications
Daniel Zagyva, Emmanouil Stergiadis, Laurens van der Maas, Aleksandra Dokic, Eran Fainman, Ilya Gusev, Moran Beladev

Honourable Mentions

Boosting Decision-Making Ability of LLMs with Speculative Reward Model Cost-Effectively
Jiawei Gu, Shangsong Liang

NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model
Yen-Ting Lin, Zhehuai Chen, Piotr Zelasko, Zhen Wan, Xuesong Yang, Zih-Ching Chen, Krishna C Puvvada, Ke Hu, Szu-Wei Fu, Jun Wei Chiu, Jagadeesh Balam, Boris Ginsburg, Yu-Chiang Frank Wang, Chao-Han Huck Yang