Accepted TACL Papers

  • Phonetic Reconstruction of the Consonant System of Middle Chinese via Mixed Integer Optimization
    Xiaoxi Luo, Weiwei Sun
  • Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
    Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev, Lyudmila Rvanova, Akim Tsvigun, Daniil Vasilev, Rui Xing, Abdelrahman “Boda” Sadallah, Kirill Grishchenkov, Sergey Petrakov, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov, Artem Shelmanov
  • LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
    Xiaohao Yang, He Zhao, Dinh Phung, Wray Buntine, Lan Du
  • LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
    Same paper above
  • NLP Security and Ethics, in the Wild
    Heather Lent, Erick Galinkin, Yiyi Chen, Jens Myrup Pedersen, Leon Derczynski, Johannes Bjerva
  • The Thai Universal Dependency Treebank
    Panyut Sriwirote, Wei Qi Leong, Charin Polpanumas, Santhawat Thanyawong, William Tjhi, Wirote Aroonmanakun, Attapol T Rutherford
  • Know Your Limits: A Survey of Abstention in Large Language Models
    Bingbing Wen, Jihan Yao, Shangbin Feng, Chenjun Xu, Yulia Tsvetkov, Bill Howe, Lucy Lu Wang
  • Few-shot Multilingual Open-domain QA from 5 Examples
    Fan Jiang, Tom Drummond, Trevor Cohn
  • How “Real” is Your Real-Time Simultaneous Speech-to-Text Translation System?
    Sara Papi, Peter Polak, Dominik Macháček, Ondřej Bojar
  • TaxoPro: A Plug-In LoRA-based Cross-Domain Method for Low-Resource Taxonomy Completion
    Hongyuan Xu, Yuhang Niu, Ciyi Liu, Yanlong Wen, Xiaojie Yuan
  • Patchwise Cooperative Game-based Interpretability Method for Large Vision-Language Models
    Yao Zhu, Yunjian Zhang, Zizhe Wang, Xiu Yan, Peng Sun, Xiangyang Ji
  • A Comparative Approach for Auditing Multilingual Phonetic Transcript Archives
    Farhan Samir, Emily, Shreya Prakash, Márton Sóskuthy, Vered Shwartz, Jian Zhu
  • TANQ: An open domain dataset of table answered questions
    Mubashara Akhtar, Chenxi Pang, Andreea Marzoca, Yasemin Altun, Julian Eisenschlos
  • Sense-Specific Historical Word Usage Generation
    Pierluigi Cassotti, Nina Tahmasebi
  • Navigating the Landscape of Hint Generation Research: From the Past to the Future
    Anubhav Jangra, Jamshid Mozafari, Adam Jatowt, Smaranda Muresan
  • Exploring Practical Gaps in Using Cross Entropy to Implement Maximum Mutual Information Criterion for Rationalization
    Wei Liu, Zhiying Deng, Zhongyu Niu, Jun Wang, Haozhao Wang, Ruixuan Li
  • Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models
    Shahriar Golchin, Mihai Surdeanu
  • MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions
    Abdullatif Köksal, Marion Thaler, Ayyoob Imani, Ahmet Üstün, Anna Korhonen, Hinrich Schütze
  • (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
    Minghao Wu, Jiahao Xu, Yulin Yuan, Gholamreza Haffari, Longyue Wang, Weihua Luo, Kaifu Zhang
  • A Context-aware Framework for Translation-mediated Conversations
    José Pombal, Sweta Agrawal, Emmanouil Zaranis, Patrick Fernandes, André Martins
  • Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art
    Chen Cecilia Liu, Iryna Gurevych, Anna Korhonen
  • Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification
    Huiyao Chen, Zulong Chen, Yu Zhao, Mengjia Wang, Liangyue Li, Meishan Zhang, Min Zhang
  • Hypernetworks for Personalizing ASR to Atypical Speech
    Dianna Yee, Max Müller-Eberstein, Karren Yang, Gautam Varma Mantena, Colin Lea
  • CAMELL: Confidence-based Acquisition Model for Efficient Self-supervised Active Learning with Label Validation
    Carel van Niekerk, Christian Geishauser, Michael Heck, Shutong Feng, Hsien-Chin Lin, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Milica Gašić
  • TabVer: Tabular Fact Verification with Natural Logic
    Rami Aly, Andreas Vlachos
  • OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
    Jikai Wang, Yi Su, Juntao Li, Qinrong Xia, Zi Ye, Xinyu Duan, Zhefeng Wang, Min Zhang
  • Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models
    Jianhui Pang, Fanghua Ye, Derek Fai Wong, Dian Yu, Shuming Shi, Zhaopeng Tu, Longyue Wang
  • DEAR: Disentangled Event-Agnostic Representation Learning for Early Fake News Detection
    Xiao Pu, Hao Wu, Xiuli Bi, Yu Wu, Xinbo Gao
  • Supervised Neural Topic Modeling with Label Alignment
    Ruihao Chen, Hegang Chen, Yuyin Lu, Yanghui Rao, Chunjiang Zhu
  • Investigating Adversarial Trigger Transfer in Large Language Models
    Nicholas Carl Meade, Arkil Patel, Siva Reddy
  • Improving Text-to-SQL with Constrained Decoding of Satisfiable Plans and Faithful Queries
    Christopher Coleman, David Demeter, Doug Downey
  • Diverse AI Feedback For Large Language Model Alignment
    Tianshu Yu
  • BroadLLM: Wider and Deeper LLM Networks are Fairer LLM Evaluators
    Xinghua Zhang, Bowen Yu, Haiyang Yu, Yangyu Lv, Tingwen Liu, Fei Huang, Hongbo Xua, Yongbin Li
  • Transformers as Transducers
    Lena Strobl, Dana Angluin, David Chiang, Jonathan Rawski, Ashish Sabharwal
  • Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
    Jing Yang, Max Glockner, Anderson Rocha, Iryna Gurevych
  • STPar:AStructure-AwareTriaffineParserforScreenplay CharacterCoreferenceResolution
    Li Zheng, Hao Fei, Lei Chen, Bobo Li, Fei Li, Chong Teng, Liang Zhao, Donghong Ji
  • Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
    Chaitanya Malaviya, Joseph Chee Chang, Dan Roth, Mohit Iyyer, Mark Yatskar, Kyle Loa
  • MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems
    Yannis Katsis, Sara Rosenthal, Kshitij Fadnis, Chulaka Gunasekara, Young-Suk Lee, Lucian Popa, Vraj Shah, Huaiyu Zhu, Marina Danilevsky
  • Human Choice Prediction in Language-Based Persuasion Games: Simulation-Based Off-Policy Evaluation
    Eilam Shapira, Omer Madmon, Reut Apel, Moshe Tennenholtz, Roi Reichart
  • Continual Pre-training on Character-Level Noisy Texts Makes Decoder-based Language Models Robust Few-shot Learners
    Takeshi Kojima, Yutaka Matsuo, Yusuke Iwasawa
  • DARE: Diverse Visual Question Answering with Robustness Evaluation
    Hannah Sterz, Jonas Pfeiffer, Ivan Vuli´c
  • Explanatory Summarization with Discourse-Driven Planning
    Dongqi Liu, Xi Yu, Vera Demberg, Mirella Lapata