- Phonetic Reconstruction of the Consonant System of Middle Chinese via Mixed Integer Optimization
Xiaoxi Luo, Weiwei Sun
- Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev, Lyudmila Rvanova, Akim Tsvigun, Daniil Vasilev, Rui Xing, Abdelrahman “Boda” Sadallah, Kirill Grishchenkov, Sergey Petrakov, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov, Artem Shelmanov
- LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
Xiaohao Yang, He Zhao, Dinh Phung, Wray Buntine, Lan Du
- LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
Same paper above
- NLP Security and Ethics, in the Wild
Heather Lent, Erick Galinkin, Yiyi Chen, Jens Myrup Pedersen, Leon Derczynski, Johannes Bjerva
- The Thai Universal Dependency Treebank
Panyut Sriwirote, Wei Qi Leong, Charin Polpanumas, Santhawat Thanyawong, William Tjhi, Wirote Aroonmanakun, Attapol T Rutherford
- Know Your Limits: A Survey of Abstention in Large Language Models
Bingbing Wen, Jihan Yao, Shangbin Feng, Chenjun Xu, Yulia Tsvetkov, Bill Howe, Lucy Lu Wang
- Few-shot Multilingual Open-domain QA from 5 Examples
Fan Jiang, Tom Drummond, Trevor Cohn
- How “Real” is Your Real-Time Simultaneous Speech-to-Text Translation System?
Sara Papi, Peter Polak, Dominik Macháček, Ondřej Bojar
- TaxoPro: A Plug-In LoRA-based Cross-Domain Method for Low-Resource Taxonomy Completion
Hongyuan Xu, Yuhang Niu, Ciyi Liu, Yanlong Wen, Xiaojie Yuan
- Patchwise Cooperative Game-based Interpretability Method for Large Vision-Language Models
Yao Zhu, Yunjian Zhang, Zizhe Wang, Xiu Yan, Peng Sun, Xiangyang Ji
- A Comparative Approach for Auditing Multilingual Phonetic Transcript Archives
Farhan Samir, Emily, Shreya Prakash, Márton Sóskuthy, Vered Shwartz, Jian Zhu
- TANQ: An open domain dataset of table answered questions
Mubashara Akhtar, Chenxi Pang, Andreea Marzoca, Yasemin Altun, Julian Eisenschlos
- Sense-Specific Historical Word Usage Generation
Pierluigi Cassotti, Nina Tahmasebi
- Navigating the Landscape of Hint Generation Research: From the Past to the Future
Anubhav Jangra, Jamshid Mozafari, Adam Jatowt, Smaranda Muresan
- Exploring Practical Gaps in Using Cross Entropy to Implement Maximum Mutual Information Criterion for Rationalization
Wei Liu, Zhiying Deng, Zhongyu Niu, Jun Wang, Haozhao Wang, Ruixuan Li
- Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models
Shahriar Golchin, Mihai Surdeanu
- MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions
Abdullatif Köksal, Marion Thaler, Ayyoob Imani, Ahmet Üstün, Anna Korhonen, Hinrich Schütze
- (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu, Jiahao Xu, Yulin Yuan, Gholamreza Haffari, Longyue Wang, Weihua Luo, Kaifu Zhang
- A Context-aware Framework for Translation-mediated Conversations
José Pombal, Sweta Agrawal, Emmanouil Zaranis, Patrick Fernandes, André Martins
- Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art
Chen Cecilia Liu, Iryna Gurevych, Anna Korhonen
- Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification
Huiyao Chen, Zulong Chen, Yu Zhao, Mengjia Wang, Liangyue Li, Meishan Zhang, Min Zhang
- Hypernetworks for Personalizing ASR to Atypical Speech
Dianna Yee, Max Müller-Eberstein, Karren Yang, Gautam Varma Mantena, Colin Lea
- CAMELL: Confidence-based Acquisition Model for Efficient Self-supervised Active Learning with Label Validation
Carel van Niekerk, Christian Geishauser, Michael Heck, Shutong Feng, Hsien-Chin Lin, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Milica Gašić
- TabVer: Tabular Fact Verification with Natural Logic
Rami Aly, Andreas Vlachos
- OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
Jikai Wang, Yi Su, Juntao Li, Qinrong Xia, Zi Ye, Xinyu Duan, Zhefeng Wang, Min Zhang
- Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models
Jianhui Pang, Fanghua Ye, Derek Fai Wong, Dian Yu, Shuming Shi, Zhaopeng Tu, Longyue Wang
- DEAR: Disentangled Event-Agnostic Representation Learning for Early Fake News Detection
Xiao Pu, Hao Wu, Xiuli Bi, Yu Wu, Xinbo Gao
- Supervised Neural Topic Modeling with Label Alignment
Ruihao Chen, Hegang Chen, Yuyin Lu, Yanghui Rao, Chunjiang Zhu
- Investigating Adversarial Trigger Transfer in Large Language Models
Nicholas Carl Meade, Arkil Patel, Siva Reddy
- Improving Text-to-SQL with Constrained Decoding of Satisfiable Plans and Faithful Queries
Christopher Coleman, David Demeter, Doug Downey
- Diverse AI Feedback For Large Language Model Alignment
Tianshu Yu
- BroadLLM: Wider and Deeper LLM Networks are Fairer LLM Evaluators
Xinghua Zhang, Bowen Yu, Haiyang Yu, Yangyu Lv, Tingwen Liu, Fei Huang, Hongbo Xua, Yongbin Li
- Transformers as Transducers
Lena Strobl, Dana Angluin, David Chiang, Jonathan Rawski, Ashish Sabharwal
- Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Jing Yang, Max Glockner, Anderson Rocha, Iryna Gurevych
- STPar:AStructure-AwareTriaffineParserforScreenplay CharacterCoreferenceResolution
Li Zheng, Hao Fei, Lei Chen, Bobo Li, Fei Li, Chong Teng, Liang Zhao, Donghong Ji
- Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Chaitanya Malaviya, Joseph Chee Chang, Dan Roth, Mohit Iyyer, Mark Yatskar, Kyle Loa
- MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems
Yannis Katsis, Sara Rosenthal, Kshitij Fadnis, Chulaka Gunasekara, Young-Suk Lee, Lucian Popa, Vraj Shah, Huaiyu Zhu, Marina Danilevsky
- Human Choice Prediction in Language-Based Persuasion Games: Simulation-Based Off-Policy Evaluation
Eilam Shapira, Omer Madmon, Reut Apel, Moshe Tennenholtz, Roi Reichart
- Continual Pre-training on Character-Level Noisy Texts Makes Decoder-based Language Models Robust Few-shot Learners
Takeshi Kojima, Yutaka Matsuo, Yusuke Iwasawa
- DARE: Diverse Visual Question Answering with Robustness Evaluation
Hannah Sterz, Jonas Pfeiffer, Ivan Vuli´c
- Explanatory Summarization with Discourse-Driven Planning
Dongqi Liu, Xi Yu, Vera Demberg, Mirella Lapata