Publications

(* indicates equal contribution)

  1. Enabling Chatbots with Eyes and Ears: An Immersive Multimodal Conversation System for Dynamic Interactions
    Jihyoung Jang*, Minwook Bae*, Minji Kim, Dilek Hakkani-Tür, Hyounghun Kim
    The Annual Meeting of the Association for Computational Linguistics (ACL). 2025.

  2. Language Specific Knowledge: Do Models Know Better in X than in English?
    Ishika Agarwal, Nimet Beyza Bozdag, Dilek Hakkani-Tür
    Preprint. 2025.

  3. Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
    Sagnik Mukherjee, Lifan Yuan, Dilek Hakkani-Tür, Hao Peng
    Preprint. 2025.

  4. Must Read: A Systematic Survey of Computational Persuasion
    Nimet Beyza Bozdag, Shuhaib Mehri, Xiaocheng Yang, Hyeonjeong Ha, Zirui Cheng, Esin Durmus, Jiaxuan You, Heng Ji, Gokhan Tur, Dilek Hakkani-Tür
    Preprint. 2025.

  5. Uncovering Cross-Domain Recommendation Ability of Large Language Models
    Xinyi Liu, Ruijie Wang, Dachun Sun, Dilek Hakkani-Tür, Tarek Abdelzaher
    Companion Proceedings of the ACM on Web Conference (WWW) 2025. 2025.

  6. PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
    Takyoung Kim*, Janvijay Singh*, Shuhaib Mehri*, Emre Can Acikgoz, Sagnik Mukherjee, Nimet Beyza Bozdag, Sumuk Shashidhar, Gokhan Tur, Dilek Hakkani-Tür
    Preprint. 2025.

  7. TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
    Emre Can Acikgoz*, Carl Guo*, Suvodip Dey*, Akul Datta, Takyoung Kim, Gokhan Tur, Dilek Hakkani-Tür
    Preprint. 2025.

  8. Spark: A System for Scientifically Creative Idea Generation
    Aishik Sanyal, Samuel Schapiro, Sumuk Shashidhar, Royce Moon, Lav R. Varshney, Dilek Hakkani-Tür
    Preprint. 2025.

  9. ToolRL: Reward is All Tool Learning Needs
    Cheng Qian, Emre Can Acikgoz, Qi He, Hongru Wang, Xiusi Chen, Dilek Hakkani-Tür, Gokhan Tur, Heng Ji
    Preprint. 2025.

  10. A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions
    Emre Can Acikgoz*, Cheng Qian*, Hongru Wang*, Vardhan Dongre, Xiusi Chen, Heng Ji, Dilek Hakkani-Tür, Gokhan Tur
    Preprint. 2025.

  11. YourBench: Easy Custom Evaluation Sets for Everyone
    Sumuk Shashidhar, Clémentine Fourrier, Alina Lozovskia, Thomas Wolf, Gokhan Tur, Dilek Hakkani-Tür
    Preprint. 2025.

  12. Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models
    Nimet Beyza Bozdag, Shuhaib Mehri, Gokhan Tur, Dilek Hakkani-Tür
    Preprint. 2025.

  13. Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model
    Emre Can Acikgoz, Jeremiah Greer, Akul Datta, Ze Yang, William Zeng, Oussama Elachqar, Emmanouil Koukoumidis, Dilek Hakkani-Tür, Gokhan Tur
    The Annual Meeting of the Association for Computational Linguistics (ACL). 2025.

  14. SMART: Self-Aware Agent for Tool Overuse Mitigation
    Cheng Qian*, Emre Can Acikgoz*, Hongru Wang, Xiusi Chen, Avirup Sil, Dilek Hakkani-Tür, Gokhan Tur, Heng Ji
    The Annual Meeting of the Association for Computational Linguistics (ACL, Findings). 2025.

  15. LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language
    Yubin Ge*, Neeraja Kirtane*, Hao Peng, Dilek Hakkani-Tür
    Preprint. 2025.

  16. Data Valuation using Neural Networks for Efficient Instruction Fine-Tuning
    Ishika Agarwal, Dilek Hakkani-Tür
    Preprint. 2025.

  17. Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs
    Sagnik Mukherjee*, Abhinav Chinta*, Takyoung Kim, Tarun Anoop Sharma, Dilek Hakkani-Tür
    International Conference on Machine Learning (ICML). 2025.

  18. Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis
    Shuhaib Mehri, Xiusi Chen, Heng Ji, Dilek Hakkani-Tür
    Preprint. 2025.

  19. Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems
    Mert İnan, Anthony Sicilia, Suvodip Dey, Vardhan Dongre, Tejas Srinivasan, Jesse Thomason, Gökhan Tür, Dilek Hakkani-Tür, Malihe Alikhani
    Transactions of the Association for Computational Linguistics (TACL). 2025.

  20. Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling
    Suvodip Dey, Yi-Jyun Sun, Gokhan Tur, Dilek Hakkani-Tür
    The Annual Meeting of the Association for Computational Linguistics (ACL). 2025.

  21. ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
    Vardhan Dongre, Xiaocheng Yang, Emre Can Acikgoz, Suvodip Dey, Gokhan Tur, Dilek Hakkani-Tür
    International Workshop of Spoken Dialogue Systems (IWSDS). 2025.

  22. DELIFT: Data Efficient Language model Instruction Fine Tuning
    Ishika Agarwal, Krishnateja Killamsetty, Lucian Popa, Marina Danilevksy
    International Conference on Learning Representations (ICLR). 2025.

  23. Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
    Takyoung Kim, Kyungjae Lee, Young Rok Jang, Ji Yong Cho, Gangwoo Kim, Minseok Cho, Moontae Lee
    Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL, Findings). 2025.

  24. Infogent: An Agent-based Framework for Web Information Aggregation
    Revanth Gangi Reddy*, Sagnik Mukherjee*, Jeonghwan Kim*, Zhenhailong Wang*, Dilek Hakkani-Tür, Heng Ji
    Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL, Findings). 2025.

  25. From Context to Action: Analysis of the Impact of State Representation and Context on Generalizability of Multi-Turn Web Navigation Agents
    Nalin Tiwary*, Vardhan Dongre*, Sanil Chawala, Ashwin Lamani, Dilek Hakkani-Tür
    Neural Information Processing Systems (NeurIPS) Workshop on Open-World Agents. 2024.

  26. Simulating User Agents for Embodied Conversational AI
    Daniel Phillipov, Vardhan Dongre, Gokhan Tur, Dilek Hakkani-Tür
    Neural Information Processing Systems (NeurIPS) Workshop on Open-World Agents. 2024.

  27. Instruct, Not Assist: LLM-based Multi-Turn Planning and Hierarchical Questioning for Socratic Code Debugging
    Priyanka Karagupta*, Ishika Agarwal*, Dilek Hakkani-Tür, Jiawei Han
    Empirical Methods in Natural Language Processing (EMNLP, Findings). 2024.

  28. Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting
    Sagnik Mukherjee*, Muhammad Farid Adilazuarda*, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury
    Empirical Methods in Natural Language Processing (EMNLP). 2024.

  29. Towards Measuring and Modeling “Culture” in LLMs: A Survey
    Muhammad Farid Adilazuarda*, Sagnik Mukherjee*, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, Monojit Choudhury
    Empirical Methods in Natural Language Processing (EMNLP). 2024.

  30. Unsupervised Human Preference Learning
    Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani-Tür
    Empirical Methods in Natural Language Processing (EMNLP). 2024.

  31. Large Language Models as User Agents for Evaluating Task-Oriented-Dialogue Systems
    Taaha Kazi, Ruiliang Lyu, Sizhe Zhou, Dilek Hakkani-Tür, Gokhan Tur
    IEEE Spoken Language Technology Workshop (IEEE SLT). 2024.

  32. Confidence Estimation for LLM-Based Dialogue State Tracking
    Yi-Jyun Sun, Suvodip Dey, Dilek Hakkani-Tür, Gokhan Tur
    IEEE Spoken Language Technology Workshop (IEEE SLT). 2024.

  33. Dialog Flow Induction for Constrainable LLM-Based Chatbots
    Stuti Agrawal, Nishi Uppuluri, Pranav Pillai, Revanth Gangi Reddy, Zoey Li, Gokhan Tur, Dilek Hakkani-Tür, Heng Ji
    Special Interest Group on Discourse and Dialogue (SIGDIAL). 2024.