publications

For an up-to-date list of my research papers, please see my Google Scholar profile. * denotes equal contribution.

2024

  1. EMNLP
    Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation
    Tu Vu*Kalpesh Krishna*Salaheddin AlzubiChris TarManaal Faruquiand Yun-Hsuan Sung
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
    // The top-performing generative model on RewardBench trained solely on publicly available data
  2. ACL
    FreshLLMs: Refreshing large language models with search engine augmentation
    Tu VuMohit IyyerXuezhi WangNoah ConstantJerry WeiJason WeiChris TarYun-Hsuan SungDenny ZhouQuoc Leand Thang Luong
    In Findings of the Association for Computational Linguistics: ACL 2024, 2024
    // Our dataset and method have inspired or been used for the development of Google’s Gemini, Perplexity.AI’s Online LLMs, You.com, and Contextual AI’s RAG 2.0
  3. ICLR
    Mixture-of-experts meets instruction tuning: A winning combination for large language models
    Sheng ShenLe HouYanqi ZhouNan DuShayne LongpreJason WeiHyung Won ChungBarret ZophWilliam FedusXinyun ChenTu VuYuexin WuWuyang ChenAlbert WebsonYunxuan LiVincent ZhaoHongkun YuKurt KeutzerTrevor Darrelland Denny Zhou
    In Proceedings of the 12th International Conference on Learning Representations, 2024

2023

  1. Preprint
    Gemini: A Family of Highly Capable Multimodal Models
    Google Gemini Team: Rohan AnilSebastian BorgeaudYonghui WuJean-Baptiste AlayracJiahui YuRadu SoricutJohan SchalkwykAndrew DaiAnja Hauthand  others including Tu Vu
    In arXiv preprint arXiv:2312.11805, 2023
    // Google AI Blog
  2. ICML
    The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
    Shayne LongpreLe HouTu VuAlbert WebsonHyung Won ChungYi TayDenny ZhouQuoc V LeBarret ZophJason Weiand Adam Roberts
    In Proceedings of the 40th International Conference on Machine Learning, 2023
    // Google Research Blog
  3. NeurIPS
    Self-Evaluation Improves Selective Generation in Large Language Models
    Jie RenYao ZhaoTu VuPeter J Liuand Balaji Lakshminarayanan
    In Proceedings on "I Can’t Believe It’s Not Better! - Failure Modes in the Age of Foundation Models" at NeurIPS 2023 Workshops, 2023
  4. ACL
    Dialect-robust Evaluation of Generated Text
    Jiao SunThibault SellamElizabeth ClarkTu VuTimothy DozatDan GarretteAditya SiddhantJacob Eisensteinand Sebastian Gehrmann
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

  1. ACL
    SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer
    Tu VuBrian LesterNoah ConstantRami Al-Rfouand Daniel Cer
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
    // Headlines of Google AI’s Natural Language Accelerated Newsletter Q1, 2022
  2. EMNLP
    Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
    Tu VuAditya BaruaBrian LesterDaniel CerMohit Iyyerand Noah Constant
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  3. EMNLP
    Leveraging QA Datasets to Improve Generative Data Augmentation
    Dheeraj MekalaTu VuTimo Schickand Jingbo Shang
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

  1. EMNLP
    STraTA: Self-Training with Task Augmentation for Better Few-shot Learning
    Tu VuThang LuongQuoc LeGrady Simonand Mohit Iyyer
    In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

  1. EMNLP
    Exploring and Predicting Transferability across NLP Tasks
    Tu VuTong WangTsendsuren MunkhdalaiAlessandro SordoniAdam TrischlerAndrew Mattarella-MickeSubhransu Majiand Mohit Iyyer
    In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

  1. ACL
    Encouraging Paragraph Embeddings to Remember Sentence Identity Improves Classification
    Tu Vuand Mohit Iyyer
    In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019

2018

  1. NAACL
    Sentence Simplification with Memory-Augmented Neural Networks
    Tu VuBaotian HuTsendsuren Munkhdalaiand Hong Yu
    In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), 2018
  2. *SEM@NAACL
    Integrating Multiplicative Features into Supervised Distributional Methods for Lexical Entailment
    Tu Vuand Vered Shwartz
    In Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics, 2018