Kaili Huang

Applied Scientist @ Microsoft. MSCS @ Stanford. BE @ Tsinghua.

prof_pic_3.jpg

I am an enthusiastic machine learning engineer and researcher. My interests span natural language processing, large language models, information retrieval, multi-modality, question answering, dialog systems, etc.

I am working at Microsoft as an Applied Scientist, where I work on natural language processing and multi-modality modeling. I graduated from Stanford University with a master’s degree in Computer Science. Before attending Stanford, I received my bachelor’s degree from Tsinghua University, and then worked at ByteDance as a Machine Learning Engineer for 1 year.

news

Apr 06, 2025 We are excited to announce that our latest research, ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring, was recently accepted and presented at the 47th European Conference on Information Retrieval (ECIR 2025). Our work introduces a new method for making state-of-the-art neural search more efficient and scalable.

selected publications

  1. ACL
    KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation
    Hao Zhou, Chujie Zheng, Kaili Huang, and 2 more authors
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), Jul 2020
  2. TACL
    CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
    Qi Zhu, Kaili Huang, Zheng Zhang, and 2 more authors
    Transactions of the Association for Computational Linguistics (TACL), Jun 2020
  3. NLPCC
    A Large-Scale Chinese Short-Text Conversation Dataset
    Yida Wang, Pei Ke, Yinhe Zheng, and 4 more authors
    In Natural Language Processing and Chinese Computing (NLPCC). Best Student Paper Award , Oct 2020
  4. ECIR
    ColBERT-Serve: Efficient Multi-stage Memory-Mapped Scoring
    Kaili Huang, Thejas Venkatesh, Uma Dingankar, and 9 more authors
    In Advances in Information Retrieval: 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings, Part IV, Lucca, Italy, Oct 2025