About Me

Hi! I’m Yaorui Shi, a second-year master student at the USTC Lab for Data Science, supervised by Professor Xiang Wang. In 2023, I’ve been visiting the Next++ Research Centre as a research intern supervised by An Zhang and Prof. Chua Tas Seng. My research interests include large language models, multimodal LM, and AI for Science. Currently, I am doing an internship at DPTechnology, focusing on developing LLMs for chemistry and biology.

I’m willing to advance my research and plan to pursue a PhD after my graduation in June 2026.

Publications

  • Multimodal Language Modelingfor High-Accuracy Single Cell Transcriptomics Analysis andGeneration. Under Review of ACL 2025.

    Yaorui Shi, Jiaqi Yang, Sihang Li, Junfeng Fang, Xiang Wang, Zhiyuan Liu, Yang Zhang

  • Intelligent System for Automated Molecular Patent Infringement Assessment. Under Review of Nature Mach Intell. [Arxiv]

    Yaorui Shi, Sihang Li, Taiyan Zhang, Xi Fang, Jiankun Wang, Zhiyuan Liu, Guojiang Zhao, Zhengdan Zhu, Zhifeng Gao, Renxin Zhong, Linfeng Zhang, Guolin Ke, Weinan E, Hengxing Cai, Xiang Wang

  • Scilitllm: How to adapt llms for scientific literature understanding. ICLR 2025. [Arxiv]

    Sihang Li*, Jin Huang*, Jiaxi Zhuang, Yaorui Shi, Xiaochen Cai, Mingjun Xu, Xiang Wang, Linfeng Zhang, Guolin Ke, Hengxing Cai.

  • ReactXT: Understanding Molecular” Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining. ACL 2024. [Website] [Arxiv] [Code] [Demo]

    Zhiyuan Liu*, Yaorui Shi*, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua.

  • Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules. NeurIPS 2023. [Arxiv] [Code]

    Zhiyuan Liu, Yaorui Shi, An Zhang, Enzhi Zhang, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua.

  • ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction. EMNLP 2023. [Arxiv] [Code]

    Yaorui Shi, An Zhang, Enzhi Zhang, Zhiyuan Liu, Xiang Wang.

Honors & Awards

  • Silver Medal at the 2021 ICPC Asia Regional Contest, 2022.
  • Participated in the online TV program Super Brain (最强大脑) in China, 2018.12