About Me
Hi! I’m Yaorui Shi, a second-year master student at the USTC Lab for Data Science, supervised by Professor Xiang Wang. In 2023, I’ve been visiting the Next++ Research Centre as a research intern supervised by An Zhang and Prof. Chua Tas Seng. My research interests include large language models, multimodal LM, and AI for Science. Currently, I am doing an internship at DPTechnology, focusing on developing LLMs for chemistry and biology.
I’m willing to advance my research and plan to pursue a PhD after my graduation in June 2026.
Publications
- Multimodal Language Modelingfor High-Accuracy Single Cell Transcriptomics Analysis andGeneration. Under Review of ACL 2025.
Yaorui Shi, Jiaqi Yang, Sihang Li, Junfeng Fang, Xiang Wang, Zhiyuan Liu, Yang Zhang
- Intelligent System for Automated Molecular Patent Infringement Assessment. Under Review of Nature Mach Intell. [Arxiv]
Yaorui Shi, Sihang Li, Taiyan Zhang, Xi Fang, Jiankun Wang, Zhiyuan Liu, Guojiang Zhao, Zhengdan Zhu, Zhifeng Gao, Renxin Zhong, Linfeng Zhang, Guolin Ke, Weinan E, Hengxing Cai, Xiang Wang
- Scilitllm: How to adapt llms for scientific literature understanding. ICLR 2025. [Arxiv]
Sihang Li*, Jin Huang*, Jiaxi Zhuang, Yaorui Shi, Xiaochen Cai, Mingjun Xu, Xiang Wang, Linfeng Zhang, Guolin Ke, Hengxing Cai.
- ReactXT: Understanding Molecular” Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining. ACL 2024. [Website] [Arxiv] [Code] [Demo]
Zhiyuan Liu*, Yaorui Shi*, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua.
- Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules. NeurIPS 2023. [Arxiv] [Code]
Zhiyuan Liu, Yaorui Shi, An Zhang, Enzhi Zhang, Kenji Kawaguchi, Xiang Wang, Tat-Seng Chua.
- ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction. EMNLP 2023. [Arxiv] [Code]
Yaorui Shi, An Zhang, Enzhi Zhang, Zhiyuan Liu, Xiang Wang.
Honors & Awards
- Silver Medal at the 2021 ICPC Asia Regional Contest, 2022.
- Participated in the online TV program Super Brain (最强大脑) in China, 2018.12