Mar 2025

We propose DAPO algorithm, outperforming DeepSeek GRPO!

Oct 2024

Our group and the ByteDance Doubao LLM team jointly established the SIA Lab.

Jun 2024

Ongoing openings for PhD students, postdocs, RAs and interns - applications welcome!

Generative Modeling of Scientific Data

We are focused on the following topics:

  • Biological Sequence Modeling
  • Deep Generative Models for Geometric Graph Generation
  • Scaled Structure-based Generative Models

Large Language Model

We propose the Decoupled Clip and Dynamic sAmpling Policy Optimization (DAPO) algorithm, and fully open-source a state-of-the-art large-scale RL system that achieves 50 points on AIME 2024 using Qwen2.5-32B base model. [code] [paper]

Foundation Models of AI4S

We aim to build the universal foundation model of science, some milestones are listed:

  • Steering Protein Family Design through Profile Bayesian Flow
  • ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling

GenSI根思
+ Follow
Apr 1, 2025
「GenSI深度」|俯瞰深度生成模型发展脉络!从扩散模型,到流匹配再到贝叶斯流网络。本文将帮助读者理解FMs和BFNs的工作原理,以及它们如何改进DMs。
我在读博(GenSI版)
+ Follow
Mar 27, 2025
请进,GenSI上新了科研机会。 如果你对AI前沿技术、生成模型或AI在科学领域的应用充满兴趣,热爱挑战,欢迎关注和联系我们!