Projects

Generative Modeling of Scientific Data

Generative Modeling of Scientific Data

Large Language Model

Large Language Model

Foundation Models of AI4S

Foundation Models of AI4S

Publications

Mol-AE: Auto-Encoder Based Molecular Representation Learning With 3D Cloze Test Objective

3D molecular representation learning has gained tremendous interest and achieved promising performance in various downstream tasks. A series of recent approaches follow a prevalent framework: an encoder-only model coupled with a coordinate denoising objective.

bib
2024

ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling

Protein language models have demonstrated significant potential in the field of protein engineering. However, current protein language models primarily operate at the residue scale, which limits their ability to provide information at the atom level.

bib
2024

MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space

Generative models for structure-based drug design (SBDD) have shown promising results in recent years. Existing works mainly focus on how to generate molecules with higher binding affinity, ignoring the feasibility prerequisites for generated 3D poses and resulting in false positives.

bib
2024

Learning Multi-view Molecular Representations with Structured and Unstructured Knowledge

Molecular representation learning bears promise in vast scientific domains. Capturing molecular expertise based on diverse views is of great significance in learning effective and generalizable molecular representations.

bib
2024

Diffusion Glancing Transformer for Parallel Sequence-to-Sequence Learning

Previously, non-autoregressive models were widely recognized as being superior in generation efficiency but inferior in generation quality due to the challenges of modeling multiple target modalities. To enhance the multi-modality modeling ability, we propose the diffusion glancing transformer, which employs a modality diffusion process and residual glancing sampling.

bib
2024

SHOW ALL