ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review
ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning
Su, J.; Zhou, X.; Zhang, X.; Yuan, F.
AbstractProTrek, a tri-modal protein language model, enables contrastive learning of protein sequence, structure, and function (SSF). Through its natural language search interface, users can navigate the vast protein universe in seconds, accessing nine distinct search tasks that cover all possible pairwise combinations of SSF. Additionally, ProTrek serves as a general-purpose protein representation model, excelling in various downstream prediction tasks through supervised transfer learning, thereby providing extensive support for protein research and analysis.