Deep Coverage and Extended Sequence Reads Obtained with a Single Archaeal Protease Expedite de novo Protein Sequencing by Mass Spectrometry
Deep Coverage and Extended Sequence Reads Obtained with a Single Archaeal Protease Expedite de novo Protein Sequencing by Mass Spectrometry
Shamorkina, T. M.; Paneda, L. P.; Kadava, T.; Schulte, D.; Pribil, P. A.; Heidelberger, S.; Narlock-Brand, A. M.; Yannone, S. M.; Snijder, J.; Heck, A. J. R.
AbstractThe ability to sequence proteins without reliance on a genomic template defines a critical frontier in modern proteomics. This approach, known as de novo protein sequencing, is essential for applications such as antibody sequencing, microbiome proteomics, and antigen discovery, which require accurate reconstruction of peptide and protein sequences. While trypsin remains the gold-standard protease in proteomics, its restricted cleavage specificity limits peptide diversity. This constraint is especially problematic in antibody sequencing, where the functionally critical regions often lack canonical tryptic sites. As a result, exclusively trypsin-based approaches yield sparse reads, leading to sequence gaps. Multi-protease and hybrid-fragmentation strategies can improve the sequence coverage, but they add complexity, compromise scalability and reproducibility. Here, we explore two HyperThermoacidic Archaeal (HTA)-proteases as single-enzyme solutions for de novo antibody sequencing. Each HTA-protease generated about five times more unique peptide reads than trypsin or chymotrypsin, providing high redundancy across all CDRs. Combined with EAciD fragmentation on a ZenoTOF 7600 system, this approach enabled complete, unambiguous antibody sequencing. De novo analysis using PEAKS/DeepNovo and Stitch showed up to fourfold higher alignment scores and reduced the sequence errors within the HTA-generated data. Additionally, the HTA-EAciD approach offers short digestion times, eliminates extensive cleanup, and enables analysis in a single LC-MS/MS run. This streamlined, single-protease approach delivers therefore performance comparable to multi-enzyme workflows, offering a scalable and efficient strategy for de novo protein sequencing across diverse applications.