OligoFormer: an accurate and robust prediction method for siRNA design

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

OligoFormer: an accurate and robust prediction method for siRNA design

Authors

Bai, Y.; Zhong, H.; Wang, T.; Lu, Z. J.

Abstract

Motivation: Small interfering RNA (siRNA) has become a widely used experimental approach for post-transcriptional regulation and is increasingly showing its potential as future targeted drugs. However, the prediction of highly efficient siRNAs is still hindered by dataset biases, the inadequacy of prediction methods, and the presence of offtarget effects. To overcome these limitations, we propose a novel model, OligoFormer, for the prediction of siRNA efficacy. Results: OligoFormer comprises three different modules including thermodynamic calculation, RNA-FM module, and Oligo encoder. Taking siRNA and mRNA sequences as input, OligoFormer obtains embeddings through Oligo encoder and pre-trained language model RNA-FM[1], and combines thermodynamic parameters to predict the efficacy of siRNAs. As far as we know, OligoFormer introduces transformer encoder and RNA-FM embedding into the siRNA design model for the first time. We carefully benchmark OligoFormer with 5 comparable methods on siRNA efficacy datasets. OligoFormer outperforms all the other methods, with an average improvement of 9% in AUC and 10.7% in F1 score in our inter-dataset validation. We also provide a whole pipeline with off-target effects using PITA score and TargetScan score as the application of OligoFormer. The ablation study shows RNA-FM module and thermodynamic parameters improved the model\'s performance and convergence speed. The saliency map by gradient backpropagation shows some base preferences in initial and terminal region of siRNAs.

Follow Us on

0 comments

Add comment