Science Cast

SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network

Tianlong LiOctober 11, 2023 11:10am

Views (47)
Comments (0)

Export Citation

Voices Powered by

Connected to paperThis paper is a preprint and has not been certified by peer review

SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network

arXivPDFOctober 10, 2023 12:00am

Authors

Tianlong Li, Wenhao Liu, Changze Lv, Jianhan Xu, Cenyuan Zhang, Muling Wu, Xiaoqing Zheng, Xuanjing Huang

Abstract

Spiking neural networks (SNNs) have demonstrated the capability to achieve comparable performance to deep neural networks (DNNs) in both visual and linguistic domains while offering the advantages of improved energy efficiency and adherence to biological plausibility. However, the extension of such single-modality SNNs into the realm of multimodal scenarios remains an unexplored territory. Drawing inspiration from the concept of contrastive language-image pre-training (CLIP), we introduce a novel framework, named SpikeCLIP, to address the gap between two modalities within the context of spike-based computing through a two-step recipe involving ``Alignment Pre-training + Dual-Loss Fine-tuning". Extensive experiments demonstrate that SNNs achieve comparable results to their DNN counterparts while significantly reducing energy consumption across a variety of datasets commonly used for multimodal model evaluation. Furthermore, SpikeCLIP maintains robust performance in image classification tasks that involve class labels not predefined within specific categories.

TwitterandLinkedIn

0 comments

Add comment

SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network

SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments