[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
audio review radar natural-language transformers point-cloud survey depth multi-modal thermal-infrared self-attention pre-training event-camera pengchenglab big-models anhui-university rgb-text-audio
-
Updated
Jan 3, 2025