😄 😄 😄 I am looking for jobs for 2027, my interest is video analysis, image restore, multi-modal pretraining, MLLMs, computer vision or other feilds about multi modality!!! Welcome to chat with me by wechat(id:Alocus)
-
🌱 I’m currently learning Video Captioning, MLLMs, and (Multi-modal) Knowledge Graph.
-
📫 How to reach me: sunhaoying97@163.com
-
💬 About me: CSDN
-
💬 About me: Homepage
-
📝 Publications
- KNOWLEDGE-BASED SYSTEMS 2026 | Towards Generalized Video Captioning: An Effective Multi-modal Knowledge Graph Perspective
- Expert Systems with Applications 2025 | Scene Adaptive Dynamic Multi-Modal Knowledge for Video Captioning
- Information Fusion 2025 | Unified Hierarchical Contrastive Learning for Video Captioning
- ICMR 2025 | DSSM-KG: Dual-Stream State-Space Modeling with Adaptive Knowledge Injection for Video Captioning
- 信号处理 2025 | 结合状态空间模型和Transformer的时空增强视频字幕生成