MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation (NeurIPS 2025)

This is the official implementation of our paper MixPrompt: Efficient Mixed Prompting forMultimodal Semantic Segmentation.

Authors: Zhiwei Hao, Zhongyu Xiao, Jianyuan Guo, Li Shen, Yong Luo, Han Hu, Dan Zeng

Introduction

We present a prompt based multimodal semantic segmentation on the basis of pertained single-modality RGB model.

Framework

Data

NYU Depth V2 Datasets (RGB-Depth)

You could download the official NYU Depth V2 data here. After downloading the official data, you should modify them according to the structure of directories we provide.

SUN-RGBD Datasets (RGB-Depth)

You can download the dataset from the official SUNRGBD website and preprocess it according to the requirements specified on the website.

For RGB-Depth semantic segmentation, the generation of HHA maps from Depth maps can refer to https://github.com/charlesCXK/Depth2HHA-python.

MFNet Datasets (RGB-Thermal)

You can download the dataset from the official MFNet website

DELIVER Datasets (RGB-Depth-Event-LiDAR)

You can download the dataset from the official DELIVER website

Installation

Clone this repo.

$ git clone https://github.com/xiaoshideta/MixPrompt.git
$ cd mixprompt-main

Install all dependencies.

$ conda create -n mixprompt python=3.8.11
$ pip install -r requirements.txt
$ conda activate mixprompt

Directory Tree

Your directory tree should look like this:

|-- <configs>
|-- <semseg>
    |-- <datasets>
    |-- <models>
    |-- <utils>
|-- <pretrained>
    |-- <pre>
    |-- <segformer>
|-- <dataset>
    |-- <NYUDepthv2>
        |-- <RGBFolder>
        |-- <HHAFolder>
        |-- <LabelFolder>
        |-- train.txt
        |-- test.txt

Training and Inference

Pretrain weights:

Download the pretrained segformer here.

Training

$ bash train.sh

$ CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --master_port=25035 --nproc_per_node=4 --use_env train_mm.py --cfg configs/nyu.yaml --wandb 0

Inference

$ bash val.sh

CUDA_VISIBLE_DEVICES=5 python val_mm.py --cfg configs/nyu.yaml

Result

Acknowledgement

Part of our code is based on CMNeXt and PrimKD, thanks for their excellent work!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation (NeurIPS 2025)

Introduction

Framework

Data

NYU Depth V2 Datasets (RGB-Depth)

SUN-RGBD Datasets (RGB-Depth)

MFNet Datasets (RGB-Thermal)

DELIVER Datasets (RGB-Depth-Event-LiDAR)

Installation

Directory Tree

Training and Inference

Pretrain weights:

Training

Inference

Result

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Image		Image
configs		configs
semseg		semseg
README.md		README.md
requirements.txt		requirements.txt
train.sh		train.sh
train_mm.py		train_mm.py
val.sh		val.sh
val_mm.py		val_mm.py

xiaoshideta/MixPrompt

Folders and files

Latest commit

History

Repository files navigation

MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation (NeurIPS 2025)

Introduction

Framework

Data

NYU Depth V2 Datasets (RGB-Depth)

SUN-RGBD Datasets (RGB-Depth)

MFNet Datasets (RGB-Thermal)

DELIVER Datasets (RGB-Depth-Event-LiDAR)

Installation

Directory Tree

Training and Inference

Pretrain weights:

Training

Inference

Result

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages