Maybe we could use upstream `candle`, which upgraded cudarc here: https://github.com/huggingface/candle/pull/3078
Maybe we could use upstream
candle, which upgraded cudarc here: huggingface#3078