llama: prefix MTP assistant tensors with 'mtp.' on load allowing use of -ot 'mtp..*=CUDA0' flag #7
+11
−1
background
wait
wait-all
cancel
Loading