Better GPU backend detection + flake variants by reedriley · Pull Request #244 · tobi/qmd

reedriley · 2026-02-22T02:02:37Z

In the original version - if CUDA libraries are installed the code will never try Metal or Vulkan as a fallback. This refactors the GPU backend selection to only try supported backends and to try all in order during fallbacks.

I also updated flake.nix and added a variant for Vulkan - because my system doesn't support CUDA. (I haven't tested the other variants - but flake.nix was broken to begin with...)

(I had to bump node-llama-cpp to 3.16 to fix a Vulkan header issue in llama/gpuInfo/vulkan-gpu-info.cpp fixed by withcatai/node-llama-cpp@57e8c22.)

Testing

bun src/qmd.ts status
bun src/qmd.ts embed
bun src/qmd.ts query

reedriley added 4 commits February 21, 2026 17:52

Add node and PATH tools for embedding builds

9e74e8e

Filter for supported GPU backends more intelligently

5fb71c2

Bump node-llama-cpp to 3.16

608014a

Update flake.nix for CUDA/Vulkan variants

1d9e78a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better GPU backend detection + flake variants#244

Better GPU backend detection + flake variants#244
reedriley wants to merge 4 commits intotobi:mainfrom
reedriley:better-gpu-backend-detection-fallback

reedriley commented Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

reedriley commented Feb 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant