Skip to content

Better GPU backend detection + flake variants#244

Open
reedriley wants to merge 4 commits intotobi:mainfrom
reedriley:better-gpu-backend-detection-fallback
Open

Better GPU backend detection + flake variants#244
reedriley wants to merge 4 commits intotobi:mainfrom
reedriley:better-gpu-backend-detection-fallback

Conversation

@reedriley
Copy link

In the original version - if CUDA libraries are installed the code will never try Metal or Vulkan as a fallback. This refactors the GPU backend selection to only try supported backends and to try all in order during fallbacks.

I also updated flake.nix and added a variant for Vulkan - because my system doesn't support CUDA. (I haven't tested the other variants - but flake.nix was broken to begin with...)

(I had to bump node-llama-cpp to 3.16 to fix a Vulkan header issue in llama/gpuInfo/vulkan-gpu-info.cpp fixed by withcatai/node-llama-cpp@57e8c22.)

Testing

  • bun src/qmd.ts status
  • bun src/qmd.ts embed
  • bun src/qmd.ts query

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant