[API] add string-id model catalog, multimodal composer, and QNN bring-up by dlwlzzero · Pull Request #36 · nntrainer/Quick.AI

dlwlzzero · 2026-06-04T04:04:50Z

Summary

Brings the Quick.AI public API up to the v0.4.0 surface in a single commit. Models are now identified by a string model id (not a C enum): each model self-registers its descriptor at load time and the catalog is exposed via getModelCatalogJson() (C API) and ModelCatalog (Android AAR). The build discovers model directories generically, so additional models can be dropped in without editing build files or the public API.

The tree and history contain no proprietary model sources or references; git grep -i gauss matches only the Qualcomm SDK GAUSSIAN constant.

Change

API: string-id descriptor registry + catalog JSON; per-model self-registration via constructors; a lazy (Meyers-singleton) registry so cross-library registration survives static-init order.
API: generic multimodal composer and a vision-encoder capability, decoupled from any specific model.
QNN: set the HTP backend-ext-config before multi-model sub-model loads; gemma4-e2b-qnn (NATIVE/NPU) bring-up.
AAR: ModelCatalog.selectableFamilies() to hide embedding-only models in the Run/OpenAI and Chat family pickers.
Build: model build hooks (meson + ndk-build) auto-discover model directories instead of naming them, so proprietary models plug in cleanly.
Guards: allow-list .gitignore and a pre-push hook that block any non-allow-listed model source directory from reaching the public remote (allow-list = src/models/qnn/gemma4-e2b-qnn).

Verified on device (S26 Ultra): the Chat and OpenAI tabs run qwen3-0.6b, gemma4-e2b-qnn (NPU), and function_gemma; the catalog lists no proprietary families.

## Summary Brings the Quick.AI public API up to the v0.4.0 surface in a single commit. Models are now identified by a string model id (not a C enum): each model self-registers its descriptor at load time and the catalog is exposed via `getModelCatalogJson()` (C API) and `ModelCatalog` (Android AAR). The build discovers model directories generically, so additional models can be dropped in without editing build files or the public API. The tree and history contain no proprietary model sources or references; `git grep -i gauss` matches only the Qualcomm SDK `GAUSSIAN` constant. ## Change - API: string-id descriptor registry + catalog JSON; per-model self-registration via constructors; a lazy (Meyers-singleton) registry so cross-library registration survives static-init order. - API: generic multimodal composer and a vision-encoder capability, decoupled from any specific model. - QNN: set the HTP backend-ext-config before multi-model sub-model loads; gemma4-e2b-qnn (NATIVE/NPU) bring-up. - AAR: `ModelCatalog.selectableFamilies()` to hide embedding-only models in the Run/OpenAI and Chat family pickers. - Build: model build hooks (meson + ndk-build) auto-discover model directories instead of naming them, so proprietary models plug in cleanly. - Guards: allow-list `.gitignore` and a `pre-push` hook that block any non-allow-listed model source directory from reaching the public remote (allow-list = `src/models/qnn/gemma4-e2b-qnn`). Verified on device (S26 Ultra): the Chat and OpenAI tabs run qwen3-0.6b, gemma4-e2b-qnn (NPU), and function_gemma; the catalog lists no proprietary families. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…edding + JNI/Kotlin)

…ction UI - Add user-editable MODEL BASE PATH field (default: /sdcard/Download/aistudio-mobile/models/) replacing the previously hardcoded path, available in both OpenAI and Chat tabs - Replace MODEL NAME dropdown with read-only folder name display derived from the model descriptor; show error message when the expected folder is missing - Remove Quantization chip selector from the OpenAI tab (W4A32 used internally) - Change default model from Gemma4 LiteRT/GPU to Gemma4 Native/NPU (GEMMA4_E2B_QNN) - Pass modelBasePath through createEngine() to LiteRTLm and buildLoadRequest() - Add bordered card style to Chat tab's model selection section - Preserve modelBasePathText across theme rebuilds from both OpenAI and Chat tabs Signed-off-by: jrock-oh <jrock.oh@samsung.com>

dlwlzzero requested review from Seunghui98, baek2sm, haehun, jaemini-shin, jayden0701 and jijoongmoon June 4, 2026 04:06

github-actions Bot added the Need Review label Jun 4, 2026

dlwlzzero and others added 3 commits June 5, 2026 16:57

Add plan and spec docs of siglip-pluggable design

3e93c2b

feat(aar,api): expose embedding encode API (encodeModelHandle/freeEmb…

df096e5

…edding + JNI/Kotlin)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API] add string-id model catalog, multimodal composer, and QNN bring-up#36

[API] add string-id model catalog, multimodal composer, and QNN bring-up#36
dlwlzzero wants to merge 4 commits into
nntrainer:mainfrom
dlwlzzero:v0.4.0

dlwlzzero commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dlwlzzero commented Jun 4, 2026

Summary

Change

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants