Skip to content

Commit 3388e7c

Browse files
metascroyfacebook-github-bot
authored andcommitted
Expose static llama in OSS
Summary: This exposes a static llama model for CoreML. We want to unify development behind one static model so that we do not need to apply fixes in multiple places (e.g., iOS 26 fixes). Differential Revision: D88875745
1 parent 8d06ed3 commit 3388e7c

File tree

4 files changed

+856
-0
lines changed

4 files changed

+856
-0
lines changed

.ci/scripts/test_ane_static_llama.sh

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,4 +30,6 @@ download_stories_model_artifacts
3030

3131
python export.py -n model.pte -p params.json -c stories110M.pt --seq_length 32 --max_seq_length 64 --dtype fp16 --coreml-quantize c4w --embedding-quantize 4,32
3232

33+
python export_static_llm_coreml.py -n model.pte -p params.json -c stories110M.pt
34+
3335
popd

0 commit comments

Comments
 (0)