Skip to content

Commit 63161b4

Browse files
authored
Update GPULlama3_ROADMAP.md
1 parent de1838f commit 63161b4

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

docs/GPULlama3_ROADMAP.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@
22

33
- [Pending Merge] **LangChain4j integration**
44
- [ ] **Additional quantization formats**
5-
- [ ] Q8
5+
- [x] Q8
66
- [ ] Q4
7-
- [ ] INT8 native support for GPUs
7+
- [x] INT8 native support for GPUs
88
- [ ] **Additional architectures and model format**
99
- [x] Mistral/Mixtral models
1010
- [x] Qwen
@@ -20,5 +20,4 @@
2020
- [ ] **Performance optimizations**
2121
- [ ] Multi-GPU support
2222
- [X] Memory-efficient attention mechanisms
23-
- [ ] More Kernel fusion improvements
24-
- [ ] **GraalVM Native Image**
23+
- [x] More Kernel fusion improvements

0 commit comments

Comments
 (0)