Commit 2effbbb
authored
Fix Falcon3-7b WOQ TP=3 with int4 checkpoint (#3402)
* Falcon3-7b WOQ TP=3 with int4 checkpoint
* Add assert
* Split mha according to kv_heads
* Fix bug with old deepspeed
* Fix run_accuracy_with_deepspeed.py1 parent 136c1d0 commit 2effbbb
File tree
2 files changed
+69
-51
lines changed- examples/cpu/llm/inference/distributed
2 files changed
+69
-51
lines changedLines changed: 32 additions & 28 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
224 | 224 | | |
225 | 225 | | |
226 | 226 | | |
227 | | - | |
228 | | - | |
229 | | - | |
230 | | - | |
231 | | - | |
232 | | - | |
233 | | - | |
234 | | - | |
235 | | - | |
236 | | - | |
237 | | - | |
238 | | - | |
239 | | - | |
240 | | - | |
241 | | - | |
242 | | - | |
243 | | - | |
244 | | - | |
245 | | - | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
246 | 256 | | |
247 | 257 | | |
248 | 258 | | |
| |||
420 | 430 | | |
421 | 431 | | |
422 | 432 | | |
423 | | - | |
424 | | - | |
425 | | - | |
| 433 | + | |
426 | 434 | | |
427 | 435 | | |
428 | 436 | | |
| |||
1495 | 1503 | | |
1496 | 1504 | | |
1497 | 1505 | | |
1498 | | - | |
1499 | | - | |
1500 | | - | |
| 1506 | + | |
1501 | 1507 | | |
1502 | 1508 | | |
1503 | 1509 | | |
| |||
2263 | 2269 | | |
2264 | 2270 | | |
2265 | 2271 | | |
2266 | | - | |
2267 | | - | |
2268 | | - | |
| 2272 | + | |
2269 | 2273 | | |
2270 | 2274 | | |
2271 | 2275 | | |
| |||
Lines changed: 37 additions & 23 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
486 | 486 | | |
487 | 487 | | |
488 | 488 | | |
489 | | - | |
490 | | - | |
491 | | - | |
492 | | - | |
493 | | - | |
494 | | - | |
495 | | - | |
496 | | - | |
497 | | - | |
498 | | - | |
499 | | - | |
500 | | - | |
501 | | - | |
502 | | - | |
503 | | - | |
504 | | - | |
505 | | - | |
506 | | - | |
507 | | - | |
| 489 | + | |
| 490 | + | |
| 491 | + | |
| 492 | + | |
| 493 | + | |
| 494 | + | |
| 495 | + | |
| 496 | + | |
| 497 | + | |
| 498 | + | |
| 499 | + | |
| 500 | + | |
| 501 | + | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
508 | 517 | | |
509 | 518 | | |
510 | 519 | | |
511 | 520 | | |
512 | 521 | | |
513 | 522 | | |
514 | 523 | | |
515 | | - | |
516 | | - | |
517 | | - | |
518 | 524 | | |
519 | 525 | | |
520 | 526 | | |
| |||
659 | 665 | | |
660 | 666 | | |
661 | 667 | | |
| 668 | + | |
| 669 | + | |
662 | 670 | | |
663 | 671 | | |
664 | 672 | | |
| |||
712 | 720 | | |
713 | 721 | | |
714 | 722 | | |
715 | | - | |
| 723 | + | |
| 724 | + | |
| 725 | + | |
| 726 | + | |
| 727 | + | |
| 728 | + | |
| 729 | + | |
716 | 730 | | |
717 | 731 | | |
718 | 732 | | |
| |||
0 commit comments