Commit cc1750b
StaticAttetnion runtime support for generate_full_logits=False (pytorch#16171)
Summary:
Prefill will produce the logits at position 0 when not generating full logits. Lookahead decoding requires full logits.
Reviewed By: billmguo
Differential Revision: D887904451 parent 5033840 commit cc1750b
File tree
3 files changed
+35
-5
lines changed- examples/models/llama
- runner
- tests
3 files changed
+35
-5
lines changedLines changed: 21 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
438 | 438 | | |
439 | 439 | | |
440 | 440 | | |
| 441 | + | |
| 442 | + | |
441 | 443 | | |
442 | 444 | | |
443 | 445 | | |
| |||
607 | 609 | | |
608 | 610 | | |
609 | 611 | | |
610 | | - | |
| 612 | + | |
611 | 613 | | |
612 | 614 | | |
| 615 | + | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
| 619 | + | |
| 620 | + | |
| 621 | + | |
613 | 622 | | |
614 | 623 | | |
615 | 624 | | |
| |||
622 | 631 | | |
623 | 632 | | |
624 | 633 | | |
625 | | - | |
| 634 | + | |
| 635 | + | |
| 636 | + | |
626 | 637 | | |
627 | 638 | | |
628 | | - | |
| 639 | + | |
629 | 640 | | |
630 | 641 | | |
631 | 642 | | |
| |||
648 | 659 | | |
649 | 660 | | |
650 | 661 | | |
| 662 | + | |
| 663 | + | |
| 664 | + | |
| 665 | + | |
| 666 | + | |
651 | 667 | | |
652 | 668 | | |
653 | 669 | | |
| |||
685 | 701 | | |
686 | 702 | | |
687 | 703 | | |
| 704 | + | |
688 | 705 | | |
689 | 706 | | |
690 | 707 | | |
| |||
968 | 985 | | |
969 | 986 | | |
970 | 987 | | |
| 988 | + | |
971 | 989 | | |
972 | 990 | | |
973 | 991 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
456 | 456 | | |
457 | 457 | | |
458 | 458 | | |
459 | | - | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
460 | 463 | | |
461 | 464 | | |
462 | 465 | | |
| |||
607 | 610 | | |
608 | 611 | | |
609 | 612 | | |
| 613 | + | |
| 614 | + | |
| 615 | + | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
610 | 619 | | |
611 | 620 | | |
612 | 621 | | |
613 | 622 | | |
614 | 623 | | |
615 | 624 | | |
616 | 625 | | |
| 626 | + | |
617 | 627 | | |
618 | 628 | | |
619 | 629 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
248 | 248 | | |
249 | 249 | | |
250 | 250 | | |
251 | | - | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
252 | 254 | | |
253 | 255 | | |
254 | 256 | | |
| |||
0 commit comments