Commit 927c8d5
StaticAttetnion runtime support for generate_full_logits=False (pytorch#16171)
Summary:
Prefill will produce the logits at position 0 when not generating full logits. Lookahead decoding requires full logits.
Reviewed By: billmguo
Differential Revision: D887904451 parent 71ebc50 commit 927c8d5
File tree
3 files changed
+33
-5
lines changed- examples/models/llama
- runner
- tests
3 files changed
+33
-5
lines changedLines changed: 19 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
438 | 438 | | |
439 | 439 | | |
440 | 440 | | |
| 441 | + | |
| 442 | + | |
441 | 443 | | |
442 | 444 | | |
443 | 445 | | |
| |||
607 | 609 | | |
608 | 610 | | |
609 | 611 | | |
610 | | - | |
| 612 | + | |
611 | 613 | | |
612 | 614 | | |
| 615 | + | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
| 619 | + | |
613 | 620 | | |
614 | 621 | | |
615 | 622 | | |
| |||
622 | 629 | | |
623 | 630 | | |
624 | 631 | | |
625 | | - | |
| 632 | + | |
| 633 | + | |
| 634 | + | |
626 | 635 | | |
627 | 636 | | |
628 | | - | |
| 637 | + | |
629 | 638 | | |
630 | 639 | | |
631 | 640 | | |
| |||
648 | 657 | | |
649 | 658 | | |
650 | 659 | | |
| 660 | + | |
| 661 | + | |
| 662 | + | |
| 663 | + | |
| 664 | + | |
651 | 665 | | |
652 | 666 | | |
653 | 667 | | |
| |||
685 | 699 | | |
686 | 700 | | |
687 | 701 | | |
| 702 | + | |
688 | 703 | | |
689 | 704 | | |
690 | 705 | | |
| |||
968 | 983 | | |
969 | 984 | | |
970 | 985 | | |
| 986 | + | |
971 | 987 | | |
972 | 988 | | |
973 | 989 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
456 | 456 | | |
457 | 457 | | |
458 | 458 | | |
459 | | - | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
460 | 463 | | |
461 | 464 | | |
462 | 465 | | |
| |||
607 | 610 | | |
608 | 611 | | |
609 | 612 | | |
| 613 | + | |
| 614 | + | |
| 615 | + | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
610 | 619 | | |
611 | 620 | | |
612 | 621 | | |
613 | 622 | | |
614 | 623 | | |
615 | 624 | | |
616 | 625 | | |
| 626 | + | |
617 | 627 | | |
618 | 628 | | |
619 | 629 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
248 | 248 | | |
249 | 249 | | |
250 | 250 | | |
251 | | - | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
252 | 254 | | |
253 | 255 | | |
254 | 256 | | |
| |||
0 commit comments