Commit 3600cc2
authored
llama : use n_swa + n_ubatch cells for SWA cache (ggml-org#13833)
* llama : use n_swa + n_ubatch cells for SWA cache
ggml-ci
* llama : add warning about multi-sqeuence SWA contexts1 parent c7e0a20 commit 3600cc2
File tree
6 files changed
+24
-11
lines changed- include
- src
- tools/server
6 files changed
+24
-11
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
366 | 366 | | |
367 | 367 | | |
368 | 368 | | |
| 369 | + | |
| 370 | + | |
369 | 371 | | |
370 | 372 | | |
371 | 373 | | |
| |||
502 | 504 | | |
503 | 505 | | |
504 | 506 | | |
| 507 | + | |
505 | 508 | | |
506 | 509 | | |
507 | 510 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
126 | 131 | | |
127 | 132 | | |
128 | 133 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1731 | 1731 | | |
1732 | 1732 | | |
1733 | 1733 | | |
1734 | | - | |
| 1734 | + | |
1735 | 1735 | | |
1736 | 1736 | | |
1737 | 1737 | | |
1738 | 1738 | | |
1739 | 1739 | | |
1740 | 1740 | | |
1741 | | - | |
| 1741 | + | |
1742 | 1742 | | |
1743 | 1743 | | |
1744 | 1744 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
339 | 339 | | |
340 | 340 | | |
341 | 341 | | |
342 | | - | |
| 342 | + | |
343 | 343 | | |
344 | 344 | | |
345 | 345 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13230 | 13230 | | |
13231 | 13231 | | |
13232 | 13232 | | |
13233 | | - | |
| 13233 | + | |
13234 | 13234 | | |
13235 | 13235 | | |
13236 | 13236 | | |
| |||
13593 | 13593 | | |
13594 | 13594 | | |
13595 | 13595 | | |
| 13596 | + | |
| 13597 | + | |
| 13598 | + | |
| 13599 | + | |
13596 | 13600 | | |
13597 | 13601 | | |
13598 | 13602 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2016 | 2016 | | |
2017 | 2017 | | |
2018 | 2018 | | |
2019 | | - | |
2020 | | - | |
2021 | | - | |
2022 | | - | |
2023 | | - | |
2024 | 2019 | | |
2025 | 2020 | | |
2026 | 2021 | | |
| |||
3215 | 3210 | | |
3216 | 3211 | | |
3217 | 3212 | | |
3218 | | - | |
3219 | | - | |
| 3213 | + | |
| 3214 | + | |
| 3215 | + | |
| 3216 | + | |
| 3217 | + | |
| 3218 | + | |
| 3219 | + | |
| 3220 | + | |
3220 | 3221 | | |
3221 | 3222 | | |
3222 | 3223 | | |
| |||
0 commit comments