Commit 82df7a1
Cpu KV Cache feature (#997)
Co-authored-by: wangzaijun <[email protected]>
Co-authored-by: niushengxiao <[email protected]>1 parent 3291109 commit 82df7a1
File tree
46 files changed
+2795
-385
lines changed- lightllm
- common/basemodel
- triton_kernel
- distributed
- server
- audioserver
- core/objs
- detokenization
- embed_cache
- httpserver_for_pd_master
- httpserver
- metrics
- multi_level_kv_cache
- router
- dynamic_prompt
- model_infer
- mode_backend
- continues_batch/pd_mode
- decode_node_impl
- prefill_node_impl
- diverse_backend
- req_queue
- chunked_prefill
- visualserver
- utils
- test/benchmark/service
- unit_tests/server/router/dynamic_prompt
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
46 files changed
+2795
-385
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
| 64 | + | |
| 65 | + | |
64 | 66 | | |
65 | 67 | | |
66 | 68 | | |
| |||
110 | 112 | | |
111 | 113 | | |
112 | 114 | | |
| 115 | + | |
| 116 | + | |
113 | 117 | | |
114 | 118 | | |
115 | 119 | | |
116 | 120 | | |
117 | 121 | | |
118 | 122 | | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
119 | 128 | | |
120 | 129 | | |
121 | 130 | | |
| |||
352 | 361 | | |
353 | 362 | | |
354 | 363 | | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
355 | 367 | | |
356 | | - | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
357 | 371 | | |
358 | 372 | | |
359 | 373 | | |
| |||
505 | 519 | | |
506 | 520 | | |
507 | 521 | | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
508 | 525 | | |
509 | 526 | | |
510 | 527 | | |
511 | 528 | | |
512 | 529 | | |
513 | 530 | | |
514 | 531 | | |
| 532 | + | |
| 533 | + | |
515 | 534 | | |
516 | 535 | | |
517 | 536 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
58 | 58 | | |
59 | 59 | | |
60 | 60 | | |
| 61 | + | |
| 62 | + | |
61 | 63 | | |
62 | 64 | | |
63 | 65 | | |
| |||
0 commit comments