-
Notifications
You must be signed in to change notification settings - Fork 127
Pull requests: alibaba/rtp-llm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support return raw prompt and input ids by parameters
#421
opened Dec 2, 2025 by
soaringk
Loading…
feat: [rocm] support online ptpc quantization for python
#420
opened Dec 2, 2025 by
muse-coder
Loading…
feat: [rocm] support online ptpc quantization and reuse cache for python
#416
opened Dec 2, 2025 by
muse-coder
Loading…
feat: optimize prefill invokeAddFusedQKVBiasTranspose
#413
opened Nov 29, 2025 by
Bruce-Lee-LY
Loading…
fix bug: insert to cache only when stream's kv cache is computed
#391
opened Nov 24, 2025 by
xinfei-shi
Loading…
feat: replace cuda allocator with virtual memory allocator
#385
opened Nov 20, 2025 by
ZhangZhiPku
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.