Update README.md

WangRongsheng · web-flow · commit 55b350162b94 · 2025-11-23T18:12:02.000+08:00
diff --git a/README.md b/README.md
@@ -208,6 +208,7 @@ To speed up Long-context LLMs' inference, approximate and dynamic sparse calcula
 50. [Shimmy](https://github.com/Michael-A-Kuykendall/shimmy): Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary.
 51. [LlamaBarn](https://github.com/ggml-org/LlamaBarn): Run local LLMs on your Mac with a simple menu bar app.
 52. [Parallax](https://github.com/GradientHQ/parallax): a distributed model serving framework that lets you build your own AI cluster anywhere.
+53. [xLLM](https://github.com/jd-opensource/xllm): A high-performance inference engine for LLMs, optimized for diverse AI accelerators.
 
 
 <div align="right">