Change the repository type filter
All
Repositories list
639 repositories
cccl
PublicCUDA Core Compute Librariesaistore
PublicAIStore: scalable storage for AI applicationscuopt
PublicGPU accelerated decision optimizationTensorRT-Incubator
PublicFuser
PublicOSMO
PublicMegatron-LM
PublicOngoing research training transformer models at scalenv-ingest
PublicNeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.multi-storage-client
Publiccuda-quantum
PublicC++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows- BioNeMo Framework: For building and adapting AI models in drug discovery at scale
TensorRT-Model-Optimizer
PublicA unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.gpu-operator
Publicsandbox-device-plugin
Publicspark-rapids-jni
Publicais-k8s
PublicTensorRT-LLM
PublicTensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.numba-cuda
Publicspark-rapids
Publiccuda-python
Publicskyhook
Publicnim-deploy
PublicNeMo-Agent-Toolkit
PublicNVSentinel
PubliccuEquivariance
PubliccuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.gds-nvidia-fs
Publictao_dataset_suite
Publictao_deploy
Public