NVIDIA Corporation

All

639 repositories

cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•297•2.1k•1.1k•193•Updated Dec 6, 2025Dec 6, 2025
aistore
Public
AIStore: scalable storage for AI applications
kubernetes high-performance distributed-storage high-availability object-storage multi-cloud batch-jobs s3-compatible multipart-upload ml-training
Go
•
MIT License
•227•1.7k•0•0•Updated Dec 6, 2025Dec 6, 2025
cuopt
Public
GPU accelerated decision optimization
gpu optimization cuda linear-programming
Cuda
•
Apache License 2.0
•97•595•72•27•Updated Dec 6, 2025Dec 6, 2025
TensorRT-Incubator
Public
Experimental projects related to TensorRT
MLIR
•18•116•37•11•Updated Dec 6, 2025Dec 6, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•69•363•205•209•Updated Dec 6, 2025Dec 6, 2025
OSMO
Public
The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
Python
•
Apache License 2.0
•2•52•5•6•Updated Dec 6, 2025Dec 6, 2025
Megatron-LM
Public
Ongoing research training transformer models at scale
transformers model-para large-language-models
Python
•
Other
•3.3k•14k•329•246•Updated Dec 5, 2025Dec 5, 2025
nv-ingest
Public
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
Python
•
Apache License 2.0
•277•2.8k•101•36•Updated Dec 5, 2025Dec 5, 2025
multi-storage-client
Public
Unified high-performance Python client for object and file stores.
Python
•
Apache License 2.0
•7•47•0•0•Updated Dec 5, 2025Dec 5, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•306•866•404•88•Updated Dec 5, 2025Dec 5, 2025
bionemo-framework
Public
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
machine-learning gpu pytorch drug-discovery
Jupyter Notebook
•104•596•60•102•Updated Dec 5, 2025Dec 5, 2025
TensorRT-Model-Optimizer
Public
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
Python
•
Apache License 2.0
•206•1.6k•68•44•Updated Dec 5, 2025Dec 5, 2025
gpu-operator
Public
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
kubernetes gpu cuda nvidia
Go
•
Apache License 2.0
•420•2.4k•95•68•Updated Dec 5, 2025Dec 5, 2025
sandbox-device-plugin
Public
Kubernetes Device Plugin to help cold plug vfio/iommufd GPUs in Kata VMs for Confidential Containers
Go
•
BSD 3-Clause "New" or "Revised" License
•1•1•0•6•Updated Dec 5, 2025Dec 5, 2025
spark-rapids-jni
Public
RAPIDS Accelerator JNI For Apache Spark
Cuda
•
Apache License 2.0
•74•52•82•6•Updated Dec 5, 2025Dec 5, 2025
ais-k8s
Public
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
Go
•
MIT License
•25•117•1•0•Updated Dec 5, 2025Dec 5, 2025
TensorRT-LLM
Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
cuda pytorch moe blackwell llm-serving
Python
•
Other
•1.9k•12k•614•460•Updated Dec 5, 2025Dec 5, 2025
numba-cuda
Public
The CUDA target for Numba
Python
•
BSD 2-Clause "Simplified" License
•47•222•99•25•Updated Dec 5, 2025Dec 5, 2025
accelerated-computing-hub
Public
NVIDIA curated collection of educational resources related to general purpose GPU programming.
Jupyter Notebook
•
Other
•164•917•13•3•Updated Dec 5, 2025Dec 5, 2025
aerial-cuda-accelerated-ran
Public
An SDK (Software Development Kit) for building commercial-grade, AI-native, 3GPP, and O-RAN compliant 5G/6G gNB software on NVIDIA-accelerated computing platforms.
C++
•
Other
•0•7•0•0•Updated Dec 5, 2025Dec 5, 2025
spark-rapids
Public
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
big-data spark gpu rapids
Scala
•
Apache License 2.0
•264•950•1.8k•29•Updated Dec 5, 2025Dec 5, 2025
cuda-python
Public
CUDA Python: Performance meets Productivity
Python
•
Other
•226•3.1k•196•14•Updated Dec 5, 2025Dec 5, 2025
skyhook
Public
A Kubernetes Operator to manage Node OS customizations.
Go
•
Apache License 2.0
•3•33•0•1•Updated Dec 5, 2025Dec 5, 2025
nim-deploy
Public
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
Jupyter Notebook
•
Apache License 2.0
•92•216•19•12•Updated Dec 5, 2025Dec 5, 2025
NeMo-Agent-Toolkit
Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Python
•
Apache License 2.0
•440•1.6k•54•26•Updated Dec 5, 2025Dec 5, 2025
NVSentinel
Public
NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
Go
•
Apache License 2.0
•25•98•40•6•Updated Dec 5, 2025Dec 5, 2025
cuEquivariance
Public
cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.
Python
•21•333•11•4•Updated Dec 5, 2025Dec 5, 2025
gds-nvidia-fs
Public
NVIDIA GPUDirect Storage Driver
C
•
Other
•52•304•31•4•Updated Dec 5, 2025Dec 5, 2025
tao_dataset_suite
Public
Set of advanced data augmentation and analytics tools
Python
•
Apache License 2.0
•4•9•0•3•Updated Dec 5, 2025Dec 5, 2025
tao_deploy
Public
Package for deploying deep learning models from TAO Toolkit
Python
•
Apache License 2.0
•5•23•5•3•Updated Dec 5, 2025Dec 5, 2025