Skip to content
Change the repository type filter

All

    Repositories list

    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      2972.1k1.1k193Updated Dec 6, 2025Dec 6, 2025
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2271.7k00Updated Dec 6, 2025Dec 6, 2025
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      975957227Updated Dec 6, 2025Dec 6, 2025
    • Experimental projects related to TensorRT
      MLIR
      181163711Updated Dec 6, 2025Dec 6, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      69363205209Updated Dec 6, 2025Dec 6, 2025
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      25256Updated Dec 6, 2025Dec 6, 2025
    • Ongoing research training transformer models at scale
      Python
      3.3k14k329246Updated Dec 5, 2025Dec 5, 2025
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2772.8k10136Updated Dec 5, 2025Dec 5, 2025
    • Unified high-performance Python client for object and file stores.
      Python
      74700Updated Dec 5, 2025Dec 5, 2025
    • C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      30686640488Updated Dec 5, 2025Dec 5, 2025
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      10459660102Updated Dec 5, 2025Dec 5, 2025
    • A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2061.6k6844Updated Dec 5, 2025Dec 5, 2025
    • NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4202.4k9568Updated Dec 5, 2025Dec 5, 2025
    • Kubernetes Device Plugin to help cold plug vfio/iommufd GPUs in Kata VMs for Confidential Containers
      Go
      1106Updated Dec 5, 2025Dec 5, 2025
    • RAPIDS Accelerator JNI For Apache Spark
      Cuda
      7452826Updated Dec 5, 2025Dec 5, 2025
    • ais-k8s

      Public
      Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
      Go
      2511710Updated Dec 5, 2025Dec 5, 2025
    • TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      1.9k12k614460Updated Dec 5, 2025Dec 5, 2025
    • The CUDA target for Numba
      Python
      472229925Updated Dec 5, 2025Dec 5, 2025
    • NVIDIA curated collection of educational resources related to general purpose GPU programming.
      Jupyter Notebook
      164917133Updated Dec 5, 2025Dec 5, 2025
    • An SDK (Software Development Kit) for building commercial-grade, AI-native, 3GPP, and O-RAN compliant 5G/6G gNB software on NVIDIA-accelerated computing platforms.
      C++
      0700Updated Dec 5, 2025Dec 5, 2025
    • Spark RAPIDS plugin - accelerate Apache Spark with GPUs
      Scala
      2649501.8k29Updated Dec 5, 2025Dec 5, 2025
    • CUDA Python: Performance meets Productivity
      Python
      2263.1k19614Updated Dec 5, 2025Dec 5, 2025
    • skyhook

      Public
      A Kubernetes Operator to manage Node OS customizations.
      Go
      33301Updated Dec 5, 2025Dec 5, 2025
    • A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
      Jupyter Notebook
      922161912Updated Dec 5, 2025Dec 5, 2025
    • The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4401.6k5426Updated Dec 5, 2025Dec 5, 2025
    • NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      2598406Updated Dec 5, 2025Dec 5, 2025
    • cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.
      Python
      21333114Updated Dec 5, 2025Dec 5, 2025
    • NVIDIA GPUDirect Storage Driver
      C
      52304314Updated Dec 5, 2025Dec 5, 2025
    • Set of advanced data augmentation and analytics tools
      Python
      4903Updated Dec 5, 2025Dec 5, 2025
    • Package for deploying deep learning models from TAO Toolkit
      Python
      52353Updated Dec 5, 2025Dec 5, 2025