Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 786 152

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 426 74

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.8k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 243

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.2k 499

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.9k 1k

Repositories

Showing 10 of 706 repositories
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,201 2,218 558 641 Updated Mar 27, 2026
  • aicr Public

    Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

    NVIDIA/aicr’s past year of commit activity
    Go 220 Apache-2.0 22 14 11 Updated Mar 27, 2026
  • nova Public Forked from torvalds/linux

    Linux kernel source tree

    NVIDIA/nova’s past year of commit activity
    C 6 64,095 0 5 Updated Mar 27, 2026
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    TypeScript 122 Apache-2.0 23 49 9 Updated Mar 26, 2026
  • cccl Public

    CUDA Core Compute Libraries

    NVIDIA/cccl’s past year of commit activity
    C++ 2,244 370 1,282 (6 issues need help) 236 Updated Mar 27, 2026
  • stdexec Public

    `std::execution`, the proposed C++ framework for asynchronous and parallel programming.

    NVIDIA/stdexec’s past year of commit activity
    C++ 2,281 Apache-2.0 234 128 15 Updated Mar 27, 2026
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,247 Apache-2.0 319 67 121 Updated Mar 27, 2026
  • NeMo-Agent-Toolkit-UI Public

    The NVIDIA NeMo Agent Toolkit UI streamlines interacting with NeMo Agent Toolkit workflows in an easy-to-use web application.

    NVIDIA/NeMo-Agent-Toolkit-UI’s past year of commit activity
    TypeScript 88 54 6 12 Updated Mar 27, 2026
  • ncx-infra-controller-core Public

    NCX Infra Controller - Hardware Lifecycle Management and multitenant networking

    NVIDIA/ncx-infra-controller-core’s past year of commit activity
    Rust 108 Apache-2.0 70 116 (4 issues need help) 48 Updated Mar 27, 2026
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 972 Apache-2.0 353 432 (16 issues need help) 106 Updated Mar 27, 2026