NVIDIA Corporation

All

541 repositories

NeMo
Public
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models
Python
•
Apache License 2.0
•2.6k•13k•35•70•Updated Jan 8, 2025Jan 8, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•174•1.4k•893•77•Updated Jan 8, 2025Jan 8, 2025
VisRTX
Public
NVIDIA OptiX based implementation of ANARI
C++
•
Other
•27•246•5•0•Updated Jan 8, 2025Jan 8, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•197•581•296•39•Updated Jan 8, 2025Jan 8, 2025
spark-rapids-jni
Public
RAPIDS Accelerator JNI For Apache Spark
Cuda
•
Apache License 2.0
•68•44•72•12•Updated Jan 8, 2025Jan 8, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•54•289•189•129•Updated Jan 8, 2025Jan 8, 2025
aistore
Public
AIStore: scalable storage for AI applications
kubernetes sds erasure-coding object-storage software-defined multiple-backends batch-jobs distributed-shuffle linear-scalability etl-offload
Go
•
MIT License
•183•1.3k•1•0•Updated Jan 8, 2025Jan 8, 2025
TransformerEngine
Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
python machine-learning deep-learning gpu cuda pytorch jax fp8
Python
•
Apache License 2.0
•341•2.1k•157•45•Updated Jan 8, 2025Jan 8, 2025
NV-Kernels
Public
Ubuntu kernels which are optimized for NVIDIA server systems
C
•
Other
•14•27•0•10•Updated Jan 8, 2025Jan 8, 2025
kvpress
Public
LLM KV cache compression made easy
python transformers inference pytorch kv-cache large-language-models llm long-context kv-cache-compression
Python
•
Apache License 2.0
•17•286•5•1•Updated Jan 8, 2025Jan 8, 2025
cloudai
Public
CloudAI Benchmark Framework
Python
•
Apache License 2.0
•23•43•0•12•Updated Jan 8, 2025Jan 8, 2025
spark-rapids
Public
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
big-data gpu rapids spark
Scala
•
Apache License 2.0
•241•847•1.5k•19•Updated Jan 8, 2025Jan 8, 2025
bionemo-framework
Public
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
machine-learning gpu pytorch drug-discovery
Python
•
Other
•28•253•28•61•Updated Jan 8, 2025Jan 8, 2025
cuda-python
Public
CUDA Python: Performance meets Productivity
Python
•
Other
•88•1k•109•10•Updated Jan 8, 2025Jan 8, 2025
warp
Public
A Python framework for high performance GPU simulation and graphics
Python
•
Other
•253•4.4k•81•5•Updated Jan 8, 2025Jan 8, 2025
cutlass
Public
CUDA Templates for Linear Algebra Subroutines
deep-learning cpp nvidia deep-learning-library gpu cuda
C++
•
Other
•1k•6k•205•35•Updated Jan 8, 2025Jan 8, 2025
cuEquivariance
Public
cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks.
Python
•7•156•4•4•Updated Jan 8, 2025Jan 8, 2025
NeMo-Guardrails
Public
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Python
•
Other
•414•4.3k•83•23•Updated Jan 8, 2025Jan 8, 2025
nvidia-container-toolkit
Public
Build and run containers leveraging NVIDIA GPUs
Go
•
Apache License 2.0
•293•2.6k•329•37•Updated Jan 8, 2025Jan 8, 2025
DALI
Public
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
python machine-learning deep-learning neural-network mxnet gpu image-processing pytorch gpu-tensorflow data-processing
C++
•
Apache License 2.0
•624•5.2k•206•42•Updated Jan 8, 2025Jan 8, 2025
NeMo-Curator
Public
Scalable data pre processing and curation toolkit for LLMs
python data data-processing data-preparation deduplication data-quality data-curation data-prep fine-tuning fast-data-processing
Jupyter Notebook
•
Apache License 2.0
•94•710•66•27•Updated Jan 8, 2025Jan 8, 2025
NeMo-Aligner
Public
Scalable toolkit for efficient model alignment
Python
•
Apache License 2.0
•83•664•68•45•Updated Jan 8, 2025Jan 8, 2025
gpu-operator
Public
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
kubernetes gpu cuda nvidia
Go
•
Apache License 2.0
•313•1.9k•288•29•Updated Jan 8, 2025Jan 8, 2025
k8s-device-plugin
Public
NVIDIA device plugin for Kubernetes
kubernetes
Go
•
Apache License 2.0
•642•2.9k•131•33•Updated Jan 8, 2025Jan 8, 2025
TensorRT-LLM
Public
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
C++
•
Apache License 2.0
•1.1k•9.1k•334•71•Updated Jan 8, 2025Jan 8, 2025
TensorRT-Model-Optimizer
Public
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Python
•
Other
•48•655•61•0•Updated Jan 8, 2025Jan 8, 2025
JAX-Toolbox
Public
JAX-Toolbox
Jupyter Notebook
•
Apache License 2.0
•52•270•116•39•Updated Jan 8, 2025Jan 8, 2025
spark-rapids-ml
Public
Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
Jupyter Notebook
•
Apache License 2.0
•30•74•24•1•Updated Jan 8, 2025Jan 8, 2025
nv-ingest
Public
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retrieval systems.
Python
•
Apache License 2.0
•113•713•46•9•Updated Jan 8, 2025Jan 8, 2025
k8s-operator-libs
Public
A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.
Go
•
Apache License 2.0
•18•21•1•3•Updated Jan 8, 2025Jan 8, 2025