cuda-programming

Here are 388 public repositories matching this topic...

taskflow / taskflow

A General-purpose Task-parallel Programming System using Modern C++

multi-threading parallel parallel-computing multithreading concurrent-programming high-performance-computing heterogeneous-parallel-programming threadpool parallel-programming work-stealing taskflow gpu-programming taskparallelism multicore-programming cuda-programming

Updated Feb 13, 2025
C++

brucefan1983 / CUDA-Programming

Star

Sample codes for my CUDA programming book

molecular-dynamics-simulation gpu-programming cuda-programming

Updated Feb 10, 2025
Cuda

NVIDIA / cccl

Star

CUDA Core Compute Libraries

cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing parallel-algorithm parallel-programming nvidia-gpu gpu-programming cuda-library cpp-programming cuda-programming accelerated-computing cuda-cpp

Updated Feb 13, 2025
C++

eyalroz / cuda-api-wrappers

Star

Thin, unified, C++-flavored wrappers for the CUDA APIs

gpu modern-cpp cuda gpgpu api-wrapper gpu-memory gpu-computing cuda-driver-api cuda-toolkit cuda-device cuda-runtime-api cuda-driver gpgpu-computing cuda-api-wrappers cuda-programming

Updated Feb 13, 2025
C++

mit-han-lab / TinyChatEngine

Star

TinyChatEngine: On-Device LLM Inference Library

c arm deep-learning cpp x86-64 quantization edge-computing cuda-programming on-device-ai large-language-models

Updated Jul 4, 2024
C++

sail-sg / Adan

Star

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Updated Jul 2, 2024
Python

coreylowman / cudarc

Sponsor

Star

Safe rust wrapper around CUDA toolkit

rust gpu cuda cublas gpu-acceleration cuda-kernels cudnn cuda-toolkit nccl curand cuda-programming nvrtc

Updated Jan 28, 2025
Rust

harleyszhang / llm_note

Star

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

cuda-programming transformer-models kv-cache llm vllm llm-inference triton-kernels

Updated Feb 11, 2025
Python

nosferalatu / SimpleGPUHashTable

Star

A simple GPU hash table implemented in CUDA using lock free techniques

gpu cuda data-structures cuda-programming gpu-cuda-programs

Updated Feb 7, 2024
Cuda

PaddleJitLab / CUDATutorial

Star

A self-learning tutorail for CUDA High Performance Programing.

deep-learning cuda-programming

Updated Dec 17, 2024
JavaScript

jaredhoberock / stanford-cs193g-sp2010

Star

This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010

cuda cuda-kernels gpu-programming cuda-programming

Updated Jun 24, 2022
C++

HMUNACHI / cuda-repo

Sponsor

Star

From zero to hero CUDA for accelerating maths and machine learning on GPU.

machine-learning cuda cuda-kernels maths cuda-programming

Updated Jul 23, 2024
Cuda

MuGdxy / muda

Star

μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updating.

cuda cuda-programming cuda-cpp

Updated Feb 7, 2025
C++

ROCm / HIP-CPU

Star

An implementation of HIP that works on CPUs, across OSes.

cuda cpp17 hip spmd stl-algorithms parallel-algorithms cuda-programming hip-runtime hip-kernel-language hip-portability

Updated Mar 19, 2024
C++

SunsetQuest / CudaPAD

Star

CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.

windows gpu cuda nvidia ptx cuda-programming ptx-utils

Updated Jan 17, 2023
C#

eyalroz / cuda-kat

Star

CUDA kernel author's tools

patterns algorithms gpu constexpr modern-cpp cuda printf cpp11 utility-library cuda-kernels gpu-programming cuda-library elegant-coding cuda-programming utility-functions printf-functions

Updated Apr 24, 2022
Cuda

tgautam03 / xGeMM

Star

Accelerated General (FP32) Matrix Multiplication from scratch in CUDA

matrix-multiplication gpu-programming sgemm cuda-programming

Updated Jan 9, 2025
Cuda

mikeroyal / CUDA-Guide

Star

CUDA Guide

machine-learning awesome deep-learning gpu cuda resources gpgpu graphics-programming awesome-list cuda-kernels cuda-toolkit cuda-opengl cuda-support cuda-development cuda-driver cuda-library gpgpu-computing cuda-programming awesome-readme

Updated Jan 4, 2024
Cuda

emptysoal / cuda-image-preprocess

Star

Speed up image preprocess with cuda when handle image or tensorrt inference

deep-learning cuda image-processing cnn cuda-kernels cuda-demo tensorrt cuda-programming

Updated Jan 18, 2025
Cuda

FahimFBA / CUDA-WSL2-Ubuntu

Star

Install CUDA on Windows11 using WSL2

machine-learning deep-learning cuda deep-reinforcement-learning wsl machinelearning deeplearning cuda-toolkit cuda-support deeplearning-ai wsl-ubuntu machinelearning-python cuda-programming wsl2 wsl-environment cuda-wsl

Updated Aug 2, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the cuda-programming topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cuda-programming topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda-programming

Here are 388 public repositories matching this topic...

taskflow / taskflow

brucefan1983 / CUDA-Programming

NVIDIA / cccl

eyalroz / cuda-api-wrappers

mit-han-lab / TinyChatEngine

sail-sg / Adan

coreylowman / cudarc

harleyszhang / llm_note

nosferalatu / SimpleGPUHashTable

PaddleJitLab / CUDATutorial

jaredhoberock / stanford-cs193g-sp2010

HMUNACHI / cuda-repo

MuGdxy / muda

ROCm / HIP-CPU

SunsetQuest / CudaPAD

eyalroz / cuda-kat

tgautam03 / xGeMM

mikeroyal / CUDA-Guide

emptysoal / cuda-image-preprocess

FahimFBA / CUDA-WSL2-Ubuntu

Improve this page

Add this topic to your repo