Fallback to triton if we fail to compile for CUDA #223

zbowling · 2025-02-26T22:56:23Z

The CUDA driver might be available but the full CUDA toolkit, ninja, and gcc/clang might not be installed so we won't have enough tools to compile the kernel, and in this case use triton as a fallback.

We could consider doing an AOT of the kernel and not JITing the kernel extension, however this really blows the packaging complexity out of the water since we would have to target multiple PyTorch extension ABIs.

Fallback if we fail to compile for cuda

ec8824c

zbowling changed the title ~~Fallback triton if we fail to compile for CUDA~~ Fallback to triton if we fail to compile for CUDA Feb 26, 2025

zbowling mentioned this pull request Feb 27, 2025

XGrammar forcing nvcc + ninja on Linux #224

Open

zbowling added 2 commits February 27, 2025 11:36

Update apply_token_bitmask_inplace_cuda.py

5a32100

Update apply_token_bitmask_inplace_cuda.py

6c82281

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fallback to triton if we fail to compile for CUDA #223

Fallback to triton if we fail to compile for CUDA #223

zbowling commented Feb 26, 2025 •

edited

Loading

Fallback to triton if we fail to compile for CUDA #223

Are you sure you want to change the base?

Fallback to triton if we fail to compile for CUDA #223

Conversation

zbowling commented Feb 26, 2025 • edited Loading

zbowling commented Feb 26, 2025 •

edited

Loading