Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fallback to triton if we fail to compile for CUDA #223

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

zbowling
Copy link
Contributor

@zbowling zbowling commented Feb 26, 2025

The CUDA driver might be available but the full CUDA toolkit, ninja, and gcc/clang might not be installed so we won't have enough tools to compile the kernel, and in this case use triton as a fallback.

We could consider doing an AOT of the kernel and not JITing the kernel extension, however this really blows the packaging complexity out of the water since we would have to target multiple PyTorch extension ABIs.

@zbowling zbowling changed the title Fallback triton if we fail to compile for CUDA Fallback to triton if we fail to compile for CUDA Feb 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant