CUDA with additional MPS and XPU support #1075

exdysa · 2025-03-16T02:09:39Z

It is a few simple lines that allow MPS and XPU to pass through gatekeeping, enabling GPU other than NVIDIA's to run.

Caches, device setting, and GPU assert statements have been changed, though possible I overlooked something.

Note: I am unsure of the lines in cosyvoice/cli/model.py and whether they need to be changed, this one because I am not familiar with tensorRT:
self.flow.decoder.estimator_engine = trt.Runtime(trt.Logger(trt.Logger.INFO)).deserialize_cuda_engine(f.read())

And this will assign nullcontext(), but I know no alternative:
67/336
self.llm_context = torch.cuda.stream(torch.cuda.Stream(self.device)) if torch.cuda.is_available() else nullcontext()

don't know how or to/if need to change these lines or not 67 self.flow.decoder.estimator_engine = trt.Runtime(trt.Logger(trt.Logger.INFO)).deserialize_cuda_engine(f.read()) 103 self.flow.decoder.estimator_engine = trt.Runtime(trt.Logger(trt.Logger.INFO)).deserialize_cuda_engine(f.read()) this line certainly needs to be changed but idk to what 67/336 self.llm_context = torch.cuda.stream(torch.cuda.Stream(self.device)) if torch.cuda.is_available() else nullcontext()

e1732a364fed · 2025-03-31T16:42:39Z

cosyvoice/cli/model.py 两个 mps 的地方少了引号

 elif self.device == "mps:

exdysa#1

fix close quotes in model.py

exdysa · 2025-03-31T18:21:31Z

cosyvoice/cli/model.py 两个 mps 的地方少了引号

Thank you! Now type checked & linted. Needs some testing.

我没有安装这个，因为我不知道如何修复这两条线。如果你想和我一起测试，我可以进一步提供帮助。

self.flow.decoder.estimator_engine = trt.Runtime(trt.Logger(trt.Logger.INFO)).deserialize_cuda_engine(f.read())

self.llm_context = torch.cuda.stream(torch.cuda.Stream(self.device)) if torch.cuda.is_available() else nullcontext()

exdysa added 8 commits March 15, 2025 21:30

~~CUDA only~~ CUDA +MPS +XPU

0cfb5b8

CUDA+MPS+XPU

ebcfa8e

CUDA+MPS+XPU

418eaf3

CUDA+MPS+XPU

cc81ceb

CUDA+MPS+XPU

8d22d18

CUDA+MPS+XPU

0a425a2

Update inference.py

b0d9a43

exdysa mentioned this pull request Mar 27, 2025

请问对 mac mini m4 支持吗，生成的时候 CPU100%，而 GPU 则在旁观望，没有负载。 #1011

Open

e1732a364fed and others added 5 commits March 31, 2025 09:51

fix close quotes in model.py

4ba8c13

fix if "cuda" in self.device: error in model.py

645821f

Merge pull request #1 from e1732a364fed/CUDA+MPS+XPU

ada35c1

fix close quotes in model.py

Correct available checks, gpu args & cache

ae13e20

Merge branch 'FunAudioLLM:main' into CUDA+MPS+XPU

94c3ea8

missed a torch.cuda.is_available() in haste

bb3c7db

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA with additional MPS and XPU support #1075

CUDA with additional MPS and XPU support #1075

exdysa commented Mar 16, 2025 •

edited

Loading

e1732a364fed commented Mar 31, 2025 •

edited

Loading

exdysa commented Mar 31, 2025 •

edited

Loading

CUDA with additional MPS and XPU support #1075

Are you sure you want to change the base?

CUDA with additional MPS and XPU support #1075

Conversation

exdysa commented Mar 16, 2025 • edited Loading

e1732a364fed commented Mar 31, 2025 • edited Loading

exdysa commented Mar 31, 2025 • edited Loading

exdysa commented Mar 16, 2025 •

edited

Loading

e1732a364fed commented Mar 31, 2025 •

edited

Loading

exdysa commented Mar 31, 2025 •

edited

Loading