Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Will this awesome proj consider supporting GPU acceleration? #35

Open
galenyu opened this issue Apr 29, 2024 · 5 comments
Open

Will this awesome proj consider supporting GPU acceleration? #35

galenyu opened this issue Apr 29, 2024 · 5 comments

Comments

@galenyu
Copy link

galenyu commented Apr 29, 2024

A very impressive job!

But it doesn't seem to support the use of GPU. Does the author consider developing code that supports GPU acceleration?

Any suggestions to migrate this project to CUDA/HIP acceleration?

Thanks for any help!

@b4rtaz
Copy link
Owner

b4rtaz commented Apr 29, 2024

Hello @galenyu! Yes, GPU acceleration is planned.

@460130107
Copy link

hello, thanks for your job. when will the gpu acceleration version be released?

@lipere123
Copy link

lipere123 commented Aug 14, 2024

Hello.
I am currently trying dllama.
Also I have a supercomputer, 6 nodes, 96Cores, 768Go RAM, 6 PNY Nvidia RTX 4000 Ada Generation, and I need GPU support.

So, no promise, because I have other related project on the fire, but what is missing, please ?
Can you do me a summary of your advancement, ... etc.

Thanks in advance.
Best Regards.
Benjamin.

@lipere123
Copy link

I can run a Llama 3.1 70B Instruct Q40

@pcfreak30
Copy link

@b4rtaz im bumping this as im interested to know what the status of this is?

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants