Skip to content

chore: add llm hybrid inference use case #1061

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

andrei-stoian-zama
Copy link
Collaborator

@andrei-stoian-zama andrei-stoian-zama commented Apr 2, 2025

Adds a simple notebook that showcases GPT2 hybrid inference. It executes at 11s/token on a desktop gpu.

@cla-bot cla-bot bot added the cla-signed label Apr 2, 2025
@andrei-stoian-zama andrei-stoian-zama force-pushed the chore/add_llm_hybrid_inference_demo branch from 10f8053 to 882c96c Compare April 30, 2025 14:53
@andrei-stoian-zama andrei-stoian-zama marked this pull request as ready for review April 30, 2025 14:57
@andrei-stoian-zama andrei-stoian-zama requested a review from a team as a code owner April 30, 2025 14:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant