Is there anyone run llamacpp on Jetson Orin SoC or other devices, how about the performance? #5059
Unanswered
adamydwang
asked this question in
Q&A
Replies: 1 comment 3 replies
-
Probably a bit late, but on Jetson Orin AGX 64GB I get approx 280 tks/s on llama2 7B:
|
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I want to know the performance of 7b or 13b models on device chips, especially the first token latency
Beta Was this translation helpful? Give feedback.
All reactions