Replies: 1 comment
-
Hmm it looks like I cannot set a shared_expert with mistral models? Does that only work with QWEN models? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is this possible? I tried to use 3 Mistral Nemo models. But the final output marks this as:
Qwen2MoeForCausalLM
I tried setting the architecture to "Mixtral", but the output told me that the models were not compatible. I am assuming this means I cannot make an MOE using Mistral Nemo models? Thanks for reading and answering.
Beta Was this translation helpful? Give feedback.
All reactions