Skip to content

Commit 7e12d64

Browse files
authored
Add Mistral-Large-Instruct-2411 (#1876)
1 parent 7b26d35 commit 7e12d64

File tree

2 files changed

+21
-0
lines changed

2 files changed

+21
-0
lines changed

litgpt/config.py

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1663,6 +1663,26 @@ def norm_class(self) -> Type:
16631663
intermediate_size=28672,
16641664
)
16651665
)
1666+
configs.append(
1667+
# https://huggingface.co/mistralai/Mistral-Large-Instruct-2411/blob/main/config.json
1668+
dict(
1669+
name="Mistral-Large-Instruct-2411",
1670+
hf_config=dict(org="mistralai", name="Mistral-Large-Instruct-2411"),
1671+
padded_vocab_size=32768,
1672+
block_size=32768,
1673+
n_layer=88,
1674+
n_head=96,
1675+
n_embd=12288,
1676+
n_query_groups=8,
1677+
rotary_percentage=1.0,
1678+
parallel_residual=False,
1679+
bias=False,
1680+
norm_class_name="RMSNorm",
1681+
norm_eps=1e-05,
1682+
mlp_class_name="LLaMAMLP",
1683+
intermediate_size=28672,
1684+
)
1685+
)
16661686

16671687

16681688
############

tutorials/download_model_weights.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -171,6 +171,7 @@ mistralai/Mistral-7B-Instruct-v0.3
171171
mistralai/Mistral-7B-v0.1
172172
mistralai/Mistral-7B-v0.3
173173
mistralai/Mistral-Large-Instruct-2407
174+
mistralai/Mistral-Large-Instruct-2411
174175
mistralai/Mixtral-8x7B-Instruct-v0.1
175176
mistralai/Mixtral-8x7B-v0.1
176177
mistralai/Mixtral-8x22B-Instruct-v0.1

0 commit comments

Comments
 (0)