Skip to content

Commit 337f67a

Browse files
ErezSC42Mor Zusman
authored andcommitted
Merged in jurassic-2.5 (pull request vllm-project#1)
BA-78554: Jurassic 2.5 * worked on jurasic2.5 configuration file, updated jurassic2_5 modeling file to support alternating experts/attn layers * finished working the forward pass of jurassic3.py * finished working the forward pass of jurassic3.py * finished working the forward pass of jurassic3.py * jurassic_3 modeling file works, uses dummy weights initialized by "dummy" flag. Tokenizer raises issues, for now copying the mixtral tokenizer * changed default tokenizer vocab values, loading of custom .pt weight files works. * removed notebook * merging master to jurassic-2.5 to reset head * Merge branch 'master' into jurassic-2.5 * align to master Approved-by: Tomer Asida Approved-by: Mor Zusman
1 parent 0543476 commit 337f67a

File tree

5 files changed

+614
-0
lines changed

5 files changed

+614
-0
lines changed

vllm/model_executor/models/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,7 @@
5454
"StableLmForCausalLM": ("stablelm", "StablelmForCausalLM"),
5555
"Starcoder2ForCausalLM": ("starcoder2", "Starcoder2ForCausalLM"),
5656
"XverseForCausalLM": ("xverse", "XverseForCausalLM"),
57+
"Jurassic3ForCausalLM": ("jurassic3", "Jurassic3ForCausalLM")
5758
}
5859

5960
# Architecture -> type.

0 commit comments

Comments
 (0)