Skip to content

Commit 4de0f2f

Browse files
authored
chore(model gallery): add open-r1_openr1-qwen-7b (#4867)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent 42ae807 commit 4de0f2f

File tree

1 file changed

+14
-0
lines changed

1 file changed

+14
-0
lines changed

gallery/index.yaml

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3930,6 +3930,20 @@
39303930
- filename: nbeerbower_Dumpling-Qwen2.5-72B-Q4_K_M.gguf
39313931
sha256: 384de5ba5e60255846cd38e2bfad0374b059fb627ba8abb02273186f28684385
39323932
uri: huggingface://bartowski/nbeerbower_Dumpling-Qwen2.5-72B-GGUF/nbeerbower_Dumpling-Qwen2.5-72B-Q4_K_M.gguf
3933+
- !!merge <<: *qwen25
3934+
name: "open-r1_openr1-qwen-7b"
3935+
urls:
3936+
- https://huggingface.co/open-r1/OpenR1-Qwen-7B
3937+
- https://huggingface.co/bartowski/open-r1_OpenR1-Qwen-7B-GGUF
3938+
description: |
3939+
This is a finetune of Qwen2.5-Math-Instruct on OpenR1-220k-Math (default split). We train the model on the default split of OpenR1-220k-Math for 3 epochs. We use learning rate of 5e-5 and extend the context length from 4k to 32k, by increasing RoPE frequency to 300k. The training follows a linear learning rate schedule with a 10% warmup phase.
3940+
overrides:
3941+
parameters:
3942+
model: open-r1_OpenR1-Qwen-7B-Q4_K_M.gguf
3943+
files:
3944+
- filename: open-r1_OpenR1-Qwen-7B-Q4_K_M.gguf
3945+
sha256: d3bf99666cd19b637948ec9943044b591d3b906d0ee4f3ef1b3eb693ac8f66a6
3946+
uri: huggingface://bartowski/open-r1_OpenR1-Qwen-7B-GGUF/open-r1_OpenR1-Qwen-7B-Q4_K_M.gguf
39333947
- &llama31
39343948
url: "github:mudler/LocalAI/gallery/llama3.1-instruct.yaml@master" ## LLama3.1
39353949
icon: https://avatars.githubusercontent.com/u/153379578

0 commit comments

Comments
 (0)