File size: 3,981 Bytes
c9d1c75 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 |
---
base_model: []
library_name: transformers
tags:
- mergekit
- merge
---
# WeirdCompound-v1.5-24b
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Notes
This is a multi-stage merge. There's little method to my madness and I just stopped when I arrived at something that I liked.
Starting point was DepravedCartographer-v1.0-24b with slight changes.
### Changelog
v1.1
* /intermediate/model/B: replaced anthracite-core/Mistral-Small-3.1-24B-Instruct-2503-HF with anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-ChatML
v1.2
* /intermediate/model/B: replaced anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-ChatML with [anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only](https://huggingface.co/anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only) for default tokenizer config.
v1.3
* /intermediate/model/A: replaced TheDrummer/Cydonia-24B-v3 with TheDrummer/Cydonia-24B-v4
* /intermediate/model/A: replaced Doctor-Shotgun/MS3.1-24B-Magnum-Diamond with Doctor-Shotgun/MS3.2-24B-Magnum-Diamond
* /intermediate/model/A: replaced Delta-Vector/Austral-24B-Winton with Delta-Vector/MS3.2-Austral-Winton
v1.4
* /intermediate/model/B: change recipe to use Doctor-Shotgun/MS3.2-24B-Magnum-Diamond and Delta-Vector/MS3.2-Austral-Winton
v1.5
didn't particularly care for v1.4. IMHO v1.3 was better
* /intermediate/model/A: replaced Doctor-Shotgun/MS3.2-24B-Magnum-Diamond with zerofata/MS3.2-PaintedFantasy-24B
* /intermediate/model/B: change recipe to use PocketDoc/Dans-PersonalityEngine-V1.3.0-24b and zerofata/MS3.2-PaintedFantasy-24B
### Merge Method
This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [TheDrummer/Cydonia-24B-v4](https://huggingface.co/TheDrummer/Cydonia-24B-v4) as a base.
This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method.
This model was merged using the NuSLERP merge method using /intermediate/model/B as a base.
### Models Merged
The following models were included in the merge:
* [Delta-Vector/MS3.2-Austral-Winton](https://huggingface.co/Delta-Vector/MS3.2-Austral-Winton)
* [zerofata/MS3.2-PaintedFantasy-24B](https://huggingface.co/zerofata/MS3.2-PaintedFantasy-24B)
* [aixonlab/Eurydice-24b-v3.5](https://huggingface.co/aixonlab/Eurydice-24b-v3.5)
* [PocketDoc/Dans-PersonalityEngine-V1.3.0-24b](https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.3.0-24b)
* [anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only](https://huggingface.co/anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only)
* /intermediate/model/A
* /intermediate/model/B
* /intermediate/model/C
### Configuration
The following YAML configuration was used to produce this model:
```yaml
base_model: TheDrummer/Cydonia-24B-v4 # Cydonia v4
merge_method: model_stock
dtype: bfloat16
models:
- model: aixonlab/Eurydice-24b-v3.5 # storytelling / RP
- model: TheDrummer/Cydonia-24B-v4 # sprinkle in some extra Cydonia v4
- model: PocketDoc/Dans-PersonalityEngine-V1.3.0-24b # Prompt Adherence
- model: Delta-Vector/MS3.2-Austral-Winton # Adventure
- model: zerofata/MS3.2-PaintedFantasy-24B # animu
```
→ `/intermediate/model/A` →
```yaml
merge_method: slerp
dtype: bfloat16
base_model: anthracite-core/Mistral-Small-3.2-24B-Instruct-2506-Text-Only
models:
- model: /intermediate/model/A
parameters:
t: 0.4
```
→ `/intermediate/model/B` →
```yaml
merge_method: nuslerp
dtype: bfloat16
base_model: /intermediate/model/B
models:
- model: PocketDoc/Dans-PersonalityEngine-V1.3.0-24b
parameters:
weight: 0.6
- model: zerofata/MS3.2-PaintedFantasy-24B
parameters:
weight: 0.4
```
→ `/intermediate/model/C` →
```yaml
merge_method: slerp
dtype: bfloat16
base_model: /intermediate/model/B
models:
- model: /intermediate/model/C
parameters:
t: 0.5
```
→ WeirdCompound-v1.5-24b
|