jeffmeloy
/

Qwen2.5-7B-olm-v1.5

Text Generation

text-generation-inference

Model card Files Files and versions

jeffmeloy commited on Feb 27

Commit

db8921c

·

verified ·

1 Parent(s): 317d855

Update README.md

Files changed (1) hide show

README.md +27 -3

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+base_model:
+- Qwen/Qwen2.5-7B
+pipeline_tag: text-generation
+language:
+- en
+library_name: transformers
+tags:
+- text-generation-inference
+---
+## Model Description
+Optimized Layer Merging (OLM)
+Is a transformer optimization framework implementing automated layer recombination.
+Olm create Frankenstein's monster out of language models by cherry-picking the best performing layers across different models to create a superior hybrid.
+The core mechanism:
+- Takes multiple language models as input
+- Uses a base model as the foundation
+- Iteratively replaces individual layers, evaluating performance on specified datasets
+- Keeps the best performing layer at each position based on metrics like perplexity, exact match, and a custom "quality" score
+- Builds a fusion model layer-by-layer while maintaining or improving performance
+https://github.com/jeffmeloy/olm