sail
/

Zephyr-7B-DICE-Iter2

Text Generation

text-generation-inference

Model card Files Files and versions

Cameron-Chen commited on Mar 11

Commit

75daac7

·

verified ·

1 Parent(s): b2cc1fb

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -12,8 +12,6 @@ pipeline_tag: text-generation
 This model was developed using [Bootstrapping Language Models with DPO Implicit Rewards](https://arxiv.org/abs/2406.09760) (DICE) at iteration 2, based on the [HuggingFaceH4/zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta) as the starting point.
-Code: https://github.com/sail-sg/dice
 ## Links to Other Models
 - [Zephyr-7B-DICE-Iter1](https://huggingface.co/sail/Zephyr-7B-DICE-Iter1)
 - [Zephyr-7B-DICE-Iter2](https://huggingface.co/sail/Zephyr-7B-DICE-Iter2)
@@ -33,6 +31,9 @@ Code: https://github.com/sail-sg/dice
 |[Zephyr-7B-DICE-Iter1](https://huggingface.co/sail/Zephyr-7B-DICE-Iter1) |19.03 |17.67
 |[Zephyr-7B-DICE-Iter2](https://huggingface.co/sail/Zephyr-7B-DICE-Iter2) |**20.71** |**20.16**
 ## Citation
 ```bibtex

 This model was developed using [Bootstrapping Language Models with DPO Implicit Rewards](https://arxiv.org/abs/2406.09760) (DICE) at iteration 2, based on the [HuggingFaceH4/zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta) as the starting point.
 ## Links to Other Models
 - [Zephyr-7B-DICE-Iter1](https://huggingface.co/sail/Zephyr-7B-DICE-Iter1)
 - [Zephyr-7B-DICE-Iter2](https://huggingface.co/sail/Zephyr-7B-DICE-Iter2)
 |[Zephyr-7B-DICE-Iter1](https://huggingface.co/sail/Zephyr-7B-DICE-Iter1) |19.03 |17.67
 |[Zephyr-7B-DICE-Iter2](https://huggingface.co/sail/Zephyr-7B-DICE-Iter2) |**20.71** |**20.16**
+## Code
+https://github.com/sail-sg/dice
 ## Citation
 ```bibtex