selfrag
/

selfrag_llama2_13b

@@ -2,7 +2,7 @@
 license: mit
 ---
-This model is a 7B [Self-RAG](https://selfrag.github.io/) model that generates outputs to diverse user queries as well as *reflection tokens* to call the retrieval system adaptively and criticize its own output and retrieved passages.
 Self-RAG is trained on our instruction-following corpora with interleaving passages and reflection tokens using the standard next-token prediction objective, enabling efficient and stable learning with fine-grained feedback.
 At inference, we leverage reflection tokens covering diverse aspects of generations to sample the best output aligning users' preferences.

 license: mit
 ---
+This model is a 13B [Self-RAG](https://selfrag.github.io/) model that generates outputs to diverse user queries as well as *reflection tokens* to call the retrieval system adaptively and criticize its own output and retrieved passages.
 Self-RAG is trained on our instruction-following corpora with interleaving passages and reflection tokens using the standard next-token prediction objective, enabling efficient and stable learning with fine-grained feedback.
 At inference, we leverage reflection tokens covering diverse aspects of generations to sample the best output aligning users' preferences.