agi-css
/

hh-rlhf-sft

Text Generation

computational social science

text-generation-inference

Model card Files Files and versions

agi-css commited on May 29, 2023

Commit

876b22a

·

1 Parent(s): 6f56fc0

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -38,4 +38,19 @@ We use the [Alpaca fine-tuning script](https://github.com/tatsu-lab/stanford_alp
 Although this project aims to better align current LMs with social norms, inappropriate content and inherent biases in the training data will still impair the alignment of the model.
-The model should not be used directly in any application, without a prior assessment of safety and fairness concerns specific to the application.

 Although this project aims to better align current LMs with social norms, inappropriate content and inherent biases in the training data will still impair the alignment of the model.
+The model should not be used directly in any application, without a prior assessment of safety and fairness concerns specific to the application.
+# Citation
+Please cite our paper if you use the data or code in this repo:
+```bibtex
+@misc{liu2023sociallyaligned,
+      title={Training Socially Aligned Language Models in Simulated Human Society},
+      author={Ruibo Liu and Ruixin Yang and Chenyan Jia and Ge Zhang and Denny Zhou and Andrew M. Dai and Diyi Yang and Soroush Vosoughi},
+      year={2023},
+      eprint={2305.16960},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```