ListUltraFeedback

TextsCC BY 4.0Introduced 2024-10-06

A listwise multi-response dataset for human preferences alignment. The dataset is derived from UltraFeedback and SimPO.