ListUltraFeedback
TextsCC BY 4.0Introduced 2024-10-06
A listwise multi-response dataset for human preferences alignment. The dataset is derived from UltraFeedback and SimPO.
A listwise multi-response dataset for human preferences alignment. The dataset is derived from UltraFeedback and SimPO.