SCST

Self-critical Sequence Training

Reinforcement LearningIntroduced 200012 papers