Large Language Model on Noun Phrase Canonicalization

Metric: Ambiguous dataset (higher is better)

LeaderboardDataset