migration-bench-java-utg

TextsApache 2.0Introduced 2025-05-14

🤗 MigrationBench is a large-scale code migration benchmark dataset at the repository level, across multiple programming languages.

  • Current and initial release includes java 8 repositories with the maven build system, as of May 2025.

It has 3 datasets:

  1. 🤗 migration-bench-java-full has 5,102 repos, and each of them has a test directory or at least one test case.

  2. 🤗 migration-bench-java-selected is a subset of migration-bench-java-full, with 300 repos.

  3. 🤗 migration-bench-java-utg contains 4,184 repos, complementary to migration-bench-java-full.