TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/JanusFlow

JanusFlow

Reported on 6 benchmarks across 6 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing3 results

  • Visual Question Answering (VQA)onMM-Vet
    GPT-4 score· 2024-11-12
    30.9
    best: 74.24 (MMCTAgent (GPT-4 + GPT-4V))
    JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationarXiv:2411.07975
  • Text-to-Image GenerationonGenEval
    Overall· 2024-11-12
    0.63
    best: 0.95 (SD3.5-Medium+Flow-GRPO)
    JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationarXiv:2411.07975
  • Visual Question AnsweringonMM-Vet
    GPT-4 score· 2024-11-12
    30.9
    best: 74.24 (MMCTAgent (GPT-4 + GPT-4V))
    JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationarXiv:2411.07975

Audio2 results

  • 10-shot image generationonGenEval
    Overall· 2024-11-12
    0.63
    best: 0.95 (SD3.5-Medium+Flow-GRPO)
    JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationarXiv:2411.07975
  • 1 Image, 2*2 StitchionGenEval
    Overall· 2024-11-12
    0.63
    best: 0.95 (SD3.5-Medium+Flow-GRPO)
    JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationarXiv:2411.07975

Medical1 result

  • Image GenerationonGenEval
    Overall· 2024-11-12
    0.63
    best: 0.95 (SD3.5-Medium+Flow-GRPO)
    JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationarXiv:2411.07975