TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/JEN-1: Text-Guided Universal Music Generation with Omnidir...

JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models

Peike Li, BoYu Chen, Yao Yao, Yikai Wang, Allen Wang, Alex Wang

2023-08-09Music GenerationText-to-Music Generation
PaperPDFCodeCode

Abstract

Music generation has attracted growing interest with the advancement of deep generative models. However, generating music conditioned on textual descriptions, known as text-to-music, remains challenging due to the complexity of musical structures and high sampling rate requirements. Despite the task's significance, prevailing generative models exhibit limitations in music quality, computational efficiency, and generalization. This paper introduces JEN-1, a universal high-fidelity model for text-to-music generation. JEN-1 is a diffusion model incorporating both autoregressive and non-autoregressive training. Through in-context learning, JEN-1 performs various generation tasks including text-guided music generation, music inpainting, and continuation. Evaluations demonstrate JEN-1's superior performance over state-of-the-art methods in text-music alignment and music quality while maintaining computational efficiency. Our demos are available at https://jenmusic.ai/audio-demos

Results

TaskDatasetMetricValueModel
Text-to-Music GenerationMusicCapsFAD2JEN-1
Text-to-Music GenerationMusicCapsKL_passt1.29JEN-1

Related Papers

WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling2025-07-14MusiScene: Leveraging MU-LLaMA for Scene Imagination and Enhanced Video Background Music Generation2025-07-08TOMI: Transforming and Organizing Music Ideas for Multi-Track Compositions with Full-Song Structure2025-06-29Exploring Adapter Design Tradeoffs for Low Resource Music Generation2025-06-26Let Your Video Listen to Your Music!2025-06-23MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners2025-06-23Benchmarking Music Generation Models and Metrics via Human Preference Studies2025-06-23AI-Generated Song Detection via Lyrics Transcripts2025-06-23