OpenMusic: SOTA Text-to-music (TTM) Generation
-
Updated
Jun 26, 2025 - Python
OpenMusic: SOTA Text-to-music (TTM) Generation
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Mustango: Toward Controllable Text-to-Music Generation
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Local windowed attention multi-instrumental music transformer for supervised music generation
SOTA Google's Perceiver-AR Music Transformer Implementation and Model
some generative audio tools for ComfyUI
[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-velocity and outro tokens
Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]
【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生视频(SVD),Ai 动作迁移,Ai 虚拟试衣,数字人,全模态理解(Omni),Ai音乐生成 干货学习 等 实战与经验。
[ICASSP'24] Investigating Personalization Methods in Text to Music Generation
[DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, optimized for speed, efficiency, and performance
Turn your words into music! Describe a sound (e.g., happy, spooky) and this app generates a short piece based on your text.
[Exclusive for GitHub] deep-muse: Advanced Text-to-Music Generator Implementation
Exploring Bark, the Open-Source Text-to-Audio Generative Model
Connect your generative AI apps with Suno API in seconds!
James Skripchuk's code to convert text to midi. I am going to convert this into flask based app. It uses NLTK to read files, then convert to sentences, then words and then turn those words into an awesome pieces of music. Core work is done by James Skripchuk, I just gave the proper interface to his work.
Five AI function sections for an Office Personal Data Assistant (OPDA) in Conversations, Coding, Images and Video Production, Music Composing, and Tech Support Chat.
Generative AI version of the GeoGuesser game.
Add a description, image, and links to the text-to-music topic page so that developers can more easily learn about it.
To associate your repository with the text-to-music topic, visit your repo's landing page and select "manage topics."