From a single emotional sentence to a full YouTube music video — entirely automated, running 24/7 on a Hetzner VPS.
Pick a Random Lyric
One sentence is selected from a pool of 100 emotional phrases — "Don't go, please", "I still feel you", "Forever starts now" — each a raw human feeling distilled into words.
n8n Code Node
Build the Music Prompt
The lyric is wrapped in a structured prompt: instrumental intro, whispered vocal, emotional drops, and a fading outro. One of 20 diverse styles is randomly chosen — from tropical house to hip hop trap, K-pop to jazz fusion.
n8n Code Node — 20 Genres
Generate the Song
The prompt is sent to Suno AI via sunoapi.org. The V4.5-ALL model generates a full 2–4 minute track with vocals, instrumentation, and musical structure. Two tracks are generated; the longest is selected.
Suno AI — sunoapi.org
Generate AI Cover Art
A unique album cover is created using FLUX.1-schnell, with the image prompt dynamically built from the song's mood and lyric — neon cityscapes for future bass, ocean depths for deep house, cosmic landscapes for progressive.
SiliconFlow — FLUX.1-schnell
Create the Video
The cover art and audio are combined into a 1920×1080 HD MP4 using FFmpeg, running on a dedicated Python HTTP server. The result: a proper YouTube music video with full-quality audio.
FFmpeg — Python Server
Upload to YouTube
The video is automatically uploaded to the Words Between Silence channel via YouTube's OAuth API, with a generated title, description, hashtags, and category (Music).
YouTube Data API v3
Post to X / Twitter
A tweet is automatically composed and posted with the YouTube link, song title, style, and hashtags — sharing the new track with the world.
X API v2 — Tweepy