Words Between Silence

How It Works

From a single emotional sentence to a full YouTube music video — entirely automated, running 24/7 on a Hetzner VPS.

Pick a Random Lyric

One sentence is selected from a pool of 100 emotional phrases — "Don't go, please", "I still feel you", "Forever starts now" — each a raw human feeling distilled into words.

n8n Code Node

Build the Music Prompt

The lyric is wrapped in a structured prompt: instrumental intro, whispered vocal, emotional drops, and a fading outro. One of 20 diverse styles is randomly chosen — from tropical house to hip hop trap, K-pop to jazz fusion.

n8n Code Node — 20 Genres

Generate the Song

The prompt is sent to Suno AI via sunoapi.org. The V4.5-ALL model generates a full 2–4 minute track with vocals, instrumentation, and musical structure. Two tracks are generated; the longest is selected.

Suno AI — sunoapi.org

Generate AI Cover Art

A unique album cover is created using FLUX.1-schnell, with the image prompt dynamically built from the song's mood and lyric — neon cityscapes for future bass, ocean depths for deep house, cosmic landscapes for progressive.

SiliconFlow — FLUX.1-schnell

Create the Video

The cover art and audio are combined into a 1920×1080 HD MP4 using FFmpeg, running on a dedicated Python HTTP server. The result: a proper YouTube music video with full-quality audio.

FFmpeg — Python Server

Upload to YouTube

The video is automatically uploaded to the Words Between Silence channel via YouTube's OAuth API, with a generated title, description, hashtags, and category (Music).

YouTube Data API v3

Post to X / Twitter

A tweet is automatically composed and posted with the YouTube link, song title, style, and hashtags — sharing the new track with the world.

X API v2 — Tweepy

The Tech Stack

Built with open-source tools and cloud APIs — running on a single VPS.

🔧

n8n

Workflow orchestration — 13 nodes, webhook-triggered, self-hosted on Docker

🎵

Suno AI

AI music generation via sunoapi.org — V4.5-ALL model, 2–4 min tracks with vocals

🎨

SiliconFlow

AI cover art generation — FLUX.1-schnell, unique 1024×1024 images per song

🎬

FFmpeg

Video creation — combines cover + audio into 1920×1080 HD MP4, running as systemd service

📺

YouTube API

Automated upload with OAuth2 — title, description, category, public visibility

🐦

X / Twitter

Cross-posting via Tweepy — automated tweet with YouTube link and hashtags

🖥️

Hetzner VPS

CPX32 — 8GB RAM, Ubuntu 24.04, Docker, systemd services, cron scheduling

⏰

Cron + Webhook

Bulletproof scheduling — cron triggers n8n webhook every 5 hours, never misses

How It Works

The Tech Stack

Listen Now