Rex Automaton
All posts
AutomationAugust 25, 20255 min read

How to automate 150+ Shorts, Reels, and TikToks a month without a video editor

The exact pipeline for turning long-form content into 150 short videos a month. Source material, AI clip selection, captions, branding, and the publishing schedule that does not get you flagged.

By Jacky Lei

You can produce 150-plus short videos a month without hiring a video editor. The architecture: source material from existing long-form content (interviews, podcasts, talks, demos), AI-driven clip selection that identifies the strongest 30-to-90-second moments, automated caption and branding overlay, and platform-specific publishing that respects each platform's anti-spam rules.

This is what the pipeline actually looks like and where it works versus where it does not.

The source material problem

The pipeline starts with content. You cannot generate clips out of nothing. The viable sources:

  • Podcast episodes (60 to 120 minutes each, easily yield 10 to 30 strong clips per episode)
  • Long-form YouTube content (interviews, talks, walkthroughs)
  • Webinar recordings
  • Sales call transcripts (with permission and PII handled correctly)
  • Conference talks

A creator publishing one podcast episode per week has enough source material for 40 to 100 clips per month from that alone. Add in one talk per quarter and a few sales-call transcripts and you are at 150+ comfortably.

The mistake people make: trying to generate this from filmed sit-down "shorts" content. That is the bottleneck the whole automation is supposed to remove. Use the long-form content you are already producing.

The clip-selection layer

This is where AI earns its keep. An LLM reads the transcript of the long-form content and identifies moments that work as standalone clips:

  • A surprising statistic or counterintuitive claim
  • A specific story with a clear arc (problem, action, outcome)
  • A tactical "how-to" segment under 90 seconds
  • A controversial opinion that drives engagement
  • A useful framework explained concisely

The model scores each candidate clip on engagement potential, standalone clarity, and emotional hook. The top-ranked clips become your daily publishing queue.

Tools that do this well: OpusClip, Vizard, Submagic, or a custom pipeline using GPT-4 / Claude reading the transcript directly. Custom is usually better because you can tune the selection criteria to your specific brand voice.

The caption and branding layer

Modern caption generators handle word-level timing reliably. The key visual decisions:

  • Caption style: word-by-word reveal beats line-by-line for retention. Use a consistent font and color across all clips for brand recognition.
  • Branding: a small watermark in the lower third, consistent across every clip. Do not overdo intro animations. Mobile viewers swipe past anything longer than 1 second.
  • Aspect ratio: 9:16 vertical for all three platforms. Same export for TikTok, Reels, and Shorts.
  • Captions burned in vs platform-native: burn them in. Platform-native captions are inconsistent across viewers (some have them on, some do not, and the on-default behavior changes per platform).

This step automates cleanly with FFmpeg + a caption-overlay layer. Build it once and it runs across every clip.

The publishing schedule that does not get you flagged

Each platform has rate limits and anti-spam heuristics. Posting 50 clips in a day on a fresh account gets the account flagged and downranked. The pacing that works long-term:

  • TikTok: 1 to 3 clips per day, spaced across the day
  • Instagram Reels: 1 to 2 clips per day
  • YouTube Shorts: 1 to 2 clips per day

That gets you to roughly 90 to 150 clips per month per platform, comfortably within the anti-spam thresholds. Multiplied across three platforms, the total is the 150+ that the headline implies.

Use the official scheduling APIs where available (Meta's Graph API, TikTok's Content Posting API) rather than browser-automation tools. The official APIs are stable. The browser-automation approach breaks the day the platform updates its UI.

What this is not

This pipeline does NOT generate "AI-talking-head" content. It re-purposes the long-form content you (or your team) already produce. The voice, the face, the perspective is yours. The automation is in the editing, captioning, branding, and distribution work that used to take a video editor.

It also does not work if you do not produce long-form content. If you are a SaaS founder who hates being on camera and never appears in long-form anything, this is not the play for you. Find a different content strategy.

The realistic ROI

A video editor producing this output volume would cost $5K to $10K/month (in-house or contractor). The automated pipeline costs $200 to $500/month in tools plus a one-time build of $3K to $8K. The breakeven is month one.

The harder ROI question: does the volume actually drive business outcomes? The honest answer is "it depends on what you sell." For creators and educators, short-form volume drives followers, list growth, and downstream revenue cleanly. For B2B services and high-ticket products, the volume matters less than the targeting. A few well-placed strategic clips can outperform 100 generic ones.

Run the volume play if you are a creator. Run the strategic play if you are a B2B services or SaaS founder. They are different optimizations.

The mistakes that kill this

  • Letting the AI pick the brand voice. AI clip selection is fine. AI caption-writing tends to flatten your voice. Use AI for selection and timing, write the captions yourself or have a human review every one.
  • Over-automating the publishing. Posting on a perfect schedule looks like a bot. Vary the timing by 30 to 60 minutes per day to look human.
  • Skipping the brand-consistency layer. Random fonts and colors per clip looks like spam. One font, one color scheme, one watermark position across every clip is non-negotiable.
  • Not measuring what works. Track which clips drive followers, which drive list signups, which drive sales. Tune the AI selection criteria toward the clips that actually convert, not just the ones with high view counts.

How to start

Pick one long-form content source you already produce. Run a single episode through the pipeline manually (selection, captioning, branding, scheduling) so you understand each step. Then automate one layer at a time. By week 4 you have the full pipeline running with you reviewing 10 minutes of output per day.


If you have a backlog of long-form content and want a 15-minute conversation about whether short-form automation is the right play for your business, book a discovery call.

Want us to build this for you?

15-minute discovery call. No pitch. We tell you what to automate first.

Book a Discovery Call

Related reading