Just your average tech head teaching you how to use AI and your camera, specialising in Creator tools, Tech and Editing with 1M+ followers
Share
This Week in AI: Gemini CLI, HeyGen Agent, Higgsfield Soul, and More
Published 9 months ago • 2 min read
Your weekly source of AI and tech to help you elevate your creator journey.
Hi Reader,
This week, your apps are learning to talk and your phone’s running AI like it’s a supercomputer. We’re not just seeing new tools, we’re watching entire workflows disappear into a single prompt.
In today’s breakdown:
Google’s terminal-based AI drops with million-token memory
HeyGen’s new agent makes videos from prompts, zero editing
Higgsfield drops high-fashion AI portraits you’ll want to post
ElevenLabs' new assistant doesn’t just talk, it works
FLUX.1 gives you Photoshop-grade AI tools fully open-source
Let’s dive in.
Google Gemini CLI: 1M Token AI in Your Terminal
Google has released Gemini CLI, an open-source AI agent that brings Gemini's coding, content generation, and research capabilities directly into developers' terminals. This tool leverages Google's Gemini 2.5 Pro reasoning model, supporting a substantial 1 million token context window. Gemini CLI enables developers to interact with the AI using natural language prompts for tasks like coding, problem-solving, content creation, and task management.
Additionally, it supports image and video generation via Google's Veo and Imagen tools and includes integration with Gemini Code Assist, the Model Context Protocol (MCP), and Google Search.
Key details:
Free & Local-Friendly Connects to the Gemini API and runs from your terminal without extra setup.
1 Million Token Context Ideal for full-project debugging, config file editing, and multi-file tasks.
Built for Devs Includes helpers for GitHub workflows, Bash commands, and JSON parsing.
HeyGen has launched its most ambitious product yet: the Video Agent. Marketed as the first Creative OS, it’s built to go from text to finished video autonomously.
Key details:
Full-Stack AI Video Creation Write a prompt, and it generates a script, picks visuals, generates voiceovers, edits, and exports.
Built-in Voice + Avatar Uses HeyGen’s avatars and multilingual voice system for realistic delivery.
Designed for Scaling Content Perfect for teams that need explainer videos, product demos, or onboarding videos at scale.
Higgsfield Soul: AI Photos with Fashion-Grade Realism
Higgsfield has released Soul, a new photo model with curated aesthetics, designed to generate stunningly styled portraits with little to no prompt engineering.
Key details:
50+ Prebuilt Styles From “New York Streetwear” to “Hyperreal Editorial,” each style produces art-directed portraits.
Realism + Expression Skintones, poses, fashion details, and facial nuance are preserved at high fidelity.
Built for Creators Ideal for influencers, product mockups, and model portfolios.
Those ultra-realistic fashion and product UCGI ads powered by Higgsfield’s Canvas are still trending hard especially across TikTok, Instagram, and LinkedIn.
Creators are using it to swap outfits, generate lifestyle shoots, and produce scroll-stopping visuals without models, cameras, or sets.
If you want to recreate these exact-style AI ads step-by-step, I broke the whole workflow down in my latest YouTube video.
Your weekly source of AI and tech to help you elevate your creator journey. Hi Reader, This week, we’re not just generating content anymore… we’re directing it, simulating it, and editing reality itself in real time. What used to take full production teams, VFX studios, and expensive hardware can now happen inside a single interface. In today’s newsletter: Beeble VFX No green screen, pure magic . Higgsfield Cinema Studio 2.5 turns your screen into a full AI film set Freepik pushes video...
Your weekly source of AI and tech to help you elevate your creator journey. Hi Reader, The AI updates keep coming fast, and this week’s releases push creation even closer to professional-grade workflows. We’ve got a new talking-video model that acts with emotion and context, an open-source TTS engine simulating natural group conversations, a major leap in video control with start-to-end framing, and Google finally confirming “Nano Banana” as its most advanced image editor yet. In today’s...
Your weekly source of AI and tech to help you elevate your creator journey. Hi Reader, The AI frontier just leaped again and this week’s launches prove we’re right at the edge of imagination meeting execution. In today’s newsletter: Nano Banana breaks the internet and edits with uncanny precision. Runway Game Worlds evolves AI storytelling. ElevenLabs Video‑to‑Music in Studio scores your visuals. Act‑Two Voices empowers expressive AI performance. Kling 2.1 redefines image‑to‑video realism....