This Week in AI: Gemini CLI, HeyGen Agent, Higgsfield Soul, and More


Hi Reader,


This week
, your apps are learning to talk and your phone’s running AI like it’s a supercomputer.
We’re not just seeing new tools, we’re watching entire workflows disappear into a single prompt.

In today’s breakdown:

  • Google’s terminal-based AI drops with million-token memory
  • HeyGen’s new agent makes videos from prompts, zero editing
  • Higgsfield drops high-fashion AI portraits you’ll want to post
  • ElevenLabs' new assistant doesn’t just talk, it works
  • FLUX.1 gives you Photoshop-grade AI tools fully open-source

Let’s dive in.

Google Gemini CLI: 1M Token AI in Your Terminal

Google has released Gemini CLI, an open-source AI agent that brings Gemini's coding, content generation, and research capabilities directly into developers' terminals. This tool leverages Google's Gemini 2.5 Pro reasoning model, supporting a substantial 1 million token context window. Gemini CLI enables developers to interact with the AI using natural language prompts for tasks like coding, problem-solving, content creation, and task management.

Additionally, it supports image and video generation via Google's Veo and Imagen tools and includes integration with Gemini Code Assist, the Model Context Protocol (MCP), and Google Search.

Key details:

  • Free & Local-Friendly
    Connects to the Gemini API and runs from your terminal without extra setup.
  • 1 Million Token Context
    Ideal for full-project debugging, config file editing, and multi-file tasks.
  • Built for Devs
    Includes helpers for GitHub workflows, Bash commands, and JSON parsing.

Try Gemini CLI here 👇

🔗https://blog.google/technology/developers/introducing-gemini-cli-open-source-ai-agent/

HeyGen Video Agent: From Prompt to Publish

HeyGen has launched its most ambitious product yet: the Video Agent. Marketed as the first Creative OS, it’s built to go from text to finished video autonomously.

Key details:

  • Full-Stack AI Video Creation
    Write a prompt, and it generates a script, picks visuals, generates voiceovers, edits, and exports.
  • Built-in Voice + Avatar
    Uses HeyGen’s avatars and multilingual voice system for realistic delivery.
  • Designed for Scaling Content
    Perfect for teams that need explainer videos, product demos, or onboarding videos at scale.

Try HeyGen Video Agent here 👇

🔗https://www.heygen.com/

Higgsfield Soul: AI Photos with Fashion-Grade Realism

Higgsfield has released Soul, a new photo model with curated aesthetics, designed to generate stunningly styled portraits with little to no prompt engineering.

Key details:

  • 50+ Prebuilt Styles
    From “New York Streetwear” to “Hyperreal Editorial,” each style produces art-directed portraits.
  • Realism + Expression
    Skintones, poses, fashion details, and facial nuance are preserved at high fidelity.
  • Built for Creators
    Ideal for influencers, product mockups, and model portfolios.

Try Soul by Higgsfield here 👇

🔗https://goto.higgsfield.ai/seb

ElevenLabs 11a: The Voice Assistant That Acts

ElevenLabs’ new voice assistant, 11a, goes beyond chat, it integrates into your tools and does real work.

Key details:

  • Real Integrations
    Connects with Salesforce, Slack, Google Calendar, and more.
  • Task Execution
    Plan your day, send messages, book meetings, all by voice.
  • Built on 11Labs Voice Stack
    Ultra-natural delivery with controllable intonation, speed, and clarity.

Try ElevenLabs 11a here 👇

🔗https://elevenlabs.io/

FLUX.1 Kontext Dev: Open-Source Image Editing AI

Black Forest Labs has open-sourced FLUX.1 Kontext , the first precision image editing model that’s local-ready and uncensored.

Key details:

  • Full Control
    Apply brush-like edits while maintaining lighting, geometry, and detail.
  • Unlocked + Local
    Works on consumer GPUs and can be fine-tuned for niche use cases.
  • First of Its Kind
    It’s the highest-quality open model for inpainting, background swaps, and more

Try Flux Kontext here 👇

🔗https://flux1.ai/flux-kontext

Higgsfield’s AI Ads Are Still Dominating Feeds

Those ultra-realistic fashion and product UCGI ads powered by Higgsfield’s Canvas are still trending hard especially across TikTok, Instagram, and LinkedIn.

Creators are using it to swap outfits, generate lifestyle shoots, and produce scroll-stopping visuals without models, cameras, or sets.

If you want to recreate these exact-style AI ads step-by-step,
I broke the whole workflow down in my latest YouTube video.

🎥 Watch the full tutorial here 👇

video preview

More wild drops, breakdowns, and tools coming next week.

Catch you next week for another round of breakthroughs.

Stay Creative,

Sebastien Jefferies.

Free: My 100+ AI Toolkit to Supercharge Your Workflow
Get your copy here → [Access Now]

1 Parkshot, Richmond, Berkshire RG401WF
Unsubscribe · Preferences

Sebastien Jefferies

Just your average tech head teaching you how to use AI and your camera, specialising in Creator tools, Tech and Editing with 1M+ followers

Read more from Sebastien Jefferies

Your weekly source of AI and tech to help you elevate your creator journey. Hi Reader, This week, we’re not just generating content anymore… we’re directing it, simulating it, and editing reality itself in real time. What used to take full production teams, VFX studios, and expensive hardware can now happen inside a single interface. In today’s newsletter: Beeble VFX No green screen, pure magic . Higgsfield Cinema Studio 2.5 turns your screen into a full AI film set Freepik pushes video...

Your weekly source of AI and tech to help you elevate your creator journey. Hi Reader, The AI updates keep coming fast, and this week’s releases push creation even closer to professional-grade workflows. We’ve got a new talking-video model that acts with emotion and context, an open-source TTS engine simulating natural group conversations, a major leap in video control with start-to-end framing, and Google finally confirming “Nano Banana” as its most advanced image editor yet. In today’s...

Your weekly source of AI and tech to help you elevate your creator journey. Hi Reader, The AI frontier just leaped again and this week’s launches prove we’re right at the edge of imagination meeting execution. In today’s newsletter: Nano Banana breaks the internet and edits with uncanny precision. Runway Game Worlds evolves AI storytelling. ElevenLabs Video‑to‑Music in Studio scores your visuals. Act‑Two Voices empowers expressive AI performance. Kling 2.1 redefines image‑to‑video realism....