Content creation on YouTube is undergoing a significant shift as the platform integrates more sophisticated generative AI into its short-form video ecosystem. YouTube is now allowing users to create AI avatars that look and sound like them for employ in Shorts, providing a way for creators to appear in their videos without needing to be physically present for every recording.
This new capability allows for the generation of photorealistic avatars based on a user’s own likeness and voice. By leveraging advanced AI models, the feature aims to lower the barrier to entry for high-quality video production, enabling creators to produce content more efficiently although maintaining a personal connection with their audience.
The feature is a continuation of the Google Veo models integrated into YouTube Shorts. While YouTube had previously offered “ingredients-to-video” capabilities that allowed users to upload pictures, the addition of a synchronized, personalized voice is a new development for the platform.
How the AI Avatar Creation Process Works
To get started, users must complete a setup process known as a “live selfie.” This involves recording their face and voice by reading a series of prompts provided by the app. Once this data is captured, the system generates a photorealistic avatar that mimics the user’s appearance and vocal characteristics.

The creation process is integrated directly into the main YouTube app and the YouTube Create platform. Once the avatar is established, users can generate clips based on text prompts. Each individual prompt-based generation can be up to 8 seconds long, though creators can string multiple clips together to build a longer narrative. While the initial setup only needs to be performed once, users have the option to retake the live selfie at any time to update their avatar’s appearance.
Safety, Privacy, and Digital Authentication
Given the potential for misuse associated with synthetic media, YouTube has implemented several safeguards to ensure the feature is used securely. A primary security measure is that the selfie video and voice recording are used exclusively for avatar creation. no other user can access or use another person’s avatar to create original Shorts.
To maintain transparency and prevent the spread of misinformation, all avatar-generated videos will include digital labels and watermarks. These include SynthID and C2PA disclosures, which signal to viewers that the content is AI-generated. YouTube has automated the disclosure process, meaning creators using these generative AI tools do not need to take manual steps to label their content as altered or synthetic.
Regarding data retention, users can delete their avatars at any time. If an avatar remains unused for three years, YouTube will automatically delete it. However, any existing videos already published with the avatar will remain on the platform until the specific video clip is manually deleted by the creator.
Content Guidelines and Deepfake Prevention
All AI-generated content on the platform must adhere to YouTube’s Community Guidelines. YouTube has incorporated safeguards into the AI tools to prevent the generation of inappropriate content or the creation of deepfakes. For instance, the AI is designed to block prompts that attempt to generate photorealistic images of identifiable people who have not authorized the use of their likeness.
The platform may also block prompts that touch on sensitive topics to avoid the creation of harmful or misleading content. YouTube encourages creators to review all AI-generated output carefully before publishing, noting that as with any generative AI tool, mistakes can occur.
Step-by-Step: How to Access AI Avatars
For users who have received the rollout, Notice two primary ways to access the avatar features within the YouTube app:
- Via the Create Menu: Tap the ‘+’ (Create) button, then select the Gemini spark icon in the corner. From there, select “Create video” in the top-left and choose “Make a video with my avatar” to enter a prompt.
- Via the Remix Menu: Navigate to the Remix menu, select “Reimagine,” and then tap “Add me to this scene.”
Key Feature Summary
| Feature | Detail |
|---|---|
| Setup Method | Live Selfie (Face and Voice recording) |
| Max Clip Length | 8 seconds per prompt |
| Available Platforms | YouTube App, YouTube Create |
| Authentication | SynthID, C2PA watermarks |
| Auto-Deletion | After 3 years of inactivity |
As AI continues to blend with traditional content creation, these tools represent a move toward “virtual presence,” where the creator’s identity is decoupled from the physical act of filming. This allows for greater flexibility in storytelling and production schedules, provided the industry continues to balance innovation with rigorous safety standards.
YouTube will continue to update its AI tools based on creator feedback and safety evaluations. Users should monitor the YouTube Help center for further updates on generative AI policies and feature expansions.
Do you feel AI avatars will change how you consume short-form content, or do you prefer the authenticity of traditional filming? Share your thoughts in the comments below.