The intersection of human emotion and digital rendering has always been a precarious frontier in game development. For actors, the challenge is no longer just about delivering a line or hitting a mark. it is about projecting a soul into a mesh of polygons that may seem nothing like them. This psychological and technical gap was recently highlighted by Maxence Cazorla, who provides the performance for the character Gustave in the upcoming RPG Clair Obscur: Expedition 33.
Developed by Sandfall Interactive, the title has garnered significant attention for its ambitious blend of turn-based combat and high-fidelity visuals. However, behind the stunning environments lies a complex acting process known as performance capture. Unlike traditional voice acting, where a performer records audio in a booth, or standard motion capture, which tracks body movement, performance capture seeks to synchronize voice, body and facial expressions simultaneously. For Cazorla, this process introduces a unique dissonance: the struggle of acting with another face
.
The experience of performance capture in Clair Obscur Expedition 33 requires a level of vulnerability that differs from stage or screen acting. When an actor looks into a mirror or a camera, they witness their own reactions. In a capture volume, the actor is often covered in markers and wearing a head-mounted camera (HMC) that obscures their vision and replaces their reflection with a digital avatar. This creates a sensory disconnect where the performer must trust that their internal emotional state is being translated accurately to a character whose facial structure, bone density, and muscle movements differ from their own.
The Psychological Gap of Digital Performance
Maxence Cazorla has described the specific hurdles of this process, noting that the primary challenge is the lack of immediate visual feedback. In traditional acting, the face is the primary tool for communication; the subtle twitch of a lip or a narrowing of the eyes conveys a wealth of information. When those movements are mapped onto a digital face, the actor cannot see the result in real-time. They must rely on the direction of the developers and the belief that the software is capturing the nuance of their performance.
This disconnect can lead to a phenomenon where actors “over-act” to ensure the emotion registers on the digital model, or conversely, feel a sense of detachment as the character on the screen does not mirror their own physical identity. For Cazorla, the goal is to bridge this gap by focusing on the emotional truth of the character, Gustave, rather than the physical appearance of the avatar. The challenge lies in maintaining a consistent emotional arc when the physical “instrument” of the actor—their face—is effectively replaced by a digital proxy.
This struggle is not unique to Cazorla but is a recurring theme in the evolution of the medium. As games strive for “photorealism,” the “uncanny valley”—the point where a digital human looks almost real but not quite, causing a sense of unease in the viewer—becomes harder to avoid. The only way to cross this valley is through authentic performance. If the facial animation is merely a mathematical approximation of an emotion, the player feels the falseness. If the animation is rooted in a genuine human performance, the digital face becomes transparent, and the character’s humanity shines through.
Sandfall Interactive and the Technical Bridge
Sandfall Interactive has positioned Clair Obscur: Expedition 33 as a project that prioritizes narrative depth and character expression. To achieve this, the studio utilizes advanced performance capture pipelines that aim to minimize the loss of nuance between the actor’s movement and the final render. This involves high-resolution cameras and sophisticated software that tracks thousands of points on the actor’s face.
The process typically follows a rigorous pipeline:
- Pre-visualization: The scene is blocked out in a low-fidelity digital environment to establish timing and positioning.
- The Capture Session: Actors perform the scene in a volume, with their voice and facial movements recorded in sync.
- Data Cleaning: Technical artists remove “noise” from the data and ensure the markers are tracked accurately.
- Animation Retargeting: This is the most critical stage, where the actor’s movements are “retargeted” to the character’s digital skeleton. Because Gustave’s face is different from Cazorla’s, the software must translate a smile or a frown from one set of proportions to another without distorting the anatomy.
The “acting with another face” challenge is most acute during retargeting. If the digital character has a larger jaw or different eye spacing than the actor, a subtle expression can either be lost or exaggerated. This requires a collaborative loop between the actor and the animation team to ensure the emotional intent is preserved.
The Stakes of Emotional Fidelity in Turn-Based RPGs
Historically, turn-based RPGs have relied on text boxes and static portraits to convey emotion. However, the modern gaming landscape, influenced by cinematic titles, demands more. In Expedition 33
, the stakes are high because the game’s premise—a world where a Paintress erases people of a certain age every year—is inherently tragic and atmospheric. The narrative requires the players to feel the desperation and resolve of the characters.
When a character like Gustave speaks, the player isn’t just listening to a voice; they are reading a face. If the facial animation is disconnected from the vocal performance, the immersion is broken. By investing in full performance capture and addressing the challenges described by Cazorla, Sandfall Interactive is attempting to bring a level of cinematic intimacy to the turn-based genre that was previously reserved for action-adventure titles.
This shift reflects a broader trend in the industry where the line between “voice actor” and “performance actor” has blurred. Actors are now expected to be athletes, dancers, and emotional conduits all at once. The ability to inhabit a character that looks entirely different from oneself is becoming a core skill for the modern gaming professional.
Industry Context: From Motion Capture to Digital Humans
The challenges faced by Maxence Cazorla are mirrored in other high-budget productions. For years, actors like Andy Serkis have pioneered the art of “motion capture” (MoCap), but the transition to “performance capture” (PerfCap) has raised the bar. In titles like The Last of Us Part II
or God of War
, the industry saw a move toward capturing the “micro-expressions”—the tiny, involuntary movements of the face that signal true emotion.
The technical evolution is moving toward “Digital Humans,” where AI and machine learning are used to fill the gaps in performance data. However, as Cazorla’s experience suggests, technology cannot replace the actor’s intent. The “challenge” is not a technical bug to be fixed, but a creative hurdle to be overcome. The actor must essentially perform “blind,” trusting that their emotional energy will survive the translation into code.
For the global audience, So that the characters in Expedition 33
are not just puppets controlled by a script, but the result of a complex psychological negotiation between a human performer and a digital avatar. The result is a more empathetic gaming experience, where the player can sense the actor’s effort to bridge the gap between their own face and the one on the screen.
Key Takeaways on Performance Capture
| Method | What is Captured | Primary Challenge | Emotional Impact |
|---|---|---|---|
| Voice Acting | Audio only | Conveying all emotion through tone | Imagined by player |
| Motion Capture | Body movement | Physicality and spatial awareness | Physical presence |
| Performance Capture | Voice, Body, and Face | Sensory disconnect (The “Other Face”) | Cinematic empathy |
As Clair Obscur: Expedition 33 moves closer to its release, the industry will be watching to see how these performances translate to the final product. The success of the game will depend not only on its combat mechanics or art direction but on whether the players can feel the human heart beating beneath the digital skin of characters like Gustave.
The next major milestone for the title will be further gameplay reveals and official release date confirmations from Sandfall Interactive. As the studio continues to showcase the game’s narrative depth, the contributions of performers like Maxence Cazorla will likely be central to the discussion of the game’s quality.
Do you think digital avatars can ever truly capture the nuance of a human face, or will there always be a gap in the performance? Share your thoughts in the comments below.