The SG Com SDK provides access to Speech Graphics' real-time audio-driven facial animation technology, enabling developers to build applications with avatars driven by voice.
SG Com converts an audio stream into high-fidelity facial animation which may be streamed to other endpoints. The processing delay between input and output is 50 milliseconds.
The SDK includes:
-
SG Com API, a C API
-
SG Com Unreal Engine Plugin, for easy integration into the Unreal Engine
C# bindings and sample Unity integrations are also available on request.
SG Com features
-
Accurate lip sync in any language for any character
-
Automatic detection of emotion (positive, negative, neutral)
-
Automatic detection of nonverbal vocalizations such as laughter and grunts
-
Automatic detection of audible breaths
-
Full-face expressions, driven by detected emotion or vocalizations
-
Head motion appropriate for the current speech
-
Chest breath motion that is timed with the speech and audible breath
-
Blinks and eye microdarts
-
Fully live and interruptable behavior
-
Idle behavior during quiet periods
-
Roles for both speaking and listening (reacting to speech)
-
Variety of live behavior controls with which to direct the behavior mode, facial expression, and characteristics such as intensity and speed of movements or frequency of changes
-
Notifications for changes in behavior mode, expression, voice activity and breath