Textless Lipsync SDK

This SDK produces accurately timed mouth positions and phonemes from a wave file. It analyses the entire file, using a statistical process that is robust against different speakers. It does not require a text file transcription of the audio.

When time is critical, or when dealing in languages not supported by the Text Based Lipsync SDK, the Textless Lipsync SDK is an excellent option.

Platforms: Win32, MacOS

demo page

About the SDKs

Annosoft licenses multimedia speech SDKs. Written in C++ and assembly language, the SDKs are painless to integrate into any C++ application or platform. Additionally, a scriptable ActiveX Control is available for use in Visual Basic or other Microsoft technologies.

Annosoft SDKs are extremely flexible because speech models are not hard-coded into the SDKs. This allows our clients to choose from various "stock" speech models that are the best fit for their application. Our stock models give our clients the ability to tune their application (at any time) for 1) recognition speed. 2) recognition accuracy. 3) application footprint. Also, custom speech models can be trained based on the audio characteristics and speaker, producing an optimal model in terms of speed and accuracy.