OpenAI Voice Engine - AI voice cloning

OpenAI Voice Engine

TL;DR OpenAI Voice Engine is a text-to-speech model that can clone a realistic voice from just a 15-second audio sample It produces emotive, natural-sounding speech despite using a small model and minimal training data Access has remained in limited preview since its 2024 announcement due to responsible AI concerns around voice cloning and impersonation Approved testers must obtain clear consent from voice providers and inform listeners that voices are AI-generated As of 2026, the technology is restricted to approved partners and researchers rather than general availability About Voice cloning used to require hours of studio recordings and bespoke model training. OpenAI’s Voice Engine changes the equation: a 15-second audio sample is enough to produce a realistic, emotive voice clone. The capability is striking, which is exactly why OpenAI has kept it locked down since the initial preview. ...

March 31, 2024 · 2 min · James M