VALL-E can be used to synthesize high-quality personalized speech with only a three-second enrollment recording of a speaker as an acoustic prompt. The model of the voice can then be used for text-to-speech applications. The post Microsoft’s New AI Can Simulate Anyone’s Voice From a 3-Second Sample appeared first on TechNewsWorld.
from TechNewsWorld https://ift.tt/cPQ9J57
from TechNewsWorld https://ift.tt/cPQ9J57
Comments
Post a Comment