5秒の音声で話者をクローン
Generate and animate full-body anime images
Generate anime images and videos
Generate audio from text using a voice synthesis model