{"id":149067,"date":"2025-06-05T23:16:48","date_gmt":"2025-06-06T04:16:48","guid":{"rendered":"https:\/\/www.synthtopia.com\/?p=149067"},"modified":"2025-06-05T23:29:53","modified_gmt":"2025-06-06T04:29:53","slug":"tts-arena-site-is-like-hot-or-not-for-voice-synthesis","status":"publish","type":"post","link":"https:\/\/www.synthtopia.com\/content\/2025\/06\/05\/tts-arena-site-is-like-hot-or-not-for-voice-synthesis\/","title":{"rendered":"TTS Arena Site Is Like &#8216;Hot Or Not&#8217; For Voice Synthesis"},"content":{"rendered":"<p><a href=\"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-149090 size-large\" src=\"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis-728x315.jpg\" alt=\"\" width=\"728\" height=\"315\" srcset=\"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis-728x315.jpg 728w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis-320x138.jpg 320w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis-768x332.jpg 768w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis-693x300.jpg 693w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-for-voice-synthesis.jpg 1500w\" sizes=\"auto, (max-width: 728px) 100vw, 728px\" \/><\/a><\/p>\n<p><strong>Hanabi AI<\/strong> has introduced <strong>OpenAudio S1<\/strong>, a new vocal synthesis tool that they say treats emotion as the core of AI voice experience, allowing users to direct the voice performance, adjust tone, pacing, and feeling as naturally as working with a human actor.<\/p>\n<figure id=\"attachment_149089\" aria-describedby=\"caption-attachment-149089\" style=\"width: 320px\" class=\"wp-caption alignright\"><a href=\"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-149089 size-medium\" src=\"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-320x180.jpg\" alt=\"\" width=\"320\" height=\"180\" srcset=\"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-320x180.jpg 320w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-728x410.jpg 728w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-768x432.jpg 768w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not-533x300.jpg 533w, https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not.jpg 1500w\" sizes=\"auto, (max-width: 320px) 100vw, 320px\" \/><\/a><figcaption id=\"caption-attachment-149089\" class=\"wp-caption-text\">&#8220;Hot or not, I&#8217;m generated using AI.&#8221;<\/figcaption><\/figure>\n<p>Ahead of launch, OpenAudio S1 was submitted to <a href=\"https:\/\/huggingface.co\/spaces\/TTS-AGI\/TTS-Arena-V2\">Hugging Face\u2019s TTS Arena<\/a>, which is sort of like the old &#8216;Hot or Not&#8217; site, but for text-to-speach (TTS) vocal synthesis. Instead of ranking photos of people, you vote on head-to-head comparisons of the results from two different vocal synthesis engines.<\/p>\n<p>Here&#8217;s how TTS Arena works:<\/p>\n<ul class=\"feature-list\">\n<li>Enter your text and select &#8216;synthesize&#8217;.<\/li>\n<li>Listen to two different TTS models synthesize the same content.<\/li>\n<li>Vote for the model that sounds better.<\/li>\n<li>Track overall model rankings on the leaderboard. You can also create an account on the site and create your own leaderboard.<\/li>\n<\/ul>\n<p>Hanabi AI shared this with us because, of course, OpenAudio S1 is currently the leading text-to-speech engine on TTS Arena.<!--more--><\/p>\n<p>OpenAudio S1 lets you &#8216;tag&#8217; your script with a variety of markers, including Emotion, Tone, and markers for things like laughing or sighing. The system uses these tags as hints for generating more realistic vocal synthesis results.<\/p>\n<p>\u201cThe future of AI voice-driven storytelling isn\u2019t just about generating speech\u2014it\u2019s about performance,\u201d said <strong>Shijia Liao<\/strong>, founder and CEO of Hanabi AI. \u201cWith OpenAudio S1, we\u2019re shaping what we see as the next creative frontier: AI voice acting.\u201d<\/p>\n<p>For more info on\u00a0OpenAudio S1, see their <a href=\"https:\/\/openaudio.com\/blogs\/s1\">launch blog post<\/a>.<\/p>\n<p>If you give TTS Arena or OpenAudio S1 a try, leave a comment, and let us know what you think of the current state of voice synthesis!<\/p>\n<p><strong>Pricing &amp; Availability:<\/strong><\/p>\n<p><strong>TTS Arena<\/strong> is <a href=\"https:\/\/huggingface.co\/spaces\/TTS-AGI\/TTS-Arena-V2\">free to try<\/a> or to view their leaderboard. <strong>OpenAudio S1<\/strong> is available now at <a href=\"https:\/\/fish.audio\/\">Fish.Audio<\/a>, priced at $15\/month or $120\/year. You can also check out the OpenAudio open source TTS repo on <a href=\"https:\/\/github.com\/fishaudio\">Github<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Instead of ranking photos of people, you vote on head-to-head comparisons of the results from two different vocal synthesis engines.&hellip; <a class=\"more-link\" href=\"https:\/\/www.synthtopia.com\/content\/2025\/06\/05\/tts-arena-site-is-like-hot-or-not-for-voice-synthesis\/\">Read More <span class=\"screen-reader-text\">TTS Arena Site Is Like &#8216;Hot Or Not&#8217; For Voice Synthesis<\/span><\/a><\/p>\n","protected":false},"author":3,"featured_media":149089,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[2592],"tags":[490],"class_list":["post-149067","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-virtual-musicians","tag-vocal-synthesis"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/www.synthtopia.com\/wp-content\/uploads\/2025\/06\/hot-or-not.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/posts\/149067","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/comments?post=149067"}],"version-history":[{"count":10,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/posts\/149067\/revisions"}],"predecessor-version":[{"id":149098,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/posts\/149067\/revisions\/149098"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/media\/149089"}],"wp:attachment":[{"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/media?parent=149067"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/categories?post=149067"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.synthtopia.com\/wp-json\/wp\/v2\/tags?post=149067"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}