{"id":35406,"date":"2025-09-01T06:56:02","date_gmt":"2025-09-01T06:56:02","guid":{"rendered":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/"},"modified":"2025-09-01T06:56:02","modified_gmt":"2025-09-01T06:56:02","slug":"stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio","status":"publish","type":"post","link":"https:\/\/youzum.net\/fr\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/","title":{"rendered":"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio"},"content":{"rendered":"<p>The StepFun AI team has released <strong>Step-Audio 2 Mini<\/strong>, an 8B parameter speech-to-speech large audio language model (LALM) that delivers expressive, grounded, and real-time audio interaction. Released under the <strong>Apache 2.0 license<\/strong>, this open-source model achieves state-of-the-art performance across speech recognition, audio understanding, and speech conversation benchmarks\u2014surpassing commercial systems such as GPT-4o-Audio.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"560\" data-attachment-id=\"74202\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/31\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/screenshot-2025-08-31-at-11-17-51-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.51-PM-1.png\" data-orig-size=\"1364,746\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2025-08-31 at 11.17.51\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.51-PM-1-300x164.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560.png\" alt=\"\" class=\"wp-image-74202\" \/><figcaption class=\"wp-element-caption\">https:\/\/huggingface.co\/stepfun-ai\/Step-Audio-2-mini<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Key Features<\/strong><\/h2>\n<h3 class=\"wp-block-heading\"><strong>1. Unified Audio\u2013Text Tokenization<\/strong><\/h3>\n<p>Unlike cascaded ASR+LLM+TTS pipelines, Step-Audio 2 integrates <strong>Multimodal Discrete Token Modeling<\/strong>, where <strong>text and audio tokens share a single modeling stream<\/strong>. <\/p>\n<p><strong>This enables:<\/strong><\/p>\n<ul class=\"wp-block-list\">\n<li>Seamless reasoning across text and audio.<\/li>\n<li>On-the-fly <strong>voice style switching<\/strong> during inference.<\/li>\n<li>Consistency in semantic, prosodic, and emotional outputs.<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\"><strong>2. Expressive and Emotion-Aware Generation<\/strong><\/h3>\n<p>The model doesn\u2019t just transcribe speech\u2014it interprets <strong>paralinguistic features<\/strong> like pitch, rhythm, emotion, timbre, and style. This allows conversations with realistic emotional tones such as whispering, sadness, or excitement. Benchmarks on <strong>StepEval-Audio-Paralinguistic<\/strong> show Step-Audio 2 achieving <strong>83.1% accuracy<\/strong>, far beyond GPT-4o Audio (43.5%) and Qwen-Omni (44.2%).<\/p>\n<h3 class=\"wp-block-heading\"><strong>3. Retrieval-Augmented Speech Generation<\/strong><\/h3>\n<p>Step-Audio 2 incorporates <strong>multimodal RAG (Retrieval-Augmented Generation)<\/strong>:<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Web search integration<\/strong> for factual grounding.<\/li>\n<li><strong>Audio search<\/strong>\u2014a novel capability that retrieves real voices from a large library and fuses them into responses, enabling <strong>voice timbre\/style imitation<\/strong> at inference time.<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\"><strong>4. Tool Calling and Multimodal Reasoning<\/strong><\/h3>\n<p>The system extends beyond speech synthesis by supporting <strong>tool invocation<\/strong>. Benchmarks show that Step-Audio 2 matches textual LLMs in <strong>tool selection and parameter accuracy<\/strong>, while uniquely excelling at <strong>audio search tool calls<\/strong>\u2014a capability unavailable in text-only LLMs.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Training and Data Scale<\/strong><\/h2>\n<ul class=\"wp-block-list\">\n<li><strong>Text + Audio Corpus:<\/strong> 1.356T tokens<\/li>\n<li><strong>Audio Hours:<\/strong> 8M+ real and synthetic hours<\/li>\n<li><strong>Speaker Diversity:<\/strong> ~50K voices across languages and dialects<\/li>\n<li><strong>Pretraining Pipeline:<\/strong> multi-stage curriculum covering ASR, TTS, speech-to-speech translation, and emotion-labeled conversational synthesis.<\/li>\n<\/ul>\n<p>This large-scale training allows Step-Audio 2 Mini to retain strong text reasoning (via its Qwen2-Audio and CosyVoice foundation) while mastering fine-grained audio modeling.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Performance Benchmarks<\/strong><\/h2>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" width=\"1024\" height=\"740\" data-attachment-id=\"74200\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/31\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/screenshot-2025-08-31-at-11-17-20-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.20-PM-1.png\" data-orig-size=\"1384,1000\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2025-08-31 at 11.17.20\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.20-PM-1-300x217.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.20-PM-1-1024x740.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.17.20-PM-1-1024x740.png\" alt=\"\" class=\"wp-image-74200\" \/><figcaption class=\"wp-element-caption\">https:\/\/huggingface.co\/stepfun-ai\/Step-Audio-2-mini<\/figcaption><\/figure>\n<\/div>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" width=\"1024\" height=\"691\" data-attachment-id=\"74198\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/31\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/screenshot-2025-08-31-at-11-12-39-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.12.39-PM-1.png\" data-orig-size=\"1432,966\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2025-08-31 at 11.12.39\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.12.39-PM-1-300x202.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.12.39-PM-1-1024x691.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-31-at-11.12.39-PM-1-1024x691.png\" alt=\"\" class=\"wp-image-74198\" \/><figcaption class=\"wp-element-caption\">https:\/\/arxiv.org\/abs\/2507.16632<\/figcaption><\/figure>\n<\/div>\n<h3 class=\"wp-block-heading\"><strong>Automatic Speech Recognition (ASR)<\/strong><\/h3>\n<ul class=\"wp-block-list\">\n<li><strong>English:<\/strong> Average WER 3.14% (beats GPT-4o Transcribe at an average 4.5%).<\/li>\n<li><strong>Chinese:<\/strong> Average CER 3.08% (significantly lower than GPT-4o and Qwen-Omni).<\/li>\n<li>Robust across dialects and accents.<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\"><strong>Audio Understanding (MMAU Benchmark)<\/strong><\/h3>\n<ul class=\"wp-block-list\">\n<li><strong>Step-Audio 2:<\/strong> 78.0 average, outperforming Omni-R1 (77.0) and Audio Flamingo 3 (73.1).<\/li>\n<li>Strongest in <strong>sound and speech reasoning tasks<\/strong>.<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\"><strong>Speech Translation<\/strong><\/h3>\n<ul class=\"wp-block-list\">\n<li><strong>CoVoST 2 (S2TT):<\/strong> BLEU 39.26 (highest among open and closed models).<\/li>\n<li><strong>CVSS (S2ST):<\/strong> BLEU 30.87, ahead of GPT-4o (23.68).<\/li>\n<\/ul>\n<h3 class=\"wp-block-heading\"><strong>Conversational Benchmarks (URO-Bench)<\/strong><\/h3>\n<ul class=\"wp-block-list\">\n<li><strong>Chinese Conversations:<\/strong> Best overall at <strong>83.3 (basic)<\/strong> and <strong>68.2 (pro)<\/strong>.<\/li>\n<li><strong>English Conversations:<\/strong> Competitive with GPT-4o (83.9 vs. 84.5), far ahead of other open models.<\/li>\n<\/ul>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><a href=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/1200x1200-infographics-scaled.png\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" data-attachment-id=\"74204\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/31\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/1200x1200-infographics\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/1200x1200-infographics-scaled.png\" data-orig-size=\"2560,2560\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"1200\u00d71200 infographics\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/1200x1200-infographics-300x300.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/1200x1200-infographics-1024x1024.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/1200x1200-infographics-1024x1024.png\" alt=\"\" class=\"wp-image-74204\" \/><\/a><figcaption class=\"wp-element-caption\">Source: Marktechpost.com<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Conclusion<\/strong><\/h2>\n<p><strong>Step-Audio 2 Mini<\/strong> makes advanced, multimodal speech intelligence accessible to the developers and research community. By combining <strong>Qwen2-Audio<\/strong>\u2019s reasoning capacity with <strong>CosyVoice\u2019s tokenization pipeline<\/strong>, and augmenting with <strong>retrieval-based grounding<\/strong>, StepFun has delivered one of the most capable <strong>open audio LLMs<\/strong>.<\/p>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out the\u00a0<strong><a href=\"https:\/\/arxiv.org\/abs\/2507.16632\" target=\"_blank\" rel=\"noreferrer noopener\">PAPER<\/a> <\/strong>and<strong> <a href=\"https:\/\/huggingface.co\/stepfun-ai\/Step-Audio-2-mini\" target=\"_blank\" rel=\"noreferrer noopener\">MODEL on HUGGING FACE<\/a>.<\/strong>\u00a0Feel free to check out our\u00a0<strong><mark><a href=\"https:\/\/github.com\/Marktechpost\/AI-Tutorial-Codes-Included\" target=\"_blank\" rel=\"noreferrer noopener\">GitHub Page for Tutorials, Codes and Notebooks<\/a><\/mark><\/strong>.\u00a0Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">100k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>.<\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2025\/08\/31\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\">StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>The StepFun AI team has released Step-Audio 2 Mini, an 8B parameter speech-to-speech large audio language model (LALM) that delivers expressive, grounded, and real-time audio interaction. Released under the Apache 2.0 license, this open-source model achieves state-of-the-art performance across speech recognition, audio understanding, and speech conversation benchmarks\u2014surpassing commercial systems such as GPT-4o-Audio. https:\/\/huggingface.co\/stepfun-ai\/Step-Audio-2-mini Key Features 1. Unified Audio\u2013Text Tokenization Unlike cascaded ASR+LLM+TTS pipelines, Step-Audio 2 integrates Multimodal Discrete Token Modeling, where text and audio tokens share a single modeling stream. This enables: Seamless reasoning across text and audio. On-the-fly voice style switching during inference. Consistency in semantic, prosodic, and emotional outputs. 2. Expressive and Emotion-Aware Generation The model doesn\u2019t just transcribe speech\u2014it interprets paralinguistic features like pitch, rhythm, emotion, timbre, and style. This allows conversations with realistic emotional tones such as whispering, sadness, or excitement. Benchmarks on StepEval-Audio-Paralinguistic show Step-Audio 2 achieving 83.1% accuracy, far beyond GPT-4o Audio (43.5%) and Qwen-Omni (44.2%). 3. Retrieval-Augmented Speech Generation Step-Audio 2 incorporates multimodal RAG (Retrieval-Augmented Generation): Web search integration for factual grounding. Audio search\u2014a novel capability that retrieves real voices from a large library and fuses them into responses, enabling voice timbre\/style imitation at inference time. 4. Tool Calling and Multimodal Reasoning The system extends beyond speech synthesis by supporting tool invocation. Benchmarks show that Step-Audio 2 matches textual LLMs in tool selection and parameter accuracy, while uniquely excelling at audio search tool calls\u2014a capability unavailable in text-only LLMs. Training and Data Scale Text + Audio Corpus: 1.356T tokens Audio Hours: 8M+ real and synthetic hours Speaker Diversity: ~50K voices across languages and dialects Pretraining Pipeline: multi-stage curriculum covering ASR, TTS, speech-to-speech translation, and emotion-labeled conversational synthesis. This large-scale training allows Step-Audio 2 Mini to retain strong text reasoning (via its Qwen2-Audio and CosyVoice foundation) while mastering fine-grained audio modeling. Performance Benchmarks https:\/\/huggingface.co\/stepfun-ai\/Step-Audio-2-mini https:\/\/arxiv.org\/abs\/2507.16632 Automatic Speech Recognition (ASR) English: Average WER 3.14% (beats GPT-4o Transcribe at an average 4.5%). Chinese: Average CER 3.08% (significantly lower than GPT-4o and Qwen-Omni). Robust across dialects and accents. Audio Understanding (MMAU Benchmark) Step-Audio 2: 78.0 average, outperforming Omni-R1 (77.0) and Audio Flamingo 3 (73.1). Strongest in sound and speech reasoning tasks. Speech Translation CoVoST 2 (S2TT): BLEU 39.26 (highest among open and closed models). CVSS (S2ST): BLEU 30.87, ahead of GPT-4o (23.68). Conversational Benchmarks (URO-Bench) Chinese Conversations: Best overall at 83.3 (basic) and 68.2 (pro). English Conversations: Competitive with GPT-4o (83.9 vs. 84.5), far ahead of other open models. Source: Marktechpost.com Conclusion Step-Audio 2 Mini makes advanced, multimodal speech intelligence accessible to the developers and research community. By combining Qwen2-Audio\u2019s reasoning capacity with CosyVoice\u2019s tokenization pipeline, and augmenting with retrieval-based grounding, StepFun has delivered one of the most capable open audio LLMs. Check out the\u00a0PAPER and MODEL on HUGGING FACE.\u00a0Feel free to check out our\u00a0GitHub Page for Tutorials, Codes and Notebooks.\u00a0Also,\u00a0feel free to follow us on\u00a0Twitter\u00a0and don\u2019t forget to join our\u00a0100k+ ML SubReddit\u00a0and Subscribe to\u00a0our Newsletter. The post StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio appeared first on MarkTechPost.<\/p>","protected":false},"author":2,"featured_media":35407,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"pmpro_default_level":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"_pvb_checkbox_block_on_post":false,"footnotes":""},"categories":[52,5,7,1],"tags":[],"class_list":["post-35406","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-club","category-committee","category-news","category-uncategorized","pmpro-has-access"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio - YouZum<\/title>\n<meta name=\"description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/youzum.net\/fr\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio - YouZum\" \/>\n<meta property=\"og:description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta property=\"og:url\" content=\"https:\/\/youzum.net\/fr\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\" \/>\n<meta property=\"og:site_name\" content=\"YouZum\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/DroneAssociationTH\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-09-01T06:56:02+00:00\" \/>\n<meta name=\"author\" content=\"admin NU\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin NU\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\"},\"author\":{\"name\":\"admin NU\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\"},\"headline\":\"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio\",\"datePublished\":\"2025-09-01T06:56:02+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\"},\"wordCount\":529,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png\",\"articleSection\":[\"AI\",\"Committee\",\"News\",\"Uncategorized\"],\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\",\"url\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\",\"name\":\"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio - YouZum\",\"isPartOf\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png\",\"datePublished\":\"2025-09-01T06:56:02+00:00\",\"description\":\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\",\"breadcrumb\":{\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png\",\"width\":1024,\"height\":560},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/youzum.net\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/yousum.gpucore.co\/#website\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"name\":\"YouSum\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/yousum.gpucore.co\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\",\"name\":\"Drone Association Thailand\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"width\":300,\"height\":300,\"caption\":\"Drone Association Thailand\"},\"image\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/DroneAssociationTH\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\",\"name\":\"admin NU\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"caption\":\"admin NU\"},\"url\":\"https:\/\/youzum.net\/fr\/members\/adminnu\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio - YouZum","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/youzum.net\/fr\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/","og_locale":"fr_FR","og_type":"article","og_title":"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio - YouZum","og_description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","og_url":"https:\/\/youzum.net\/fr\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/","og_site_name":"YouZum","article_publisher":"https:\/\/www.facebook.com\/DroneAssociationTH\/","article_published_time":"2025-09-01T06:56:02+00:00","author":"admin NU","twitter_card":"summary_large_image","twitter_misc":{"\u00c9crit par":"admin NU","Dur\u00e9e de lecture estim\u00e9e":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#article","isPartOf":{"@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/"},"author":{"name":"admin NU","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c"},"headline":"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio","datePublished":"2025-09-01T06:56:02+00:00","mainEntityOfPage":{"@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/"},"wordCount":529,"commentCount":0,"publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"image":{"@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png","articleSection":["AI","Committee","News","Uncategorized"],"inLanguage":"fr-FR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/","url":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/","name":"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio - YouZum","isPartOf":{"@id":"https:\/\/yousum.gpucore.co\/#website"},"primaryImageOfPage":{"@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage"},"image":{"@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png","datePublished":"2025-09-01T06:56:02+00:00","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","breadcrumb":{"@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#primaryimage","url":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png","width":1024,"height":560},{"@type":"BreadcrumbList","@id":"https:\/\/youzum.net\/stepfun-ai-releases-step-audio-2-mini-an-open-source-8b-speech-to-speech-ai-model-that-surpasses-gpt-4o-audio\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/youzum.net\/"},{"@type":"ListItem","position":2,"name":"StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio"}]},{"@type":"WebSite","@id":"https:\/\/yousum.gpucore.co\/#website","url":"https:\/\/yousum.gpucore.co\/","name":"YouSum","description":"","publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/yousum.gpucore.co\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/yousum.gpucore.co\/#organization","name":"Drone Association Thailand","url":"https:\/\/yousum.gpucore.co\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","width":300,"height":300,"caption":"Drone Association Thailand"},"image":{"@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/DroneAssociationTH\/"]},{"@type":"Person","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c","name":"admin NU","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","caption":"admin NU"},"url":"https:\/\/youzum.net\/fr\/members\/adminnu\/"}]}},"rttpg_featured_image_url":{"full":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png",1024,560,false],"landscape":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png",1024,560,false],"portraits":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png",1024,560,false],"thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE-150x150.png",150,150,true],"medium":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE-300x164.png",300,164,true],"large":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png",1024,560,false],"1536x1536":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png",1024,560,false],"2048x2048":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE.png",1024,560,false],"trp-custom-language-flag":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE-18x10.png",18,10,true],"woocommerce_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE-300x300.png",300,300,true],"woocommerce_single":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE-600x328.png",600,328,true],"woocommerce_gallery_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/09\/Screenshot-2025-08-31-at-11.17.51-PM-1-1024x560-UBHyHE-100x100.png",100,100,true]},"rttpg_author":{"display_name":"admin NU","author_link":"https:\/\/youzum.net\/fr\/members\/adminnu\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/youzum.net\/fr\/category\/ai-club\/\" rel=\"category tag\">AI<\/a> <a href=\"https:\/\/youzum.net\/fr\/category\/committee\/\" rel=\"category tag\">Committee<\/a> <a href=\"https:\/\/youzum.net\/fr\/category\/news\/\" rel=\"category tag\">News<\/a> <a href=\"https:\/\/youzum.net\/fr\/category\/uncategorized\/\" rel=\"category tag\">Uncategorized<\/a>","rttpg_excerpt":"The StepFun AI team has released Step-Audio 2 Mini, an 8B parameter speech-to-speech large audio language model (LALM) that delivers expressive, grounded, and real-time audio interaction. Released under the Apache 2.0 license, this open-source model achieves state-of-the-art performance across speech recognition, audio understanding, and speech conversation benchmarks\u2014surpassing commercial systems such as GPT-4o-Audio. https:\/\/huggingface.co\/stepfun-ai\/Step-Audio-2-mini Key Features\u2026","_links":{"self":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/posts\/35406","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/comments?post=35406"}],"version-history":[{"count":0,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/posts\/35406\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/media\/35407"}],"wp:attachment":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/media?parent=35406"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/categories?post=35406"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/tags?post=35406"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}