{"id":79820,"date":"2026-03-29T14:44:53","date_gmt":"2026-03-29T14:44:53","guid":{"rendered":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/"},"modified":"2026-03-29T14:44:53","modified_gmt":"2026-03-29T14:44:53","slug":"mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation","status":"publish","type":"post","link":"https:\/\/youzum.net\/fr\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/","title":{"rendered":"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation"},"content":{"rendered":"<p>Mistral AI has released <strong>Voxtral TTS<\/strong>, an open-weight text-to-speech model that marks the company\u2019s first major move into audio generation. Following the release of its transcription and language models, Mistral is now providing the final \u2018output layer\u2019 of the audio stack, positioning itself as a direct competitor to proprietary voice APIs in the developer ecosystem.<\/p>\n<p>Voxtral TTS is more than just a synthetic voice generator. It is a high-performance, modular component designed to be integrated into real-time voice workflows. By releasing the model under a <strong>CC BY-NC license<\/strong>, Mistral team continues its strategy of enabling developers to build and deploy frontier-grade capabilities without the constraints of closed-source API pricing or data privacy limitations.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1726\" height=\"1400\" data-attachment-id=\"78667\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/03\/28\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/screenshot-2026-03-28-at-1-48-14-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1.png\" data-orig-size=\"1726,1400\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-03-28 at 1.48.14\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-300x243.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-1024x831.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1.png\" alt=\"\" class=\"wp-image-78667\" \/><figcaption class=\"wp-element-caption\">https:\/\/arxiv.org\/pdf\/2603.25551<\/figcaption><\/figure>\n<\/div>\n<h3 class=\"wp-block-heading\"><strong>Architecture: The 4B Parameter Hybrid Model<\/strong><\/h3>\n<p>While many recent developments in text-to-speech have focused on massive, resource-intensive architectures, Voxtral TTS is built with a focus on efficiency.<sup><\/sup> The model features <strong>4B parameters<\/strong>, categorized as a lightweight model by modern frontier standards.<sup><\/sup><\/p>\n<p>This parameter count is distributed across a hybrid architecture designed to solve the common trade-offs between generation speed and audio naturalness. <strong>The system comprises three primary components:<\/strong><\/p>\n<ol start=\"1\" class=\"wp-block-list\">\n<li><strong>Transformer Decoder Backbone:<\/strong> A 3.4B parameter module based on the Ministral architecture that handles the text understanding and predicts semantic representations of speech.<\/li>\n<li><strong>Flow-Matching Acoustic Transformer:<\/strong> A 390M parameter module that converts those semantic representations into detailed acoustic features.<\/li>\n<li><strong>Neural Audio Codec:<\/strong> A 300M parameter decoder that maps the acoustic features back into a high-fidelity audio waveform.<\/li>\n<\/ol>\n<p>By separating the \u2018meaning\u2019 of the speech (semantic) from the \u2018texture\u2019 of the voice (acoustic), Voxtral TTS maintains long-range consistency while delivering the fine-grained nuances required for lifelike interaction.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Performance: 70ms Latency and High Throughput<\/strong><\/h3>\n<p>In the context of production-grade AI, latency is the defining constraint. Mistral has optimized Voxtral TTS for low-latency streaming inference, making it suitable for conversational agents and real-time translation.<sup><\/sup><\/p>\n<p>The model achieves a <strong>70ms model latency<\/strong> for a typical 10-second voice sample and 500-character input. This speed is critical for reducing the perceived delay in voice-first applications, where even small pauses can disrupt the flow of human-machine interaction.<\/p>\n<p>Furthermore, the model boasts a high <strong>Real-Time Factor (RTF) of approximately 9.7x<\/strong>. This means the system can synthesize audio nearly ten times faster than it is spoken. For developers, this throughput translates to lower compute costs and the ability to handle high-concurrency workloads on standard inference hardware.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Global Reach: 9 Languages and Dialect Accuracy<\/strong><\/h3>\n<p>Voxtral TTS is natively multilingual, supporting <strong>9 languages<\/strong> out of the gate: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic.<sup><\/sup><\/p>\n<p>The training objective for the model goes beyond simple phonetic translation. Mistral has emphasized the model\u2019s ability to capture <strong>diverse dialects<\/strong>, recognizing the subtle shifts in cadence and prosody that distinguish regional speakers. This technical precision makes the model an effective tool for global applications\u2014from international customer support to localized content creation\u2014where a generic, \u2018flattened\u2019 accent often fails to pass the human test.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Adaptive Voice Adaptation<\/strong><\/h3>\n<p>One of the standout features for AI devs is the model\u2019s ease of <strong>voice adaptation<\/strong>. Voxtral TTS supports zero-shot and few-shot voice cloning, allowing it to adapt to a new voice using as little as <strong>3 seconds of reference audio<\/strong>.<\/p>\n<p>This capability allows for the creation of consistent brand voices or personalized user experiences without the need for extensive fine-tuning. Because the model uses a factorized representation, it can apply the characteristics of a reference voice (timbre, tone, and pitch) to any generated text while maintaining the correct linguistic prosody of the target language.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Benchmarks: A Challenge to the Proprietary Giants<\/strong><\/h3>\n<p>Mistral\u2019s evaluations focus on how Voxtral TTS stacks up against the current industry leaders in synthetic speech, specifically <strong>ElevenLabs<\/strong>. In human preference tests conducted by native speakers, <strong>Voxtral TTS<\/strong> demonstrated significant gains in naturalness and expressivity.<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Vs. ElevenLabs Flash v2.5:<\/strong> Voxtral TTS achieved a <strong>68.4% win rate<\/strong> in multilingual voice cloning evaluations.<\/li>\n<li><strong>Vs. ElevenLabs v3:<\/strong> The model achieved parity or higher scores in <strong>speaker similarity<\/strong>, proving that an open-weight model can effectively match the fidelity of the most advanced proprietary flagship voices.<\/li>\n<\/ul>\n<p>These benchmarks suggest that for many enterprise use cases, the performance gap between open-source tools and high-cost APIs has effectively closed.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img decoding=\"async\" width=\"1176\" height=\"348\" data-attachment-id=\"78669\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/03\/28\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/screenshot-2026-03-28-at-1-50-11-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.50.11-PM-1.png\" data-orig-size=\"1176,348\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2026-03-28 at 1.50.11\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.50.11-PM-1-300x89.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.50.11-PM-1-1024x303.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.50.11-PM-1.png\" alt=\"\" class=\"wp-image-78669\" \/><figcaption class=\"wp-element-caption\">https:\/\/arxiv.org\/pdf\/2603.25551<\/figcaption><\/figure>\n<\/div>\n<h3 class=\"wp-block-heading\"><strong>Deployment and Integration<\/strong><\/h3>\n<p>Voxtral TTS is designed to function as part of a comprehensive <strong>Audio Intelligence<\/strong> stack. It integrates natively with <strong>Voxtral Transcribe<\/strong>, creating an end-to-end speech-to-speech (S2S) pipeline.<sup><\/sup><\/p>\n<p>For AI developers building on local or private cloud infrastructure, the model\u2019s small footprint is a significant advantage. Mistral\u2019s team has confirmed that the model is efficient enough to run on standard <strong>smartphone and laptop<\/strong> hardware once quantized. This \u2018edge-readiness\u2019 allows for a new class of private, offline applications, from secure corporate assistants to on-device accessibility tools.<\/p>\n<figure class=\"wp-block-table\">\n<table class=\"has-fixed-layout\">\n<thead>\n<tr>\n<td><strong>Specification<\/strong><\/td>\n<td><strong>Metric<\/strong><\/td>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Model Size<\/strong><\/td>\n<td>4B Parameters<\/td>\n<\/tr>\n<tr>\n<td><strong>Latency (10s voice \/ 500 chars)<\/strong><\/td>\n<td>70ms<\/td>\n<\/tr>\n<tr>\n<td><strong>Real-Time Factor (RTF)<\/strong><\/td>\n<td>~9.7x<\/td>\n<\/tr>\n<tr>\n<td><strong>Supported Languages<\/strong><\/td>\n<td>9<\/td>\n<\/tr>\n<tr>\n<td><strong>Reference Audio Needed<\/strong><\/td>\n<td>3 \u2013 30 seconds<\/td>\n<\/tr>\n<tr>\n<td><strong>License<\/strong><\/td>\n<td>CC BY-NC<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/figure>\n<h3 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h3>\n<ul class=\"wp-block-list\">\n<li><strong>High-Efficiency 4B Parameter Model:<\/strong> Voxtral TTS is a frontier open-weight model with a <strong>4B parameter<\/strong> footprint, utilizing a hybrid architecture that combines auto-regressive semantic generation with flow-matching for acoustic details.<\/li>\n<li><strong>Ultra-Low 70ms Latency:<\/strong> Optimized for real-time applications, the model achieves a <strong>70ms model latency<\/strong> for a typical 10-second voice sample (500-character input) and an impressive <strong>Real-Time Factor (RTF) of approximately 9.7x<\/strong>.<\/li>\n<li><strong>Superior Multilingual Performance:<\/strong> The model supports <strong>9 languages<\/strong> (English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic) and outperformed <strong>ElevenLabs Flash v2.5<\/strong> with a <strong>68.4% win rate<\/strong> in human preference tests for multilingual voice cloning.<\/li>\n<li><strong>Instant Voice Adaptation:<\/strong> Developers can achieve high-fidelity voice cloning with as little as <strong>3 seconds of reference audio<\/strong>, enabling zero-shot cross-lingual adaptation where a speaker\u2019s unique identity is preserved across different languages.<\/li>\n<li><strong>Full Audio Stack Integration:<\/strong> Designed as the \u2018output layer\u2019 of a unified audio intelligence pipeline, it plugs natively into <strong>Voxtral Transcribe<\/strong> to create low-latency, end-to-end speech-to-speech workflows.<\/li>\n<\/ul>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out\u00a0the\u00a0<strong><a href=\"https:\/\/arxiv.org\/pdf\/2603.25551\" target=\"_blank\" rel=\"noreferrer noopener\">Paper<\/a>, <a href=\"https:\/\/huggingface.co\/mistralai\/Voxtral-4B-TTS-2603\" target=\"_blank\" rel=\"noreferrer noopener\">Model Weight<\/a>\u00a0<\/strong>and<strong>\u00a0<a href=\"https:\/\/mistral.ai\/news\/voxtral-tts\" target=\"_blank\" rel=\"noreferrer noopener\">Technical details<\/a>.\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">120k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/03\/28\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\">Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Mistral AI has released Voxtral TTS, an open-weight text-to-speech model that marks the company\u2019s first major move into audio generation. Following the release of its transcription and language models, Mistral is now providing the final \u2018output layer\u2019 of the audio stack, positioning itself as a direct competitor to proprietary voice APIs in the developer ecosystem. Voxtral TTS is more than just a synthetic voice generator. It is a high-performance, modular component designed to be integrated into real-time voice workflows. By releasing the model under a CC BY-NC license, Mistral team continues its strategy of enabling developers to build and deploy frontier-grade capabilities without the constraints of closed-source API pricing or data privacy limitations. https:\/\/arxiv.org\/pdf\/2603.25551 Architecture: The 4B Parameter Hybrid Model While many recent developments in text-to-speech have focused on massive, resource-intensive architectures, Voxtral TTS is built with a focus on efficiency. The model features 4B parameters, categorized as a lightweight model by modern frontier standards. This parameter count is distributed across a hybrid architecture designed to solve the common trade-offs between generation speed and audio naturalness. The system comprises three primary components: Transformer Decoder Backbone: A 3.4B parameter module based on the Ministral architecture that handles the text understanding and predicts semantic representations of speech. Flow-Matching Acoustic Transformer: A 390M parameter module that converts those semantic representations into detailed acoustic features. Neural Audio Codec: A 300M parameter decoder that maps the acoustic features back into a high-fidelity audio waveform. By separating the \u2018meaning\u2019 of the speech (semantic) from the \u2018texture\u2019 of the voice (acoustic), Voxtral TTS maintains long-range consistency while delivering the fine-grained nuances required for lifelike interaction. Performance: 70ms Latency and High Throughput In the context of production-grade AI, latency is the defining constraint. Mistral has optimized Voxtral TTS for low-latency streaming inference, making it suitable for conversational agents and real-time translation. The model achieves a 70ms model latency for a typical 10-second voice sample and 500-character input. This speed is critical for reducing the perceived delay in voice-first applications, where even small pauses can disrupt the flow of human-machine interaction. Furthermore, the model boasts a high Real-Time Factor (RTF) of approximately 9.7x. This means the system can synthesize audio nearly ten times faster than it is spoken. For developers, this throughput translates to lower compute costs and the ability to handle high-concurrency workloads on standard inference hardware. Global Reach: 9 Languages and Dialect Accuracy Voxtral TTS is natively multilingual, supporting 9 languages out of the gate: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. The training objective for the model goes beyond simple phonetic translation. Mistral has emphasized the model\u2019s ability to capture diverse dialects, recognizing the subtle shifts in cadence and prosody that distinguish regional speakers. This technical precision makes the model an effective tool for global applications\u2014from international customer support to localized content creation\u2014where a generic, \u2018flattened\u2019 accent often fails to pass the human test. Adaptive Voice Adaptation One of the standout features for AI devs is the model\u2019s ease of voice adaptation. Voxtral TTS supports zero-shot and few-shot voice cloning, allowing it to adapt to a new voice using as little as 3 seconds of reference audio. This capability allows for the creation of consistent brand voices or personalized user experiences without the need for extensive fine-tuning. Because the model uses a factorized representation, it can apply the characteristics of a reference voice (timbre, tone, and pitch) to any generated text while maintaining the correct linguistic prosody of the target language. Benchmarks: A Challenge to the Proprietary Giants Mistral\u2019s evaluations focus on how Voxtral TTS stacks up against the current industry leaders in synthetic speech, specifically ElevenLabs. In human preference tests conducted by native speakers, Voxtral TTS demonstrated significant gains in naturalness and expressivity. Vs. ElevenLabs Flash v2.5: Voxtral TTS achieved a 68.4% win rate in multilingual voice cloning evaluations. Vs. ElevenLabs v3: The model achieved parity or higher scores in speaker similarity, proving that an open-weight model can effectively match the fidelity of the most advanced proprietary flagship voices. These benchmarks suggest that for many enterprise use cases, the performance gap between open-source tools and high-cost APIs has effectively closed. https:\/\/arxiv.org\/pdf\/2603.25551 Deployment and Integration Voxtral TTS is designed to function as part of a comprehensive Audio Intelligence stack. It integrates natively with Voxtral Transcribe, creating an end-to-end speech-to-speech (S2S) pipeline. For AI developers building on local or private cloud infrastructure, the model\u2019s small footprint is a significant advantage. Mistral\u2019s team has confirmed that the model is efficient enough to run on standard smartphone and laptop hardware once quantized. This \u2018edge-readiness\u2019 allows for a new class of private, offline applications, from secure corporate assistants to on-device accessibility tools. Specification Metric Model Size 4B Parameters Latency (10s voice \/ 500 chars) 70ms Real-Time Factor (RTF) ~9.7x Supported Languages 9 Reference Audio Needed 3 \u2013 30 seconds License CC BY-NC Key Takeaways High-Efficiency 4B Parameter Model: Voxtral TTS is a frontier open-weight model with a 4B parameter footprint, utilizing a hybrid architecture that combines auto-regressive semantic generation with flow-matching for acoustic details. Ultra-Low 70ms Latency: Optimized for real-time applications, the model achieves a 70ms model latency for a typical 10-second voice sample (500-character input) and an impressive Real-Time Factor (RTF) of approximately 9.7x. Superior Multilingual Performance: The model supports 9 languages (English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic) and outperformed ElevenLabs Flash v2.5 with a 68.4% win rate in human preference tests for multilingual voice cloning. Instant Voice Adaptation: Developers can achieve high-fidelity voice cloning with as little as 3 seconds of reference audio, enabling zero-shot cross-lingual adaptation where a speaker\u2019s unique identity is preserved across different languages. Full Audio Stack Integration: Designed as the \u2018output layer\u2019 of a unified audio intelligence pipeline, it plugs natively into Voxtral Transcribe to create low-latency, end-to-end speech-to-speech workflows. Check out\u00a0the\u00a0Paper, Model Weight\u00a0and\u00a0Technical details.\u00a0Also,\u00a0feel free to follow us on\u00a0Twitter\u00a0and don\u2019t forget to join our\u00a0120k+ ML SubReddit\u00a0and Subscribe to\u00a0our Newsletter. Wait! are you on telegram?\u00a0now you can join us on telegram as well. The post Mistral<\/p>","protected":false},"author":2,"featured_media":79821,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"pmpro_default_level":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"_pvb_checkbox_block_on_post":false,"footnotes":""},"categories":[52,5,7,1],"tags":[],"class_list":["post-79820","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-club","category-committee","category-news","category-uncategorized","pmpro-has-access"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation - YouZum<\/title>\n<meta name=\"description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/youzum.net\/fr\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation - YouZum\" \/>\n<meta property=\"og:description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta property=\"og:url\" content=\"https:\/\/youzum.net\/fr\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\" \/>\n<meta property=\"og:site_name\" content=\"YouZum\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/DroneAssociationTH\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-03-29T14:44:53+00:00\" \/>\n<meta name=\"author\" content=\"admin NU\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin NU\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\"},\"author\":{\"name\":\"admin NU\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\"},\"headline\":\"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation\",\"datePublished\":\"2026-03-29T14:44:53+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\"},\"wordCount\":1050,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png\",\"articleSection\":[\"AI\",\"Committee\",\"News\",\"Uncategorized\"],\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\",\"url\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\",\"name\":\"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation - YouZum\",\"isPartOf\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png\",\"datePublished\":\"2026-03-29T14:44:53+00:00\",\"description\":\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\",\"breadcrumb\":{\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png\",\"width\":1726,\"height\":1400},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/youzum.net\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/yousum.gpucore.co\/#website\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"name\":\"YouSum\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/yousum.gpucore.co\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\",\"name\":\"Drone Association Thailand\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"width\":300,\"height\":300,\"caption\":\"Drone Association Thailand\"},\"image\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/DroneAssociationTH\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\",\"name\":\"admin NU\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"caption\":\"admin NU\"},\"url\":\"https:\/\/youzum.net\/fr\/members\/adminnu\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation - YouZum","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/youzum.net\/fr\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/","og_locale":"fr_FR","og_type":"article","og_title":"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation - YouZum","og_description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","og_url":"https:\/\/youzum.net\/fr\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/","og_site_name":"YouZum","article_publisher":"https:\/\/www.facebook.com\/DroneAssociationTH\/","article_published_time":"2026-03-29T14:44:53+00:00","author":"admin NU","twitter_card":"summary_large_image","twitter_misc":{"\u00c9crit par":"admin NU","Dur\u00e9e de lecture estim\u00e9e":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#article","isPartOf":{"@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/"},"author":{"name":"admin NU","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c"},"headline":"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation","datePublished":"2026-03-29T14:44:53+00:00","mainEntityOfPage":{"@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/"},"wordCount":1050,"commentCount":0,"publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"image":{"@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png","articleSection":["AI","Committee","News","Uncategorized"],"inLanguage":"fr-FR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/","url":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/","name":"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation - YouZum","isPartOf":{"@id":"https:\/\/yousum.gpucore.co\/#website"},"primaryImageOfPage":{"@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage"},"image":{"@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png","datePublished":"2026-03-29T14:44:53+00:00","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","breadcrumb":{"@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#primaryimage","url":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png","width":1726,"height":1400},{"@type":"BreadcrumbList","@id":"https:\/\/youzum.net\/mistral-ai-releases-voxtral-tts-a-4b-open-weight-streaming-speech-model-for-low-latency-multilingual-voice-generation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/youzum.net\/"},{"@type":"ListItem","position":2,"name":"Mistral AI Releases Voxtral TTS: A 4B Open-Weight Streaming Speech Model for Low-Latency Multilingual Voice Generation"}]},{"@type":"WebSite","@id":"https:\/\/yousum.gpucore.co\/#website","url":"https:\/\/yousum.gpucore.co\/","name":"YouSum","description":"","publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/yousum.gpucore.co\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/yousum.gpucore.co\/#organization","name":"Drone Association Thailand","url":"https:\/\/yousum.gpucore.co\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","width":300,"height":300,"caption":"Drone Association Thailand"},"image":{"@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/DroneAssociationTH\/"]},{"@type":"Person","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c","name":"admin NU","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","caption":"admin NU"},"url":"https:\/\/youzum.net\/fr\/members\/adminnu\/"}]}},"rttpg_featured_image_url":{"full":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png",1726,1400,false],"landscape":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png",1726,1400,false],"portraits":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png",1726,1400,false],"thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-150x150.png",150,150,true],"medium":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-300x243.png",300,243,true],"large":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-1024x831.png",1024,831,true],"1536x1536":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-1536x1246.png",1536,1246,true],"2048x2048":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA.png",1726,1400,false],"trp-custom-language-flag":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-15x12.png",15,12,true],"woocommerce_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-300x300.png",300,300,true],"woocommerce_single":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-600x487.png",600,487,true],"woocommerce_gallery_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/03\/Screenshot-2026-03-28-at-1.48.14-PM-1-QbSLZA-100x100.png",100,100,true]},"rttpg_author":{"display_name":"admin NU","author_link":"https:\/\/youzum.net\/fr\/members\/adminnu\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/youzum.net\/fr\/category\/ai-club\/\" rel=\"category tag\">AI<\/a> <a href=\"https:\/\/youzum.net\/fr\/category\/committee\/\" rel=\"category tag\">Committee<\/a> <a href=\"https:\/\/youzum.net\/fr\/category\/news\/\" rel=\"category tag\">News<\/a> <a href=\"https:\/\/youzum.net\/fr\/category\/uncategorized\/\" rel=\"category tag\">Uncategorized<\/a>","rttpg_excerpt":"Mistral AI has released Voxtral TTS, an open-weight text-to-speech model that marks the company\u2019s first major move into audio generation. Following the release of its transcription and language models, Mistral is now providing the final \u2018output layer\u2019 of the audio stack, positioning itself as a direct competitor to proprietary voice APIs in the developer ecosystem.\u2026","_links":{"self":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/posts\/79820","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/comments?post=79820"}],"version-history":[{"count":0,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/posts\/79820\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/media\/79821"}],"wp:attachment":[{"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/media?parent=79820"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/categories?post=79820"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/youzum.net\/fr\/wp-json\/wp\/v2\/tags?post=79820"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}