{"id":32052,"date":"2025-08-16T06:06:25","date_gmt":"2025-08-16T06:06:25","guid":{"rendered":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/"},"modified":"2025-08-16T06:06:25","modified_gmt":"2025-08-16T06:06:25","slug":"nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages","status":"publish","type":"post","link":"https:\/\/youzum.net\/ja\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/","title":{"rendered":"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages"},"content":{"rendered":"<p>Nvidia has taken a major leap in the development of multilingual speech AI, unveiling <strong>Granary<\/strong>, the largest open-source speech dataset for European languages, and two state-of-the-art models: <strong>Canary-1b-v2<\/strong> and <strong>Parakeet-tdt-0.6b-v3<\/strong>. This release sets a new standard for accessible, high-quality resources in automatic speech recognition (ASR) and speech translation (AST), especially for underrepresented European languages.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Granary: The Foundation of Multilingual Speech AI<\/strong><\/h3>\n<p><strong>Granary<\/strong> is a massive, multilingual corpus developed in collaboration with Carnegie Mellon University and Fondazione Bruno Kessler. It delivers around <strong>one million hours of audio<\/strong>, with <strong>650,000 hours for speech recognition<\/strong> and <strong>350,000 for speech translation<\/strong>. The dataset covers 25 European languages\u2014representing nearly all official EU languages, plus Russian and Ukrainian\u2014with a critical focus on languages with limited annotated data, such as Croatian, Estonian, and Maltese.<\/p>\n<h4 class=\"wp-block-heading\"><strong>Key features:<\/strong><\/h4>\n<ul class=\"wp-block-list\">\n<li><strong>Largest open-source speech dataset<\/strong> for 25 European languages.<\/li>\n<li><strong>Pseudo-labeling pipeline:<\/strong> Unlabeled public audio data is processed using Nvidia NeMo\u2019s Speech Data Processor, which adds structure and enhances quality, reducing the need for resource-intensive manual annotation.<\/li>\n<li><strong>Supports both ASR and AST:<\/strong> Designed for transcription and translation tasks.<\/li>\n<li><strong>Open access:<\/strong> Available to the global developer community for flexible, production-scale model training.<\/li>\n<\/ul>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"703\" data-attachment-id=\"73669\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/15\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/image-92\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-16.png\" data-orig-size=\"1509,1036\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-16-300x206.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-16-1024x703.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-16-1024x703.png\" alt=\"\" class=\"wp-image-73669\" \/><\/figure>\n<\/div>\n<p>By leveraging clean, high-quality data, <strong>Granary enables significantly faster model convergence<\/strong>. Research demonstrates that developers need <strong>half as much Granary data to reach target accuracies compared to competing datasets<\/strong>, making it especially valuable for resource-constrained languages and rapid prototyping.<\/p>\n<h3 class=\"wp-block-heading\"><strong>Canary-1b-v2: Multilingual ASR + Translation (En <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/16.0.1\/72x72\/2194.png\" alt=\"\u2194\" class=\"wp-smiley\" \/> 24 Languages)<\/strong><\/h3>\n<p><strong>Canary-1b-v2<\/strong> is a <strong>billion-parameter Encoder-Decoder model<\/strong> trained on Granary, delivering high-quality transcription and translation between English and 24 supported European languages. <\/p>\n<h4 class=\"wp-block-heading\"><strong>It\u2019s architected for accuracy and multitask capabilities:<\/strong><\/h4>\n<ul class=\"wp-block-list\">\n<li><strong>Languages supported:<\/strong> 25 European languages, doubling Canary\u2019s coverage from 4.<\/li>\n<li><strong>State-of-the-art performance:<\/strong> Comparable accuracy to models three times larger, but <strong>up to 10\u00d7 faster inference<\/strong>.<\/li>\n<li><strong>Multitask capability:<\/strong> Robust across both ASR and AST tasks.<\/li>\n<li><strong>Features:<\/strong> Automatic punctuation, capitalization, word and segment-level timestamps\u2014even timestamped translated outputs.<\/li>\n<li><strong>Architecture:<\/strong> FastConformer Encoder with Transformer Decoder; unified vocabulary for all languages via SentencePiece tokenizer.<\/li>\n<li><strong>Robustness:<\/strong> Maintains strong performance under noisy conditions and resists output hallucinations.<\/li>\n<\/ul>\n<h4 class=\"wp-block-heading\"><strong>Evaluation highlights:<\/strong><\/h4>\n<ul class=\"wp-block-list\">\n<li><strong>ASR Word Error Rate (WER):<\/strong> 7.15% (AMI dataset), 10.82% (LibriSpeech Clean).<\/li>\n<li><strong>AST COMET Scores:<\/strong> 79.3 (X\u2192English), 84.56 (English\u2192X).<\/li>\n<li><strong>Deployment:<\/strong> Available under CC BY 4.0 license; optimized for Nvidia GPU-accelerated systems, enabling fast training and inference for scalable production use.<\/li>\n<\/ul>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" width=\"1024\" height=\"637\" data-attachment-id=\"73667\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/15\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/image-90\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-14.png\" data-orig-size=\"1539,957\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"image\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-14-300x187.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-14-1024x637.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/image-14-1024x637.png\" alt=\"\" class=\"wp-image-73667\" \/><\/figure>\n<\/div>\n<h3 class=\"wp-block-heading\"><strong>Parakeet-tdt-0.6b-v3: Real-Time Multilingual ASR<\/strong><\/h3>\n<p><strong>Parakeet-tdt-0.6b-v3<\/strong> is a <strong>600-million-parameter multilingual ASR model<\/strong> designed for high-throughput or large-volume transcription in all 25 supported languages. It extends the Parakeet family (previously English-centric) to full European coverage.<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Automatic language detection:<\/strong> Transcribes input audio without needing extra prompts.<\/li>\n<li><strong>Real-time capability:<\/strong> Efficiently transcribes up to 24-minute audio segments in a single inference pass.<\/li>\n<li><strong>Fast, scalable, and commercial-ready:<\/strong> Prioritizes low latency, batch processing, and accurate outputs, with word-level timestamps, punctuation, and capitalization.<\/li>\n<li><strong>Robustness:<\/strong> Reliable even on complex content (numbers, lyrics) and challenging audio conditions.<\/li>\n<\/ul>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large is-resized\"><img decoding=\"async\" width=\"1024\" height=\"644\" data-attachment-id=\"73666\" data-permalink=\"https:\/\/www.marktechpost.com\/2025\/08\/15\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/screenshot-2025-08-15-at-10-25-40-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-15-at-10.25.40-PM-1.png\" data-orig-size=\"1294,814\" data-comments-opened=\"1\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\"}' data-image-title=\"Screenshot 2025-08-15 at 10.25.40\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-15-at-10.25.40-PM-1-300x189.png\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-15-at-10.25.40-PM-1-1024x644.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2025\/08\/Screenshot-2025-08-15-at-10.25.40-PM-1-1024x644.png\" alt=\"\" class=\"wp-image-73666\" \/><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Impact on Speech AI Development<\/strong><\/h2>\n<p>Nvidia\u2019s Granary dataset and model suite accelerate the democratization of speech AI for Europe, enabling scalable development of:<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Multilingual chatbots<\/strong><\/li>\n<li><strong>Customer service voice agents<\/strong><\/li>\n<li><strong>Near-real-time translation services<\/strong><\/li>\n<\/ul>\n<p>Developers, researchers, and businesses can now build inclusive, high-quality applications supporting linguistic diversity, with open access to these super cool models and datasets<\/p>\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<p>Check out the\u00a0<strong><a href=\"https:\/\/huggingface.co\/datasets\/nvidia\/granary\" target=\"_blank\" rel=\"noreferrer noopener\">Granary<\/a>,<\/strong> <a href=\"https:\/\/huggingface.co\/nvidia\/canary-1b-v2\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>NVIDIA Canary-1b-v2<\/strong><\/a> and <a href=\"https:\/\/huggingface.co\/nvidia\/parakeet-tdt-0.6b-v3\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>NVIDIA Parakeet-tdt-0.6b-v3<\/strong><\/a>. Feel free to check out our\u00a0<strong><mark><a href=\"https:\/\/github.com\/Marktechpost\/AI-Tutorial-Codes-Included\" target=\"_blank\" rel=\"noreferrer noopener\">GitHub Page for Tutorials, Codes and Notebooks<\/a><\/mark><\/strong>.\u00a0Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">100k+ ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>.<\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2025\/08\/15\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\">NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Nvidia has taken a major leap in the development of multilingual speech AI, unveiling Granary, the largest open-source speech dataset for European languages, and two state-of-the-art models: Canary-1b-v2 and Parakeet-tdt-0.6b-v3. This release sets a new standard for accessible, high-quality resources in automatic speech recognition (ASR) and speech translation (AST), especially for underrepresented European languages. Granary: The Foundation of Multilingual Speech AI Granary is a massive, multilingual corpus developed in collaboration with Carnegie Mellon University and Fondazione Bruno Kessler. It delivers around one million hours of audio, with 650,000 hours for speech recognition and 350,000 for speech translation. The dataset covers 25 European languages\u2014representing nearly all official EU languages, plus Russian and Ukrainian\u2014with a critical focus on languages with limited annotated data, such as Croatian, Estonian, and Maltese. Key features: Largest open-source speech dataset for 25 European languages. Pseudo-labeling pipeline: Unlabeled public audio data is processed using Nvidia NeMo\u2019s Speech Data Processor, which adds structure and enhances quality, reducing the need for resource-intensive manual annotation. Supports both ASR and AST: Designed for transcription and translation tasks. Open access: Available to the global developer community for flexible, production-scale model training. By leveraging clean, high-quality data, Granary enables significantly faster model convergence. Research demonstrates that developers need half as much Granary data to reach target accuracies compared to competing datasets, making it especially valuable for resource-constrained languages and rapid prototyping. Canary-1b-v2: Multilingual ASR + Translation (En 24 Languages) Canary-1b-v2 is a billion-parameter Encoder-Decoder model trained on Granary, delivering high-quality transcription and translation between English and 24 supported European languages. It\u2019s architected for accuracy and multitask capabilities: Languages supported: 25 European languages, doubling Canary\u2019s coverage from 4. State-of-the-art performance: Comparable accuracy to models three times larger, but up to 10\u00d7 faster inference. Multitask capability: Robust across both ASR and AST tasks. Features: Automatic punctuation, capitalization, word and segment-level timestamps\u2014even timestamped translated outputs. Architecture: FastConformer Encoder with Transformer Decoder; unified vocabulary for all languages via SentencePiece tokenizer. Robustness: Maintains strong performance under noisy conditions and resists output hallucinations. Evaluation highlights: ASR Word Error Rate (WER): 7.15% (AMI dataset), 10.82% (LibriSpeech Clean). AST COMET Scores: 79.3 (X\u2192English), 84.56 (English\u2192X). Deployment: Available under CC BY 4.0 license; optimized for Nvidia GPU-accelerated systems, enabling fast training and inference for scalable production use. Parakeet-tdt-0.6b-v3: Real-Time Multilingual ASR Parakeet-tdt-0.6b-v3 is a 600-million-parameter multilingual ASR model designed for high-throughput or large-volume transcription in all 25 supported languages. It extends the Parakeet family (previously English-centric) to full European coverage. Automatic language detection: Transcribes input audio without needing extra prompts. Real-time capability: Efficiently transcribes up to 24-minute audio segments in a single inference pass. Fast, scalable, and commercial-ready: Prioritizes low latency, batch processing, and accurate outputs, with word-level timestamps, punctuation, and capitalization. Robustness: Reliable even on complex content (numbers, lyrics) and challenging audio conditions. Impact on Speech AI Development Nvidia\u2019s Granary dataset and model suite accelerate the democratization of speech AI for Europe, enabling scalable development of: Multilingual chatbots Customer service voice agents Near-real-time translation services Developers, researchers, and businesses can now build inclusive, high-quality applications supporting linguistic diversity, with open access to these super cool models and datasets Check out the\u00a0Granary, NVIDIA Canary-1b-v2 and NVIDIA Parakeet-tdt-0.6b-v3. Feel free to check out our\u00a0GitHub Page for Tutorials, Codes and Notebooks.\u00a0Also,\u00a0feel free to follow us on\u00a0Twitter\u00a0and don\u2019t forget to join our\u00a0100k+ ML SubReddit\u00a0and Subscribe to\u00a0our Newsletter. The post NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages appeared first on MarkTechPost.<\/p>","protected":false},"author":2,"featured_media":32053,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"pmpro_default_level":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"_pvb_checkbox_block_on_post":false,"footnotes":""},"categories":[52,5,7,1],"tags":[],"class_list":["post-32052","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-club","category-committee","category-news","category-uncategorized","pmpro-has-access"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages - YouZum<\/title>\n<meta name=\"description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/youzum.net\/ja\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\" \/>\n<meta property=\"og:locale\" content=\"ja_JP\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages - YouZum\" \/>\n<meta property=\"og:description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta property=\"og:url\" content=\"https:\/\/youzum.net\/ja\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\" \/>\n<meta property=\"og:site_name\" content=\"YouZum\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/DroneAssociationTH\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-08-16T06:06:25+00:00\" \/>\n<meta name=\"author\" content=\"admin NU\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u57f7\u7b46\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin NU\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593\" \/>\n\t<meta name=\"twitter:data2\" content=\"3\u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\"},\"author\":{\"name\":\"admin NU\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\"},\"headline\":\"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages\",\"datePublished\":\"2025-08-16T06:06:25+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\"},\"wordCount\":600,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png\",\"articleSection\":[\"AI\",\"Committee\",\"News\",\"Uncategorized\"],\"inLanguage\":\"ja\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\",\"url\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\",\"name\":\"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages - YouZum\",\"isPartOf\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png\",\"datePublished\":\"2025-08-16T06:06:25+00:00\",\"description\":\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\",\"breadcrumb\":{\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#breadcrumb\"},\"inLanguage\":\"ja\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png\",\"width\":1024,\"height\":703},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/youzum.net\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/yousum.gpucore.co\/#website\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"name\":\"YouSum\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/yousum.gpucore.co\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ja\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\",\"name\":\"Drone Association Thailand\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"width\":300,\"height\":300,\"caption\":\"Drone Association Thailand\"},\"image\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/DroneAssociationTH\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\",\"name\":\"admin NU\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ja\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"caption\":\"admin NU\"},\"url\":\"https:\/\/youzum.net\/ja\/members\/adminnu\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages - YouZum","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/youzum.net\/ja\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/","og_locale":"ja_JP","og_type":"article","og_title":"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages - YouZum","og_description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","og_url":"https:\/\/youzum.net\/ja\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/","og_site_name":"YouZum","article_publisher":"https:\/\/www.facebook.com\/DroneAssociationTH\/","article_published_time":"2025-08-16T06:06:25+00:00","author":"admin NU","twitter_card":"summary_large_image","twitter_misc":{"\u57f7\u7b46\u8005":"admin NU","\u63a8\u5b9a\u8aad\u307f\u53d6\u308a\u6642\u9593":"3\u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#article","isPartOf":{"@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/"},"author":{"name":"admin NU","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c"},"headline":"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages","datePublished":"2025-08-16T06:06:25+00:00","mainEntityOfPage":{"@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/"},"wordCount":600,"commentCount":0,"publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"image":{"@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png","articleSection":["AI","Committee","News","Uncategorized"],"inLanguage":"ja","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/","url":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/","name":"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages - YouZum","isPartOf":{"@id":"https:\/\/yousum.gpucore.co\/#website"},"primaryImageOfPage":{"@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage"},"image":{"@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png","datePublished":"2025-08-16T06:06:25+00:00","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","breadcrumb":{"@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#breadcrumb"},"inLanguage":"ja","potentialAction":[{"@type":"ReadAction","target":["https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/"]}]},{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#primaryimage","url":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png","width":1024,"height":703},{"@type":"BreadcrumbList","@id":"https:\/\/youzum.net\/nvidia-ai-just-released-the-largest-open-source-speech-ai-dataset-and-state-of-the-art-models-for-european-languages\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/youzum.net\/"},{"@type":"ListItem","position":2,"name":"NVIDIA AI Just Released the Largest Open-Source Speech AI Dataset and State-of-the-Art Models for European Languages"}]},{"@type":"WebSite","@id":"https:\/\/yousum.gpucore.co\/#website","url":"https:\/\/yousum.gpucore.co\/","name":"YouSum","description":"","publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/yousum.gpucore.co\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ja"},{"@type":"Organization","@id":"https:\/\/yousum.gpucore.co\/#organization","name":"Drone Association Thailand","url":"https:\/\/yousum.gpucore.co\/","logo":{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","width":300,"height":300,"caption":"Drone Association Thailand"},"image":{"@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/DroneAssociationTH\/"]},{"@type":"Person","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c","name":"admin NU","image":{"@type":"ImageObject","inLanguage":"ja","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","caption":"admin NU"},"url":"https:\/\/youzum.net\/ja\/members\/adminnu\/"}]}},"rttpg_featured_image_url":{"full":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png",1024,703,false],"landscape":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png",1024,703,false],"portraits":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png",1024,703,false],"thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM-150x150.png",150,150,true],"medium":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM-300x206.png",300,206,true],"large":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png",1024,703,false],"1536x1536":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png",1024,703,false],"2048x2048":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM.png",1024,703,false],"trp-custom-language-flag":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM-18x12.png",18,12,true],"woocommerce_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM-300x300.png",300,300,true],"woocommerce_single":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM-600x412.png",600,412,true],"woocommerce_gallery_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2025\/08\/image-16-1024x703-VTZAJM-100x100.png",100,100,true]},"rttpg_author":{"display_name":"admin NU","author_link":"https:\/\/youzum.net\/ja\/members\/adminnu\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/youzum.net\/ja\/category\/ai-club\/\" rel=\"category tag\">AI<\/a> <a href=\"https:\/\/youzum.net\/ja\/category\/committee\/\" rel=\"category tag\">Committee<\/a> <a href=\"https:\/\/youzum.net\/ja\/category\/news\/\" rel=\"category tag\">News<\/a> <a href=\"https:\/\/youzum.net\/ja\/category\/uncategorized\/\" rel=\"category tag\">Uncategorized<\/a>","rttpg_excerpt":"Nvidia has taken a major leap in the development of multilingual speech AI, unveiling Granary, the largest open-source speech dataset for European languages, and two state-of-the-art models: Canary-1b-v2 and Parakeet-tdt-0.6b-v3. This release sets a new standard for accessible, high-quality resources in automatic speech recognition (ASR) and speech translation (AST), especially for underrepresented European languages. Granary:&hellip;","_links":{"self":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/posts\/32052","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/comments?post=32052"}],"version-history":[{"count":0,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/posts\/32052\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/media\/32053"}],"wp:attachment":[{"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/media?parent=32052"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/categories?post=32052"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/youzum.net\/ja\/wp-json\/wp\/v2\/tags?post=32052"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}