{"id":98731,"date":"2026-06-20T18:11:02","date_gmt":"2026-06-20T18:11:02","guid":{"rendered":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/"},"modified":"2026-06-20T18:11:02","modified_gmt":"2026-06-20T18:11:02","slug":"vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline","status":"publish","type":"post","link":"https:\/\/youzum.net\/es\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/","title":{"rendered":"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline"},"content":{"rendered":"<p class=\"wp-block-paragraph\">While recent breakthroughs in AI reasoning have largely been driven by massive scale, pouring in billions of parameters to cross complex cognitive thresholds\u2014<strong>VibeThinker-3B<\/strong> is charting a completely different path.<\/p>\n<p class=\"wp-block-paragraph\">Created by researchers from Sina Weibo Inc (China), this 3-billion-parameter model proves that efficiency can punch far above its weight class. Released under an open-source MIT license, VibeThinker-3B matches the performance of models hundreds of times its size on verifiable tasks like mathematics, coding, and STEM disciplines.<\/p>\n<h2 class=\"wp-block-heading\"><strong>What is VibeThinker-3B<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">VibeThinker-3B is a compact dense model built on the Qwen2.5-Coder-3B base. It is post-trained, not pretrained from scratch. The research team applies supervised fine-tuning, reinforcement learning, and self-distillation on top.<\/p>\n<p class=\"wp-block-paragraph\">The training framework continues the Spectrum-to-Signal Principle (SSP) from the earlier VibeThinker-1.5B. SFT (Supervised Fine-Tuning) builds a broad space of valid reasoning paths, the \u2018Spectrum.\u2019 RL then amplifies the correct paths, the \u2018Signal.\u2019<\/p>\n<p class=\"wp-block-paragraph\">The model targets one job: reasoning where a verifier can confirm the answer. The research team recommends larger general models for open-domain knowledge tasks. VibeThinker-3B is a specialist by design.<\/p>\n<p class=\"wp-block-paragraph\">It runs on standard stacks. The model weights require <code>transformers&gt;=4.54.0<\/code>. For faster inference it recommends <code>vLLM==0.10.1<\/code> or <code>SGLang&gt;=0.4.9.post6<\/code>. The BF16 weights are roughly 6 GB, small enough for a single GPU.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1656\" height=\"910\" data-attachment-id=\"80617\" data-permalink=\"https:\/\/www.marktechpost.com\/2026\/06\/19\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/screenshot-2026-06-19-at-3-07-21-pm-2\/\" data-orig-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1.png\" data-orig-size=\"1656,910\" data-comments-opened=\"0\" data-image-meta='{\"aperture\":\"0\",\"credit\":\"\",\"camera\":\"\",\"caption\":\"\",\"created_timestamp\":\"0\",\"copyright\":\"\",\"focal_length\":\"0\",\"iso\":\"0\",\"shutter_speed\":\"0\",\"title\":\"\",\"orientation\":\"0\",\"alt\":\"\"}' data-image-title=\"Screenshot 2026-06-19 at 3.07.21\u202fPM\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-1024x563.png\" src=\"https:\/\/www.marktechpost.com\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1.png\" alt=\"\" class=\"wp-image-80617\" \/><figcaption class=\"wp-element-caption\">https:\/\/arxiv.org\/pdf\/2606.16140v1<\/figcaption><\/figure>\n<\/div>\n<h2 class=\"wp-block-heading\"><strong>Benchmark<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">On AIME26, VibeThinker-3B scores 94.3. According to the research paper, this is comparable to DeepSeek V3.2 (671B) and Kimi K2.5 (1T).<\/p>\n<p class=\"wp-block-paragraph\">On LiveCodeBench v6, it reaches 80.2 Pass@1. On OJBench, another code benchmark, it scores 38.6, below the largest models. On HMMT25 it scores 89.3, and on BruMO25 it reaches 93.8. On IMO-AnswerBench, a 400-problem IMO-level set, it scores 76.4.<\/p>\n<p class=\"wp-block-paragraph\">The table below compares it against much larger reasoning models. The \u2018+CLR\u2019 row uses test-time scaling. It stands for Claim-Level Reliability Assessment<\/p>\n<figure class=\"wp-block-table\">\n<table class=\"has-fixed-layout\">\n<thead>\n<tr>\n<th>Model<\/th>\n<th>Params<\/th>\n<th>AIME26<\/th>\n<th>HMMT25<\/th>\n<th>IMO-Ans<\/th>\n<th>LCBv6<\/th>\n<th>GPQA-D<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>VibeThinker-3B<\/strong><\/td>\n<td><strong>3B<\/strong><\/td>\n<td><strong>94.3<\/strong><\/td>\n<td><strong>89.3<\/strong><\/td>\n<td><strong>76.4<\/strong><\/td>\n<td><strong>80.2<\/strong><\/td>\n<td><strong>70.2<\/strong><\/td>\n<\/tr>\n<tr>\n<td><strong>VibeThinker-3B +CLR<\/strong><\/td>\n<td><strong>3B<\/strong><\/td>\n<td><strong>97.1<\/strong><\/td>\n<td><strong>95.4<\/strong><\/td>\n<td><strong>80.6<\/strong><\/td>\n<td>\u2014<\/td>\n<td><strong>72.9<\/strong><\/td>\n<\/tr>\n<tr>\n<td>GPT-OSS (high)<\/td>\n<td>120B<\/td>\n<td>93.2<\/td>\n<td>90.0<\/td>\n<td>75.6<\/td>\n<td>81.9<\/td>\n<td>80.1<\/td>\n<\/tr>\n<tr>\n<td>DeepSeek V3.2<\/td>\n<td>671B<\/td>\n<td>94.2<\/td>\n<td>90.2<\/td>\n<td>78.3<\/td>\n<td>80.8<\/td>\n<td>82.4<\/td>\n<\/tr>\n<tr>\n<td>GLM-5<\/td>\n<td>744B<\/td>\n<td>95.8<\/td>\n<td>97.9<\/td>\n<td>82.5<\/td>\n<td>85.5<\/td>\n<td>86.0<\/td>\n<\/tr>\n<tr>\n<td>Kimi K2.5<\/td>\n<td>1T<\/td>\n<td>93.3<\/td>\n<td>95.4<\/td>\n<td>81.8<\/td>\n<td>85.0<\/td>\n<td>87.6<\/td>\n<\/tr>\n<\/tbody>\n<\/table><figcaption class=\"wp-element-caption\"><em>Source: VibeThinker-3B Technical Report, Table 2. GPQA-D is GPQA-Diamond.<\/em><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">The pattern is consistent. On verifiable math and code, the 3B model sits near the top cluster. On GPQA-Diamond, a knowledge-heavy benchmark, the gap to large models stays visible.<\/p>\n<p class=\"wp-block-paragraph\">The research team also ran an out-of-distribution coding test. It used recent LeetCode weekly and biweekly contests, from Apr 25 to May 31, 2026. The model passed 123 of 128 first-attempt Python submissions. That is a 96.1% acceptance rate on unseen problems.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Inside the Spectrum-to-Signal Pipeline<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">The post-training pipeline runs in four stages. Each one targets a different weakness of small reasoning models.<\/p>\n<p class=\"wp-block-paragraph\"><strong>First comes<\/strong> curriculum-based two-stage SFT. Stage 1 covers math, code, STEM, dialogue, and instruction following broadly. Stage 2 shifts to harder, longer-horizon samples filtered by reasoning length and difficulty. Diversity-Exploring Distillation preserves multiple valid solution paths through both stages.<\/p>\n<p class=\"wp-block-paragraph\"><strong>Second comes<\/strong> multi-domain Reasoning RL. The research team reuses MaxEnt-Guided Policy Optimization (MGPO). MGPO weights prompts near the model\u2019s current capability boundary, where correct and incorrect rollouts coexist. Training runs sequentially across Math, Code, and STEM.<\/p>\n<p class=\"wp-block-paragraph\">A notable detail: VibeThinker-3B drops progressive context expansion. The research team found high-truncation warm-up hurt long reasoning at this scale. So RL uses a single 64K long-context window throughout.<\/p>\n<p class=\"wp-block-paragraph\">Math RL adds a Long2Short stage. It redistributes reward among correct trajectories by length. Shorter correct answers get higher reward, longer ones lower, with the group mean unchanged. The goal is fewer redundant tokens without losing accuracy.<\/p>\n<p class=\"wp-block-paragraph\"><strong>Third,<\/strong> Offline Self-Distillation merges the RL checkpoints back into one student model. <strong>Fourth<\/strong>, Instruct RL improves instruction adherence. That stage explains the 93.4 IFEval and 74.5 IFBench scores. Both show reasoning tuning did not break controllability.<\/p>\n<h2 class=\"wp-block-heading\"><strong>CLR: Scaling at Test Time, Not Parameter Count<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">Claim-Level Reliability Assessment (CLR) is the report\u2019s test-time scaling method. It runs on answer-verifiable tasks and adds no parameters.<\/p>\n<p class=\"wp-block-paragraph\">The procedure has two steps. The model first generates K = 32 trajectories per problem. From each, it extracts M = 5 decision-relevant claims plus a final answer.<\/p>\n<p class=\"wp-block-paragraph\">The model then acts as its own verifier. It validates or falsifies each claim, producing binary verdicts. CLR maps these into a nonlinear trajectory reliability score, where one weak claim sharply lowers the weight.<\/p>\n<p class=\"wp-block-paragraph\">Answers are clustered by equivalence, and the highest reliability-weighted answer wins. The full flow runs 8 times, and the averaged Pass@1 is reported. CLR lifts AIME26 to 97.1 and BruMO25 to 99.2.<\/p>\n<p class=\"wp-block-paragraph\">The interactive demo below lets you flip claims and watch the score collapse. It also lets you switch benchmarks and compare against larger models.<\/p>\n<p><!-- VibeThinker-3B interactive explainer | paste into a WordPress \"Custom HTML\" block --><\/p>\n<p class=\"wp-block-paragraph\">\n<h2 class=\"wp-block-heading\"><strong>Use Cases With Examples<\/strong><\/h2>\n<\/p><p class=\"wp-block-paragraph\">The research team frames VibeThinker-3B as a specialist, so use cases follow the verifiable-reasoning boundary.<\/p>\n<ul class=\"wp-block-list\">\n<li><strong>Competitive math tutoring<\/strong>: It solves AIME and HMMT-style problems with full chains of reasoning. A study tool could generate worked solutions and self-check answers locally.<\/li>\n<li><strong>Algorithmic coding help<\/strong>: The 96.1% LeetCode acceptance rate suggests strong one-shot Python generation. An IDE assistant could draft contest-style solutions and run hidden tests.<\/li>\n<li><strong>Cost-sensitive RL or agent backends<\/strong>: A 3B model is cheap to serve at scale. Teams running many verifiable subtasks could route them here instead of a 600B+ model.<\/li>\n<li><strong>On-device reasoning.<\/strong> BF16 weights fit one consumer GPU. Edge or offline deployments gain a reasoning engine without cloud calls.<\/li>\n<\/ul>\n<h2 class=\"wp-block-heading\"><strong>Running It: Quick Start<\/strong><\/h2>\n<p class=\"wp-block-paragraph\">Serving with vLLM exposes an OpenAI-compatible endpoint:<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\"no-line-numbers\"><code class=\"no-wrap language-php\">pip install vllm\nvllm serve \"WeiboAI\/VibeThinker-3B\"\n\ncurl -X POST \"http:\/\/localhost:8000\/v1\/chat\/completions\" \n  -H \"Content-Type: application\/json\" \n  --data '{\n    \"model\": \"WeiboAI\/VibeThinker-3B\",\n    \"messages\": [{\"role\":\"user\",\"content\":\"Prove there are infinitely many primes.\"}],\n    \"temperature\": 1.0, \"top_p\": 0.95\n  }'<\/code><\/pre>\n<\/div>\n<\/div>\n<p class=\"wp-block-paragraph\">Direct Transformers usage mirrors the official card:<\/p>\n<div class=\"dm-code-snippet dark dm-normal-version default no-background-mobile\">\n<div class=\"control-language\">\n<div class=\"dm-buttons\">\n<div class=\"dm-buttons-left\">\n<div class=\"dm-button-snippet red-button\"><\/div>\n<div class=\"dm-button-snippet orange-button\"><\/div>\n<div class=\"dm-button-snippet green-button\"><\/div>\n<\/div>\n<div class=\"dm-buttons-right\"><a><span class=\"dm-copy-text\">Copy Code<\/span><span class=\"dm-copy-confirmed\">Copied<\/span><span class=\"dm-error-message\">Use a different Browser<\/span><\/a><\/div>\n<\/div>\n<pre class=\"no-line-numbers\"><code class=\"no-wrap language-php\">from transformers import AutoModelForCausalLM, AutoTokenizer\n\ntok = AutoTokenizer.from_pretrained(\"WeiboAI\/VibeThinker-3B\", trust_remote_code=True)\nmodel = AutoModelForCausalLM.from_pretrained(\n    \"WeiboAI\/VibeThinker-3B\", torch_dtype=\"bfloat16\", device_map=\"auto\")\n\nmsgs = [{\"role\": \"user\", \"content\": \"Your prompt\"}]\ntext = tok.apply_chat_template(msgs, tokenize=False, add_generation_prompt=True)\ninputs = tok([text], return_tensors=\"pt\").to(model.device)\nout = model.generate(**inputs, max_new_tokens=102400,\n                     do_sample=True, temperature=1.0, top_p=0.95)\nprint(tok.decode(out[0][inputs.input_ids.shape[-1]:], skip_special_tokens=True))<\/code><\/pre>\n<\/div>\n<\/div>\n<p class=\"wp-block-paragraph\">The high <code>max_new_tokens<\/code> matters. The model produces long reasoning traces, so short caps can truncate answers.<\/p>\n<h2 class=\"wp-block-heading\"><strong>Key Takeaways<\/strong><\/h2>\n<ul class=\"wp-block-list\">\n<li>VibeThinker-3B is a 3B dense model, MIT-licensed, built on Qwen2.5-Coder-3B for verifiable reasoning.<\/li>\n<li>It scores 94.3 on AIME26, comparable to DeepSeek V3.2 (671B) and Kimi K2.5 (1T).<\/li>\n<li>CLR test-time scaling lifts AIME26 to 97.1 and BruMO25 to 99.2, with no extra parameters.<\/li>\n<li>On unseen LeetCode contests, it passed 123 of 128 first-attempt Python submissions (96.1%).<\/li>\n<li>The gain is narrow: it trails large models on GPQA-Diamond and broad open-domain knowledge.<\/li>\n<\/ul>\n<p class=\"wp-block-paragraph\">\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n<\/p><p class=\"wp-block-paragraph\">Check out\u00a0the\u00a0<strong><a href=\"https:\/\/arxiv.org\/pdf\/2606.16140v1\" target=\"_blank\" rel=\"noreferrer noopener\">Paper<\/a><\/strong>, <strong><a href=\"https:\/\/huggingface.co\/WeiboAI\/VibeThinker-3B\" target=\"_blank\" rel=\"noreferrer noopener\">Model weight<\/a><\/strong> and <strong><a href=\"https:\/\/github.com\/WeiboAI\/VibeThinker\" target=\"_blank\" rel=\"noreferrer noopener\">Repo<\/a><\/strong>.<strong>\u00a0<\/strong>Also,\u00a0feel free to follow us on\u00a0<strong><a href=\"https:\/\/x.com\/intent\/follow?screen_name=marktechpost\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Twitter<\/mark><\/a><\/strong>\u00a0and don\u2019t forget to join our\u00a0<strong><a href=\"https:\/\/www.reddit.com\/r\/machinelearningnews\/\" target=\"_blank\" rel=\"noreferrer noopener\">150k+ML SubReddit<\/a><\/strong>\u00a0and Subscribe to\u00a0<strong><a href=\"https:\/\/www.aidevsignals.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">our Newsletter<\/a><\/strong>. Wait! are you on telegram?\u00a0<strong><a href=\"https:\/\/t.me\/machinelearningresearchnews\" target=\"_blank\" rel=\"noreferrer noopener\">now you can join us on telegram as well.<\/a><\/strong><\/p>\n<p class=\"wp-block-paragraph\">Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.?\u00a0<strong><a href=\"https:\/\/forms.gle\/wbash1wF6efRj8G58\" target=\"_blank\" rel=\"noreferrer noopener\"><mark>Connect with us<\/mark><\/a><\/strong><\/p>\n<p>The post <a href=\"https:\/\/www.marktechpost.com\/2026\/06\/19\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\">VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline<\/a> appeared first on <a href=\"https:\/\/www.marktechpost.com\/\">MarkTechPost<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>While recent breakthroughs in AI reasoning have largely been driven by massive scale, pouring in billions of parameters to cross complex cognitive thresholds\u2014VibeThinker-3B is charting a completely different path. Created by researchers from Sina Weibo Inc (China), this 3-billion-parameter model proves that efficiency can punch far above its weight class. Released under an open-source MIT license, VibeThinker-3B matches the performance of models hundreds of times its size on verifiable tasks like mathematics, coding, and STEM disciplines. What is VibeThinker-3B VibeThinker-3B is a compact dense model built on the Qwen2.5-Coder-3B base. It is post-trained, not pretrained from scratch. The research team applies supervised fine-tuning, reinforcement learning, and self-distillation on top. The training framework continues the Spectrum-to-Signal Principle (SSP) from the earlier VibeThinker-1.5B. SFT (Supervised Fine-Tuning) builds a broad space of valid reasoning paths, the \u2018Spectrum.\u2019 RL then amplifies the correct paths, the \u2018Signal.\u2019 The model targets one job: reasoning where a verifier can confirm the answer. The research team recommends larger general models for open-domain knowledge tasks. VibeThinker-3B is a specialist by design. It runs on standard stacks. The model weights require transformers&gt;=4.54.0. For faster inference it recommends vLLM==0.10.1 or SGLang&gt;=0.4.9.post6. The BF16 weights are roughly 6 GB, small enough for a single GPU. https:\/\/arxiv.org\/pdf\/2606.16140v1 Benchmark On AIME26, VibeThinker-3B scores 94.3. According to the research paper, this is comparable to DeepSeek V3.2 (671B) and Kimi K2.5 (1T). On LiveCodeBench v6, it reaches 80.2 Pass@1. On OJBench, another code benchmark, it scores 38.6, below the largest models. On HMMT25 it scores 89.3, and on BruMO25 it reaches 93.8. On IMO-AnswerBench, a 400-problem IMO-level set, it scores 76.4. The table below compares it against much larger reasoning models. The \u2018+CLR\u2019 row uses test-time scaling. It stands for Claim-Level Reliability Assessment Model Params AIME26 HMMT25 IMO-Ans LCBv6 GPQA-D VibeThinker-3B 3B 94.3 89.3 76.4 80.2 70.2 VibeThinker-3B +CLR 3B 97.1 95.4 80.6 \u2014 72.9 GPT-OSS (high) 120B 93.2 90.0 75.6 81.9 80.1 DeepSeek V3.2 671B 94.2 90.2 78.3 80.8 82.4 GLM-5 744B 95.8 97.9 82.5 85.5 86.0 Kimi K2.5 1T 93.3 95.4 81.8 85.0 87.6 Source: VibeThinker-3B Technical Report, Table 2. GPQA-D is GPQA-Diamond. The pattern is consistent. On verifiable math and code, the 3B model sits near the top cluster. On GPQA-Diamond, a knowledge-heavy benchmark, the gap to large models stays visible. The research team also ran an out-of-distribution coding test. It used recent LeetCode weekly and biweekly contests, from Apr 25 to May 31, 2026. The model passed 123 of 128 first-attempt Python submissions. That is a 96.1% acceptance rate on unseen problems. Inside the Spectrum-to-Signal Pipeline The post-training pipeline runs in four stages. Each one targets a different weakness of small reasoning models. First comes curriculum-based two-stage SFT. Stage 1 covers math, code, STEM, dialogue, and instruction following broadly. Stage 2 shifts to harder, longer-horizon samples filtered by reasoning length and difficulty. Diversity-Exploring Distillation preserves multiple valid solution paths through both stages. Second comes multi-domain Reasoning RL. The research team reuses MaxEnt-Guided Policy Optimization (MGPO). MGPO weights prompts near the model\u2019s current capability boundary, where correct and incorrect rollouts coexist. Training runs sequentially across Math, Code, and STEM. A notable detail: VibeThinker-3B drops progressive context expansion. The research team found high-truncation warm-up hurt long reasoning at this scale. So RL uses a single 64K long-context window throughout. Math RL adds a Long2Short stage. It redistributes reward among correct trajectories by length. Shorter correct answers get higher reward, longer ones lower, with the group mean unchanged. The goal is fewer redundant tokens without losing accuracy. Third, Offline Self-Distillation merges the RL checkpoints back into one student model. Fourth, Instruct RL improves instruction adherence. That stage explains the 93.4 IFEval and 74.5 IFBench scores. Both show reasoning tuning did not break controllability. CLR: Scaling at Test Time, Not Parameter Count Claim-Level Reliability Assessment (CLR) is the report\u2019s test-time scaling method. It runs on answer-verifiable tasks and adds no parameters. The procedure has two steps. The model first generates K = 32 trajectories per problem. From each, it extracts M = 5 decision-relevant claims plus a final answer. The model then acts as its own verifier. It validates or falsifies each claim, producing binary verdicts. CLR maps these into a nonlinear trajectory reliability score, where one weak claim sharply lowers the weight. Answers are clustered by equivalence, and the highest reliability-weighted answer wins. The full flow runs 8 times, and the averaged Pass@1 is reported. CLR lifts AIME26 to 97.1 and BruMO25 to 99.2. The interactive demo below lets you flip claims and watch the score collapse. It also lets you switch benchmarks and compare against larger models. Use Cases With Examples The research team frames VibeThinker-3B as a specialist, so use cases follow the verifiable-reasoning boundary. Competitive math tutoring: It solves AIME and HMMT-style problems with full chains of reasoning. A study tool could generate worked solutions and self-check answers locally. Algorithmic coding help: The 96.1% LeetCode acceptance rate suggests strong one-shot Python generation. An IDE assistant could draft contest-style solutions and run hidden tests. Cost-sensitive RL or agent backends: A 3B model is cheap to serve at scale. Teams running many verifiable subtasks could route them here instead of a 600B+ model. On-device reasoning. BF16 weights fit one consumer GPU. Edge or offline deployments gain a reasoning engine without cloud calls. Running It: Quick Start Serving with vLLM exposes an OpenAI-compatible endpoint: Copy CodeCopiedUse a different Browser pip install vllm vllm serve &#8220;WeiboAI\/VibeThinker-3B&#8221; curl -X POST &#8220;http:\/\/localhost:8000\/v1\/chat\/completions&#8221; -H &#8220;Content-Type: application\/json&#8221; &#8211;data &#8216;{ &#8220;model&#8221;: &#8220;WeiboAI\/VibeThinker-3B&#8221;, &#8220;messages&#8221;: [{&#8220;role&#8221;:&#8221;user&#8221;,&#8221;content&#8221;:&#8221;Prove there are infinitely many primes.&#8221;}], &#8220;temperature&#8221;: 1.0, &#8220;top_p&#8221;: 0.95 }&#8217; Direct Transformers usage mirrors the official card: Copy CodeCopiedUse a different Browser from transformers import AutoModelForCausalLM, AutoTokenizer tok = AutoTokenizer.from_pretrained(&#8220;WeiboAI\/VibeThinker-3B&#8221;, trust_remote_code=True) model = AutoModelForCausalLM.from_pretrained( &#8220;WeiboAI\/VibeThinker-3B&#8221;, torch_dtype=&#8221;bfloat16&#8243;, device_map=&#8221;auto&#8221;) msgs = [{&#8220;role&#8221;: &#8220;user&#8221;, &#8220;content&#8221;: &#8220;Your prompt&#8221;}] text = tok.apply_chat_template(msgs, tokenize=False, add_generation_prompt=True) inputs = tok([text], return_tensors=&#8221;pt&#8221;).to(model.device) out = model.generate(**inputs, max_new_tokens=102400, do_sample=True, temperature=1.0, top_p=0.95) print(tok.decode(out[0][inputs.input_ids.shape[-1]:], skip_special_tokens=True)) The high max_new_tokens matters. The model produces long reasoning traces, so short caps can truncate answers. Key Takeaways VibeThinker-3B is a 3B dense model,<\/p>","protected":false},"author":2,"featured_media":98732,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"pmpro_default_level":"","site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"_pvb_checkbox_block_on_post":false,"footnotes":""},"categories":[52,5,7,1],"tags":[],"class_list":["post-98731","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-club","category-committee","category-news","category-uncategorized","pmpro-has-access"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.3 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline - YouZum<\/title>\n<meta name=\"description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/youzum.net\/es\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\" \/>\n<meta property=\"og:locale\" content=\"es_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline - YouZum\" \/>\n<meta property=\"og:description\" content=\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\" \/>\n<meta property=\"og:url\" content=\"https:\/\/youzum.net\/es\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\" \/>\n<meta property=\"og:site_name\" content=\"YouZum\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/DroneAssociationTH\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-06-20T18:11:02+00:00\" \/>\n<meta name=\"author\" content=\"admin NU\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin NU\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tiempo de lectura\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\"},\"author\":{\"name\":\"admin NU\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\"},\"headline\":\"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline\",\"datePublished\":\"2026-06-20T18:11:02+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\"},\"wordCount\":1064,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp\",\"articleSection\":[\"AI\",\"Committee\",\"News\",\"Uncategorized\"],\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\",\"url\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\",\"name\":\"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline - YouZum\",\"isPartOf\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp\",\"datePublished\":\"2026-06-20T18:11:02+00:00\",\"description\":\"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19\",\"breadcrumb\":{\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#breadcrumb\"},\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp\",\"width\":1656,\"height\":910},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/youzum.net\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/yousum.gpucore.co\/#website\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"name\":\"YouSum\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/yousum.gpucore.co\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"es\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/yousum.gpucore.co\/#organization\",\"name\":\"Drone Association Thailand\",\"url\":\"https:\/\/yousum.gpucore.co\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png\",\"width\":300,\"height\":300,\"caption\":\"Drone Association Thailand\"},\"image\":{\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/DroneAssociationTH\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c\",\"name\":\"admin NU\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"contentUrl\":\"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png\",\"caption\":\"admin NU\"},\"url\":\"https:\/\/youzum.net\/es\/members\/adminnu\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline - YouZum","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/youzum.net\/es\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/","og_locale":"es_ES","og_type":"article","og_title":"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline - YouZum","og_description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","og_url":"https:\/\/youzum.net\/es\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/","og_site_name":"YouZum","article_publisher":"https:\/\/www.facebook.com\/DroneAssociationTH\/","article_published_time":"2026-06-20T18:11:02+00:00","author":"admin NU","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"admin NU","Tiempo de lectura":"6 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#article","isPartOf":{"@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/"},"author":{"name":"admin NU","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c"},"headline":"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline","datePublished":"2026-06-20T18:11:02+00:00","mainEntityOfPage":{"@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/"},"wordCount":1064,"commentCount":0,"publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"image":{"@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp","articleSection":["AI","Committee","News","Uncategorized"],"inLanguage":"es","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/","url":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/","name":"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline - YouZum","isPartOf":{"@id":"https:\/\/yousum.gpucore.co\/#website"},"primaryImageOfPage":{"@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage"},"image":{"@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage"},"thumbnailUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp","datePublished":"2026-06-20T18:11:02+00:00","description":"\u0e01\u0e34\u0e08\u0e01\u0e23\u0e23\u0e21\u0e40\u0e01\u0e35\u0e48\u0e22\u0e27\u0e01\u0e31\u0e1a\u0e42\u0e14\u0e23\u0e19","breadcrumb":{"@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#breadcrumb"},"inLanguage":"es","potentialAction":[{"@type":"ReadAction","target":["https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/"]}]},{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#primaryimage","url":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp","width":1656,"height":910},{"@type":"BreadcrumbList","@id":"https:\/\/youzum.net\/vibethinker-3b-a-3b-dense-reasoning-model-built-on-qwen2-5-coder-3b-with-the-spectrum-to-signal-post-training-pipeline\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/youzum.net\/"},{"@type":"ListItem","position":2,"name":"VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline"}]},{"@type":"WebSite","@id":"https:\/\/yousum.gpucore.co\/#website","url":"https:\/\/yousum.gpucore.co\/","name":"YouSum","description":"","publisher":{"@id":"https:\/\/yousum.gpucore.co\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/yousum.gpucore.co\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"es"},{"@type":"Organization","@id":"https:\/\/yousum.gpucore.co\/#organization","name":"Drone Association Thailand","url":"https:\/\/yousum.gpucore.co\/","logo":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/2024\/11\/tranparent-logo.png","width":300,"height":300,"caption":"Drone Association Thailand"},"image":{"@id":"https:\/\/yousum.gpucore.co\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/DroneAssociationTH\/"]},{"@type":"Person","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/97fa48242daf3908e4d9a5f26f4a059c","name":"admin NU","image":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/yousum.gpucore.co\/#\/schema\/person\/image\/","url":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","contentUrl":"https:\/\/youzum.net\/wp-content\/uploads\/avatars\/2\/1746849356-bpfull.png","caption":"admin NU"},"url":"https:\/\/youzum.net\/es\/members\/adminnu\/"}]}},"rttpg_featured_image_url":{"full":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp",1656,910,false],"landscape":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp",1656,910,false],"portraits":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp",1656,910,false],"thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-150x150.webp",150,150,true],"medium":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-300x165.webp",300,165,true],"large":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-1024x563.webp",1024,563,true],"1536x1536":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-1536x844.webp",1536,844,true],"2048x2048":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW.webp",1656,910,false],"trp-custom-language-flag":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-18x10.webp",18,10,true],"woocommerce_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-300x300.webp",300,300,true],"woocommerce_single":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-600x330.webp",600,330,true],"woocommerce_gallery_thumbnail":["https:\/\/youzum.net\/wp-content\/uploads\/2026\/06\/Screenshot-2026-06-19-at-3.07.21-PM-1-RQOwnW-100x100.webp",100,100,true]},"rttpg_author":{"display_name":"admin NU","author_link":"https:\/\/youzum.net\/es\/members\/adminnu\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/youzum.net\/es\/category\/ai-club\/\" rel=\"category tag\">AI<\/a> <a href=\"https:\/\/youzum.net\/es\/category\/committee\/\" rel=\"category tag\">Committee<\/a> <a href=\"https:\/\/youzum.net\/es\/category\/news\/\" rel=\"category tag\">News<\/a> <a href=\"https:\/\/youzum.net\/es\/category\/uncategorized\/\" rel=\"category tag\">Uncategorized<\/a>","rttpg_excerpt":"While recent breakthroughs in AI reasoning have largely been driven by massive scale, pouring in billions of parameters to cross complex cognitive thresholds\u2014VibeThinker-3B is charting a completely different path. Created by researchers from Sina Weibo Inc (China), this 3-billion-parameter model proves that efficiency can punch far above its weight class. Released under an open-source MIT&hellip;","_links":{"self":[{"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/posts\/98731","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/comments?post=98731"}],"version-history":[{"count":0,"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/posts\/98731\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/media\/98732"}],"wp:attachment":[{"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/media?parent=98731"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/categories?post=98731"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/youzum.net\/es\/wp-json\/wp\/v2\/tags?post=98731"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}