Noticias
Noticias
Towards Understanding the Cognitive Habits of Large Reasoning Models
arXiv:2506.21571v2 Announce Type: replace Abstract: Large Reasoning Models (LRMs), which autonomously produce a reasoning Chain...
Time Magazine appears to accidentally publish embargoed story confirming new Anthropic model
Someone also appears to have published a full scrape of the Time article online on...
Three takeaways about AI’s energy use and climate impacts
This week, we published Power Hungry, a package all about AI and energy. At the...
This data set helps researchers spot harmful stereotypes in LLMs
AI models are riddled with culturally specific biases. A new data set, called SHADES, is...
This AI Paper Investigates Test-Time Scaling of English-Centric RLMs for Enhanced Multilingual Reasoning and Domain Generalization
Reasoning language models, or RLMs, are increasingly used to simulate step-by-step problem-solving by generating long...
This AI Paper Introduces WINGS: A Dual-Learner Architecture to Prevent Text-Only Forgetting in Multimodal Large Language Models
Multimodal LLMs: Expanding Capabilities Across Text and Vision Expanding large language models (LLMs) to handle...
This AI Paper Introduces WEB-SHEPHERD: A Process Reward Model for Web Agents with 40K Dataset and 10× Cost Efficiency
Web navigation focuses on teaching machines how to interact with websites to perform tasks such...
This AI Paper Introduces MathCoder-VL and FigCodifier: Advancing Multimodal Mathematical Reasoning with Vision-to-Code Alignment
Multimodal mathematical reasoning enables machines to solve problems involving textual information and visual components like...
This AI Paper Introduces LLaDA-V: A Purely Diffusion-Based Multimodal Large Language Model for Visual Instruction Tuning and Multimodal Reasoning
Multimodal large language models (MLLMs) are designed to process and generate content across various modalities...
This AI Paper Introduces Group Think: A Token-Level Multi-Agent Reasoning Paradigm for Faster and Collaborative LLM Inference
A prominent area of exploration involves enabling large language models (LLMs) to function collaboratively. Multi-agent...
This AI Paper Introduces GRIT: A Method for Teaching MLLMs to Reason with Images by Interleaving Text and Visual Grounding
The core idea of Multimodal Large Language Models (MLLMs) is to create models that can...
This AI Paper Introduces Differentiable MCMC Layers: A New AI Framework for Learning with Inexact Combinatorial Solvers in Neural Networks
Neural networks have long been powerful tools for handling complex data-driven tasks. Still, they often...