Actualités
Actualités
Can LLMs Really Judge with Reasoning? Microsoft and Tsinghua Researchers Introduce Reward Reasoning Models to Dynamically Scale Test-Time Compute for Better Alignment
Reinforcement learning (RL) has emerged as a fundamental approach in LLM post-training, utilizing supervision signals...
Can crowdsourced fact-checking curb misinformation on social media?
In a 2019 speech at Georgetown University, Mark Zuckerberg famously declared that he didn’t want...
Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding
arXiv:2505.03788v1 Announce Type: new Abstract: We introduce a novel approach for calibrating uncertainty quantification (UQ)...
ByteDance Researchers Introduce DetailFlow: A 1D Coarse-to-Fine Autoregressive Framework for Faster, Token-Efficient Image Generation
Autoregressive image generation has been shaped by advances in sequential modeling, originally seen in natural...
By putting AI into everything, Google wants to make it invisible
If you want to know where AI is headed, this year’s Google I/O has you...
Building voice AI that listens to everyone: Transfer learning and synthetic speech in action
Enterprises adopting voice AI must consider not just usability, but inclusion. Supporting users with disabilities...
Bryan Johnson wants to start a new religion in which “the body is God”
Bryan Johnson is on a mission to not die. The 47-year-old multimillionaire has already applied...
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
arXiv:2501.18858v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in complex...
Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model
arXiv:2505.04132v1 Announce Type: new Abstract: Access to legal information is fundamental to access to justice...
Bridging Social Media and Search Engines: Dredge Words and the Detection of Unreliable Domains
arXiv:2406.11423v4 Announce Type: replace-cross Abstract: Proactive content moderation requires platforms to rapidly and continuously evaluate...
BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
arXiv:2504.19467v2 Announce Type: replace Abstract: Large language models (LLMs) hold great promise for medical applications...
Breaking PEFT Limitations: Leveraging Weak-to-Strong Knowledge Transfer for Backdoor Attacks in LLMs
arXiv:2409.17946v4 Announce Type: replace-cross Abstract: Despite being widely applied due to their exceptional capabilities, Large...