YouZum

News

News

HIVMedQA: Benchmarking large language models for HIV medical decision support

arXiv:2507.18143v2 Announce Type: replace Abstract: Large language models (LLMs) are emerging as valuable tools to...

Highlighted at CVPR 2025: Google DeepMind’s ‘Motion Prompting’ Paper Unlocks Granular Video Control

Key Takeaways: Researchers from Google DeepMind, the University of Michigan & Brown university have developed...

High-Entropy Token Selection in Reinforcement Learning with Verifiable Rewards (RLVR) Improves Accuracy and Reduces Training Cost for LLMs

Large Language Models (LLMs) generate step-by-step responses known as Chain-of-Thoughts (CoTs), where each token contributes...

High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

arXiv:2506.04051v1 Announce Type: new Abstract: Large Language Models (LLMs) currently respond to every prompt. However...

Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

arXiv:2507.22925v1 Announce Type: new Abstract: Long-term memory is one of the key factors influencing the...

Here’s how we picked this year’s Innovators Under 35

Next week, we’ll publish our 2025 list of Innovators Under 35, highlighting smart and talented...

HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization

arXiv:2508.04010v1 Announce Type: new Abstract: Large language models enable agents to autonomously perform tasks in...

Handling Numeric Expressions in Automatic Speech Recognition

arXiv:2408.00004v2 Announce Type: replace-cross Abstract: This paper addresses the problem of correctly formatting numeric expressions...

GuidedBench: Measuring and Mitigating the Evaluation Discrepancies of In-the-wild LLM Jailbreak Methods

arXiv:2502.16903v2 Announce Type: replace Abstract: Despite the growing interest in jailbreak methods as an effective...

Graph-R1: An Agentic GraphRAG Framework for Structured, Multi-Turn Reasoning with Reinforcement Learning

Introduction Large Language Models (LLMs) have set new benchmarks in natural language processing, but their...

Graph-Based Spectral Decomposition for Parameter Coordination in Language Model Fine-Tuning

arXiv:2504.19583v2 Announce Type: replace-cross Abstract: This paper proposes a parameter collaborative optimization algorithm for large...

GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs

arXiv:2507.18043v1 Announce Type: new Abstract: Inference-time steering methods offer a lightweight alternative to fine-tuning large...

We use cookies to improve your experience and performance on our website. You can learn more at Privacy Policy and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
en_US