Nachrichten
Nachrichten
HIVMedQA: Benchmarking large language models for HIV medical decision support
arXiv:2507.18143v2 Announce Type: replace Abstract: Large language models (LLMs) are emerging as valuable tools to...
Highlighted at CVPR 2025: Google DeepMind’s ‘Motion Prompting’ Paper Unlocks Granular Video Control
Key Takeaways: Researchers from Google DeepMind, the University of Michigan & Brown university have developed...
High-Entropy Token Selection in Reinforcement Learning with Verifiable Rewards (RLVR) Improves Accuracy and Reduces Training Cost for LLMs
Large Language Models (LLMs) generate step-by-step responses known as Chain-of-Thoughts (CoTs), where each token contributes...
High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning
arXiv:2506.04051v1 Announce Type: new Abstract: Large Language Models (LLMs) currently respond to every prompt. However...
Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents
arXiv:2507.22925v1 Announce Type: new Abstract: Long-term memory is one of the key factors influencing the...
Here’s how we picked this year’s Innovators Under 35
Next week, we’ll publish our 2025 list of Innovators Under 35, highlighting smart and talented...
HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization
arXiv:2508.04010v1 Announce Type: new Abstract: Large language models enable agents to autonomously perform tasks in...
Handling Numeric Expressions in Automatic Speech Recognition
arXiv:2408.00004v2 Announce Type: replace-cross Abstract: This paper addresses the problem of correctly formatting numeric expressions...
GuidedBench: Measuring and Mitigating the Evaluation Discrepancies of In-the-wild LLM Jailbreak Methods
arXiv:2502.16903v2 Announce Type: replace Abstract: Despite the growing interest in jailbreak methods as an effective...
Graph-R1: An Agentic GraphRAG Framework for Structured, Multi-Turn Reasoning with Reinforcement Learning
Introduction Large Language Models (LLMs) have set new benchmarks in natural language processing, but their...
Graph-Based Spectral Decomposition for Parameter Coordination in Language Model Fine-Tuning
arXiv:2504.19583v2 Announce Type: replace-cross Abstract: This paper proposes a parameter collaborative optimization algorithm for large...
GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs
arXiv:2507.18043v1 Announce Type: new Abstract: Inference-time steering methods offer a lightweight alternative to fine-tuning large...