ニュース
ニュース
Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents
arXiv:2507.22925v1 Announce Type: new Abstract: Long-term memory is one of the key factors influencing the...
Here’s how we picked this year’s Innovators Under 35
Next week, we’ll publish our 2025 list of Innovators Under 35, highlighting smart and talented...
HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization
arXiv:2508.04010v1 Announce Type: new Abstract: Large language models enable agents to autonomously perform tasks in...
Handling Numeric Expressions in Automatic Speech Recognition
arXiv:2408.00004v2 Announce Type: replace-cross Abstract: This paper addresses the problem of correctly formatting numeric expressions...
GuidedBench: Measuring and Mitigating the Evaluation Discrepancies of In-the-wild LLM Jailbreak Methods
arXiv:2502.16903v2 Announce Type: replace Abstract: Despite the growing interest in jailbreak methods as an effective...
Graph-R1: An Agentic GraphRAG Framework for Structured, Multi-Turn Reasoning with Reinforcement Learning
Introduction Large Language Models (LLMs) have set new benchmarks in natural language processing, but their...
Graph-Based Spectral Decomposition for Parameter Coordination in Language Model Fine-Tuning
arXiv:2504.19583v2 Announce Type: replace-cross Abstract: This paper proposes a parameter collaborative optimization algorithm for large...
GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs
arXiv:2507.18043v1 Announce Type: new Abstract: Inference-time steering methods offer a lightweight alternative to fine-tuning large...
GPZ: A Next-Generation GPU-Accelerated Lossy Compressor for Large-Scale Particle Data
Particle-based simulations and point-cloud applications are driving a massive expansion in the size and complexity...
GPT-4o Understands Text, But Does It See Clearly? A Benchmarking Study of MFMs on Vision Tasks
Multimodal foundation models (MFMs) like GPT-4o, Gemini, and Claude have shown rapid progress recently, especially...
GPT and Prejudice: A Sparse Approach to Understanding Learned Representations in Large Language Models
arXiv:2510.01252v2 Announce Type: replace Abstract: As large language models (LLMs) are increasingly trained on massive...
Google’s Jules aims to out-code Codex in battle for the AI developer stack
Google released Jules, its coding agent, into beta as autonomous coding agents are quickly gaining...



