YouZum

ニュース

ニュース

MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based Agents

arXiv:2506.21605v1 Announce Type: new Abstract: Recent works have highlighted the significance of memory mechanisms in...

MemAgent: A Reinforcement Learning Framework Redefining Long-Context Processing in LLMs

Handling extremely long documents remains a persistent challenge for large language models (LLMs). Even with...

Meet Cathy Tie, Bride of “China’s Frankenstein”

Since the Chinese biophysicist He Jiankui was released from prison in 2022, he has sought...

MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

arXiv:2505.24040v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable performance on various...

MCP and the innovation paradox: Why open standards will save AI from itself

Much like HTTP and REST standardized how web applications connect to services, MCP standardizes how...

Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science

arXiv:2506.04410v1 Announce Type: cross Abstract: Contemporary approaches to assisted scientific discovery use language models to...

MATCHA: Can Multi-Agent Collaboration Build a Trustworthy Conversational Recommender?

arXiv:2504.20094v1 Announce Type: cross Abstract: In this paper, we propose a multi-agent collaboration framework called...

Masked Gated Linear Unit

arXiv:2506.23225v1 Announce Type: cross Abstract: Gated Linear Units (GLUs) have become essential components in the...

Manus has kick-started an AI agent boom in China

Last year, China saw a boom in foundation models, the do-everything large language models that...
ja