YouZum

Nachrichten

Nachrichten

MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering

arXiv:2505.24040v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable performance on various...

MCP and the innovation paradox: Why open standards will save AI from itself

Much like HTTP and REST standardized how web applications connect to services, MCP standardizes how...

Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science

arXiv:2506.04410v1 Announce Type: cross Abstract: Contemporary approaches to assisted scientific discovery use language models to...

MATCHA: Can Multi-Agent Collaboration Build a Trustworthy Conversational Recommender?

arXiv:2504.20094v1 Announce Type: cross Abstract: In this paper, we propose a multi-agent collaboration framework called...

Masked Gated Linear Unit

arXiv:2506.23225v1 Announce Type: cross Abstract: Gated Linear Units (GLUs) have become essential components in the...

Manus has kick-started an AI agent boom in China

Last year, China saw a boom in foundation models, the do-everything large language models that...

MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration

arXiv:2506.19835v1 Announce Type: new Abstract: Recent advancements in medical Large Language Models (LLMs) have showcased...

M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

arXiv:2506.02510v1 Announce Type: new Abstract: Recent breakthroughs in large language models (LLMs) have led to...

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

arXiv:2412.18424v3 Announce Type: replace-cross Abstract: Large vision language models (LVLMs) have improved the document understanding...

Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models

arXiv:2507.18263v1 Announce Type: new Abstract: Direct speech translation (ST) has garnered increasing attention nowadays, yet...

LLMs Struggle with Real Conversations: Microsoft and Salesforce Researchers Reveal a 39% Performance Drop in Multi-Turn Underspecified Tasks

Conversational artificial intelligence is centered on enabling large language models (LLMs) to engage in dynamic...
de_DE