Nachrichten
Nachrichten
MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering
arXiv:2505.24040v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable performance on various...
MCP and the innovation paradox: Why open standards will save AI from itself
Much like HTTP and REST standardized how web applications connect to services, MCP standardizes how...
Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science
arXiv:2506.04410v1 Announce Type: cross Abstract: Contemporary approaches to assisted scientific discovery use language models to...
MATCHA: Can Multi-Agent Collaboration Build a Trustworthy Conversational Recommender?
arXiv:2504.20094v1 Announce Type: cross Abstract: In this paper, we propose a multi-agent collaboration framework called...
Master Generative AI in 2025 | Live Online Training
Continue reading on Medium »...
Masked Gated Linear Unit
arXiv:2506.23225v1 Announce Type: cross Abstract: Gated Linear Units (GLUs) have become essential components in the...
Manus has kick-started an AI agent boom in China
Last year, China saw a boom in foundation models, the do-everything large language models that...
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration
arXiv:2506.19835v1 Announce Type: new Abstract: Recent advancements in medical Large Language Models (LLMs) have showcased...
M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset
arXiv:2506.02510v1 Announce Type: new Abstract: Recent breakthroughs in large language models (LLMs) have led to...
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating
arXiv:2412.18424v3 Announce Type: replace-cross Abstract: Large vision language models (LVLMs) have improved the document understanding...
Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models
arXiv:2507.18263v1 Announce Type: new Abstract: Direct speech translation (ST) has garnered increasing attention nowadays, yet...
LLMs Struggle with Real Conversations: Microsoft and Salesforce Researchers Reveal a 39% Performance Drop in Multi-Turn Underspecified Tasks
Conversational artificial intelligence is centered on enabling large language models (LLMs) to engage in dynamic...