News

September 29, 2025admin NUAI,Committee,News,Uncategorized0

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

arXiv:2509.22461v1 Announce Type: cross Abstract: The ability to reason from audio, including speech, paralinguistic cues...

August 23, 2025admin NUAI,Committee,News,Uncategorized0

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

A new benchmark from Salesforce research evaluates model and agentic performance on real-life enterprise tasks.Read...

May 11, 2025admin NUAI,Committee,News,Uncategorized0

MCP and the innovation paradox: Why open standards will save AI from itself

Much like HTTP and REST standardized how web applications connect to services, MCP standardizes how...

August 8, 2025admin NUAI,Committee,News,Uncategorized0

McBE: A Multi-task Chinese Bias Evaluation Benchmark for Large Language Models

arXiv:2507.02088v2 Announce Type: replace Abstract: As large language models (LLMs) are increasingly applied to various...

October 21, 2025admin NUAI,Committee,News,Uncategorized0

Max It or Miss It: Benchmarking LLM On Solving Extremal Problems

arXiv:2510.12997v2 Announce Type: replace-cross Abstract: Test-time scaling has enabled Large Language Models (LLMs) with remarkable...

June 6, 2025admin NUAI,Committee,News,Uncategorized0

Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science

arXiv:2506.04410v1 Announce Type: cross Abstract: Contemporary approaches to assisted scientific discovery use language models to...

April 29, 2025admin NUAI,Committee,Drone Type,News,Uncategorized0

Master Generative AI in 2025 | Live Online Training

Continue reading on Medium »...

July 1, 2025admin NUAI,Committee,News,Uncategorized0

Masked Gated Linear Unit

arXiv:2506.23225v1 Announce Type: cross Abstract: Gated Linear Units (GLUs) have become essential components in the...

September 22, 2025admin NUAI,Committee,News,Uncategorized0

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

arXiv:2509.16197v1 Announce Type: cross Abstract: Unified multimodal Large Language Models (LLMs) that can both understand...

June 6, 2025admin NUAI,Committee,News,Uncategorized0

Manus has kick-started an AI agent boom in China

Last year, China saw a boom in foundation models, the do-everything large language models that...

June 25, 2025admin NUAI,Committee,News,Uncategorized0

MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration

arXiv:2506.19835v1 Announce Type: new Abstract: Recent advancements in medical Large Language Models (LLMs) have showcased...

September 16, 2025admin NUAI,Committee,News,Uncategorized0

MALLM: Multi-Agent Large Language Models Framework

arXiv:2509.11656v1 Announce Type: cross Abstract: Multi-agent debate (MAD) has demonstrated the ability to augment collective...

News

News

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

MCP and the innovation paradox: Why open standards will save AI from itself

McBE: A Multi-task Chinese Bias Evaluation Benchmark for Large Language Models

Max It or Miss It: Benchmarking LLM On Solving Extremal Problems

Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science

Master Generative AI in 2025 | Live Online Training

Masked Gated Linear Unit

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Manus has kick-started an AI agent boom in China

MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration

MALLM: Multi-Agent Large Language Models Framework

Our Services

Home

How it work

News

Pricing

Support

Help Center

Report an Issue

Give Feedback

Privacy Policy

User Account

Follow Us