ข่าว

ตุลาคม 21, 2025admin NUAI,Committee,ข่าว,Uncategorized0

Max It or Miss It: Benchmarking LLM On Solving Extremal Problems

arXiv:2510.12997v2 Announce Type: replace-cross Abstract: Test-time scaling has enabled Large Language Models (LLMs) with remarkable...

อ่านเพิ่มเติม

มิถุนายน 6, 2025admin NUAI,Committee,ข่าว,Uncategorized0

Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science

arXiv:2506.04410v1 Announce Type: cross Abstract: Contemporary approaches to assisted scientific discovery use language models to...

อ่านเพิ่มเติม

เมษายน 29, 2025admin NUAI,Committee,Drone Type,ข่าว,Uncategorized0

Master Generative AI in 2025 | Live Online Training

Continue reading on Medium »...

อ่านเพิ่มเติม

กรกฎาคม 1, 2025admin NUAI,Committee,ข่าว,Uncategorized0

Masked Gated Linear Unit

arXiv:2506.23225v1 Announce Type: cross Abstract: Gated Linear Units (GLUs) have become essential components in the...

อ่านเพิ่มเติม

กันยายน 22, 2025admin NUAI,Committee,ข่าว,Uncategorized0

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

arXiv:2509.16197v1 Announce Type: cross Abstract: Unified multimodal Large Language Models (LLMs) that can both understand...

อ่านเพิ่มเติม

มิถุนายน 6, 2025admin NUAI,Committee,ข่าว,Uncategorized0

Manus has kick-started an AI agent boom in China

Last year, China saw a boom in foundation models, the do-everything large language models that...

อ่านเพิ่มเติม

มิถุนายน 25, 2025admin NUAI,Committee,ข่าว,Uncategorized0

MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration

arXiv:2506.19835v1 Announce Type: new Abstract: Recent advancements in medical Large Language Models (LLMs) have showcased...

อ่านเพิ่มเติม

กันยายน 16, 2025admin NUAI,Committee,ข่าว,Uncategorized0

MALLM: Multi-Agent Large Language Models Framework

arXiv:2509.11656v1 Announce Type: cross Abstract: Multi-agent debate (MAD) has demonstrated the ability to augment collective...

อ่านเพิ่มเติม

ตุลาคม 10, 2025admin NUAI,Committee,ข่าว,Uncategorized0

MacroBench: A Novel Testbed for Web Automation Scripts via Large Language Models

arXiv:2510.04363v2 Announce Type: replace-cross Abstract: We introduce MacroBench, a code-first benchmark that evaluates whether LLMs...

อ่านเพิ่มเติม

มิถุนายน 4, 2025admin NUAI,Committee,ข่าว,Uncategorized0

M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

arXiv:2506.02510v1 Announce Type: new Abstract: Recent breakthroughs in large language models (LLMs) have led to...

อ่านเพิ่มเติม

สิงหาคม 4, 2025admin NUAI,Committee,ข่าว,Uncategorized0

Loss Landscape Degeneracy and Stagewise Development in Transformers

arXiv:2402.02364v3 Announce Type: replace-cross Abstract: Deep learning involves navigating a high-dimensional loss landscape over the...

อ่านเพิ่มเติม

กรกฎาคม 16, 2025admin NUAI,Committee,ข่าว,Uncategorized0

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

arXiv:2412.18424v3 Announce Type: replace-cross Abstract: Large vision language models (LVLMs) have improved the document understanding...

อ่านเพิ่มเติม

ข่าว

ข่าว

Max It or Miss It: Benchmarking LLM On Solving Extremal Problems

Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science

Master Generative AI in 2025 | Live Online Training

Masked Gated Linear Unit

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Manus has kick-started an AI agent boom in China

MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration

MALLM: Multi-Agent Large Language Models Framework

MacroBench: A Novel Testbed for Web Automation Scripts via Large Language Models

M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

Loss Landscape Degeneracy and Stagewise Development in Transformers

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

บริการของเรา

หน้าแรก

วิธีการทำงาน

ข่าว

แพ็กเกจราคา

ฝ่ายสนับสนุน

ศูนย์ช่วยเหลือ

รายงานปัญหา

ให้ความคิดเห็น

นโยบายความเป็นส่วนตัว

บัญชีผู้ใช้

ติดตามเรา