新闻

5 月 17, 2025admin NUAI,Committee,新闻,Uncategorized0

Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

Multimodal modeling focuses on building systems to understand and generate content across visual and textual...

6 月 6, 2025admin NUAI,Committee,新闻,Uncategorized0

Salesforce AI Introduces CRMArena-Pro: The First Multi-Turn and Enterprise-Grade Benchmark for LLM Agents

AI agents powered by LLMs show great promise for handling complex business tasks, especially in...

9 月 27, 2025admin NUAI,Committee,新闻,Uncategorized0

Sakana AI Released ShinkaEvolve: An Open-Source Framework that Evolves Programs for Scientific Discovery with Unprecedented Sample-Efficiency

Table of contents What problem is it actually solving? Does the sample-efficiency claim hold beyond...

6 月 15, 2025admin NUAI,Committee,新闻,Uncategorized0

Sakana AI Introduces Text-to-LoRA (T2L): A Hypernetwork that Generates Task-Specific LLM Adapters (LoRAs) based on a Text Description of the Task

Transformer models have significantly influenced how AI systems approach tasks in natural language understanding, translation...

10 月 15, 2025admin NUAI,Committee,新闻,Uncategorized0

SafeMT: Multi-turn Safety for Multimodal Language Models

arXiv:2510.12133v1 Announce Type: new Abstract: With the widespread use of multi-modal Large Language models (MLLMs)...

8 月 12, 2025admin NUAI,Committee,新闻,Uncategorized0

SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling

arXiv:2508.08211v1 Announce Type: new Abstract: Watermarking LLM-generated text is critical for content attribution and misinformation...

8 月 13, 2025admin NUAI,Committee,新闻,Uncategorized0

Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions

arXiv:2508.08287v1 Announce Type: new Abstract: Despite the increasing usage of Large Language Models (LLMs) in...

5 月 29, 2025admin NUAI,Committee,新闻,Uncategorized0

s3: The new RAG framework that trains search agents with minimal data

S3 decouples RAG search from generation, boosting efficiency and generalization for enterprise LLM applications with...

7 月 30, 2025admin NUAI,Committee,新闻,Uncategorized0

Rubrics as Rewards (RaR): A Reinforcement Learning Framework for Training Language Models with Structured, Multi-Criteria Evaluation Signals

Reinforcement Learning with Verifiable Rewards (RLVR) allows LLMs to perform complex reasoning on tasks with...

5 月 3, 2025admin NUAI,Committee,新闻,Uncategorized0

RSAC 2025: Why the AI agent era means more demand for CISOS

RSAC 2025 made one thing clear: AI agents are entering security workflows, but boards want...

5 月 2, 2025admin NUAI,Committee,新闻,Uncategorized0

Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning

arXiv:2505.00001v1 Announce Type: new Abstract: Large Language Models (LLMs) are primarily trained on high-resource natural...

7 月 11, 2025admin NUAI,Committee,新闻,Uncategorized0

Robust Multimodal Large Language Models Against Modality Conflict

arXiv:2507.07151v1 Announce Type: cross Abstract: Despite the impressive capabilities of multimodal large language models (MLLMs)...

新闻

新闻

Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation

Salesforce AI Introduces CRMArena-Pro: The First Multi-Turn and Enterprise-Grade Benchmark for LLM Agents

Sakana AI Released ShinkaEvolve: An Open-Source Framework that Evolves Programs for Scientific Discovery with Unprecedented Sample-Efficiency

Sakana AI Introduces Text-to-LoRA (T2L): A Hypernetwork that Generates Task-Specific LLM Adapters (LoRAs) based on a Text Description of the Task

SafeMT: Multi-turn Safety for Multimodal Language Models

SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling

Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions

s3: The new RAG framework that trains search agents with minimal data

Rubrics as Rewards (RaR): A Reinforcement Learning Framework for Training Language Models with Structured, Multi-Criteria Evaluation Signals

RSAC 2025: Why the AI agent era means more demand for CISOS

Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning

Robust Multimodal Large Language Models Against Modality Conflict

我们的服务

首页

工作原理

新闻

定价

支持

幫助中心

报告问题

提供反馈

隱私權政策

用户账户

关注我们