新闻
新闻
Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation
Multimodal modeling focuses on building systems to understand and generate content across visual and textual...
Salesforce AI Introduces CRMArena-Pro: The First Multi-Turn and Enterprise-Grade Benchmark for LLM Agents
AI agents powered by LLMs show great promise for handling complex business tasks, especially in...
Sakana AI Released ShinkaEvolve: An Open-Source Framework that Evolves Programs for Scientific Discovery with Unprecedented Sample-Efficiency
Table of contents What problem is it actually solving? Does the sample-efficiency claim hold beyond...
Sakana AI Introduces Text-to-LoRA (T2L): A Hypernetwork that Generates Task-Specific LLM Adapters (LoRAs) based on a Text Description of the Task
Transformer models have significantly influenced how AI systems approach tasks in natural language understanding, translation...
SafeMT: Multi-turn Safety for Multimodal Language Models
arXiv:2510.12133v1 Announce Type: new Abstract: With the widespread use of multi-modal Large Language models (MLLMs)...
SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling
arXiv:2508.08211v1 Announce Type: new Abstract: Watermarking LLM-generated text is critical for content attribution and misinformation...
Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions
arXiv:2508.08287v1 Announce Type: new Abstract: Despite the increasing usage of Large Language Models (LLMs) in...
s3: The new RAG framework that trains search agents with minimal data
S3 decouples RAG search from generation, boosting efficiency and generalization for enterprise LLM applications with...
Rubrics as Rewards (RaR): A Reinforcement Learning Framework for Training Language Models with Structured, Multi-Criteria Evaluation Signals
Reinforcement Learning with Verifiable Rewards (RLVR) allows LLMs to perform complex reasoning on tasks with...
RSAC 2025: Why the AI agent era means more demand for CISOS
RSAC 2025 made one thing clear: AI agents are entering security workflows, but boards want...
Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning
arXiv:2505.00001v1 Announce Type: new Abstract: Large Language Models (LLMs) are primarily trained on high-resource natural...
Robust Multimodal Large Language Models Against Modality Conflict
arXiv:2507.07151v1 Announce Type: cross Abstract: Despite the impressive capabilities of multimodal large language models (MLLMs)...






