Actualités
Actualités
Salesforce AI Introduces CRMArena-Pro: The First Multi-Turn and Enterprise-Grade Benchmark for LLM Agents
AI agents powered by LLMs show great promise for handling complex business tasks, especially in...
Sakana AI Introduces Text-to-LoRA (T2L): A Hypernetwork that Generates Task-Specific LLM Adapters (LoRAs) based on a Text Description of the Task
Transformer models have significantly influenced how AI systems approach tasks in natural language understanding, translation...
SAEMark: Multi-bit LLM Watermarking with Inference-Time Scaling
arXiv:2508.08211v1 Announce Type: new Abstract: Watermarking LLM-generated text is critical for content attribution and misinformation...
Sacred or Synthetic? Evaluating LLM Reliability and Abstention for Religious Questions
arXiv:2508.08287v1 Announce Type: new Abstract: Despite the increasing usage of Large Language Models (LLMs) in...
s3: The new RAG framework that trains search agents with minimal data
S3 decouples RAG search from generation, boosting efficiency and generalization for enterprise LLM applications with...
Rubrics as Rewards (RaR): A Reinforcement Learning Framework for Training Language Models with Structured, Multi-Criteria Evaluation Signals
Reinforcement Learning with Verifiable Rewards (RLVR) allows LLMs to perform complex reasoning on tasks with...
RSAC 2025: Why the AI agent era means more demand for CISOS
RSAC 2025 made one thing clear: AI agents are entering security workflows, but boards want...
Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning
arXiv:2505.00001v1 Announce Type: new Abstract: Large Language Models (LLMs) are primarily trained on high-resource natural...
Robust Multimodal Large Language Models Against Modality Conflict
arXiv:2507.07151v1 Announce Type: cross Abstract: Despite the impressive capabilities of multimodal large language models (MLLMs)...
RoboBrain 2.0: The Next-Generation Vision-Language Model Unifying Embodied AI for Advanced Robotics
Advancements in artificial intelligence are rapidly closing the gap between digital reasoning and real-world interaction...
Roblox breaks ground on data center in Brazil for early 2026
At Gamescom Latam, Roblox announced it has broken ground on a new data center in Brazil...
ReviewGraph: A Knowledge Graph Embedding Based Framework for Review Rating Prediction with Sentiment Features
arXiv:2508.13953v1 Announce Type: new Abstract: In the hospitality industry, understanding the factors that drive customer...