新闻
新闻
Understanding In-context Learning of Addition via Activation Subspaces
arXiv:2505.05145v2 Announce Type: replace-cross Abstract: To perform in-context learning, language models must extract signals from...
UltraCUA: A Foundation Computer-Use Agents Model that Bridges the Gap between General-Purpose GUI Agents and Specialized API-based Agents
Computer-use agents have been limited to primitives. They click, they type, they scroll. Long action...
Tutorial: Exploring SHAP-IQ Visualizations
In this tutorial, we’ll explore a range of SHAP-IQ visualizations that provide insights into how...
Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions
arXiv:2501.01872v5 Announce Type: replace Abstract: Large language models, despite extensive alignment with human values and...
TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers
arXiv:2505.08402v1 Announce Type: new Abstract: Recently, large language models(LLMs) have played an increasingly important role...
TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs
arXiv:2506.23423v1 Announce Type: new Abstract: Past work has studied the effects of fine-tuning on large...
Trusted Uncertainty in Large Language Models: A Unified Framework for Confidence Calibration and Risk-Controlled Refusal
arXiv:2509.01455v1 Announce Type: new Abstract: Deployed language models must decide not only what to answer...
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv:2505.17826v2 Announce Type: replace-cross Abstract: Trinity-RFT is a general-purpose, unified and easy-to-use framework designed for...
Transforming Wearable Data into Personal Health Insights using Large Language Model Agents
arXiv:2406.06464v3 Announce Type: replace-cross Abstract: Deriving personalized insights from popular wearable trackers requires complex numerical...
Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models
arXiv:2508.03998v1 Announce Type: new Abstract: Successful group meetings, such as those implemented in group behavioral-change...
TransEvalnia: A Prompting-Based System for Fine-Grained, Human-Aligned Translation Evaluation Using LLMs
Translation systems powered by LLMs have become so advanced that they can outperform human translators...
Traits Run Deep: Enhancing Personality Assessment via Psychology-Guided LLM Representations and Multimodal Apparent Behaviors
arXiv:2507.22367v1 Announce Type: new Abstract: Accurate and reliable personality assessment plays a vital role in...
 
				 
				


 
				 
					           
					           
					           
					           
					           
					          