新闻
新闻
Counting Clues: A Lightweight Probabilistic Baseline Can Match an LLM
arXiv:2512.12868v1 Announce Type: new Abstract: Large language models (LLMs) excel on multiple-choice clinical diagnosis benchmarks...
Counterfactual LLM-based Framework for Measuring Rhetorical Style
arXiv:2512.19908v1 Announce Type: new Abstract: The rise of AI has fueled growing concerns about “hype”...
CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought
arXiv:2502.17214v2 Announce Type: replace Abstract: Large language models (LLMs) excel in many tasks but struggle...
CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning
arXiv:2509.04027v2 Announce Type: replace-cross Abstract: Reinforcement Learning (RL) has become a pivotal approach for enhancing...
CoSyn: The open-source tool that’s making GPT-4V-level vision AI accessible to everyone
Researchers at the University of Pennsylvania and the Allen Institute for Artificial Intelligence have developed...
ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
arXiv:2511.05359v1 Announce Type: cross Abstract: As language models evolve into autonomous agents that act and...
Conversation Forests: The Key to Fine Tuning Large Language Models for Multi-Turn Medical Conversations is Branching
arXiv:2507.04099v2 Announce Type: replace Abstract: Fine-tuning methods such as Direct Preference Optimization (DPO) and Group...
Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning
arXiv:2511.02755v1 Announce Type: new Abstract: Large language models (LLMs) exhibit complementary strengths across domains and...
Controlled Self-Evolution for Algorithmic Code Optimization
arXiv:2601.07348v4 Announce Type: replace Abstract: Self-evolution methods enhance code generation through iterative “generate-verify-refine” cycles, yet...
Context Selection and Rewriting for Video-based Educational Question Generation
arXiv:2504.19406v2 Announce Type: replace Abstract: Educational question generation (EQG) is a crucial component of intelligent...
Context Parametrization with Compositional Adapters
arXiv:2509.22158v1 Announce Type: new Abstract: Large language models (LLMs) often seamlessly adapt to new tasks...
Computational Fact-Checking of Online Discourse: Scoring scientific accuracy in climate change related news articles
arXiv:2505.07409v2 Announce Type: replace Abstract: Democratic societies need reliable information. Misinformation in popular media, such...
