Notizie
Notizie
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
arXiv:2508.03686v1 Announce Type: new Abstract: Answer verification is crucial not only for evaluating large language...
Comparison of different Unique hard attention transformer models by the formal languages they can recognize
arXiv:2506.03370v1 Announce Type: cross Abstract: This note is a survey of various results on the...
Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes
arXiv:2507.13335v2 Announce Type: replace Abstract: Humour, as a complex language form, is derived from myriad...
Combining Evidence and Reasoning for Biomedical Fact-Checking
arXiv:2509.13879v1 Announce Type: new Abstract: Misinformation in healthcare, from vaccine hesitancy to unproven treatments, poses...
Combinatorial Optimization for All: Using LLMs to Aid Non-Experts in Improving Optimization Algorithms
arXiv:2503.10968v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) have shown notable potential in code...
Combating Confirmation Bias: A Unified Pseudo-Labeling Framework for Entity Alignment
arXiv:2307.02075v4 Announce Type: replace-cross Abstract: Entity alignment (EA) aims at identifying equivalent entity pairs across...
Combating Biomedical Misinformation through Multi-modal Claim Detection and Evidence-based Verification
arXiv:2509.13888v1 Announce Type: new Abstract: Misinformation in healthcare, from vaccine hesitancy to unproven treatments, poses...
Collaborative Stance Detection via Small-Large Language Model Consistency Verification
arXiv:2502.19954v2 Announce Type: replace Abstract: Stance detection on social media aims to identify attitudes expressed...
Cognitive Surgery: The Awakening of Implicit Territorial Awareness in LLMs
arXiv:2508.14408v1 Announce Type: new Abstract: Large language models (LLMs) have been shown to possess a...
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
arXiv:2508.00414v1 Announce Type: cross Abstract: General AI Agents are increasingly recognized as foundational frameworks for...
Codev lets enterprises avoid vibe coding hangovers with a team of agents that generate and document code
For many software developers using generative AI, vibe coding is a double-edged sword. The process...
CoCoTen: Detecting Adversarial Inputs to Large Language Models through Latent Space Features of Contextual Co-occurrence Tensors
arXiv:2508.02997v3 Announce Type: replace Abstract: The widespread use of Large Language Models (LLMs) in many...