Notizie
Notizie
Faster and Better LLMs via Latency-Aware Test-Time Scaling
arXiv:2505.19634v4 Announce Type: replace Abstract: Test-Time Scaling (TTS) has proven effective in improving the performance...
Fast, Slow, and Tool-augmented Thinking for LLMs: A Review
arXiv:2508.12265v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable progress in reasoning...
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning
arXiv:2505.08054v1 Announce Type: new Abstract: Safety alignment approaches in large language models (LLMs) often lead...
False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize
arXiv:2509.03888v1 Announce Type: new Abstract: Large Language Models (LLMs) can comply with harmful instructions, raising...
Fair-GPTQ: Bias-Aware Quantization for Large Language Models
arXiv:2509.15206v1 Announce Type: new Abstract: High memory demands of generative language models have drawn attention...
Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones
arXiv:2507.00322v1 Announce Type: new Abstract: Despite remarkable advances in coding capabilities, language models (LMs) still...
Exploring the Escalation of Source Bias in User, Data, and Recommender System Feedback Loop
arXiv:2405.17998v2 Announce Type: replace-cross Abstract: Recommender systems are essential for information access, allowing users to...
Exploring Procedural Data Generation for Automatic Acoustic Guitar Fingerpicking Transcription
arXiv:2508.07987v1 Announce Type: cross Abstract: Automatic transcription of acoustic guitar fingerpicking performances remains a challenging...
Exploring LLM Autoscoring Reliability in Large-Scale Writing Assessments Using Generalizability Theory
arXiv:2507.19980v1 Announce Type: new Abstract: This study investigates the estimation of reliability for large language...
Exploring Cross-Lingual Knowledge Transfer via Transliteration-Based MLM Fine-Tuning for Critically Low-resource Chakma Language
arXiv:2510.09032v1 Announce Type: new Abstract: As an Indo-Aryan language with limited available data, Chakma remains...
Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm
arXiv:2509.23946v2 Announce Type: replace-cross Abstract: Chain-of-Thought (CoT) and its variants have markedly advanced the reasoning...
Exploration of Plan-Guided Summarization for Narrative Texts: the Case of Small Language Models
arXiv:2504.09071v2 Announce Type: replace Abstract: Plan-guided summarization attempts to reduce hallucinations in small language models...