Noticias
Noticias
Exploiting Adaptive Contextual Masking for Aspect-Based Sentiment Analysis
arXiv:2402.13722v2 Announce Type: replace Abstract: Aspect-Based Sentiment Analysis (ABSA) is a fine-grained linguistics problem that...
Explaining Length Bias in LLM-Based Preference Evaluations
arXiv:2407.01085v4 Announce Type: replace-cross Abstract: The use of large language models (LLMs) as judges, particularly...
Explaining Large Language Models with gSMILE
arXiv:2505.21657v5 Announce Type: replace Abstract: Large Language Models (LLMs) such as GPT, LLaMA, and Claude...
Expanding the WMT24++ Benchmark with Rumantsch Grischun, Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader
arXiv:2509.03148v2 Announce Type: replace Abstract: The Romansh language, spoken in Switzerland, has limited resources for...
ExCyTIn-Bench: Evaluating LLM agents on Cyber Threat Investigation
arXiv:2507.14201v2 Announce Type: replace-cross Abstract: We present ExCyTIn-Bench, the first benchmark to Evaluate an LLM...
Everything you need to know about estimating AI’s energy and emissions burden
When we set out to write a story on the best available estimates for AI’s...
Everyone’s looking to get in on vibe coding — and Google is no different with Stitch, its follow-up to Jules
Google is looking to compete in vibe coding with Stitch, which designs user interfaces (UIs)...
EventHunter: Dynamic Clustering and Ranking of Security Events from Hacker Forum Discussions
arXiv:2507.09762v1 Announce Type: cross Abstract: Hacker forums provide critical early warning signals for emerging cybersecurity...
Evaluation of LLM Vulnerabilities to Being Misused for Personalized Disinformation Generation
arXiv:2412.13666v2 Announce Type: replace Abstract: The capabilities of recent large language models (LLMs) to generate...
Evaluation and Facilitation of Online Discussions in the LLM Era: A Survey
arXiv:2503.01513v3 Announce Type: replace Abstract: We present a survey of methods for assessing and enhancing...
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
arXiv:2412.09645v3 Announce Type: replace-cross Abstract: Recent advancements in visual generative models have enabled high-quality image...
Evaluating Speech-to-Text x LLM x Text-to-Speech Combinations for AI Interview Systems
arXiv:2507.16835v1 Announce Type: cross Abstract: Voice-based conversational AI systems increasingly rely on cascaded architectures combining...