YouZum

News

News

M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

arXiv:2506.02510v1 Announce Type: new Abstract: Recent breakthroughs in large language models (LLMs) have led to...

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

arXiv:2412.18424v3 Announce Type: replace-cross Abstract: Large vision language models (LVLMs) have improved the document understanding...

LLMs Struggle with Real Conversations: Microsoft and Salesforce Researchers Reveal a 39% Performance Drop in Multi-Turn Underspecified Tasks

Conversational artificial intelligence is centered on enabling large language models (LLMs) to engage in dynamic...

LLMs for Customized Marketing Content Generation and Evaluation at Scale

arXiv:2506.17863v1 Announce Type: new Abstract: Offsite marketing is essential in e-commerce, enabling businesses to reach...

LLMs Can Also Do Well! Breaking Barriers in Semantic Role Labeling via Large Language Models

arXiv:2506.05385v1 Announce Type: new Abstract: Semantic role labeling (SRL) is a crucial task of natural...

LLM-Independent Adaptive RAG: Let the Question Speak for Itself

arXiv:2505.04253v1 Announce Type: new Abstract: Large Language Models~(LLMs) are prone to hallucinations, and Retrieval-Augmented Generation...

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey

arXiv:2505.00753v4 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have sparked growing...

LLM-as-a-Judge: Can Language Models Be Trusted to Evaluate Other Models?

Exploring the promise, pitfalls, and practical applications of using LLMs to automate AI evaluation — from synthetic...

LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation

arXiv:2506.11237v1 Announce Type: cross Abstract: In an effort to automatically evaluate and select the best...

LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning

arXiv:2507.08496v1 Announce Type: new Abstract: While large language models (LLMs) have advanced procedural planning for...

Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs

arXiv:2505.09338v1 Announce Type: new Abstract: We observe a novel phenomenon, contextual entrainment, across a wide...

Like humans, AI is forcing institutions to rethink their purpose

Like people undergoing cognitive migration, institutions must reassess what they were made for in this...
en_US