YouZum

新闻

新闻

Loss Landscape Degeneracy and Stagewise Development in Transformers

arXiv:2402.02364v3 Announce Type: replace-cross Abstract: Deep learning involves navigating a high-dimensional loss landscape over the...

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

arXiv:2412.18424v3 Announce Type: replace-cross Abstract: Large vision language models (LVLMs) have improved the document understanding...

Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models

arXiv:2507.18263v1 Announce Type: new Abstract: Direct speech translation (ST) has garnered increasing attention nowadays, yet...

LLMs Struggle with Real Conversations: Microsoft and Salesforce Researchers Reveal a 39% Performance Drop in Multi-Turn Underspecified Tasks

Conversational artificial intelligence is centered on enabling large language models (LLMs) to engage in dynamic...

LLMs for Customized Marketing Content Generation and Evaluation at Scale

arXiv:2506.17863v1 Announce Type: new Abstract: Offsite marketing is essential in e-commerce, enabling businesses to reach...

LLMs Can Also Do Well! Breaking Barriers in Semantic Role Labeling via Large Language Models

arXiv:2506.05385v1 Announce Type: new Abstract: Semantic role labeling (SRL) is a crucial task of natural...

LLM-Independent Adaptive RAG: Let the Question Speak for Itself

arXiv:2505.04253v1 Announce Type: new Abstract: Large Language Models~(LLMs) are prone to hallucinations, and Retrieval-Augmented Generation...

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey

arXiv:2505.00753v4 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have sparked growing...

LLM-as-a-Judge: Can Language Models Be Trusted to Evaluate Other Models?

Exploring the promise, pitfalls, and practical applications of using LLMs to automate AI evaluation — from synthetic...

LLM-as-a-Judge for Reference-less Automatic Code Validation and Refinement for Natural Language to Bash in IT Automation

arXiv:2506.11237v1 Announce Type: cross Abstract: In an effort to automatically evaluate and select the best...

LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning

arXiv:2507.08496v1 Announce Type: new Abstract: While large language models (LLMs) have advanced procedural planning for...

Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs

arXiv:2505.09338v1 Announce Type: new Abstract: We observe a novel phenomenon, contextual entrainment, across a wide...

We use cookies to improve your experience and performance on our website. You can learn more at 隱私權政策 and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
zh_CN