YouZum

ข่าว

ข่าว

Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation — a Multilingual Perspective

arXiv:2505.06010v1 Announce Type: new Abstract: Current machine translation models provide us with high-quality outputs in...

Do Multilingual LLMs have specialized language heads?

arXiv:2602.08625v1 Announce Type: new Abstract: Multilingual large language models (LLMs) have gained significant popularity for...

Do LLMs Truly Understand When a Precedent Is Overruled?

arXiv:2510.20941v1 Announce Type: new Abstract: Large language models (LLMs) with extended context windows show promise...

DNACHUNKER: Learnable Tokenization for DNA Language Models

arXiv:2601.03019v1 Announce Type: cross Abstract: DNA language models have emerged as powerful tools for decoding...

Diversity or Precision? A Deep Dive into Next Token Prediction

arXiv:2512.22955v1 Announce Type: new Abstract: Recent advancements have shown that reinforcement learning (RL) can substantially...

Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models

arXiv:2505.16245v3 Announce Type: replace Abstract: Diverse language model responses are crucial for creative generation, open-ended...

DiVA: Fine-grained Factuality Verification with Agentic-Discriminative Verifier

arXiv:2601.03605v1 Announce Type: new Abstract: Despite the significant advancements of Large Language Models (LLMs), their...

DiTTO-LLM: Framework for Discovering Topic-based Technology Opportunities via Large Language Model

arXiv:2509.09724v1 Announce Type: new Abstract: Technology opportunities are critical information that serve as a foundation...

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv:2509.15888v3 Announce Type: replace Abstract: Adapting billion-parameter language models to a downstream task is still...

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

arXiv:2512.21002v2 Announce Type: replace Abstract: Distilling the capabilities from a large reasoning model (LRM) to...

Distilling Multilingual Vision-Language Models: When Smaller Models Stay Multilingual

arXiv:2510.26271v1 Announce Type: new Abstract: Vision-language models (VLMs) exhibit uneven performance across languages, a problem...

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

arXiv:2601.04377v2 Announce Type: replace Abstract: Retrieval-Augmented Generation (RAG) has emerged as an important means of...

We use cookies to improve your experience and performance on our website. You can learn more at นโยบายความเป็นส่วนตัว and manage your privacy settings by clicking Settings.

ตั้งค่าความเป็นส่วนตัว

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

ยอมรับทั้งหมด
จัดการความเป็นส่วนตัว
  • เปิดใช้งานตลอด

บันทึกการตั้งค่า
th