YouZum

Nachrichten

Nachrichten

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

arXiv:2507.00432v2 Announce Type: replace-cross Abstract: Math reasoning has become the poster child of progress in...

Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation

arXiv:2506.19998v1 Announce Type: new Abstract: REST APIs play important roles in enriching the action space...

Do reasoning models really “think” or not? Apple research sparks lively debate, response

Ultimately, the big takeaway for ML researchers is that before proclaiming an AI milestone—or obituary—make...

Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation — a Multilingual Perspective

arXiv:2505.06010v1 Announce Type: new Abstract: Current machine translation models provide us with high-quality outputs in...

Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models

arXiv:2505.16245v3 Announce Type: replace Abstract: Diverse language model responses are crucial for creative generation, open-ended...

DiTTO-LLM: Framework for Discovering Topic-based Technology Opportunities via Large Language Model

arXiv:2509.09724v1 Announce Type: new Abstract: Technology opportunities are critical information that serve as a foundation...

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv:2509.15888v3 Announce Type: replace Abstract: Adapting billion-parameter language models to a downstream task is still...

DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech

arXiv:2509.09631v1 Announce Type: cross Abstract: Zero-shot Text-to-Speech (TTS) aims to synthesize high-quality speech that mimics...

Diffusion Language Models Know the Answer Before Decoding

arXiv:2508.19982v1 Announce Type: new Abstract: Diffusion language models (DLMs) have recently emerged as an alternative...

Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models

arXiv:2506.09277v2 Announce Type: replace Abstract: Large Language Models (LLM) have demonstrated the capability of generating...

DICE: Structured Reasoning in LLMs through SLM-Guided Chain-of-Thought Correction

arXiv:2510.09211v2 Announce Type: replace Abstract: When performing reasoning tasks with user-specific requirements, such as strict...

DiaBlo: Diagonal Blocks Are Sufficient For Finetuning

arXiv:2506.03230v1 Announce Type: cross Abstract: Finetuning is a critical step for adapting large language models...

We use cookies to improve your experience and performance on our website. You can learn more at Datenschutzrichtlinie and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
de_DE