YouZum

新闻

新闻

Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization

arXiv:2505.24229v1 Announce Type: new Abstract: Inverse Text Normalization (ITN) is crucial for converting spoken Automatic...

Dynamic Acoustic Model Architecture Optimization in Training for ASR

arXiv:2506.13180v2 Announce Type: replace Abstract: Architecture design is inherently complex. Existing approaches rely on either...

DVAGen: Dynamic Vocabulary Augmented Generation

arXiv:2510.17115v1 Announce Type: new Abstract: Language models trained with a fixed vocabulary struggle to generalize...

DualDistill and Agentic-R1: How AI Combines Natural Language and Tool Use for Superior Math Problem Solving

Existing long-CoT reasoning models have achieved state-of-the-art performance in mathematical reasoning by generating reasoning trajectories...

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

arXiv:2507.00432v2 Announce Type: replace-cross Abstract: Math reasoning has become the poster child of progress in...

Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation

arXiv:2506.19998v1 Announce Type: new Abstract: REST APIs play important roles in enriching the action space...

Do reasoning models really “think” or not? Apple research sparks lively debate, response

Ultimately, the big takeaway for ML researchers is that before proclaiming an AI milestone—or obituary—make...

Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation — a Multilingual Perspective

arXiv:2505.06010v1 Announce Type: new Abstract: Current machine translation models provide us with high-quality outputs in...

Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models

arXiv:2505.16245v3 Announce Type: replace Abstract: Diverse language model responses are crucial for creative generation, open-ended...

DiTTO-LLM: Framework for Discovering Topic-based Technology Opportunities via Large Language Model

arXiv:2509.09724v1 Announce Type: new Abstract: Technology opportunities are critical information that serve as a foundation...

Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv:2509.15888v3 Announce Type: replace Abstract: Adapting billion-parameter language models to a downstream task is still...

DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech

arXiv:2509.09631v1 Announce Type: cross Abstract: Zero-shot Text-to-Speech (TTS) aims to synthesize high-quality speech that mimics...

We use cookies to improve your experience and performance on our website. You can learn more at 隱私權政策 and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
zh_CN