YouZum

News

News

Echoes of BERT: Do Modern Language Models Rediscover the Classical NLP Pipeline?

arXiv:2506.02132v4 Announce Type: replace Abstract: Large transformer-based language models dominate modern NLP, yet our understanding...

Echoes of Automation: The Increasing Use of LLMs in Newsmaking

arXiv:2508.06445v2 Announce Type: replace Abstract: The rapid rise of Generative AI (GenAI), particularly LLMs, poses...

Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization

arXiv:2505.24229v1 Announce Type: new Abstract: Inverse Text Normalization (ITN) is crucial for converting spoken Automatic...

Dynamic Acoustic Model Architecture Optimization in Training for ASR

arXiv:2506.13180v2 Announce Type: replace Abstract: Architecture design is inherently complex. Existing approaches rely on either...

DVAGen: Dynamic Vocabulary Augmented Generation

arXiv:2510.17115v1 Announce Type: new Abstract: Language models trained with a fixed vocabulary struggle to generalize...

DualDistill and Agentic-R1: How AI Combines Natural Language and Tool Use for Superior Math Problem Solving

Existing long-CoT reasoning models have achieved state-of-the-art performance in mathematical reasoning by generating reasoning trajectories...

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

arXiv:2507.00432v2 Announce Type: replace-cross Abstract: Math reasoning has become the poster child of progress in...

Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation

arXiv:2506.19998v1 Announce Type: new Abstract: REST APIs play important roles in enriching the action space...

Do reasoning models really “think” or not? Apple research sparks lively debate, response

Ultimately, the big takeaway for ML researchers is that before proclaiming an AI milestone—or obituary—make...

Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation — a Multilingual Perspective

arXiv:2505.06010v1 Announce Type: new Abstract: Current machine translation models provide us with high-quality outputs in...

Diverse, not Short: A Length-Controlled Data Selection Strategy for Improving Response Diversity of Language Models

arXiv:2505.16245v3 Announce Type: replace Abstract: Diverse language model responses are crucial for creative generation, open-ended...

DiTTO-LLM: Framework for Discovering Topic-based Technology Opportunities via Large Language Model

arXiv:2509.09724v1 Announce Type: new Abstract: Technology opportunities are critical information that serve as a foundation...

We use cookies to improve your experience and performance on our website. You can learn more at Privacy Policy and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
en_US