Notizie
Notizie
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization
arXiv:2505.24229v1 Announce Type: new Abstract: Inverse Text Normalization (ITN) is crucial for converting spoken Automatic...
Dynamic Acoustic Model Architecture Optimization in Training for ASR
arXiv:2506.13180v2 Announce Type: replace Abstract: Architecture design is inherently complex. Existing approaches rely on either...
Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation
arXiv:2506.19998v1 Announce Type: new Abstract: REST APIs play important roles in enriching the action space...
Do reasoning models really “think” or not? Apple research sparks lively debate, response
Ultimately, the big takeaway for ML researchers is that before proclaiming an AI milestone—or obituary—make...
Do Not Change Me: On Transferring Entities Without Modification in Neural Machine Translation — a Multilingual Perspective
arXiv:2505.06010v1 Announce Type: new Abstract: Current machine translation models provide us with high-quality outputs in...
Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models
arXiv:2506.09277v2 Announce Type: replace Abstract: Large Language Models (LLM) have demonstrated the capability of generating...
DiaBlo: Diagonal Blocks Are Sufficient For Finetuning
arXiv:2506.03230v1 Announce Type: cross Abstract: Finetuning is a critical step for adapting large language models...
Detecting Sockpuppetry on Wikipedia Using Meta-Learning
arXiv:2506.10314v1 Announce Type: cross Abstract: Malicious sockpuppet detection on Wikipedia is critical to preserving access...
Demystifying ChatGPT: How It Masters Genre Recognition
arXiv:2507.03875v1 Announce Type: new Abstract: The introduction of ChatGPT has garnered significant attention within the...
Delving into Multilingual Ethical Bias: The MSQAD with Statistical Hypothesis Tests for Large Language Models
arXiv:2505.19121v2 Announce Type: replace Abstract: Despite the recent strides in large language models, studies have...
DeepSeek R1-0528 arrives in powerful open source challenge to OpenAI o3 and Google Gemini 2.5 Pro
Additionally, the model’s hallucination rate has been reduced, contributing to more reliable and consistent output.Read...
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
arXiv:2506.11763v1 Announce Type: new Abstract: Deep Research Agents are a prominent category of LLM-based agents...