ข่าว
ข่าว
Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion
arXiv:2506.01365v1 Announce Type: cross Abstract: Voice Activity Detection (VAD) plays a key role in speech...
AsyncSwitch: Asynchronous Text-Speech Adaptation for Code-Switched ASR
arXiv:2506.14190v1 Announce Type: new Abstract: Developing code-switched ASR systems is challenging due to language ambiguity...
AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science
arXiv:2506.13992v1 Announce Type: cross Abstract: Large language models (LLMs) have advanced the automation of data...
Apple Researchers Reveal Structural Failures in Large Reasoning Models Using Puzzle-Based Evaluation
Artificial intelligence has undergone a significant transition from basic language models to advanced models that...
Apple makes major AI advance with image generation technology rivaling DALL-E and Midjourney
Apple researchers develop STARFlow, a breakthrough AI image generation system that challenges diffusion models used...
Apple and Duke Researchers Present a Reinforcement Learning Approach That Enables LLMs to Provide Intermediate Answers, Enhancing Speed and Accuracy
Long CoT reasoning improves large language models’ performance on complex tasks but comes with drawbacks...
Anthropic study: Leading AI models show up to 96% blackmail rate against executives
Anthropic research reveals AI models from OpenAI, Google, Meta and others chose blackmail, corporate espionage...
Anthropic debuts Claude conversational voice mode on mobile that searches your Google Docs, Drive, Calendar
With the rollout of voice mode, Anthropic continues to broaden Claude’s functionality and accessibility to...
Announcing the 2025 finalists for VentureBeat Women in AI Awards
Announcing the finalists for the 2025 women in AI awards.Read More...
Announcing our 2025 VB Transform Innovation Showcase finalists
Seven companies will be sharing their latest AI innovations from the main stage at VB...
Analyzing and Improving Speaker Similarity Assessment for Speech Synthesis
arXiv:2507.02176v1 Announce Type: cross Abstract: Modeling voice identity is challenging due to its multifaceted nature...
Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model
arXiv:2503.13575v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) possess encompassing capabilities that can process...