YouZum

News

News

Waking Up an AI: A Quantitative Framework for Prompt-Induced Phase Transition in Large Language Models

arXiv:2504.21012v1 Announce Type: new Abstract: What underlies intuitive human thinking? One approach to this question...

Voice AI that actually converts: New TTS model boosts sales 15% for major brands

A new spoken language model can quickly generate “infinite” new voices of varying genders, ages...

Vocabulary Customization for Efficient Domain-Specific LLM Deployment

arXiv:2509.26124v1 Announce Type: new Abstract: When using an LLM to process text outside the training...

VLQA: The First Comprehensive, Large, and High-Quality Vietnamese Dataset for Legal Question Answering

arXiv:2507.19995v1 Announce Type: new Abstract: The advent of large language models (LLMs) has led to...

VL-Cogito: Advancing Multimodal Reasoning with Progressive Curriculum Reinforcement Learning

Multimodal reasoning, where models integrate and interpret information from multiple sources such as text, images...

ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding

arXiv:2509.15235v4 Announce Type: replace-cross Abstract: Speculative decoding is a widely adopted technique for accelerating inference...

Visa launches ‘Intelligent Commerce’ platform, letting AI agents swipe your card—safely, it says

Visa launches Intelligent Commerce platform enabling AI assistants to make secure purchases with your credit...

Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics

arXiv:2405.19988v3 Announce Type: replace-cross Abstract: Natural language is often the easiest and most convenient modality...

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

arXiv:2506.21582v1 Announce Type: new Abstract: Text analytics has traditionally required specialized knowledge in Natural Language...

VeBrain: A Unified Multimodal AI Framework for Visual Reasoning and Real-World Robotic Control

Bridging Perception and Action in Robotics Multimodal Large Language Models (MLLMs) hold promise for enabling...

Using Sentiment Analysis to Investigate Peer Feedback by Native and Non-Native English Speakers

arXiv:2507.22924v1 Announce Type: new Abstract: Graduate-level CS programs in the U.S. increasingly enroll international students...

We use cookies to improve your experience and performance on our website. You can learn more at Privacy Policy and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
en_US