YouZum

Actualités

Actualités

VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?

arXiv:2512.15649v2 Announce Type: replace-cross Abstract: The computational and memory overheads associated with expanding the context...

Voice AI that actually converts: New TTS model boosts sales 15% for major brands

A new spoken language model can quickly generate “infinite” new voices of varying genders, ages...

Vocabulary Customization for Efficient Domain-Specific LLM Deployment

arXiv:2509.26124v1 Announce Type: new Abstract: When using an LLM to process text outside the training...

VLQA: The First Comprehensive, Large, and High-Quality Vietnamese Dataset for Legal Question Answering

arXiv:2507.19995v1 Announce Type: new Abstract: The advent of large language models (LLMs) has led to...

vLLM vs TensorRT-LLM vs HF TGI vs LMDeploy, A Deep Technical Comparison for Production LLM Inference

Production LLM serving is now a systems problem, not a generate() loop. For real workloads...

VL-Cogito: Advancing Multimodal Reasoning with Progressive Curriculum Reinforcement Learning

Multimodal reasoning, where models integrate and interpret information from multiple sources such as text, images...

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

arXiv:2510.27623v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have advanced embodied agents by...

ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding

arXiv:2509.15235v4 Announce Type: replace-cross Abstract: Speculative decoding is a widely adopted technique for accelerating inference...

Visa launches ‘Intelligent Commerce’ platform, letting AI agents swipe your card—safely, it says

Visa launches Intelligent Commerce platform enabling AI assistants to make secure purchases with your credit...

Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics

arXiv:2405.19988v3 Announce Type: replace-cross Abstract: Natural language is often the easiest and most convenient modality...

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

arXiv:2506.21582v1 Announce Type: new Abstract: Text analytics has traditionally required specialized knowledge in Natural Language...

Verdict: A Library for Scaling Judge-Time Compute

arXiv:2502.18018v2 Announce Type: replace Abstract: The use of LLMs as automated judges (“LLM-as-a-judge”) is now...

We use cookies to improve your experience and performance on our website. You can learn more at Politique de confidentialité and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
fr_FR