ニュース
ニュース
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
arXiv:2512.15649v2 Announce Type: replace-cross Abstract: The computational and memory overheads associated with expanding the context...
Voice AI that actually converts: New TTS model boosts sales 15% for major brands
A new spoken language model can quickly generate “infinite” new voices of varying genders, ages...
Vocabulary Customization for Efficient Domain-Specific LLM Deployment
arXiv:2509.26124v1 Announce Type: new Abstract: When using an LLM to process text outside the training...
VLQA: The First Comprehensive, Large, and High-Quality Vietnamese Dataset for Legal Question Answering
arXiv:2507.19995v1 Announce Type: new Abstract: The advent of large language models (LLMs) has led to...
vLLM vs TensorRT-LLM vs HF TGI vs LMDeploy, A Deep Technical Comparison for Production LLM Inference
Production LLM serving is now a systems problem, not a generate() loop. For real workloads...
VL-Cogito: Advancing Multimodal Reasoning with Progressive Curriculum Reinforcement Learning
Multimodal reasoning, where models integrate and interpret information from multiple sources such as text, images...
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
arXiv:2510.27623v1 Announce Type: cross Abstract: Multimodal large language models (MLLMs) have advanced embodied agents by...
ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding
arXiv:2509.15235v4 Announce Type: replace-cross Abstract: Speculative decoding is a widely adopted technique for accelerating inference...
Visa launches ‘Intelligent Commerce’ platform, letting AI agents swipe your card—safely, it says
Visa launches Intelligent Commerce platform enabling AI assistants to make secure purchases with your credit...
Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
arXiv:2405.19988v3 Announce Type: replace-cross Abstract: Natural language is often the easiest and most convenient modality...
VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents
arXiv:2506.21582v1 Announce Type: new Abstract: Text analytics has traditionally required specialized knowledge in Natural Language...
Verdict: A Library for Scaling Judge-Time Compute
arXiv:2502.18018v2 Announce Type: replace Abstract: The use of LLMs as automated judges (“LLM-as-a-judge”) is now...



