Actualités
Actualités
CHEER-Ekman: Fine-grained Embodied Emotion Classification
arXiv:2506.01047v1 Announce Type: new Abstract: Emotions manifest through physical experiences and bodily reactions, yet identifying...
ChartHal: A Fine-grained Framework Evaluating Hallucination of Large Vision Language Models in Chart Understanding
arXiv:2509.17481v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have recently demonstrated remarkable progress, yet...
ChartGaze: Enhancing Chart Understanding in LVLMs with Eye-Tracking Guided Attention Refinement
arXiv:2509.13282v1 Announce Type: new Abstract: Charts are a crucial visual medium for communicating and representing...
ChainReaction! Structured Approach with Causal Chains as Intermediate Representations for Improved and Explainable Causal Video Question Answering
arXiv:2508.21010v1 Announce Type: cross Abstract: Existing Causal-Why Video Question Answering (VideoQA) models often struggle with...
Chai Discovery Team Releases Chai-2: AI Model Achieves 16% Hit Rate in De Novo Antibody Design
TLDR: Chai Discovery Team introduces Chai-2, a multimodal AI model that enables zero-shot de novo...
Causal2Vec: Improving Decoder-only LLMs as Versatile Embedding Models
arXiv:2507.23386v1 Announce Type: new Abstract: Decoder-only large language models (LLMs) are increasingly used to build...
Catio wins ‘coolest tech’ award at VB Transform 2025
Catio also announced the upcoming launch of Archie, a conversational, multi-agent AI system.Read More...
Capabilities and Evaluation Biases of Large Language Models in Classical Chinese Poetry Generation: A Case Study on Tang Poetry
arXiv:2510.15313v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly applied to creative domains...
Can We Trust Machine Learning? The Reliability of Features from Open-Source Speech Analysis Tools for Speech Modeling
arXiv:2506.11072v1 Announce Type: cross Abstract: Machine learning-based behavioral models rely on features extracted from audio-visual...
Can We Improve Llama 3’s Reasoning Through Post-Training Alone? ASTRO Shows +16% to +20% Benchmark Gains
Improving the reasoning capabilities of large language models (LLMs) without architectural changes is a core...
Can Vision Language Models Infer Human Gaze Direction? A Controlled Study
arXiv:2506.05412v1 Announce Type: cross Abstract: Gaze-referential inference–the ability to infer what others are looking at–is...
Can structural correspondences ground real world representational content in Large Language Models?
arXiv:2506.16370v1 Announce Type: new Abstract: Large Language Models (LLMs) such as GPT-4 produce compelling responses...