YouZum

Actualités

Actualités

Training LLMs to be Better Text Embedders through Bidirectional Reconstruction

arXiv:2509.03020v1 Announce Type: new Abstract: Large language models (LLMs) have increasingly been explored as powerful...

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo

arXiv:2508.18370v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated exceptional capabilities when trained...

ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection

arXiv:2508.11281v1 Announce Type: new Abstract: Detecting toxic content using language models is crucial yet challenging...

Towards Understanding the Cognitive Habits of Large Reasoning Models

arXiv:2506.21571v2 Announce Type: replace Abstract: Large Reasoning Models (LRMs), which autonomously produce a reasoning Chain...

Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

arXiv:2508.17184v1 Announce Type: new Abstract: Instruction tuning is a pivotal technique for aligning large language...

Toward Safe and Human-Aligned Game Conversational Recommendation via Multi-Agent Decomposition

arXiv:2504.20094v2 Announce Type: replace-cross Abstract: Conversational recommender systems (CRS) have advanced with large language models...

Toward LLM-Supported Automated Assessment of Critical Thinking Subskills

arXiv:2510.12915v1 Announce Type: cross Abstract: Critical thinking represents a fundamental competency in today’s education landscape...

Top Computer Vision CV Blogs & News Websites (2025)

Computer vision moved fast in 2025: new multimodal backbones, larger open datasets, and tighter model–systems...

Top 20 Voice AI Blogs and News Websites 2025: The Ultimate Resource Guide

Voice AI technology has experienced unprecedented growth in 2025, with revolutionary breakthroughs in real-time conversational...

Top 10 Local LLMs (2025): Context Windows, VRAM Targets, and Licenses Compared

Local LLMs matured fast in 2025: open-weight families like Llama 3.1 (128K context length (ctx))...

TokenTiming: A Dynamic Alignment Method for Universal Speculative Decoding Model Pairs

arXiv:2510.15545v1 Announce Type: new Abstract: Accelerating the inference of large language models (LLMs) has been...

TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability

arXiv:2507.19419v1 Announce Type: new Abstract: Understanding the relationship between training data and model behavior during...

We use cookies to improve your experience and performance on our website. You can learn more at Politique de confidentialité and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
fr_FR