Noticias
Noticias
Training LLMs to be Better Text Embedders through Bidirectional Reconstruction
arXiv:2509.03020v1 Announce Type: new Abstract: Large language models (LLMs) have increasingly been explored as powerful...
Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
arXiv:2508.18370v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated exceptional capabilities when trained...
ToxiFrench: Benchmarking and Enhancing Language Models via CoT Fine-Tuning for French Toxicity Detection
arXiv:2508.11281v1 Announce Type: new Abstract: Detecting toxic content using language models is crucial yet challenging...
Towards Understanding the Cognitive Habits of Large Reasoning Models
arXiv:2506.21571v2 Announce Type: replace Abstract: Large Reasoning Models (LRMs), which autonomously produce a reasoning Chain...
Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models
arXiv:2508.17184v1 Announce Type: new Abstract: Instruction tuning is a pivotal technique for aligning large language...
Toward Safe and Human-Aligned Game Conversational Recommendation via Multi-Agent Decomposition
arXiv:2504.20094v2 Announce Type: replace-cross Abstract: Conversational recommender systems (CRS) have advanced with large language models...
Toward LLM-Supported Automated Assessment of Critical Thinking Subskills
arXiv:2510.12915v1 Announce Type: cross Abstract: Critical thinking represents a fundamental competency in today’s education landscape...
Top Computer Vision CV Blogs & News Websites (2025)
Computer vision moved fast in 2025: new multimodal backbones, larger open datasets, and tighter model–systems...
Top 20 Voice AI Blogs and News Websites 2025: The Ultimate Resource Guide
Voice AI technology has experienced unprecedented growth in 2025, with revolutionary breakthroughs in real-time conversational...
Top 10 Local LLMs (2025): Context Windows, VRAM Targets, and Licenses Compared
Local LLMs matured fast in 2025: open-weight families like Llama 3.1 (128K context length (ctx))...
TokenTiming: A Dynamic Alignment Method for Universal Speculative Decoding Model Pairs
arXiv:2510.15545v1 Announce Type: new Abstract: Accelerating the inference of large language models (LLMs) has been...
TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability
arXiv:2507.19419v1 Announce Type: new Abstract: Understanding the relationship between training data and model behavior during...
 
				 
				

 
				 
					           
					           
					           
					           
					           
					          