Actualités
Actualités
Can a Small Language Model Predict Kernel Latency, Memory, and Model Accuracy from Code? A New Regression Language Model (RLM) Says Yes
Researchers from Cornell and Google introduce a unified Regression Language Model (RLM) that predicts numeric...
Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance
arXiv:2504.19811v2 Announce Type: replace Abstract: Accurately forecasting the performance of Large Language Models (LLMs) before...
Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding
arXiv:2505.03788v1 Announce Type: new Abstract: We introduce a novel approach for calibrating uncertainty quantification (UQ)...
C-VARC: A Large-Scale Chinese Value Rule Corpus for Value Alignment of Large Language Models
arXiv:2506.01495v5 Announce Type: replace Abstract: Ensuring that Large Language Models (LLMs) align with mainstream human...
ByteDance Researchers Introduce DetailFlow: A 1D Coarse-to-Fine Autoregressive Framework for Faster, Token-Efficient Image Generation
Autoregressive image generation has been shaped by advances in sequential modeling, originally seen in natural...
By putting AI into everything, Google wants to make it invisible
If you want to know where AI is headed, this year’s Google I/O has you...
Busted by the em dash — AI’s favorite punctuation mark, and how it’s blowing your cover
AI is brilliant at polishing and rephrasing. But like a child with glitter glue, you...
Building voice AI that listens to everyone: Transfer learning and synthetic speech in action
Enterprises adopting voice AI must consider not just usability, but inclusion. Supporting users with disabilities...
Building Transformer Models from Scratch with PyTorch (10-day Mini-Course)
Before we begin, let’s make sure you’re in the right place...
Building Task Bots with Self-learning for Enhanced Adaptability, Extensibility, and Factuality
arXiv:2508.19689v1 Announce Type: new Abstract: Developing adaptable, extensible, and accurate task bots with minimal or...
Building Patient Journeys in Hebrew: A Language Model for Clinical Timeline Extraction
arXiv:2512.11502v1 Announce Type: new Abstract: We present a new Hebrew medical language model designed to...
Building Domain-Specific Small Language Models via Guided Data Generation
arXiv:2511.21748v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown remarkable success in supporting...




