Actualités
Actualités
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
arXiv:2508.15096v1 Announce Type: new Abstract: Pretraining large language models (LLMs) on high-quality, structured data such...
Nebius AI Advances Open-Weight LLMs Through Reinforcement Learning for Capable SWE Agents
The landscape of software engineering automation is evolving rapidly, driven by advances in Large Language...
Natural language processing for African languages
arXiv:2507.00297v1 Announce Type: new Abstract: Recent advances in word embeddings and language models use large-scale...
Native RAG vs. Agentic RAG: Which Approach Advances Enterprise AI Decision-Making?
Retrieval-Augmented Generation (RAG) has emerged as a cornerstone technique for enhancing Large Language Models (LLMs)...
National University of Singapore Researchers Introduce Dimple: A Discrete Diffusion Multimodal Language Model for Efficient and Controllable Text Generation
In recent months, there has been growing interest in applying diffusion models—originally designed for continuous...
Narrowing the Gap: Supervised Fine-Tuning of Open-Source LLMs as a Viable Alternative to Proprietary Models for Pedagogical Tools
arXiv:2507.05305v1 Announce Type: cross Abstract: Frontier Large language models (LLMs) like ChatGPT and Gemini can...
Multimodal LLMs Without Compromise: Researchers from UCLA, UW–Madison, and Adobe Introduce X-Fusion to Add Vision to Frozen Language Models Without Losing Language Capabilities
LLMs have made significant strides in language-related tasks such as conversational AI, reasoning, and code...
Multimodal Large Language Models Meet Multimodal Emotion Recognition and Reasoning: A Survey
arXiv:2509.24322v1 Announce Type: new Abstract: In recent years, large language models (LLMs) have driven major...
Multimodal Foundation Models Fall Short on Physical Reasoning: PHYX Benchmark Highlights Key Limitations in Visual and Symbolic Integration
State-of-the-art models show human-competitive accuracy on AIME, GPQA, MATH-500, and OlympiadBench, solving Olympiad-level problems. Recent...
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach
arXiv:2505.07902v1 Announce Type: cross Abstract: Classroom discourse is an essential vehicle through which teaching and...
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
arXiv:2401.16092v4 Announce Type: replace Abstract: Text-to-image generation models have recently achieved astonishing results in image...
Multilingual Machine Translation with Quantum Encoder Decoder Attention-based Convolutional Variational Circuits
arXiv:2505.09407v1 Announce Type: new Abstract: Cloud-based multilingual translation services like Google Translate and Microsoft Translator...
 
				 
				




 
				 
					           
					           
					           
					           
					           
					          