Noticias
Noticias
High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning
arXiv:2506.04051v1 Announce Type: new Abstract: Large Language Models (LLMs) currently respond to every prompt. However...
Handling Numeric Expressions in Automatic Speech Recognition
arXiv:2408.00004v2 Announce Type: replace-cross Abstract: This paper addresses the problem of correctly formatting numeric expressions...
GuidedBench: Measuring and Mitigating the Evaluation Discrepancies of In-the-wild LLM Jailbreak Methods
arXiv:2502.16903v2 Announce Type: replace Abstract: Despite the growing interest in jailbreak methods as an effective...
Graph-Based Spectral Decomposition for Parameter Coordination in Language Model Fine-Tuning
arXiv:2504.19583v2 Announce Type: replace-cross Abstract: This paper proposes a parameter collaborative optimization algorithm for large...
GPT-4o Understands Text, But Does It See Clearly? A Benchmarking Study of MFMs on Vision Tasks
Multimodal foundation models (MFMs) like GPT-4o, Gemini, and Claude have shown rapid progress recently, especially...
Google’s Jules aims to out-code Codex in battle for the AI developer stack
Google released Jules, its coding agent, into beta as autonomous coding agents are quickly gaining...
Google’s AlphaEvolve: The AI agent that reclaimed 0.7% of Google’s compute – and how to copy it
Google’s AlphaEvolve is the epitome of a best-practice AI agent orchestration. It offers a lesson...
Google’s ‘world-model’ bet: building the AI operating layer before Microsoft captures the UI
Google doubles down on its ‘world-model’ vision, racing to build an AI operating layer to...
Google Researchers Introduce LightLab: A Diffusion-Based AI Method for Physically Plausible, Fine-Grained Light Control in Single Images
Manipulating lighting conditions in images post-capture is challenging. Traditional approaches rely on 3D graphics methods...
Google Redefines Computer Science R&D: A Hybrid Research Model that Merges Innovation with Scalable Engineering
Computer science research has evolved into a multidisciplinary effort involving logic, engineering, and data-driven experimentation...
Google quietly launches AI Edge Gallery, letting Android phones run AI without the cloud
Google quietly launched AI Edge Gallery, an experimental Android app that runs AI models offline...
Google just leapfrogged every competitor with mind-blowing AI that can think deeper, shop smarter, and create videos with dialogue
Google unveiled major AI advancements at I/O 2025, including Gemini 2.5 with Deep Think, AI...