YouZum

ニュース

ニュース

GPT-4o Understands Text, But Does It See Clearly? A Benchmarking Study of MFMs on Vision Tasks

Multimodal foundation models (MFMs) like GPT-4o, Gemini, and Claude have shown rapid progress recently, especially...

Google’s Jules aims to out-code Codex in battle for the AI developer stack

Google released Jules, its coding agent, into beta as autonomous coding agents are quickly gaining...

Google’s AlphaEvolve: The AI agent that reclaimed 0.7% of Google’s compute – and how to copy it

Google’s AlphaEvolve is the epitome of a best-practice AI agent orchestration. It offers a lesson...

Google’s ‘world-model’ bet: building the AI operating layer before Microsoft captures the UI

Google doubles down on its ‘world-model’ vision, racing to build an AI operating layer to...

Google Researchers Introduce LightLab: A Diffusion-Based AI Method for Physically Plausible, Fine-Grained Light Control in Single Images

Manipulating lighting conditions in images post-capture is challenging. Traditional approaches rely on 3D graphics methods...

Google Redefines Computer Science R&D: A Hybrid Research Model that Merges Innovation with Scalable Engineering

Computer science research has evolved into a multidisciplinary effort involving logic, engineering, and data-driven experimentation...

Google quietly launches AI Edge Gallery, letting Android phones run AI without the cloud

Google quietly launched AI Edge Gallery, an experimental Android app that runs AI models offline...

Google finally launches NotebookLM mobile app at I/O: hands-on, first impressions

As NotebookLM matures, its emerging business-tier capabilities suggest growing alignment with the productivity and compliance…Read...

Google DeepMind Releases Gemma 3n: A Compact, High-Efficiency Multimodal AI Model for Real-Time On-Device Use

Researchers are reimagining how models operate as demand skyrockets for faster, smarter, and more private...
ja