YouZum

News

News

Delving into Multilingual Ethical Bias: The MSQAD with Statistical Hypothesis Tests for Large Language Models

arXiv:2505.19121v2 Announce Type: replace Abstract: Despite the recent strides in large language models, studies have...

DelvePO: Direction-Guided Self-Evolving Framework for Flexible Prompt Optimization

arXiv:2510.18257v1 Announce Type: new Abstract: Prompt Optimization has emerged as a crucial approach due to...

DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments

arXiv:2506.00739v3 Announce Type: replace Abstract: Large language model (LLM) agents have shown impressive capabilities in...

DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection

arXiv:2511.01192v1 Announce Type: new Abstract: Detecting machine-generated text (MGT) has emerged as a critical challenge...

DEER: A Benchmark for Evaluating Deep Research Agents on Expert Report Generation

arXiv:2512.17776v2 Announce Type: replace Abstract: As large language models advance, deep research systems capable of...

DeepSeek Researchers Apply a 1967 Matrix Normalization Algorithm to Fix Instability in Hyper Connections

DeepSeek researchers are trying to solve a precise issue in large language model training. Residual...

DeepSeek R1-0528 arrives in powerful open source challenge to OpenAI o3 and Google Gemini 2.5 Pro

Additionally, the model’s hallucination rate has been reduced, contributing to more reliable and consistent output.Read...

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For Sparse LLMs

Transformers use attention and Mixture-of-Experts to scale computation, but they still lack a native way...

DeepSeek AI Releases DeepSeek-OCR 2 with Causal Visual Flow Encoder for Layout Aware Document Understanding

DeepSeek AI released DeepSeek-OCR 2, an open source document OCR and understanding system that restructures...

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

arXiv:2506.11763v1 Announce Type: new Abstract: Deep Research Agents are a prominent category of LLM-based agents...

DeepReinforce Team Introduces CUDA-L1: An Automated Reinforcement Learning (RL) Framework for CUDA Optimization Unlocking 3x More Power from GPUs

Estimated reading time: 6 minutes Table of contents The Breakthrough: Contrastive Reinforcement Learning (Contrastive-RL) How...

Deepfake Word Detection by Next-token Prediction using Fine-tuned Whisper

arXiv:2602.22658v1 Announce Type: cross Abstract: Deepfake speech utterances can be forged by replacing one or...

We use cookies to improve your experience and performance on our website. You can learn more at Privacy Policy and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
en_US