YouZum

News

News

SwiReasoning: Entropy-Driven Alternation of Latent and Explicit Chain-of-Thought for Reasoning LLMs

SwiReasoning is a decoding-time framework that lets a reasoning LLM decide when to think in...

SwiftMem: Fast Agentic Memory via Query-aware Indexing

arXiv:2601.08160v1 Announce Type: new Abstract: Agentic memory systems have become critical for enabling LLM agents...

SWE-Bench Performance Reaches 50.8% Without Tool Use: A Case for Monolithic State-in-Context Agents

Recent advancements in LM agents have shown promising potential for automating intricate real-world tasks. These...

Surprise Calibration for Better In-Context Learning

arXiv:2506.12796v1 Announce Type: new Abstract: In-context learning (ICL) has emerged as a powerful paradigm for...

Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation

arXiv:2311.01766v5 Announce Type: replace Abstract: Mis- and disinformation online have become a major societal problem...

Superintelligence: Unlocking the Mysteries of the Future

Imagine a future where machines don’t just outperform humans in specific tasks but fundamentally outthink...

Style Over Story: A Process-Oriented Study of Authorial Creativity in Large Language Models

arXiv:2510.02025v2 Announce Type: replace Abstract: Evaluations of large language models (LLMs)’ creativity have focused primarily...

Studying the Effects of Collaboration in Interactive Theme Discovery Systems

arXiv:2408.09030v4 Announce Type: replace Abstract: NLP-assisted solutions have gained considerable traction to support qualitative data...

Stronger Normalization-Free Transformers

arXiv:2512.10938v1 Announce Type: cross Abstract: Although normalization layers have long been viewed as indispensable components...

StreamTensor: A PyTorch-to-Accelerator Compiler that Streams LLM Intermediates Across FPGA Dataflows

Why treat LLM inference as batched kernels to DRAM when a dataflow compiler can pipe...

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

arXiv:2509.08753v1 Announce Type: new Abstract: We introduce Delayed Streams Modeling (DSM), a flexible formulation for...

Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong

Anthropic’s open-source circuit tracing tool can help developers debug, optimize, and control AI for reliable...

We use cookies to improve your experience and performance on our website. You can learn more at Privacy Policy and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
en_US