News
News
SwiReasoning: Entropy-Driven Alternation of Latent and Explicit Chain-of-Thought for Reasoning LLMs
SwiReasoning is a decoding-time framework that lets a reasoning LLM decide when to think in...
SwiftMem: Fast Agentic Memory via Query-aware Indexing
arXiv:2601.08160v1 Announce Type: new Abstract: Agentic memory systems have become critical for enabling LLM agents...
SWE-Bench Performance Reaches 50.8% Without Tool Use: A Case for Monolithic State-in-Context Agents
Recent advancements in LM agents have shown promising potential for automating intricate real-world tasks. These...
Surprise Calibration for Better In-Context Learning
arXiv:2506.12796v1 Announce Type: new Abstract: In-context learning (ICL) has emerged as a powerful paradigm for...
Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation
arXiv:2311.01766v5 Announce Type: replace Abstract: Mis- and disinformation online have become a major societal problem...
Superintelligence: Unlocking the Mysteries of the Future
Imagine a future where machines don’t just outperform humans in specific tasks but fundamentally outthink...
Style Over Story: A Process-Oriented Study of Authorial Creativity in Large Language Models
arXiv:2510.02025v2 Announce Type: replace Abstract: Evaluations of large language models (LLMs)’ creativity have focused primarily...
Studying the Effects of Collaboration in Interactive Theme Discovery Systems
arXiv:2408.09030v4 Announce Type: replace Abstract: NLP-assisted solutions have gained considerable traction to support qualitative data...
Stronger Normalization-Free Transformers
arXiv:2512.10938v1 Announce Type: cross Abstract: Although normalization layers have long been viewed as indispensable components...
StreamTensor: A PyTorch-to-Accelerator Compiler that Streams LLM Intermediates Across FPGA Dataflows
Why treat LLM inference as batched kernels to DRAM when a dataflow compiler can pipe...
Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling
arXiv:2509.08753v1 Announce Type: new Abstract: We introduce Delayed Streams Modeling (DSM), a flexible formulation for...
Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong
Anthropic’s open-source circuit tracing tool can help developers debug, optimize, and control AI for reliable...



