ข่าว
ข่าว
SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model
arXiv:2507.02822v1 Announce Type: new Abstract: With the widespread adoption of large language models (LLMs) in...
SwiReasoning: Entropy-Driven Alternation of Latent and Explicit Chain-of-Thought for Reasoning LLMs
SwiReasoning is a decoding-time framework that lets a reasoning LLM decide when to think in...
SWE-Bench Performance Reaches 50.8% Without Tool Use: A Case for Monolithic State-in-Context Agents
Recent advancements in LM agents have shown promising potential for automating intricate real-world tasks. These...
Surprise Calibration for Better In-Context Learning
arXiv:2506.12796v1 Announce Type: new Abstract: In-context learning (ICL) has emerged as a powerful paradigm for...
Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation
arXiv:2311.01766v5 Announce Type: replace Abstract: Mis- and disinformation online have become a major societal problem...
Superintelligence: Unlocking the Mysteries of the Future
Imagine a future where machines don’t just outperform humans in specific tasks but fundamentally outthink...
StreamTensor: A PyTorch-to-Accelerator Compiler that Streams LLM Intermediates Across FPGA Dataflows
Why treat LLM inference as batched kernels to DRAM when a dataflow compiler can pipe...
Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling
arXiv:2509.08753v1 Announce Type: new Abstract: We introduce Delayed Streams Modeling (DSM), a flexible formulation for...
Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong
Anthropic’s open-source circuit tracing tool can help developers debug, optimize, and control AI for reliable...
StepFun AI Releases Step-Audio 2 Mini: An Open-Source 8B Speech-to-Speech AI Model that Surpasses GPT-4o-Audio
The StepFun AI team has released Step-Audio 2 Mini, an 8B parameter speech-to-speech large audio...
STEPER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models
arXiv:2510.07923v1 Announce Type: new Abstract: Answering complex real-world questions requires step-by-step retrieval and integration of...
Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models
arXiv:2507.15512v3 Announce Type: replace Abstract: Test-Time Scaling (TTS) is a promising approach to progressively elicit...




