ข่าว
ข่าว
Token Buncher: Shielding LLMs from Harmful Reinforcement Learning Fine-Tuning
arXiv:2508.20697v1 Announce Type: cross Abstract: As large language models (LLMs) continue to grow in capability...
Together AI’s ATLAS adaptive speculator delivers 400% inference speedup by learning from workloads in real-time
Enterprises expanding AI deployments are hitting an invisible performance wall. The culprit? Static speculators that...
Time Magazine appears to accidentally publish embargoed story confirming new Anthropic model
Someone also appears to have published a full scrape of the Time article online on...
Tilde AI Releases TildeOpen LLM: An Open-Source Large Language Model with Over 30 Billion Parameters and Support Most European Languages
Latvian language-tech firm Tilde has released TildeOpen LLM, an open-source foundational large language model (LLM)...
Three takeaways about AI’s energy use and climate impacts
This week, we published Power Hungry, a package all about AI and energy. At the...
This researcher turned OpenAI’s open weights model gpt-oss-20b into a non-reasoning ‘base’ model with less alignment, more freedom
Morris found it could also reproduce verbatim passages from copyrighted works, including three out of...
This data set helps researchers spot harmful stereotypes in LLMs
AI models are riddled with culturally specific biases. A new data set, called SHADES, is...
This AI Paper Proposes a Novel Dual-Branch Encoder-Decoder Architecture for Unsupervised Speech Enhancement (SE)
Can a speech enhancer trained only on real noisy recordings cleanly separate speech and noise—without...
This AI Paper Investigates Test-Time Scaling of English-Centric RLMs for Enhanced Multilingual Reasoning and Domain Generalization
Reasoning language models, or RLMs, are increasingly used to simulate step-by-step problem-solving by generating long...
This AI Paper Introduces WINGS: A Dual-Learner Architecture to Prevent Text-Only Forgetting in Multimodal Large Language Models
Multimodal LLMs: Expanding Capabilities Across Text and Vision Expanding large language models (LLMs) to handle...
This AI Paper Introduces WEB-SHEPHERD: A Process Reward Model for Web Agents with 40K Dataset and 10× Cost Efficiency
Web navigation focuses on teaching machines how to interact with websites to perform tasks such...
This AI Paper Introduces PyVision: A Python-Centric Framework Where AI Writes Tools as It Thinks
Visual reasoning tasks challenge artificial intelligence models to interpret and process visual information using both...
 
				 
				






 
				 
					           
					           
					           
					           
					           
					          