ข่าว
ข่าว
AI agents are hitting a liability wall. Mixus has a plan to overcome it using human overseers on high-risk workflows
Mixus’s “colleague-in-the-loop” model blends automation with human judgment for safe deployment of AI agents.Read More...
Agent-based computing is outgrowing the web as we know it
AI agents are moving from passive assistants to active participants. Today, we ask them to...
AegisLLM: Scaling LLM Security Through Adaptive Multi-Agent Systems at Inference Time
The Growing Threat Landscape for LLMs LLMs are key targets for fast-evolving attacks, including prompt...
Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning
arXiv:2412.08587v2 Announce Type: replace Abstract: Both encoder-only models (e.g., BERT, RoBERTa) and large language models...
Adopting agentic AI? Build AI fluency, redesign workflows, don’t neglect supervision
How can organizations decide how to use human-in-the-loop mechanisms and collaborative frameworks with AI agents?Read...
Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning
arXiv:2505.09738v1 Announce Type: new Abstract: Pretrained language models (LLMs) are often constrained by their fixed...
AbstRaL: Teaching LLMs Abstract Reasoning via Reinforcement to Boost Robustness on GSM Benchmarks
Recent research indicates that LLMs, particularly smaller ones, frequently struggle with robust reasoning. They tend...
A Unified Representation for Continuity and Discontinuity: Syntactic and Computational Motivations
arXiv:2506.05686v1 Announce Type: new Abstract: This paper advances a unified representation of linguistic structure for...
A Survey on Latent Reasoning
arXiv:2507.06203v2 Announce Type: replace Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, especially...
A Survey on (M)LLM-Based GUI Agents
arXiv:2504.13865v2 Announce Type: replace-cross Abstract: Graphical User Interface (GUI) Agents have emerged as a transformative...
A Survey of Context Engineering for Large Language Models
arXiv:2507.13334v1 Announce Type: new Abstract: The performance of Large Language Models (LLMs) is fundamentally determined...
A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification
arXiv:2504.18884v2 Announce Type: replace Abstract: With the advance of large language models (LLMs), LLMs have...