YouZum

ข่าว

ข่าว

AI agents are hitting a liability wall. Mixus has a plan to overcome it using human overseers on high-risk workflows

Mixus’s “colleague-in-the-loop” model blends automation with human judgment for safe deployment of AI agents.Read More...

Agent-based computing is outgrowing the web as we know it

AI agents are moving from passive assistants to active participants. Today, we ask them to...

AegisLLM: Scaling LLM Security Through Adaptive Multi-Agent Systems at Inference Time

The Growing Threat Landscape for LLMs LLMs are key targets for fast-evolving attacks, including prompt...

Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning

arXiv:2412.08587v2 Announce Type: replace Abstract: Both encoder-only models (e.g., BERT, RoBERTa) and large language models...

Adopting agentic AI? Build AI fluency, redesign workflows, don’t neglect supervision

How can organizations decide how to use human-in-the-loop mechanisms and collaborative frameworks with AI agents?Read...

Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning

arXiv:2505.09738v1 Announce Type: new Abstract: Pretrained language models (LLMs) are often constrained by their fixed...

AbstRaL: Teaching LLMs Abstract Reasoning via Reinforcement to Boost Robustness on GSM Benchmarks

Recent research indicates that LLMs, particularly smaller ones, frequently struggle with robust reasoning. They tend...

A Unified Representation for Continuity and Discontinuity: Syntactic and Computational Motivations

arXiv:2506.05686v1 Announce Type: new Abstract: This paper advances a unified representation of linguistic structure for...

A Survey on Latent Reasoning

arXiv:2507.06203v2 Announce Type: replace Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, especially...

A Survey on (M)LLM-Based GUI Agents

arXiv:2504.13865v2 Announce Type: replace-cross Abstract: Graphical User Interface (GUI) Agents have emerged as a transformative...

A Survey of Context Engineering for Large Language Models

arXiv:2507.13334v1 Announce Type: new Abstract: The performance of Large Language Models (LLMs) is fundamentally determined...

A Simple Ensemble Strategy for LLM Inference: Towards More Stable Text Classification

arXiv:2504.18884v2 Announce Type: replace Abstract: With the advance of large language models (LLMs), LLMs have...
th