Noticias
Noticias
Benchmarking Chinese Commonsense Reasoning with a Multi-hop Reasoning Perspective
arXiv:2510.08800v1 Announce Type: new Abstract: While Large Language Models (LLMs) have demonstrated advanced reasoning capabilities...
Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression
arXiv:2412.05693v3 Announce Type: replace Abstract: Several works have developed eviction policies to remove key-value (KV)...
Base Models Beat Aligned Models at Randomness and Creativity
arXiv:2505.00047v2 Announce Type: replace Abstract: Alignment has quickly become a default ingredient in LLM development...
Baidu Releases ERNIE-4.5-VL-28B-A3B-Thinking: An Open-Source and Compact Multimodal Reasoning Model Under the ERNIE-4.5 Family
How can we get large model level multimodal reasoning for documents, charts and videos while...
Aya Vision: Advancing the Frontier of Multilingual Multimodality
arXiv:2505.08751v1 Announce Type: new Abstract: Building multimodal language models is fundamentally challenging: it requires aligning...
AWARE, Beyond Sentence Boundaries: A Contextual Transformer Framework for Identifying Cultural Capital in STEM Narratives
arXiv:2510.04983v3 Announce Type: replace Abstract: Identifying cultural capital (CC) themes in student reflections can offer...
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
arXiv:2509.07555v1 Announce Type: new Abstract: In a rapidly evolving world where information updates swiftly, knowledge...
AutoSpec: An Agentic Framework for Automatically Drafting Patent Specification
arXiv:2509.19640v1 Announce Type: new Abstract: Patents play a critical role in driving technological innovation by...
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
arXiv:2506.08140v1 Announce Type: cross Abstract: Despite long-standing efforts in accelerating scientific discovery with AI, building...
AutoRev: Multi-Modal Graph Retrieval for Automated Peer-Review Generation
arXiv:2505.14376v2 Announce Type: replace Abstract: Enhancing the quality and efficiency of academic publishing is critical...
AutoMixer: Checkpoint Artifacts as Automatic Data Mixers
arXiv:2506.21910v1 Announce Type: new Abstract: In language model training, it is desirable to equip models...
Automatically assessing oral narratives of Afrikaans and isiXhosa children
arXiv:2507.13205v1 Announce Type: new Abstract: Developing narrative and comprehension skills in early childhood is critical...
