Nachrichten
Nachrichten
Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression
arXiv:2412.05693v3 Announce Type: replace Abstract: Several works have developed eviction policies to remove key-value (KV)...
Base Models Beat Aligned Models at Randomness and Creativity
arXiv:2505.00047v2 Announce Type: replace Abstract: Alignment has quickly become a default ingredient in LLM development...
Aya Vision: Advancing the Frontier of Multilingual Multimodality
arXiv:2505.08751v1 Announce Type: new Abstract: Building multimodal language models is fundamentally challenging: it requires aligning...
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
arXiv:2509.07555v1 Announce Type: new Abstract: In a rapidly evolving world where information updates swiftly, knowledge...
AutoSpec: An Agentic Framework for Automatically Drafting Patent Specification
arXiv:2509.19640v1 Announce Type: new Abstract: Patents play a critical role in driving technological innovation by...
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
arXiv:2506.08140v1 Announce Type: cross Abstract: Despite long-standing efforts in accelerating scientific discovery with AI, building...
AutoRev: Multi-Modal Graph Retrieval for Automated Peer-Review Generation
arXiv:2505.14376v2 Announce Type: replace Abstract: Enhancing the quality and efficiency of academic publishing is critical...
AutoMixer: Checkpoint Artifacts as Automatic Data Mixers
arXiv:2506.21910v1 Announce Type: new Abstract: In language model training, it is desirable to equip models...
Automatically assessing oral narratives of Afrikaans and isiXhosa children
arXiv:2507.13205v1 Announce Type: new Abstract: Developing narrative and comprehension skills in early childhood is critical...
Automatic Prompt Optimization with Prompt Distillation
arXiv:2508.18992v2 Announce Type: replace Abstract: Autoprompting is the process of automatically selecting optimized prompts for...
Automatic Detection of Inauthentic Templated Responses in English Language Assessments
arXiv:2509.08355v1 Announce Type: new Abstract: In high-stakes English Language Assessments, low-skill test takers may employ...
Automated Generation of Research Workflows from Academic Papers: A Full-text Mining Framework
arXiv:2509.12955v2 Announce Type: replace Abstract: The automated generation of research workflows is essential for improving...