YouZum

News

News

Benchmarking the Pedagogical Knowledge of Large Language Models

arXiv:2506.18710v3 Announce Type: replace Abstract: Benchmarks like Massive Multitask Language Understanding (MMLU) have played a...

Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression

arXiv:2412.05693v3 Announce Type: replace Abstract: Several works have developed eviction policies to remove key-value (KV)...

Aya Vision: Advancing the Frontier of Multilingual Multimodality

arXiv:2505.08751v1 Announce Type: new Abstract: Building multimodal language models is fundamentally challenging: it requires aligning...

AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

arXiv:2506.08140v1 Announce Type: cross Abstract: Despite long-standing efforts in accelerating scientific discovery with AI, building...

AutoMixer: Checkpoint Artifacts as Automatic Data Mixers

arXiv:2506.21910v1 Announce Type: new Abstract: In language model training, it is desirable to equip models...

Automatically assessing oral narratives of Afrikaans and isiXhosa children

arXiv:2507.13205v1 Announce Type: new Abstract: Developing narrative and comprehension skills in early childhood is critical...

AutoLibra: Agent Metric Induction from Open-Ended Feedback

arXiv:2505.02820v2 Announce Type: replace-cross Abstract: Agents are predominantly evaluated and optimized via task success metrics...

Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion

arXiv:2506.01365v1 Announce Type: cross Abstract: Voice Activity Detection (VAD) plays a key role in speech...

Attention Basin: Why Contextual Position Matters in Large Language Models

arXiv:2508.05128v1 Announce Type: new Abstract: The performance of Large Language Models (LLMs) is significantly sensitive...

AsyncSwitch: Asynchronous Text-Speech Adaptation for Code-Switched ASR

arXiv:2506.14190v1 Announce Type: new Abstract: Developing code-switched ASR systems is challenging due to language ambiguity...

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science

arXiv:2506.13992v1 Announce Type: cross Abstract: Large language models (LLMs) have advanced the automation of data...

Apple Researchers Reveal Structural Failures in Large Reasoning Models Using Puzzle-Based Evaluation

Artificial intelligence has undergone a significant transition from basic language models to advanced models that...
en_US