Actualités

octobre 2, 2025admin NUAI,Committee,Actualités,Uncategorized0

It Takes Two: Your GRPO Is Secretly DPO

arXiv:2510.00977v1 Announce Type: cross Abstract: Group Relative Policy Optimization (GRPO) is a prominent reinforcement learning...

octobre 28, 2025admin NUAI,Committee,Actualités,Uncategorized0

ISA-Bench: Benchmarking Instruction Sensitivity for Large Audio Language Models

arXiv:2510.23558v1 Announce Type: cross Abstract: Large Audio Language Models (LALMs), which couple acoustic perception with...

octobre 12, 2025admin NUAI,Committee,Actualités,Uncategorized0

Is vibe coding ruining a generation of engineers?

AI tools are revolutionizing software development by automating repetitive tasks, refactoring bloated code, and identifying...

juillet 21, 2025admin NUAI,Committee,Actualités,Uncategorized0

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

arXiv:2502.13962v2 Announce Type: replace Abstract: Scaling the test-time compute of large language models has demonstrated...

octobre 8, 2025admin NUAI,Committee,Actualités,Uncategorized0

Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort

arXiv:2510.01367v3 Announce Type: replace-cross Abstract: Reward hacking, where a reasoning model exploits loopholes in a...

septembre 16, 2025admin NUAI,Committee,Actualités,Uncategorized0

Is In-Context Learning Learning?

arXiv:2509.10414v2 Announce Type: replace Abstract: In-context learning (ICL) allows some autoregressive models to solve tasks...

septembre 26, 2025admin NUAI,Committee,Actualités,Uncategorized0

Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown

arXiv:2411.15993v2 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated strong capabilities in text...

juin 10, 2025admin NUAI,Committee,Actualités,Uncategorized0

Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment

arXiv:2505.12452v2 Announce Type: replace Abstract: Large language models (LLMs) increasingly demonstrate signs of conceptual understanding...

septembre 19, 2025admin NUAI,Committee,Actualités,Uncategorized0

Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction

arXiv:2509.14504v1 Announce Type: new Abstract: In this paper, we introduce OmniGEC, a collection of multilingual...

octobre 17, 2025admin NUAI,Committee,Actualités,Uncategorized0

Interpreting the Latent Structure of Operator Precedence in Language Models

arXiv:2510.13908v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning capabilities but...

octobre 23, 2025admin NUAI,Committee,Actualités,Uncategorized0

Interpretable Question Answering with Knowledge Graphs

arXiv:2510.19181v1 Announce Type: new Abstract: This paper presents a question answering system that operates exclusively...

septembre 1, 2025admin NUAI,Committee,Actualités,Uncategorized0

Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization

arXiv:2507.05137v2 Announce Type: replace Abstract: Learning Japanese vocabulary is a challenge for learners from Roman...

Actualités

Actualités

It Takes Two: Your GRPO Is Secretly DPO

ISA-Bench: Benchmarking Instruction Sensitivity for Large Audio Language Models

Is vibe coding ruining a generation of engineers?

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort

Is In-Context Learning Learning?

Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown

Introspective Growth: Automatically Advancing LLM Expertise in Technology Judgment

Introducing OmniGEC: A Silver Multilingual Dataset for Grammatical Error Correction

Interpreting the Latent Structure of Operator Precedence in Language Models

Interpretable Question Answering with Knowledge Graphs

Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization

Nos services

Accueil

Comment ça marche

Actualités

Tarifs

Support

Centre d'aide

Signaler un problème

Donner un retour

Politique de confidentialité

Compte utilisateur

Suivez-nous