Actualités

juin 26, 2025admin NUAI,Committee,Actualités,Uncategorized0

Evaluating Rare Disease Diagnostic Performance in Symptom Checkers: A Synthetic Vignette Simulation Approach

arXiv:2506.19750v2 Announce Type: replace Abstract: Symptom Checkers (SCs) provide users with personalized medical information. To...

août 6, 2025admin NUAI,Committee,Actualités,Uncategorized0

Evaluating LLMs on Real-World Forecasting Against Expert Forecasters

arXiv:2507.04562v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse...

mai 13, 2025admin NUAI,Committee,Actualités,Uncategorized0

Evaluating Creative Short Story Generation in Humans and Large Language Models

arXiv:2411.02316v5 Announce Type: replace Abstract: Story-writing is a fundamental aspect of human imagination, relying heavily...

juin 16, 2025admin NUAI,Committee,Actualités,Uncategorized0

Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions

arXiv:2506.11111v1 Announce Type: new Abstract: Large Language Models (LLMs) have gained enormous attention in recent...

août 26, 2025admin NUAI,Committee,Actualités,Uncategorized0

Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG

arXiv:2406.13069v4 Announce Type: replace Abstract: How novel are texts generated by language models (LMs) relative...

juillet 14, 2025admin NUAI,Committee,Actualités,Uncategorized0

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

arXiv:2503.08893v2 Announce Type: replace Abstract: An ideal model evaluation should achieve two goals: identifying where...

mai 8, 2025admin NUAI,Committee,Actualités,Uncategorized0

Estimating LLM Uncertainty with Logits

arXiv:2502.00290v4 Announce Type: replace Abstract: Over the past few years, Large Language Models (LLMs) have...

juillet 23, 2025admin NUAI,Committee,Actualités,Uncategorized0

Erasing Conceptual Knowledge from Language Models

arXiv:2410.02760v3 Announce Type: replace Abstract: In this work, we introduce Erasure of Language Memory (ELM)...

juin 16, 2025admin NUAI,Committee,Actualités,Uncategorized0

EPFL Researchers Unveil FG2 at CVPR: A New AI Model That Slashes Localization Errors by 28% for Autonomous Vehicles in GPS-Denied Environments

Navigating the dense urban canyons of cities like San Francisco or New York can be...

juin 17, 2025admin NUAI,Committee,Actualités,Uncategorized0

Actualités

Actualités

Evaluating Rare Disease Diagnostic Performance in Symptom Checkers: A Synthetic Vignette Simulation Approach

Evaluating LLMs on Real-World Forecasting Against Expert Forecasters

Evaluating Creative Short Story Generation in Humans and Large Language Models

Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions

Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Estimating LLM Uncertainty with Logits

Erasing Conceptual Knowledge from Language Models

EPFL Researchers Unveil FG2 at CVPR: A New AI Model That Slashes Localization Errors by 28% for Autonomous Vehicles in GPS-Denied Environments

EPFL Researchers Introduce MEMOIR: A Scalable Framework for Lifelong Model Editing in LLMs

Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language Representations

Enterprise alert: PostgreSQL just became the database you can’t ignore for AI applications

Nos services

Accueil

Comment ça marche

Actualités

Tarifs

Support

Centre d'aide

Signaler un problème

Donner un retour

Politique de confidentialité

Compte utilisateur

Suivez-nous