YouZum

Actualités

Actualités

Evaluating Rare Disease Diagnostic Performance in Symptom Checkers: A Synthetic Vignette Simulation Approach

arXiv:2506.19750v2 Announce Type: replace Abstract: Symptom Checkers (SCs) provide users with personalized medical information. To...

Evaluating LLMs on Real-World Forecasting Against Expert Forecasters

arXiv:2507.04562v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse...

Evaluating Creative Short Story Generation in Humans and Large Language Models

arXiv:2411.02316v5 Announce Type: replace Abstract: Story-writing is a fundamental aspect of human imagination, relying heavily...

Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions

arXiv:2506.11111v1 Announce Type: new Abstract: Large Language Models (LLMs) have gained enormous attention in recent...

Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG

arXiv:2406.13069v4 Announce Type: replace Abstract: How novel are texts generated by language models (LMs) relative...

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

arXiv:2503.08893v2 Announce Type: replace Abstract: An ideal model evaluation should achieve two goals: identifying where...

Estimating LLM Uncertainty with Logits

arXiv:2502.00290v4 Announce Type: replace Abstract: Over the past few years, Large Language Models (LLMs) have...

Erasing Conceptual Knowledge from Language Models

arXiv:2410.02760v3 Announce Type: replace Abstract: In this work, we introduce Erasure of Language Memory (ELM)...

EPFL Researchers Introduce MEMOIR: A Scalable Framework for Lifelong Model Editing in LLMs

The Challenge of Updating LLM Knowledge LLMs have shown outstanding performance for various tasks through...

Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language Representations

arXiv:2509.05060v1 Announce Type: new Abstract: We introduce Entropy2Vec, a novel framework for deriving cross-lingual language...

Enterprise alert: PostgreSQL just became the database you can’t ignore for AI applications

Analysts provide insight on what the latest acquisition of a PostgreSQL database vendor means for...

We use cookies to improve your experience and performance on our website. You can learn more at Politique de confidentialité and manage your privacy settings by clicking Settings.

Privacy Preferences

You can choose your cookie settings by turning on/off each type of cookie as you wish, except for essential cookies.

Allow All
Manage Consent Preferences
  • Always Active

Save
fr_FR