Noticias
Noticias
PerCoR: Evaluating Commonsense Reasoning in Persian via Multiple-Choice Sentence Completion
arXiv:2510.22616v1 Announce Type: new Abstract: We introduced PerCoR (Persian Commonsense Reasoning), the first large-scale Persian...
Pensieve Grader: An AI-Powered, Ready-to-Use Platform for Effortless Handwritten STEM Grading
arXiv:2507.01431v1 Announce Type: cross Abstract: Grading handwritten, open-ended responses remains a major bottleneck in large...
Partitioner Guided Modal Learning Framework
arXiv:2507.11661v1 Announce Type: new Abstract: Multimodal learning benefits from multiple modal information, and each learned...
Partial Information Decomposition via Normalizing Flows in Latent Gaussian Distributions
arXiv:2510.04417v1 Announce Type: cross Abstract: The study of multimodality has garnered significant interest in fields...
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents
arXiv:2509.06917v2 Announce Type: replace-cross Abstract: We introduce Paper2Agent, an automated framework that converts research papers...
PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier
arXiv:2506.10406v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in complex...
PadChest-GR: A Bilingual Chest X-ray Dataset for Grounded Radiology Report Generation
arXiv:2411.05085v2 Announce Type: replace-cross Abstract: Radiology report generation (RRG) aims to create free-text radiology reports...
P-React: Synthesizing Topic-Adaptive Reactions of Personality Traits via Mixture of Specialized LoRA Experts
arXiv:2406.12548v3 Announce Type: replace Abstract: Personalized large language models (LLMs) have attracted great attention in...
Optimizing Length Compression in Large Reasoning Models
arXiv:2506.14755v2 Announce Type: replace-cross Abstract: Large Reasoning Models (LRMs) have achieved remarkable success, yet they...
Optimizing Assembly Code with LLMs: Reinforcement Learning Outperforms Traditional Compilers
LLMs have shown impressive capabilities across various programming tasks, yet their potential for program optimization...
OpenThoughts: A Scalable Supervised Fine-Tuning SFT Data Curation Pipeline for Reasoning Models
The Growing Complexity of Reasoning Data Curation Recent reasoning models, such as DeepSeek-R1 and o3...
OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic
The open source framework provides the data and training recipe for building powerful computer-use agents...


