Paper Recommendations

All Candidates

48 papers

Score	Paper	Authors	Category
6	Misaligned by Reward: Socially Undesirable Preferences in LLMs	Gayane Ghazaryan, Esra Dönmez	cs.CL, cs.AI, cs.CY
6	Why Expert Alignment Is Hard: Evidence from Subjective Evaluation	Tzu-Mi Lin, Wataru Hirota et al.	cs.CL
5	Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning	Alper Kamil Bozkurt, Xiaoan Xu et al.	cs.LG, cs.AI
5	Beyond Semantics: An Evidential Reasoning-Aware Multi-View Learning Framework for Trustworthy Mental Health Prediction	Yucheng Ruan, Ling Huang et al.	cs.CL
5	TabEmbed: Benchmarking and Learning Generalist Embeddings for Tabular Understanding	Minjie Qiang, Mingming Zhang et al.	cs.CL, cs.IR
4	Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation	Enhui Chai, Sicheng Chen et al.	cs.CV, cs.AI
4	Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement	Nicholas S. Kersting, Vittorio Castelli et al.	cs.CL, cs.AI, cs.CY
4	Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction	Dan Wilson, Mohamed Akrout	cs.LG, math.DS
4	MRI-Eval: A Tiered Benchmark for Evaluating LLM Performance on MRI Physics and GE Scanner Operations Knowledge	Perry E. Radau	eess.IV, cs.CL, physics.med-ph
4	The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences	Hubert Plisiecki, Sabina Siudaj et al.	cs.CL
4	UFAL-CUNI at SemEval-2026 Task 11: An Efficient Modular Neuro-symbolic Method for Syllogistic Reasoning	Ivan Kartáč, Kristýna Onderková et al.	cs.CL
3	Human-AI Co-Mentorship in Project-Based Learning: A Case Study in Financial Forecasting	Freyaa Chawla, Ahan Chawla et al.	cs.LG, cs.CY
3	Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models	Quintin Pope, Ajay Hayagreeve Balaji et al.	cs.CL, cs.AI
3	When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise	Philip Wootaek Shin, Ajay Narayanan Sridhar et al.	cs.CV, cs.CL
2	The First Token Knows: Single-Decode Confidence for Hallucination Detection	Mina Gabriel	cs.CL, cs.AI
2	Aes3D: Aesthetic Assessment in 3D Gaussian Splatting	Chuanzhi Xu, Boyu Wei et al.	cs.CV, cs.AI
2	Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting	Alper Yıldırım	cs.LG, cs.AI
2	Joint Treatment Effect Estimation from Incomplete Healthcare Data: Temporal Causal Normalizing Flows with LLM-driven Evolutionary MNAR Imputation	Olivia Jullian Parra, Sara Zoccheddu et al.	cs.LG, cs.AI
2	Transformed Latent Variable Multi-Output Gaussian Processes	Xiaoyu Jiang, Xinxing Shi et al.	cs.LG
2	Conditional outlier detection for clinical alerting	Milos Hauskrecht, Michal Valko et al.	cs.LG, cs.CY
2	Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior	Daniel Wurgaft, Can Rager et al.	cs.LG
2	Adapting Large Language Models to a Low-Resource Agglutinative Language: A Comparative Study of LoRA and QLoRA for Bashkir	Mullosharaf K. Arabov, Svetlana S. Khaybullina	cs.CL
1	When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning	Lakshita Dodeja, Ondrej Biza et al.	cs.RO, cs.AI
1	PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation	Srikar Kashyap Pulipaka	cs.CL, cs.AI, cs.LG
1	LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts	Seungeun Rho, Shamel Fahmi et al.	cs.RO, cs.AI
1	Building informative materials datasets beyond targeted objectives	Rafael Espinosa Castañeda, Ashley Dale et al.	cond-mat.mtrl-sci, cs.AI, cs.DB, cs.LG, stat.AP
1	Sharp Capacity Thresholds in Linear Associative Memory: From Winner-Take-All to Listwise Retrieval	Nicholas Barnfield, Juno Kim et al.	stat.ML, cs.IT, cs.LG
1	Estimating the expected output of wide random MLPs more efficiently than sampling	Wilson Wu, Victor Lecomte et al.	cs.LG, cond-mat.dis-nn, stat.ML
1	Physiologically Grounded Driver Behavior Classification: SHAP-Driven Elite Feature Selection and Hybrid Gradient Boosting for Multimodal Physiological Signals	Sahar Askari, Mohammad Mahdi Mirza Ali Mohammadi et al.	cs.LG, eess.SP
1	On the Hardness of Junking LLMs	Marco Rando, Samuel Vaiter	cs.LG
1	Implicit Representations of Grammaticality in Language Models	Yingshan Susan Wang, Linlu Qiu et al.	cs.CL
1	Detecting Hallucinations in Large Language Models via Internal Attention Divergence Signals	Gijs van Dijk	cs.CL
1	Conceptors for Semantic Steering	Ilias Triantafyllopoulos, Young-Min Cho et al.	cs.LG, cs.CL
0	Taming Outlier Tokens in Diffusion Transformers	Xiaoyu Wu, Yifei Wang et al.	cs.CV, cs.AI, cs.LG
0	Grokability in five inequalities	Paata Ivanisvili, Xinyuan Xie	math.PR, cs.AI, math.AP, math.CA, math.FA
0	Almost-Orthogonality in Lp Spaces: A Case Study with Grok	Ziang Chen, Jaume de Dios Pont et al.	math.CA, cs.AI, math.CO, math.PR
0	What Matters in Practical Learned Image Compression	Kedar Tatwawadi, Parisa Rahimzadeh et al.	cs.CV, cs.AI, cs.LG
0	On the Wasserstein Gradient Flow Interpretation of Drifting Models	Arthur Gretton, Li Kevin Wenliang et al.	cs.LG, cs.AI, stat.ML
0	Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics	Andreas Pattichis, Constantine Dovrolis	cs.LG, cs.AI, cs.CL
0	Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer	Alexander Hsu, Zhaiming Shen et al.	cs.LG, math.NA
0	How Long Does Infinite Width Last? Signal Propagation in Long-Range Linear Recurrences	Mariia Seleznova	cs.LG
0	The Impossibility Triangle of Long-Context Modeling	Yan Zhou	cs.CL, cs.AI, cs.LG
0	Why Geometric Continuity Emerges in Deep Neural Networks: Residual Connections and Rotational Symmetry Breaking	Kyungwon Jeong, Won-Gi Paeng et al.	cs.LG, cs.AI, cs.CL

Editor's Rationale

Top Picks

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

Self-Induced Outcome Potential: Turn-Level Credit Assignment for Agents without Verifiers

Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours

Rollout Pass-Rate Control: Steering Binary-Reward RL Toward Its Most Informative Regime

Executable World Models for ARC-AGI-3 in the Era of Coding Agents

All Candidates