ArXiv Intelligence

Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment

Topic · 大模型后训练

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

How Much Is a Dataset Worth? Scaling Laws, the Vendi Score, and Matrix Spectral Functions

Topic · 大模型底座

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

DELOS: Detecting Shallow Transits in Kepler Photometry Using a Contrastive-Learning Framework

Topic · 机器学习框架

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Beyond Bilingual Transfer: Multilingual Code-Switching in Instruction Tuning

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

On the Optimizer Dependence of Neural Scaling Laws

Topic · 大模型底座

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

TRACER: Persistent Regularization for Robust Multimodal Finetuning

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

SURGENT: A Surgical Multi-Agent Assistance System Across the Perioperative Workflow

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Does Distributed Training Undermine Compute Governance?

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Rethinking FID Through the Geometry of the Reference Dataset

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

GrepSeek: Training Search Agents for Direct Corpus Interaction

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

MusTBENCH: Benchmarking and Advancing Temporal Grounding in Music LLMs

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

2026-05-29 · 354 篇

Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment

Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs

How Much Is a Dataset Worth? Scaling Laws, the Vendi Score, and Matrix Spectral Functions

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

DELOS: Detecting Shallow Transits in Kepler Photometry Using a Contrastive-Learning Framework

Beyond Bilingual Transfer: Multilingual Code-Switching in Instruction Tuning

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction

Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

On the Optimizer Dependence of Neural Scaling Laws

Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies

TRACER: Persistent Regularization for Robust Multimodal Finetuning

SURGENT: A Surgical Multi-Agent Assistance System Across the Perioperative Workflow

Does Distributed Training Undermine Compute Governance?

Rethinking FID Through the Geometry of the Reference Dataset

GrepSeek: Training Search Agents for Direct Corpus Interaction

MusTBENCH: Benchmarking and Advancing Temporal Grounding in Music LLMs

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models