ArXiv Intelligence

Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison

Topic · 记忆

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Conformal Certification of Reasoning Trace Prefixes

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Learning to Choose: An Empowerment-Guided Multi-Agent System with semantic communication for Adaptive Method Selection

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Teaching Values to Machines: Simulating Human-Like Behavior in LLMs

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

RAISE: RAG Design as an Architecture Search Problem

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

From GPS Points to Travel Patterns: Flexible and Semantic Trajectory Generation with LLMs

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

KairosAgent: Agentic Time Series Forecasting with Fused Semantic Reasoning

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Cookie-Bench: Continuous On-screen Key Interaction Evaluation for Web Generation

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Accelerating Constrained Decoding with Token Space Compression

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Compass: Navigating Global Marine Lead Data Integration through Expert-Guided LLM Agent

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Meta-Programming for Linear-time Temporal Answer Set Programming

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Formalizing Mathematics at Scale

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Make LLM Learn to Synthesize from Streaming Experiences through Feedback

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

It`s All About Speed: AI`s Impact on Workflow in Music Production

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Toward AI Systems That Understand Self and Others: A Multi-Phase Inference Framework for Human Cognitive Diversity and World-Model Alignment

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

On the Geometry of Games and their Solvers

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Redundant or Necessary? A Benchmark for Detecting Redundant Steps in Agent Trajectories

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

2026-05-29 · 354 篇

Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison

Conformal Certification of Reasoning Trace Prefixes

Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers

Learning to Choose: An Empowerment-Guided Multi-Agent System with semantic communication for Adaptive Method Selection

Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

Teaching Values to Machines: Simulating Human-Like Behavior in LLMs

RAISE: RAG Design as an Architecture Search Problem

From GPS Points to Travel Patterns: Flexible and Semantic Trajectory Generation with LLMs

KairosAgent: Agentic Time Series Forecasting with Fused Semantic Reasoning

Cookie-Bench: Continuous On-screen Key Interaction Evaluation for Web Generation

Accelerating Constrained Decoding with Token Space Compression

Compass: Navigating Global Marine Lead Data Integration through Expert-Guided LLM Agent

Meta-Programming for Linear-time Temporal Answer Set Programming

Formalizing Mathematics at Scale

MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization

Make LLM Learn to Synthesize from Streaming Experiences through Feedback

It`s All About Speed: AI`s Impact on Workflow in Music Production

Toward AI Systems That Understand Self and Others: A Multi-Phase Inference Framework for Human Cognitive Diversity and World-Model Alignment

On the Geometry of Games and their Solvers

Redundant or Necessary? A Benchmark for Detecting Redundant Steps in Agent Trajectories