Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison
Selective QA over Conflicting Multi-Source Personal Memory: A Diagnostic Testbed and Method Comparison
Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used
Conformal Certification of Reasoning Trace Prefixes
Conformal Certification of Reasoning Trace Prefixes
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers
Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Learning to Choose: An Empowerment-Guided Multi-Agent System with semantic communication for Adaptive Method Selection
Learning to Choose: An Empowerment-Guided Multi-Agent System with semantic communication for Adaptive Method Selection
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning
Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Teaching Values to Machines: Simulating Human-Like Behavior in LLMs
Teaching Values to Machines: Simulating Human-Like Behavior in LLMs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
RAISE: RAG Design as an Architecture Search Problem
RAISE: RAG Design as an Architecture Search Problem
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
From GPS Points to Travel Patterns: Flexible and Semantic Trajectory Generation with LLMs
From GPS Points to Travel Patterns: Flexible and Semantic Trajectory Generation with LLMs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
KairosAgent: Agentic Time Series Forecasting with Fused Semantic Reasoning
KairosAgent: Agentic Time Series Forecasting with Fused Semantic Reasoning
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Cookie-Bench: Continuous On-screen Key Interaction Evaluation for Web Generation
Cookie-Bench: Continuous On-screen Key Interaction Evaluation for Web Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Accelerating Constrained Decoding with Token Space Compression
Accelerating Constrained Decoding with Token Space Compression
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Compass: Navigating Global Marine Lead Data Integration through Expert-Guided LLM Agent
Compass: Navigating Global Marine Lead Data Integration through Expert-Guided LLM Agent
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Meta-Programming for Linear-time Temporal Answer Set Programming
Meta-Programming for Linear-time Temporal Answer Set Programming
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Formalizing Mathematics at Scale
Formalizing Mathematics at Scale
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization
MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Make LLM Learn to Synthesize from Streaming Experiences through Feedback
Make LLM Learn to Synthesize from Streaming Experiences through Feedback
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
It`s All About Speed: AI`s Impact on Workflow in Music Production
It`s All About Speed: AI`s Impact on Workflow in Music Production
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Toward AI Systems That Understand Self and Others: A Multi-Phase Inference Framework for Human Cognitive Diversity and World-Model Alignment
Toward AI Systems That Understand Self and Others: A Multi-Phase Inference Framework for Human Cognitive Diversity and World-Model Alignment
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
On the Geometry of Games and their Solvers
On the Geometry of Games and their Solvers
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Redundant or Necessary? A Benchmark for Detecting Redundant Steps in Agent Trajectories
Redundant or Necessary? A Benchmark for Detecting Redundant Steps in Agent Trajectories
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used