Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Prediction
Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Prediction
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
LLMSurgeon: Diagnosing Data Mixture of Large Language Models
LLMSurgeon: Diagnosing Data Mixture of Large Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Unlocking the Working Memory of Large Language Models for Latent Reasoning
Unlocking the Working Memory of Large Language Models for Latent Reasoning
Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used
GPIC: A Giant Permissive Image Corpus for Visual Generation
GPIC: A Giant Permissive Image Corpus for Visual Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Reasoning with Sampling: Cutting at Decision Points
Reasoning with Sampling: Cutting at Decision Points
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
RoboWits: Unexpected Challenges for Robotic Creative Problem Solving
RoboWits: Unexpected Challenges for Robotic Creative Problem Solving
Topic · 具身智能
仅有原始 MD
Quick Read
LLM failed, fallback used
On Language Generation in the Limit with Bounded Memory
On Language Generation in the Limit with Bounded Memory
Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used
In-Context Reward Adaptation for Robust Preference Modeling
In-Context Reward Adaptation for Robust Preference Modeling
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Gram: Assessing sabotage propensities via automated alignment auditing
Gram: Assessing sabotage propensities via automated alignment auditing
Topic · 大模型后训练
仅有原始 MD
Quick Read
LLM failed, fallback used
Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion
Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes
Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Archon: A Unified Multimodal Model for Holistic Digital Human Generation
Archon: A Unified Multimodal Model for Holistic Digital Human Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images
City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings
MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Self-Trained Verification for Training- and Test-Time Self-Improvement
Self-Trained Verification for Training- and Test-Time Self-Improvement
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
Topic · 具身智能
仅有原始 MD
Quick Read
LLM failed, fallback used
Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection
Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
LLUMI: Improving LLM Writing Assistance for Mental Health Support with Online Community Feedback
LLUMI: Improving LLM Writing Assistance for Mental Health Support with Online Community Feedback
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
PhyGenHOI: Physically-Aware 4D Generation of Dynamic Human-Object Interactions
PhyGenHOI: Physically-Aware 4D Generation of Dynamic Human-Object Interactions
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used