Beyond Trajectory Rewards: Step-level Credit Assignment for Agentic Search via Graph Modeling
Beyond Trajectory Rewards: Step-level Credit Assignment for Agentic Search via Graph Modeling
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
FHRFormer: A Self-Supervised Masked Transformer Framework for Fetal Heart Rate Time-Series Inpainting and Forecasting
FHRFormer: A Self-Supervised Masked Transformer Framework for Fetal Heart Rate Time-Series Inpainting and Forecasting
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
Reliable Reasoning with Large Language Models via Preference-Based Maximum Satisfiability
Reliable Reasoning with Large Language Models via Preference-Based Maximum Satisfiability
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
NICE: A Theory-Grounded Diagnostic Benchmark for Social Intelligence of LLMs
NICE: A Theory-Grounded Diagnostic Benchmark for Social Intelligence of LLMs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems
Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
GRASP: Gated Regression-Aware Skill Proposer for Self-Improving LLM Agents
GRASP: Gated Regression-Aware Skill Proposer for Self-Improving LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation
TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
PTCG-Bench: Can LLM Agents Master Pokémon Trading Card Game?
PTCG-Bench: Can LLM Agents Master Pokémon Trading Card Game?
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Think Fast, Talk Smart: Partitioning Deterministic and Neural Computation for Structured Health Text Generation
Think Fast, Talk Smart: Partitioning Deterministic and Neural Computation for Structured Health Text Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning
LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
VikingMem: A Memory Base Management System for Stateful LLM-based Applications
VikingMem: A Memory Base Management System for Stateful LLM-based Applications
Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used
Beyond Attack Success Rate: Temporal Logit Observability for LLM Safety Failures
Beyond Attack Success Rate: Temporal Logit Observability for LLM Safety Failures
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models
Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering
HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion
Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
FinVerBench: Benchmark Validity and Calibration in Large Language Model Financial Statement Verification
FinVerBench: Benchmark Validity and Calibration in Large Language Model Financial Statement Verification
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
GPS-Enhanced Tourist Mobility Modeling with Seasonal Spatial Priors and LLM-Based Activity Chain Generation
GPS-Enhanced Tourist Mobility Modeling with Seasonal Spatial Priors and LLM-Based Activity Chain Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning
DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Planning with the Views via Scene Self-Exploration
Planning with the Views via Scene Self-Exploration
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
ParaTool: Shifting Tool Representations from Context to Parameters
ParaTool: Shifting Tool Representations from Context to Parameters
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used