Research Stream

2026-05-29 · 354 篇

显示 61-80 / 354
筛选与排序 默认折叠,需要时再展开,当前条件会直接显示在右侧。
日期 2026-05-29
清空
快捷日期
更多筛选

Beyond Trajectory Rewards: Step-level Credit Assignment for Agentic Search via Graph Modeling

Beyond Trajectory Rewards: Step-level Credit Assignment for Agentic Search via Graph Modeling

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

FHRFormer: A Self-Supervised Masked Transformer Framework for Fetal Heart Rate Time-Series Inpainting and Forecasting

FHRFormer: A Self-Supervised Masked Transformer Framework for Fetal Heart Rate Time-Series Inpainting and Forecasting

Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used

Reliable Reasoning with Large Language Models via Preference-Based Maximum Satisfiability

Reliable Reasoning with Large Language Models via Preference-Based Maximum Satisfiability

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

NICE: A Theory-Grounded Diagnostic Benchmark for Social Intelligence of LLMs

NICE: A Theory-Grounded Diagnostic Benchmark for Social Intelligence of LLMs

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems

Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

GRASP: Gated Regression-Aware Skill Proposer for Self-Improving LLM Agents

GRASP: Gated Regression-Aware Skill Proposer for Self-Improving LLM Agents

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation

TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

PTCG-Bench: Can LLM Agents Master Pokémon Trading Card Game?

PTCG-Bench: Can LLM Agents Master Pokémon Trading Card Game?

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Think Fast, Talk Smart: Partitioning Deterministic and Neural Computation for Structured Health Text Generation

Think Fast, Talk Smart: Partitioning Deterministic and Neural Computation for Structured Health Text Generation

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning

LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

VikingMem: A Memory Base Management System for Stateful LLM-based Applications

VikingMem: A Memory Base Management System for Stateful LLM-based Applications

Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used

Beyond Attack Success Rate: Temporal Logit Observability for LLM Safety Failures

Beyond Attack Success Rate: Temporal Logit Observability for LLM Safety Failures

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models

Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering

HiKEY: Hierarchical Multimodal Retrieval for Open-Domain Document Question Answering

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion

Mind-Omni: A Unified Multi-Task Framework for Brain-Vision-Language Modeling via Discrete Diffusion

Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used

FinVerBench: Benchmark Validity and Calibration in Large Language Model Financial Statement Verification

FinVerBench: Benchmark Validity and Calibration in Large Language Model Financial Statement Verification

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

GPS-Enhanced Tourist Mobility Modeling with Seasonal Spatial Priors and LLM-Based Activity Chain Generation

GPS-Enhanced Tourist Mobility Modeling with Seasonal Spatial Priors and LLM-Based Activity Chain Generation

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning

DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning

Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used

Planning with the Views via Scene Self-Exploration

Planning with the Views via Scene Self-Exploration

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

ParaTool: Shifting Tool Representations from Context to Parameters

ParaTool: Shifting Tool Representations from Context to Parameters

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
PrevPage 4/18Next