GPF-LiveNews: A Streaming Evaluation Protocol for Group-Conditioned Framing in Large Language Models
GPF-LiveNews: A Streaming Evaluation Protocol for Group-Conditioned Framing in Large Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Thoughts-as-Planning: Latent World Models for Chain-of-Thoughts Optimization via Reinforcement Planning
Thoughts-as-Planning: Latent World Models for Chain-of-Thoughts Optimization via Reinforcement Planning
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
How Consistent Are LLM Agents? Measuring Behavioral Reproducibility in Multi-Step Tool-Calling Pipelines
How Consistent Are LLM Agents? Measuring Behavioral Reproducibility in Multi-Step Tool-Calling Pipelines
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Specialty-Specific Medical Language Model for Immune-Mediated Diseases
Specialty-Specific Medical Language Model for Immune-Mediated Diseases
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
SERC: LDPC-Inspired Semantic Error Correction for Retrieval-Augmented Generation
SERC: LDPC-Inspired Semantic Error Correction for Retrieval-Augmented Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand
No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling
GenesisFunc: Multi-Agent Data Generation for Accurate and Generalizable Function-Calling
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Assessing Dutch Syllabification Algorithms and Improving Accuracy by Combining Phonetic and Orthographic Information through Deep Learning
Assessing Dutch Syllabification Algorithms and Improving Accuracy by Combining Phonetic and Orthographic Information through Deep Learning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Transcribing Children's Speech: ASR Performance and Obtaining Reliable Orthographic Transcriptions
Transcribing Children's Speech: ASR Performance and Obtaining Reliable Orthographic Transcriptions
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
A comparative study of transformer-based embeddings for topic coherence
A comparative study of transformer-based embeddings for topic coherence
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
S3Mem: Structured Spatiotemporal Scene-Event Memory for Long-Horizon Interactive Question Answering
S3Mem: Structured Spatiotemporal Scene-Event Memory for Long-Horizon Interactive Question Answering
Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used
Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation
Benchmarking Open-Source Safety Guard Models: A Comprehensive Evaluation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Aryabhata 2: Scaling Reinforcement Learning for Advanced STEM Reasoning
Aryabhata 2: Scaling Reinforcement Learning for Advanced STEM Reasoning
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models
Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used