Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment
Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment
Topic · 大模型后训练
仅有原始 MD
Quick Read
LLM failed, fallback used
Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs
Forget Less, Generalize More: Unifying Temporal and Structural Adaptation for Dynamic Graphs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
How Much Is a Dataset Worth? Scaling Laws, the Vendi Score, and Matrix Spectral Functions
How Much Is a Dataset Worth? Scaling Laws, the Vendi Score, and Matrix Spectral Functions
Topic · 大模型底座
仅有原始 MD
Quick Read
LLM failed, fallback used
How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions
How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents
SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing
AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
DELOS: Detecting Shallow Transits in Kepler Photometry Using a Contrastive-Learning Framework
DELOS: Detecting Shallow Transits in Kepler Photometry Using a Contrastive-Learning Framework
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
Beyond Bilingual Transfer: Multilingual Code-Switching in Instruction Tuning
Beyond Bilingual Transfer: Multilingual Code-Switching in Instruction Tuning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction
The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge
Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
On the Optimizer Dependence of Neural Scaling Laws
On the Optimizer Dependence of Neural Scaling Laws
Topic · 大模型底座
仅有原始 MD
Quick Read
LLM failed, fallback used
Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies
Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
TRACER: Persistent Regularization for Robust Multimodal Finetuning
TRACER: Persistent Regularization for Robust Multimodal Finetuning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
SURGENT: A Surgical Multi-Agent Assistance System Across the Perioperative Workflow
SURGENT: A Surgical Multi-Agent Assistance System Across the Perioperative Workflow
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Does Distributed Training Undermine Compute Governance?
Does Distributed Training Undermine Compute Governance?
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Rethinking FID Through the Geometry of the Reference Dataset
Rethinking FID Through the Geometry of the Reference Dataset
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
GrepSeek: Training Search Agents for Direct Corpus Interaction
GrepSeek: Training Search Agents for Direct Corpus Interaction
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
MusTBENCH: Benchmarking and Advancing Temporal Grounding in Music LLMs
MusTBENCH: Benchmarking and Advancing Temporal Grounding in Music LLMs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models
Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used