Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software
Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations
SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection
Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents
Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Demystifying Data Organization for Enhanced LLM Training
Demystifying Data Organization for Enhanced LLM Training
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection
MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure
ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
mcp-proto-okn: Natural-language access to open scientific knowledge graphs through the Model Context Protocol
mcp-proto-okn: Natural-language access to open scientific knowledge graphs through the Model Context Protocol
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Persona Conditioning of Brand Recommendations in Retrieval-Augmented Commercial Chat: A Prominence-Stratified Cross-Provider Audit
Persona Conditioning of Brand Recommendations in Retrieval-Augmented Commercial Chat: A Prominence-Stratified Cross-Provider Audit
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Double-Edged Sword or Sharp Tool? Designing and Evaluating Triadic LLM-Teacher Collaboration for K-12 Writing at Scale
Double-Edged Sword or Sharp Tool? Designing and Evaluating Triadic LLM-Teacher Collaboration for K-12 Writing at Scale
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance
Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
BioRefusalAudit: Auditing Biosecurity Refusal Depth Using General and Domain-Fine-Tuned Sparse Autoencoders
BioRefusalAudit: Auditing Biosecurity Refusal Depth Using General and Domain-Fine-Tuned Sparse Autoencoders
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents
Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Temporal Stability and Few-Shot Prompting in Math Task Assessment
Temporal Stability and Few-Shot Prompting in Math Task Assessment
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Anchorless Diversification for Parallel LLM Ideation
Anchorless Diversification for Parallel LLM Ideation
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
AgentSchool: An LLM-Powered Multi-Agent Simulation for Education
AgentSchool: An LLM-Powered Multi-Agent Simulation for Education
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Enhancing Multi-Agent Communication through Attention Steering with Context Relevance
Enhancing Multi-Agent Communication through Attention Steering with Context Relevance
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
VLA-Trace: Diagnosing Vision-Language-Action Models through Representation and Behavior Tracing
VLA-Trace: Diagnosing Vision-Language-Action Models through Representation and Behavior Tracing
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
PokerSkill: LLMs Can Play Expert-Level Poker without Training or Solvers
PokerSkill: LLMs Can Play Expert-Level Poker without Training or Solvers
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used