Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration
Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents
VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents
Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff
Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Pattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models
Pattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography
Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
A Taxonomy of Runtime Faults in Model Context Protocol Servers
A Taxonomy of Runtime Faults in Model Context Protocol Servers
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing
A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show
The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network
Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
LoRi: Low-Rank Distillation for Implicit Reasoning
LoRi: Low-Rank Distillation for Implicit Reasoning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference
Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents
Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation
Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Personal AI Agent for Camera Roll VQA
Personal AI Agent for Camera Roll VQA
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents
Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks
X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
NIV: Neural Axis Variations for Variable Font Generation
NIV: Neural Axis Variations for Variable Font Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability
From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation
Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used