ArXiv Intelligence

Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Pattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

A Taxonomy of Runtime Faults in Model Context Protocol Servers

Topic · 机器学习框架

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

LoRi: Low-Rank Distillation for Implicit Reasoning

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Personal AI Agent for Camera Roll VQA

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

NIV: Neural Axis Variations for Variable Font Generation

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

2026-06-05 · 280 篇

Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration

VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents

Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents

Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff

Pattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models

Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography

A Taxonomy of Runtime Faults in Model Context Protocol Servers

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

LoRi: Low-Rank Distillation for Implicit Reasoning

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation

Personal AI Agent for Camera Roll VQA

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks

NIV: Neural Axis Variations for Variable Font Generation

From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation