AI 投研 – 第 5 页

6.0 AI 投研 Tech Xplore AI 2026-06-16

AI 投研 2026-06-16

AI robots can go rogue: A researcher on how easily it happens

AI 摘要：Earlier this year in Beijing, a humanoid robot crossed a half-marathon finish line in a blistering 50 mi

来源: Tech Xplore AI 6.0

6.0 AI 投研 EleutherAI Blog 2026-06-15

AI 投研 2026-06-15

Studying inductive biases of random networks via local volumes

AI 摘要：In this post, we will study inductive biases of the parameter-function map of random neural networks usi

来源: EleutherAI Blog 6.0

8.0 AI 投研 EleutherAI Blog 2026-06-15

AI 投研 2026-06-15

Research Update: Applications of Local Volume Measurement

本文为EleutherAI Blog于2025年6月23日发布的AI研究更新通报，核心围绕局部体积测量技术在下游AI任务中的应用展开。目前公开信息仅披露了研究主题，未提及具体技术细节、实验数据或落地成果，仅明确该内容属于

来源: EleutherAI Blog 8.0

AI Radar Summary

本文为EleutherAI Blog于2025年6月23日发布的AI研究更新通报，核心围绕局部体积测量技术在下游AI任务中的应用展开。目前公开信息仅披露了研究主题，未提及具体技术细节、实验数据或落地成果，仅明确该内容属于AI基础研究范畴的动态更新，暂未披露更多可验证的研究进展或相关案例，属于AI研究领域的常规资讯动态。

7.8 AI 投研 NVIDIA Technical Blog 2026-06-15

AI 投研 2026-06-15

Pretrained to Imagine, Fine-Tuned to Act: The Rise of World-Action Models

AI 摘要：Quick glossary for readers new to VLA/WAM terminology VLA Vision-Language-Action model: a robot policy t

来源: NVIDIA Technical Blog 7.8

6.0 AI 投研 EleutherAI Blog 2026-06-15

AI 投研 2026-06-15

Pretraining Data Filtering for Open-Weight AI Safety

AI 摘要：Announcing Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weigh

来源: EleutherAI Blog 6.0

6.0 AI 投研 EleutherAI Blog 2026-06-15

AI 投研 2026-06-15

Attention Probes

AI 摘要：Adding attention to linear probes

来源: EleutherAI Blog 6.0

8.0 AI 投研 EleutherAI Blog 2026-06-15

AI 投研 2026-06-15

Reward Hacking Resarch Update

本文为EleutherAI官方博客于2025年10月7日发布的奖励黑客（Reward Hacking）研究中期进展报告，属于AI对齐领域的研究动态。公开片段仅说明该内容为持续性研究的阶段性更新，未披露具体实验设计、核心发

来源: EleutherAI Blog 8.0

AI Radar Summary

本文为EleutherAI官方博客于2025年10月7日发布的奖励黑客（Reward Hacking）研究中期进展报告，属于AI对齐领域的研究动态。公开片段仅说明该内容为持续性研究的阶段性更新，未披露具体实验设计、核心发现等细节。奖励黑客指AI系统利用奖励机制漏洞而非完成预设目标的现象，是当前AI安全领域的重点研究方向之一，本次更新为该领域的最新研究跟踪内容。

6.8 AI 投研 Tech Xplore AI 2026-06-15

AI 投研 2026-06-15

US order cutting access to Anthropic’s AI models sparks criticism

AI 摘要：The U.S. government's order for Anthropic to withdraw its most powerful artificial intelligence models h

来源: Tech Xplore AI 6.8

6.0 AI 投研 Tech Xplore AI 2026-06-15

AI 投研 2026-06-15

Courts cracking down on error-strewn AI-assisted legal briefs

AI 摘要：When a U.S. judge found fabricated quotes in a lawyer's brief earlier this year, the attorney admitted h

来源: Tech Xplore AI 6.0

6.0 AI 投研 EleutherAI Blog 2026-06-15

AI 投研 2026-06-15

Early Indicators of Reward Hacking via Reasoning Interpolation

AI 摘要：Using importance sampling with fine-tuned donor prefills to predict reward hacking emergence during trai

来源: EleutherAI Blog 6.0