Here is the Technical Intelligence Report for 2026-01-13.

Executive Summary

AMD AGI Team Releases Athena-PRM: A new multimodal Process Reward Model (PRM) optimized for AMD Instinct GPUs has been released, demonstrating significant reasoning improvements (up to +10.2 points) on Qwen2.5-VL models.
Kubernetes DRA Support Detected: Repository updates indicate imminent documentation for “DRA (Dynamic Resource Allocation) for GPU,” signaling improved Kubernetes scheduling for AMD hardware.
NVIDIA & Eli Lilly $1B Partnership: NVIDIA confirms the deployment of DGX B300 (Blackwell) systems in a new joint lab, solidifying their dominance in the BioMed/HPC sector.
Software Ecosystem Expansion: NVIDIA expands BioNeMo with nvMolKit and new recipes, continuing to raise the software moat in drug discovery.

🤖 ROCm Updates & Software

Source: ROCm Tech Blog / GitHub

Key takeaway relevant to AMD:

Demonstrates AMD’s move up the stack into Model Architecture and AGI research, providing specific fine-tuning workflows (Reward Modeling) verified on Instinct hardware.
Provides developers with a concrete example of achieving SOTA results on AMD hardware, countering the narrative that advanced RLHF/Reasoning workflows are NVIDIA-exclusive.

Summary:

AMD has released a technical blog and model code for Athena-PRM, a model designed to evaluate the correctness of intermediate reasoning steps in Multimodal LLMs (MLLMs).
The model addresses the high cost and noise associated with training Process Reward Models by using a “consistency” filter between weak and strong completers.

Details:

Architecture & Methodology:
- Consistency Filtering: Uses both a “weak” and “strong” completer to estimate step labels; only steps where both agree are retained to remove bias.
- ORM Initialization: The PRM is initialized from an Outcome Reward Model (ORM) to leverage sample-level supervision before fine-tuning on process labels.
- Negative Up-sampling: Addresses label imbalance (where correct steps usually outnumber errors) by up-sampling negative data.
Performance Metrics (on AMD Hardware):
- Test-Time Scaling: When used with Qwen2.5-VL-7B, Athena-PRM improved performance by 10.2 points on the WeMath benchmark and 7.1 points on MathVista compared to the base model.
- SOTA Results: Claims State-of-the-Art results on VisualProcessBench, outperforming previous SOTA by 3.9 F1-score.
- Comparison: Outperforms VisualPRM-8B and InternVL2.5-8B across MathVerse, DynaMath, and MMMU benchmarks.
Usage Scenarios:
1. Best-of-N Verification: Ranking multiple solutions generated by a policy model.
2. Direct Judgment: Single forward pass to identify error steps in a chain of thought.
3. Reward Ranked Fine-tuning: Generating synthetic data to fine-tune policy models (used to create Athena-7B).
Availability: Code available at AMD-AGI/Athena-PRM; models trainable on AMD Instinct GPUs.

Source: ROCm Tech Blog / GitHub

Key takeaway relevant to AMD:

Indicates AMD is modernizing its Kubernetes integration, moving towards the new native Kubernetes Dynamic Resource Allocation (DRA) standard, which is critical for efficient GPU slicing and orchestration in enterprise clouds.

Summary:

A new documentation directory blogs/software-tools-optimization/dra-gpu/README.md was added to the ROCm blogs repository (content currently 362 lines).
Wordlist updates in the commit reveal specific technical keywords pointing to K8s implementation.

Details:

New Terminology Added:
- ResourceClaim, ResourceClaims, ResourceClass, ResourceSlice, ResourceSlices: These are core API objects in the Kubernetes DRA specification.
- GetPreferredAllocation: A function likely related to the scheduler logic for selecting specific GPU topology.
- smi (likely rocm-smi integration) and lifecycle management.
Implication: AMD is preparing to publish a guide or tool for “DRA-GPU,” likely a driver/plugin allowing more granular control over how AMD GPUs are claimed by pods in Kubernetes clusters (e.g., sharing GPUs or requesting specific topologies).

Source: NVIDIA Blog

Key takeaway relevant to AMD:

NVIDIA is aggressively deploying Blackwell (DGX B300) hardware into high-value pharmaceutical partnerships, setting a high bar for AMD’s MI325X/MI350 series in the BioMed vertical.
The “Scientist-in-the-loop” framework suggests NVIDIA is deeply embedding its software stack (BioNeMo) into the physical workflow of drug discovery, increasing vendor lock-in.

Summary:

NVIDIA and Eli Lilly announced a joint AI co-innovation lab in the San Francisco Bay Area with a combined investment of up to $1 billion over five years.
The collaboration focuses on integrating “agentic wet labs” with computational “dry labs.”

Details:

Hardware Deployment: The lab will utilize the NVIDIA DGX SuperPOD featuring DGX B300 systems. This is a confirmed deployment of NVIDIA’s Blackwell architecture for enterprise AI.
Software Updates (BioNeMo Platform):
- nvMolKit: A new GPU-accelerated cheminformatics tool announced for molecular design.
- BioNeMo Recipes: Pre-configured scripts to accelerate training and scaling of biological foundation models.
- NVIDIA Clara: New open models for RNA structure prediction released.
Ecosystem & Partners:
- Integration with Thermo Fisher for autonomous lab infrastructure.
- Integration with Multiply Labs for robotic cell therapy manufacturing.
- Highlighted startups using NVIDIA stacks: Isomorphic (AlphaFold), Recursion (OpenPhenom), and Insilico Medicine.
Strategic Goal: To move from “artisanal drug-making” to an engineering problem by simulating massive numbers of molecules in silico before physical testing.