Added gfx950a to pyt_mochi_inference.ubuntu dockerfile to enable fa by vadseshu · Pull Request #148 · ROCm/MAD

vadseshu · 2026-04-16T13:53:26Z

Motivation

PR to fix the pyt_mochi_video_inference enablment on Mi350/ Mi355.

Technical Details

Updated gfx arch for Mi350 on mochi inference dockerfile

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

Updates the AMD Ubuntu Mochi inference Dockerfile to include the MI350/MI355 GPU architecture in the ROCm arch list so flash-attention can be built/enabled on those GPUs.

Changes:

Add gfx950 to PYTORCH_ROCM_ARCH in pyt_mochi_inference.ubuntu.amd.Dockerfile.

Comments suppressed due to low confidence (1)

docker/pyt_mochi_inference.ubuntu.amd.Dockerfile:40

PYTORCH_ROCM_ARCH is a semicolon-delimited list, but it’s later expanded unquoted in the GPU_ARCHS=$(echo ${PYTORCH_ROCM_ARCH} | ...) command and then used in an env-var assignment. In sh, the semicolons will be treated as command separators (e.g., echo gfx950;gfx90a;...), which can break the Docker build. Quote the expansion and ensure the computed GPU_ARCHS value is passed/assigned in a way that preserves the semicolons (or convert the list to a delimiter that’s shell-safe before using it).

ARG PYTORCH_ROCM_ARCH=gfx950;gfx90a;gfx942;gfx1100;gfx1101;gfx1200;gfx1201
RUN git clone ${FA_REPO}
RUN cd flash-attention \
    && git submodule update --init \
    && GPU_ARCHS=$(echo ${PYTORCH_ROCM_ARCH} | sed -e 's/;gfx1[0-9]\{3\}//g') python3 setup.py bdist_wheel --dist-dir=dist \

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-16T13:56:40Z


 ARG FA_REPO="https://github.com/Dao-AILab/flash-attention.git"
-ARG PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx1100;gfx1101;gfx1200;gfx1201
+ARG PYTORCH_ROCM_ARCH=gfx950;gfx90a;gfx942;gfx1100;gfx1101;gfx1200;gfx1201


The PR title mentions adding gfx950a, but the Dockerfile adds gfx950. Please confirm the intended ROCm arch string for MI350/MI355 and align either the Dockerfile value or the PR title/description to avoid confusion for downstream users.

Added gfx950a to pyt_mochi_inference.ubuntu dockerfile to enable fa

549e9ec

vadseshu requested review from amathews-amd, coketaste and gargrahul as code owners April 16, 2026 13:53

Copilot AI review requested due to automatic review settings April 16, 2026 13:53

vadseshu requested review from Rohan138 and ppalaniappan-amd as code owners April 16, 2026 13:53

Copilot started reviewing on behalf of vadseshu April 16, 2026 13:54 View session

Copilot AI reviewed Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added gfx950a to pyt_mochi_inference.ubuntu dockerfile to enable fa#148

Added gfx950a to pyt_mochi_inference.ubuntu dockerfile to enable fa#148
vadseshu wants to merge 1 commit intoROCm:developfrom
vadseshu:develop

vadseshu commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vadseshu commented Apr 16, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants