-
Notifications
You must be signed in to change notification settings - Fork 587
Fix small-LLM B200 dockerfile #826
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅ |
|
@suachong - can you please review this? |
|
@suachong - is this the error you're still getting? - @mmarcinkiewicz - can you please help with this one? NVIDIA-SMI 580.65.06 |
|
@mmarcinkiewicz is the Dockerfile for B200 working? I'm still running into the same error that shriya posted. |
|
I've rebuilt the image and I can repro now, which is weird because it used to work. |
|
@suachong please try now |
suachong
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tested with training run to convergence.
|
@suachong is this working now and can we merge this? |
|
[AMD Official Use Only - AMD Internal Distribution Only]
Yeah it is working. I’ve already approved the changes yesterday. I believe Michal said he wanted to push more changes but the Dockerfile he provided now works.
From: Hiwot Tadese Kassa ***@***.***>
Sent: Thursday, September 11, 2025 11:47 AM
To: mlcommons/training ***@***.***>
Cc: Chong, Su Ann ***@***.***>; Mention ***@***.***>
Subject: Re: [mlcommons/training] Fix small-LLM B200 dockerfile (PR #826)
Caution: This message originated from an External Source. Use proper caution when opening attachments, clicking links, or responding.
[Image removed by sender.]hiwotadese left a comment (mlcommons/training#826)<#826 (comment)>
@suachong<https://github.com/suachong> is this working now and can we merge this?
—
Reply to this email directly, view it on GitHub<#826 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/BJADJOHFKOVSLYN6JQTCEIT3SGKOFAVCNFSM6AAAAACFEZUJ7CVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTEOBRGUZDQMZZGQ>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
No description provided.