Skip to content

Conversation

@hsuanchi1506
Copy link

✅ PR Checklist – Previous Submission Round repo (v5.0)

This repository adds additional implementation guidance on enabling multi-node inference for Llama-70B using Triton Server.

🔒 Safeguard Against Accidental Disclosure

  • I confirm that I am not committing results or system updates intended for a future round.
  • This PR does not contain any result, metadata, or logs for an unreleased submission round.

🛠️ Valid Post-Publication Changes (if any)

  • This PR does not alter the published accuracy, latency, or system identity of any prior result.

📄 PR Communication

  • I have clearly explained the reason for this change in the PR description.

…nstructions and configuration changes. Updated INFERENCE_HASH in Makefile.build, modified Docker run command in Makefile.docker, and added detailed multinode execution steps for Llama70b inference in README.md.
@hsuanchi1506 hsuanchi1506 requested review from a team as code owners November 11, 2025 01:11
@github-actions
Copy link

MLCommons CLA bot:
Thank you very much for your submission, we really appreciate it. Before we can accept your contribution, we ask that you sign the MLCommons CLA (Apache 2). Please use this [Google form] (https://forms.gle/Ew1KkBVpyeJDuRw67) to initiate authorization. If you are from an MLCommons member organization, we will request that you be added to the CLA. If you are not from a member organization, we will email you a CLA to sign. For any questions, please contact [email protected].
0 out of 1 committers have signed the MLCommons CLA.
@hsuanchi1506
You can retrigger this bot by commenting recheck in this Pull Request

@github-actions github-actions bot locked and limited conversation to collaborators Nov 11, 2025
@hsuanchi1506 hsuanchi1506 reopened this Nov 11, 2025
@hsuanchi1506 hsuanchi1506 marked this pull request as draft November 11, 2025 04:03
@hsuanchi1506 hsuanchi1506 marked this pull request as ready for review November 11, 2025 04:14
@hanyunfan
Copy link
Contributor

Hi @hsuanchi1506 Are you are member of MLPerf Inference week group?

@anandhu-eng
Copy link
Contributor

anandhu-eng commented Nov 12, 2025

Hi @hanyunfan , this and the following PRs are raised as part of Student Cluster Competition 2025 where MLPerf Inference is one of the benchmarks.

Other PRs:
#20
mlcommons/inference#2387

Edit: I'm converting these PR's to drafts since these would be used to award bonus points for teams, there is no urgent need to merge.

@anandhu-eng anandhu-eng marked this pull request as draft November 12, 2025 07:53
@anandhu-eng anandhu-eng changed the base branch from main to for-scc-2025 November 13, 2025 08:01
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants