-
Notifications
You must be signed in to change notification settings - Fork 164
ARM hybrid CPU support #2356
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
ARM hybrid CPU support #2356
Conversation
If we have an inclusive range like `0-3`, that's four cores, not three.
260571c to
89b27b4
Compare
|
Note: I haven't touched the |
89b27b4 to
84c2e69
Compare
Kobzol
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The package-lock.json change looks unrelated, could you revert it please?
CC @Jamesbarford if you have any comments as an ARM expert :)
Otherwise I'm fine with merging ofc, as it solves your use-case.
As far as I understand there are 2 physical clusters with different cache sizes which could impact performance runs?
reference: https://docs.nvidia.com/dgx/dgx-spark-porting-guide/overview.html |
Specifically: - Add support for detecting ARM hybrid CPUs, via a heuristic on CPU "capacity". - Adjust ARM-specific event names as necessary, e.g. `armv8_pmuv3_0/instructions:u/` -> `instructions:u`. There is also some refactoring of the existing code for handling Intel hybrid architectures, e.g. merging `run_on_p_cores` into `performance_cores`, to avoid code duplication.
84c2e69 to
2902893
Compare
|
I moved the regex and updated the comment about the ASUS GX10 to mention the L3 cache. |
Jamesbarford
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me 👍. (I’m on a phone and GitHub won’t let me select the Approve radio button, thus this is a comment 🤷🏻♂️)

I have an ASUS GX10 Ascent which has an ARM CPU with 10 performance cores and 10 efficiency cores. This PR is enough to get rustc-perf working on it.