Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CPU simulated benchmarking for GKE cluster. #143

Merged
merged 10 commits into from
Oct 7, 2024
Merged

CPU simulated benchmarking for GKE cluster. #143

merged 10 commits into from
Oct 7, 2024

Conversation

Yash9060
Copy link
Collaborator

@Yash9060 Yash9060 commented Oct 7, 2024

Simulated CPU benchmarking code for GKE cluster. I have tested this on GKE cluster of 20 N2-standard-64.
Example of deployment-yaml: http://gpaste/6024689370791936

Using linear model here since it is easier to divide it amongst the processes (in order to ensure consistent size across the processes) .

Also not using trainer object (& trainer.fit) since training does consume some time (decreases dev velocity) and also produces checkpoints of varying sizes.

I will have a separate version of similar code which can be run on cloudtop for quick testing.

  • Tests pass

@Yash9060 Yash9060 requested a review from a team as a code owner October 7, 2024 18:49
demo/lightning/checkpoint/multinode/train2.py Outdated Show resolved Hide resolved
Copy link
Contributor

@awonak awonak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Yash9060 Yash9060 enabled auto-merge (squash) October 7, 2024 19:43
@Yash9060 Yash9060 self-assigned this Oct 7, 2024
@Yash9060 Yash9060 merged commit cdcb24b into main Oct 7, 2024
5 checks passed
@Yash9060 Yash9060 deleted the yashsha-sim branch October 7, 2024 20:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants