Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Windows Makefile review comments addressed. Remove extra commands and CI changes. #256

Closed

Conversation

rosslwheeler
Copy link
Contributor

Addressed the review comments. Removed the extra rename Cuda command. Tested against CI, WSL2, Windows 11 and Windows Server 2022.

@Ricardicus - please review and let me know if you have any comments. Tried to incorporate our discussed changes. This should be cleaner now. Thanks.

karpathy and others added 9 commits April 27, 2024 00:54
…tensors to put the layernorms at the end. the training loop seems to work ok, and the tests pass and the loss and optimization looks ok, but the gradients don't match. which can't be right. so there is a bug, but it's a bit too late in the day for me to debug right now, creating a PR and going to sleep, will fix tomorrow
Tested locally and reduced compilation time by 200ms, unfortunately for me upgrading to 12.4 made my compilations times slow by 2x but at least this can make it a bit faster
Addressed the review comments. Remove the extra rename Cuda command. Tested against CI, WSL2, Windows 11 and Windows Server 2022.
@rosslwheeler rosslwheeler force-pushed the makefile_windows_fixes branch from e4ce057 to 25240ec Compare April 27, 2024 21:41
karpathy and others added 19 commits April 27, 2024 23:17
… for some tensors and i don't exactly know why sad
…fp32 or bf16 or fp16. fp16 will error, though
load bf16 directly, and some "quality of life" handling of fp32/fp16/bf16 precisions
…prone, but i think it is done. had to bump versions on all .bin files, invalidating the previous files. re-run the python training script to re-export the new version files. let's not do too much of things like this in the future lol. actually, fun fact i had a chance to do the padded vocab really really early in the history of llm.c development, and chose not do it, thinking i'll just do it later. i should have done it. such is life, you make mistakes, you accumulate scar tissue, and you learn, and you become better, faster, stronger. this is the mindset one must have to lead a happy and fulfilling life. it's not important that you are perfect at any point in time, it's only important that you keep improving, every day.
@rosslwheeler rosslwheeler deleted the makefile_windows_fixes branch April 29, 2024 09:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants