Refactored code from different base based on leyan_branch #588

cesposo · 2025-01-09T22:30:33Z

Summary
This PR addresses two things, the extension of model_ext.py and train_sat.py from leyan_branch with my additions from the previous PR. Second it addresses some run-time issues in the flash attention path that was caused by a dtype and shape mismatch when passing the attention_mask to PyTorch’s scaled_dot_product_attention function. By default, flash attention expects a boolean or floating-point mask of the same dtype/shape as the query tensor, broadcastable to [batch_size, n_heads, seq_len, seq_len]. Our previous code passed a [batch_size, seq_len] mask of int64, leading to said runtime error.

We now proceed w/ the training runs.

Pulling

…_SAT

Main

…PT_SAT

…ed it

leyanpan and others added 25 commits October 14, 2023 22:47

First Commit for SAT Solving

fff7336

Add updates for partial training

087a633

Modify gitignore

1c2647e

add large training set

e9caadb

add test for Large CDCL

432ea20

add new prediction file

fe26dc3

add diff dataset

7260b24

Merge branch 'master' of github.com:leyanpan/nanoGPT_SAT

209b732

Pulling

add 20-layer model

3b40a81

Fix prediction files

983eeef

add LTL dataset

fa795f2

Update Code for binary classification

e304737

Update .gitignore and remove large files

022e649

make server change

43ecb20

Merge remote-tracking branch 'origin/main'

7e80dad

Update code for classification evaluation

32aeac3

Merge branch 'main' of github.gatech.edu:LLM-Formal-Reasoning/nanoGPT…

02799e7

…_SAT

Windows Client Changes

6596324

Merge pull request #2 from LLM-Formal-Reasoning/main

c2db691

Main

Merge branch 'master' of github.gatech.edu:LLM-Formal-Reasoning/nanoG…

f62d523

…PT_SAT

Add debug option and debug logs

d6e824d

Update model

01d2d3c

Updates

ba3b143

December PR

419d11f

added the refactored code that's Llama complianit and appended/extend…

2067ce1

…ed it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactored code from different base based on leyan_branch #588

Refactored code from different base based on leyan_branch #588

cesposo commented Jan 9, 2025

Refactored code from different base based on leyan_branch #588

Are you sure you want to change the base?

Refactored code from different base based on leyan_branch #588

Conversation

cesposo commented Jan 9, 2025