Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RAFT Recovery Mode for interruptions #410

Merged
merged 7 commits into from
May 4, 2024
Merged

Conversation

kaiwen129
Copy link
Contributor

@kaiwen129 kaiwen129 commented May 4, 2024

Implemented a "safe"/recovery mode that periodically saves chunks into "checkpoint" datasets while also keeping track of the current chunk number. In the case of an interruption or crash, the script resumes at the saved chunk number. After all chunks have been processed, all checkpoint datasets are concatenated and saved as one final dataset.

Added an argument allowing user to choose whether to run RAFT in safe or fast mode (defaults to safe).

Close #394

@ShishirPatil ShishirPatil merged commit 624d371 into ShishirPatil:main May 4, 2024
devanshamin pushed a commit to devanshamin/gorilla that referenced this pull request Jul 9, 2024
Implemented a "safe"/recovery mode that periodically saves chunks into
"checkpoint" datasets while also keeping track of the current chunk
number. In the case of an interruption or crash, the script resumes at
the saved chunk number. After all chunks have been processed, all
checkpoint datasets are concatenated and saved as one final dataset.

Added an argument allowing user to choose whether to run RAFT in safe or
fast mode (defaults to safe).

Close ShishirPatil#394

---------

Co-authored-by: Kaihao Wen <[email protected]>
aw632 pushed a commit to vinaybagade/gorilla that referenced this pull request Aug 22, 2024
Implemented a "safe"/recovery mode that periodically saves chunks into
"checkpoint" datasets while also keeping track of the current chunk
number. In the case of an interruption or crash, the script resumes at
the saved chunk number. After all chunks have been processed, all
checkpoint datasets are concatenated and saved as one final dataset.

Added an argument allowing user to choose whether to run RAFT in safe or
fast mode (defaults to safe).

Close ShishirPatil#394

---------

Co-authored-by: Kaihao Wen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

RAFT Add support for resuming raft.py dataset generation in case of interruption
2 participants