Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add alpha scaling to lora #8483

Merged
merged 29 commits into from
Feb 23, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
1489ac7
coldfix (#8412)
Jorjeous Feb 14, 2024
5c70924
Fixed errors in the CTM gen functions (#8416) (#8420)
github-actions[bot] Feb 14, 2024
859b8f2
Add change_vocabulary and save_tokenizers() support to Multitask ASR …
github-actions[bot] Feb 14, 2024
b951d64
fix path location and branch (#8314)
github-actions[bot] Feb 15, 2024
d43d29f
Add TP comm overlap knobs to AutocastTransformerLayer (#8290)
minitu Feb 15, 2024
7ae94a0
add deallocate pipeline output optimization (#8279) (#8318)
github-actions[bot] Feb 16, 2024
344f655
remove assertion (#8302) (#8321)
github-actions[bot] Feb 16, 2024
9a0c79d
Keep max_seqlen and cu_seqlens_argmin for later micro-batches when PP…
github-actions[bot] Feb 16, 2024
a28e153
Enable megatron core loggers for GPT pretraining (#8354) (#8384)
github-actions[bot] Feb 16, 2024
e89f8be
Fix dreambooth data sampler issue (#8400) (#8413)
github-actions[bot] Feb 16, 2024
8575c31
add ensemble decoding fix (#8427) (#8433)
github-actions[bot] Feb 16, 2024
d440d7d
NeVA Tutorial Notebook (#8217)
PannuMuthu Feb 16, 2024
cca9141
mcore customization doc minor fix (#8421) (#8437)
github-actions[bot] Feb 16, 2024
7110f68
Add `loop_labels` algorithm for TDT greedy decoding (#8215)
artbataev Feb 16, 2024
84c6c5a
Add dist ckpt support for regular optimizers (#7749) (#8293)
github-actions[bot] Feb 16, 2024
8d3ddb9
Multimodal r1.23.0 bug fix (#8315) (#8339)
github-actions[bot] Feb 16, 2024
9231b83
mcore ds fix (#8283) (#8385)
github-actions[bot] Feb 16, 2024
1e119a9
MCore dataset compatibility for tokenizers (#8390) (#8397)
github-actions[bot] Feb 16, 2024
ef17d91
Canary: inference tokenization improvements; preserving custom keys w…
pzelasko Feb 16, 2024
1e1cf63
add sbert to IR (#8445)
JRD971000 Feb 17, 2024
150784c
Update readme (#8440)
ericharper Feb 17, 2024
5406d91
NeMo-Mistral to HF converter bugfix. (#8353) (#8442)
github-actions[bot] Feb 20, 2024
b810c8c
Fixing mcore bert for TP, PP and SP (#8336) (#8443)
github-actions[bot] Feb 20, 2024
76b9eea
Add LoRA support to all linear layers (#7988)
ertkonuk Feb 21, 2024
223456a
Add Neva Template for NV-DPO Models (#8358)
HuiyingLi Feb 21, 2024
25e02f4
Rebase scaling alpha
michal2409 Feb 22, 2024
6d3ad7b
default for alpha
arendu Jan 25, 2024
1ec1892
Rebase scaling alpha
pre-commit-ci[bot] Jan 25, 2024
1912933
Merge branch 'adithyare/lora_alpha' into mfutrega/lora_alpha
michal2409 Feb 23, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@
# See the License for the specific language governing permissions and
# limitations under the License.

ARG BASE_IMAGE=nvcr.io/nvidia/pytorch:23.12-py3
ARG BASE_IMAGE=nvcr.io/nvidia/pytorch:24.01-py3

# build an image that includes only the nemo dependencies, ensures that dependencies
# are included first for optimal caching, and useful for building a development
Expand Down Expand Up @@ -66,19 +66,19 @@ WORKDIR /workspace/
# We leave it here in case we need to work off of a specific commit in main
RUN git clone https://github.com/NVIDIA/Megatron-LM.git && \
cd Megatron-LM && \
git checkout 27cbe46714a50c43ed290f1b1472db8d2780c55c && \
git checkout 240a8ef7a21df201e47b5b2ae33cc5f4c5486849 && \
pip install .

# Performance optimizations for distributed optimizer: https://github.com/NVIDIA/apex/pull/1771
RUN git clone https://github.com/NVIDIA/apex.git && \
cd apex && \
git checkout b496d85fb88a801d8e680872a12822de310951fd && \
git checkout f058162b215791b15507bb542f22ccfde49c872d && \
pip install -v --no-build-isolation --disable-pip-version-check --no-cache-dir --config-settings "--build-option=--cpp_ext --cuda_ext --fast_layer_norm --distributed_adam --deprecated_fused_adam" ./

# Transformer Engine 1.2.0
RUN git clone https://github.com/NVIDIA/TransformerEngine.git && \
cd TransformerEngine && \
git fetch origin 4f9662fbe621671f5f905e772fc1138953af77f6 && \
git fetch origin da30634a6c9ccdbb6c587b6c93b1860e4b038204 && \
git checkout FETCH_HEAD && \
git submodule init && git submodule update && \
NVTE_FRAMEWORK=pytorch NVTE_WITH_USERBUFFERS=1 MPI_HOME=/usr/local/mpi pip install .
Expand Down
Loading
Loading