Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feat,refactor,fix] Major change: multitasking, See details #173

Merged
merged 10 commits into from
Sep 11, 2019

Conversation

apsdehal
Copy link
Contributor

  • This PR has breaking changes for API and will break a lot of things
    before v0.3.1
  • Upgrades for 1.2.0
  • Add support for MultiTasking, multiple datasets can be trained
    together now
  • Add proper version for fastText
  • Remove concept of tasks, datasets are first class citizens
  • Update the folder structure to reflect datasets as first class
    citizens
  • Fixes for Distributed setup
  • Fixes Advanced MultiTasking Support #160
  • Fixes [enhancement] Upgrade to PyTorch 1.2.0 #76

…ommit in description

- This PR has breaking changes for API and will break a lot of things
before v0.3.1
- Upgrades for 1.2.0
- Add support for MultiTasking, multiple datasets can be trained
together now
- Add proper version for fastText
- Remove concept of tasks, datasets are first class citizens
- Update the folder structure to reflect datasets as first class
citizens
- Fixes for Distributed setup
@apsdehal apsdehal requested review from vedanuj and abhiskk August 27, 2019 23:17
@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Aug 27, 2019
@apsdehal apsdehal changed the title [feat,refactor,bug] Major change: multitasking, See details [feat,refactor,bug] Major change: multitasking, see details Aug 27, 2019
@apsdehal apsdehal changed the title [feat,refactor,bug] Major change: multitasking, see details [feat,refactor,fix] Major change: multitasking, See details Aug 27, 2019
Copy link
Contributor

@vedanuj vedanuj left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall looks good. Few nits. Please test thoroughly before merging as it has lot of BC.

vqa2:
image_features:
train:
- /private/home/asg/datasets/COCO/detectron_fix_100/fc6/train_val_2014,coco/resnet152/train_val_2014
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove /private/home/asg/

vqa2:
image_features:
train:
- /private/home/asg/datasets/COCO/detectron_fix_100/fc6/train_val_2014,coco/resnet152/train_val_2014
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above.

self._load_imdb(imdb_path)

def _load_imdb(self, imdb_path):
if imdb_path.endswith(".npy"):
self._load_npy(imdb_path)
elif imdb_path.endswith(".jsonl"):
self._load_jsonl(imdb_path)
elif imdb_path.contains("visdial") or imdb_path.contains("visual_dialog"):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the difference between visdial and visual_dialog?

@@ -57,7 +58,7 @@ def _init_process_group(self):
raise RuntimeError(
"Unable to initialize process group: NCCL is not available"
)
torch.distributed.init_process_group(backend="nccl")
torch.distributed.init_process_group(backend="nccl", init_method="env://")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

init_method default value is env://. Do we need to specify explicitly?

import numpy as np
from torch.utils.data import Dataset
from torch.utils.data import DataLoader
# from torch.utils.data.distributed import DistributedSampler
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove

@apsdehal apsdehal merged commit 926d3b0 into v0.4 Sep 11, 2019
@apsdehal apsdehal deleted the multitasking branch September 11, 2019 01:30
apsdehal added a commit that referenced this pull request May 8, 2020
* [feat,refactor,bug] Major change: multitasking, See details for the commit in description

- This PR has breaking changes for API and will break a lot of things
before v0.3.1
- Upgrades for 1.2.0
- Add support for MultiTasking, multiple datasets can be trained
together now
- Add proper version for fastText
- Remove concept of tasks, datasets are first class citizens
- Update the folder structure to reflect datasets as first class
citizens
- Fixes for Distributed setup

* [fix] Remove import for single dataset

* [fix] Fix metrics name in configs

* [fix] Fix values for tests due to changes in PyTorch 1.2

* [fix] Remove print statement in test

* [fix] Address comments

* [fix] Pythia train and val configuration

* [fix] TextVQA config

* Update multi_dataset.py

* Update base_trainer.py
apsdehal added a commit that referenced this pull request May 8, 2020
* [feat,refactor,bug] Major change: multitasking, See details for the commit in description

- This PR has breaking changes for API and will break a lot of things
before v0.3.1
- Upgrades for 1.2.0
- Add support for MultiTasking, multiple datasets can be trained
together now
- Add proper version for fastText
- Remove concept of tasks, datasets are first class citizens
- Update the folder structure to reflect datasets as first class
citizens
- Fixes for Distributed setup

* [fix] Remove import for single dataset

* [fix] Fix metrics name in configs

* [fix] Fix values for tests due to changes in PyTorch 1.2

* [fix] Remove print statement in test

* [fix] Address comments

* [fix] Pythia train and val configuration

* [fix] TextVQA config

* Update multi_dataset.py

* Update base_trainer.py
facebook-github-bot pushed a commit that referenced this pull request May 8, 2021
Summary:
Pull Request resolved: fairinternal/mmf-internal#173

* adds a feature in download.py to allow it to work for manifold
* onboards clip_processor to work with all three config below:
  * local on disk file
  * manifold file
  * http file

Reviewed By: vedanuj

Differential Revision: D27760358

fbshipit-source-id: 1b7b8eff09a21e8afc48971d69d46df18a8ced6b
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed Do not delete this pull request or issue due to inactivity.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants