[feat,refactor,fix] Major change: multitasking, See details #173

apsdehal · 2019-08-27T23:17:32Z

This PR has breaking changes for API and will break a lot of things
before v0.3.1
Upgrades for 1.2.0
Add support for MultiTasking, multiple datasets can be trained
together now
Add proper version for fastText
Remove concept of tasks, datasets are first class citizens
Update the folder structure to reflect datasets as first class
citizens
Fixes for Distributed setup
Fixes Advanced MultiTasking Support #160
Fixes [enhancement] Upgrade to PyTorch 1.2.0 #76

…ommit in description - This PR has breaking changes for API and will break a lot of things before v0.3.1 - Upgrades for 1.2.0 - Add support for MultiTasking, multiple datasets can be trained together now - Add proper version for fastText - Remove concept of tasks, datasets are first class citizens - Update the folder structure to reflect datasets as first class citizens - Fixes for Distributed setup

vedanuj

Overall looks good. Few nits. Please test thoroughly before merging as it has lot of BC.

vedanuj · 2019-08-29T01:33:41Z

configs/vqa/vqa2/pythia.yml

+      vqa2:
+        image_features:
+          train:
+          - /private/home/asg/datasets/COCO/detectron_fix_100/fc6/train_val_2014,coco/resnet152/train_val_2014


Remove /private/home/asg/

vedanuj · 2019-08-29T01:34:27Z

configs/vqa/vqa2/pythia_train_and_val.yml

+  vqa2:
+    image_features:
+      train:
+      - /private/home/asg/datasets/COCO/detectron_fix_100/fc6/train_val_2014,coco/resnet152/train_val_2014


Same as above.

vedanuj · 2019-08-29T01:38:12Z

pythia/datasets/image_database.py

        self._load_imdb(imdb_path)

    def _load_imdb(self, imdb_path):
        if imdb_path.endswith(".npy"):
            self._load_npy(imdb_path)
        elif imdb_path.endswith(".jsonl"):
            self._load_jsonl(imdb_path)
+        elif imdb_path.contains("visdial") or imdb_path.contains("visual_dialog"):


What is the difference between visdial and visual_dialog?

vedanuj · 2019-08-29T01:42:27Z

pythia/trainers/base_trainer.py

@@ -57,7 +58,7 @@ def _init_process_group(self):
                raise RuntimeError(
                    "Unable to initialize process group: NCCL is not available"
                )
-            torch.distributed.init_process_group(backend="nccl")
+            torch.distributed.init_process_group(backend="nccl", init_method="env://")


init_method default value is env://. Do we need to specify explicitly?

vedanuj · 2019-08-29T01:47:59Z

pythia/datasets/multi_dataset.py

+import numpy as np
+from torch.utils.data import Dataset
+from torch.utils.data import DataLoader
+# from torch.utils.data.distributed import DistributedSampler


* [feat,refactor,bug] Major change: multitasking, See details for the commit in description - This PR has breaking changes for API and will break a lot of things before v0.3.1 - Upgrades for 1.2.0 - Add support for MultiTasking, multiple datasets can be trained together now - Add proper version for fastText - Remove concept of tasks, datasets are first class citizens - Update the folder structure to reflect datasets as first class citizens - Fixes for Distributed setup * [fix] Remove import for single dataset * [fix] Fix metrics name in configs * [fix] Fix values for tests due to changes in PyTorch 1.2 * [fix] Remove print statement in test * [fix] Address comments * [fix] Pythia train and val configuration * [fix] TextVQA config * Update multi_dataset.py * Update base_trainer.py

Summary: Pull Request resolved: fairinternal/mmf-internal#173 * adds a feature in download.py to allow it to work for manifold * onboards clip_processor to work with all three config below: * local on disk file * manifold file * http file Reviewed By: vedanuj Differential Revision: D27760358 fbshipit-source-id: 1b7b8eff09a21e8afc48971d69d46df18a8ced6b

apsdehal requested review from vedanuj and abhiskk August 27, 2019 23:17

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Aug 27, 2019

[fix] Remove import for single dataset

74c89f6

apsdehal changed the title ~~[feat,refactor,bug] Major change: multitasking, See details~~ [feat,refactor,bug] Major change: multitasking, see details Aug 27, 2019

apsdehal changed the title ~~[feat,refactor,bug] Major change: multitasking, see details~~ [feat,refactor,fix] Major change: multitasking, See details Aug 27, 2019

apsdehal added 3 commits August 27, 2019 16:27

[fix] Fix metrics name in configs

22e9c5c

[fix] Fix values for tests due to changes in PyTorch 1.2

f5e83a8

[fix] Remove print statement in test

c63acd4

vedanuj approved these changes Aug 29, 2019

View reviewed changes

apsdehal added 5 commits September 10, 2019 18:20

[fix] Address comments

f603b2f

[fix] Pythia train and val configuration

659ae9d

[fix] TextVQA config

5efd487

Update multi_dataset.py

e183627

Update base_trainer.py

716596f

apsdehal merged commit 926d3b0 into v0.4 Sep 11, 2019

apsdehal deleted the multitasking branch September 11, 2019 01:30

ronghanghu mentioned this pull request Jan 17, 2020

[feat] add M4C model for TextVQA #213

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat,refactor,fix] Major change: multitasking, See details #173

[feat,refactor,fix] Major change: multitasking, See details #173

apsdehal commented Aug 27, 2019

vedanuj left a comment

vedanuj Aug 29, 2019

vedanuj Aug 29, 2019

vedanuj Aug 29, 2019

vedanuj Aug 29, 2019

vedanuj Aug 29, 2019

[feat,refactor,fix] Major change: multitasking, See details #173

[feat,refactor,fix] Major change: multitasking, See details #173

Conversation

apsdehal commented Aug 27, 2019

vedanuj left a comment

Choose a reason for hiding this comment

vedanuj Aug 29, 2019

Choose a reason for hiding this comment

vedanuj Aug 29, 2019

Choose a reason for hiding this comment

vedanuj Aug 29, 2019

Choose a reason for hiding this comment

vedanuj Aug 29, 2019

Choose a reason for hiding this comment

vedanuj Aug 29, 2019

Choose a reason for hiding this comment