[Scala] Training differences between macOS and Ubuntu? #12553

mariussoutier · 2018-09-13T16:36:24Z

Tested this with both MXNet 1.2.1 and 1.3.0 Staging. I have the identical code and dataset to train a MLP and a CNN on image data. On my Mac (MBP Late 2013) it converges easily within 5-10 epochs to an accuracy of 80% using learning rate of 0.00001. On my Dell laptop, using Ubuntu 18 and both with and without GPU, it essentially doesn't converge at all (accuracy around 1,6%).

How is this possible?

The text was updated successfully, but these errors were encountered:

kalyc · 2018-09-13T17:09:51Z

Thanks for submitting this issue @mariussoutier
@mxnet-label-bot[Scala, OSX, Ubuntu]

lanking520 · 2018-09-13T19:59:21Z

Hi @mariussoutier thanks for your issue, could you please provide a minimum reproducible code? This looks weird to me too.

gigasquid · 2018-09-13T20:18:41Z

@mariussoutier A Clojure user had a similar problem. Maybe something in this issue can diagnose (The Clojure package has since joined the main project) gigasquid/clojure-mxnet#5

A final solution wasn't found for his laptop but he could run on the 18.04 server

mariussoutier · 2018-09-13T20:35:00Z

Interesting. I've also noticed that MXNet-CPU is slower on my Ubuntu laptop than on my MacBook. The MacBook is from 2013 and the Dell from 2017, so has newer CPU, twice the RAM, and way faster SSD.

I just don't know where I should investigate, I'm pretty new to Ubuntu and it already took me a day to set this all up. Would building MXNet from source on the laptop help?

gigasquid · 2018-09-13T20:44:36Z

I would try using the Scala jars and comparing your dependencies against these Clojure docker files
for 18.04
https://hub.docker.com/r/magnetcoop/mxnet-clj-cpu/
https://hub.docker.com/r/magnetcoop/mxnet-clj-gpu/

The Dockerfiles in this project's ci are 16.04 so might not be as relevant to you

gigasquid · 2018-09-15T12:26:41Z

I found this and thought it might be helpful https://mc.ai/install-mxnet-on-ubuntu-18-04/
especially the part about gcc7 vs gcc6

mariussoutier · 2018-09-15T16:54:32Z

I thought it was just the Scala API that was problematic. pip install mxnet-cu90mkl installs fine, I have to rewrite my code in Python to verify this assumption.

lanking520 · 2018-09-15T17:42:06Z

@mariussoutier maybe this can be helpful: #11303. We will try to bring instruction on 18.04 since you are not the only one who asked for this... About the performance issue, could you please provide some code that can reproduce it? I will test to see what the issues came from

mariussoutier · 2018-09-15T18:27:24Z

@lanking520 Ah thanks, then I'll stop trying to compile it on Ubuntu. About the training performance, I'm seeing this with the MLP from the tutorials.

piyushghai · 2019-01-14T23:59:52Z

@mariussoutier Are you seeing differences in Python API v/s Scala API as well in terms of training ?

mariussoutier · 2019-01-15T14:55:26Z

@piyushghai I gave up trying to train in Scala, am only using it for inference now.

lanking520 · 2019-01-28T20:12:32Z

@mariussoutier Currently we do support 18.04 now since we successfully get it static linked. Please feel free to try it again. Close this issue for now.

marcoabreu added OSX Scala Ubuntu labels Sep 13, 2018

ddavydenko mentioned this issue Sep 25, 2018

Scala: DataDesc IllegalArgumentException with simple example #12409

Closed

lanking520 closed this as completed Jan 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Scala] Training differences between macOS and Ubuntu? #12553

[Scala] Training differences between macOS and Ubuntu? #12553

mariussoutier commented Sep 13, 2018

kalyc commented Sep 13, 2018

lanking520 commented Sep 13, 2018

gigasquid commented Sep 13, 2018 •

edited

Loading

mariussoutier commented Sep 13, 2018

gigasquid commented Sep 13, 2018 •

edited

Loading

gigasquid commented Sep 15, 2018 •

edited

Loading

mariussoutier commented Sep 15, 2018

lanking520 commented Sep 15, 2018 •

edited

Loading

mariussoutier commented Sep 15, 2018

piyushghai commented Jan 14, 2019

mariussoutier commented Jan 15, 2019

lanking520 commented Jan 28, 2019

[Scala] Training differences between macOS and Ubuntu? #12553

[Scala] Training differences between macOS and Ubuntu? #12553

Comments

mariussoutier commented Sep 13, 2018

kalyc commented Sep 13, 2018

lanking520 commented Sep 13, 2018

gigasquid commented Sep 13, 2018 • edited Loading

mariussoutier commented Sep 13, 2018

gigasquid commented Sep 13, 2018 • edited Loading

gigasquid commented Sep 15, 2018 • edited Loading

mariussoutier commented Sep 15, 2018

lanking520 commented Sep 15, 2018 • edited Loading

mariussoutier commented Sep 15, 2018

piyushghai commented Jan 14, 2019

mariussoutier commented Jan 15, 2019

lanking520 commented Jan 28, 2019

gigasquid commented Sep 13, 2018 •

edited

Loading

gigasquid commented Sep 13, 2018 •

edited

Loading

gigasquid commented Sep 15, 2018 •

edited

Loading

lanking520 commented Sep 15, 2018 •

edited

Loading