[WIP] DL4J toSameDiff conversion method by rnett · Pull Request #495 · KonduitAI/deeplearning4j

rnett · 2020-06-23T19:04:13Z

Adding a toSameDiff method + overloads to MultiLayerNetwork and ComputationGraph, and adding the used define* methods for layers, vertices, activations, and losses.

Need a SameDiff Mish function.
Currently no support for Truncate convolution mode in SameDiff.

rectifiedTanh and rationalTanh were in SameDiff.math but not SameDiff.nn..

Needed new SameDiff functions for optimization:

a RELU that supports leaky and a custom threshold at the same time.
A non-weighted cross entropy loss. The weightedCrossEntropyWithLogits javadoc says it supports null weights, but I get an exception. Same for BinaryCrossentropy
For LossSparseMCXENT, a version of OneHot that takes depth as a SDVariable. The current version takes a double and should be passed input.shape[-1], which I can't get w/o offline shape inference.
A 1d ops for subsampling and upsampling op.
a way to set depthMultiplier for depthwise convolutions
weight format on all convolution configurations

Not being implemented in the first pass:

Losses
- LossMultiLabel
- OCNNLossFunction
- LossMixtureDensity
Activations
- ActivationMish
Layers
- Mask Layer (no mask support yet)
- GlobalPoolingLayer (no SameDiff op?)
- MaskZeroLayer
- TFOpLayer
- Yolo2OutputLayer
- CenterLossOutputLayer (should be done as a loss function)
- OCNNOutputLayer maybe?
- EmbeddingLayer (JVM crash issues)
- EmbeddingSequenceLayer
- Autoencoder and VAE
Vertices (for the most part, because there's no way to get int rank from a SDVariable. A SDIndex.ellipses() would solve this too)
- SubsetVertex
- DuplicateToTimeSeriesVertex
Every dropout but Dropout (needs SameDiff support).

Need to add support for custom ILossLayer.computeScore methods, e.g. CnnLossLayer.

What's the difference between FrozenLayerWithBackprop and FrozenLayer?

Signed-off-by: Ryan Nett <ryan@konduit.ai>

…s extend BaseInputPreProcessor Signed-off-by: Ryan Nett <ryan@konduit.ai>

Signed-off-by: Ryan Nett <ryan@konduit.ai>

AlexDBlack

Overall looks good so far. See comments though.

AlexDBlack · 2020-06-26T14:49:36Z

+            }
+
+            // layer
+            //TODO regularizations?  No SameDiff support for per-layer/weight regularizes


Hm... it's only global ATM, true. Might be worth adding (just not in this PR). Another issue to be opened perhaps.

AlexDBlack · 2020-06-26T14:59:14Z

+    }
+
+    public static SDVariable batchAverage(@NonNull SDVariable loss){
+        return loss.sum().div(loss.shape().get(SDIndex.point(0)));


loss.shape().get(SDIndex.point(0)) -> sd.sizeAt(loss, 0)

Can we add that to SDVariable (not this PR)? squeeze and expandDims too.

Signed-off-by: Ryan Nett <ryan@konduit.ai>

AlexDBlack

Only half way through the second review. I'll submit another review for the rest soon.

Main issues here are:
(a) We'll need to work out a way to do reshapes, permutes, etc of weights at the INDArray level (not SDVariable level) for performance and memory reasons.
(b) We'll need some sort of coverage checking. There's 2 models we can use for that:
https://github.com/eclipse/deeplearning4j/blob/master/deeplearning4j/deeplearning4j-core/src/test/java/org/deeplearning4j/nn/dtypes/DTypeTests.java
or
https://github.com/eclipse/deeplearning4j/blob/master/nd4j/nd4j-backends/nd4j-tests/src/test/java/org/nd4j/OpValidationSuite.java

Having the tests in the gradient checks is definitely good, and we don't want to needlessly write redundant tests...
That said, DTypeTests approach allows us to assert that all layers/preprocessors/etc are checked though (and fail a test if not), which is definitely nice. i.e., it stops us introducing a bug if we add a new layer and forget to write/test the SameDiff conversion.

AlexDBlack · 2020-06-30T13:51:48Z

+        }
+
+        testSameDiffActivations(model, network, input, true);
+        testSameDiffLoss(model, network, input, labels);


It would also be great to test fitting - i.e., parameters are the same after each fit step, for say 3 steps.

AlexDBlack · 2020-06-30T14:56:51Z

+//            out = out.add(bias);
+//
+//        return doActivation(out);
+        throw new UnsupportedOperationException("Can't convert EmbeddingLayer to SameDiff");


Let's try to isolate this crash.
If it's the squeeze (bad shape) we can maybe wokr around it via .reshape(-1).castTo(DataType.INT64)

Also int32 input should be fine for this too
https://github.com/eclipse/deeplearning4j/blob/master/libnd4j/include/ops/declarable/generic/transforms/gather.cpp#L88

Signed-off-by: Ryan Nett <ryan@konduit.ai>

…j_to_samediff # Conflicts: # deeplearning4j/deeplearning4j-nn/src/main/java/org/deeplearning4j/util/ToSameDiffUtils.java

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Ryan Nett added 3 commits June 22, 2020 14:00

Add SameDiff.setOutputs overloads for SDVariables

1551d04

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Add toSameDiff() for MultiLayerNetwork. Doesn't support masks.

11c49d4

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Add (not working yet) partial MNIST test.

fd484d8

Signed-off-by: Ryan Nett <ryan@konduit.ai>

rnett requested a review from AlexDBlack June 23, 2020 19:04

rnett self-assigned this Jun 23, 2020

Ryan Nett added 5 commits June 23, 2020 12:12

Pass iUpdater and InputType to configuration from builder

a9002ba

Signed-off-by: Ryan Nett <ryan@konduit.ai>

InputPreProcessor default define function, make all InputPreProcessor…

d383d17

…s extend BaseInputPreProcessor Signed-off-by: Ryan Nett <ryan@konduit.ai>

Add BaseLossFunction with default define function, make others extend it

44f5ad3

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Add define for activation functions, helper method for it in BaseLayer

940d8aa

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Update toSameDiff to handle activations, preprocessors, and improvements

82da5df

Signed-off-by: Ryan Nett <ryan@konduit.ai>

rnett removed the request for review from AlexDBlack June 23, 2020 22:11

Ryan Nett added 9 commits June 23, 2020 15:27

MNIST test implementation

8c50f33

Signed-off-by: Ryan Nett <ryan@konduit.ai>

A few fixes, most activation and loss implementations

e64f895

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Fix loss averaging

2623c60

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Tests and fixes

f91c796

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Test fixes, mostly specifying output type.

79e437f

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Test fixes, mostly specifying output type.

85f0377

Signed-off-by: Ryan Nett <ryan@konduit.ai>

add convolution mode

ca6245d

Signed-off-by: Ryan Nett <ryan@konduit.ai>

support more output layers

42e428e

Signed-off-by: Ryan Nett <ryan@konduit.ai>

implementation for no param layers

1f5dde1

Signed-off-by: Ryan Nett <ryan@konduit.ai>

AlexDBlack reviewed Jun 26, 2020

View reviewed changes

Ryan Nett added 5 commits June 26, 2020 20:49

Lots of fixes and layer implementations

a92f67d

Signed-off-by: Ryan Nett <ryan@konduit.ai>

quick fix & comment

0b5f5fe

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Better tests

0a31eb3

Signed-off-by: Ryan Nett <ryan@konduit.ai>

partial fixes

0037514

Signed-off-by: Ryan Nett <ryan@konduit.ai>

reduce overloads

17c16c2

Signed-off-by: Ryan Nett <ryan@konduit.ai>

AlexDBlack suggested changes Jun 30, 2020

View reviewed changes

Ryan Nett added 3 commits June 30, 2020 13:57

toSameDiff updates & fixes, ComputationGraph support

c5b92a3

Signed-off-by: Ryan Nett <ryan@konduit.ai>

vertices

85f7e03

Signed-off-by: Ryan Nett <ryan@konduit.ai>

new loss definitions

c365af2

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Ryan Nett added 30 commits July 2, 2020 15:17

change weight transform method to alter map

58d1dae

Signed-off-by: Ryan Nett <ryan@konduit.ai>

test fixes

302a313

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Test fixes, utils class, overloads ot toSameDiff

894fb2f

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Most of updater state support

f1e642f

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Most of updater state support

06afec4

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Merge remote-tracking branch 'origin/rn_dl4j_to_samediff' into rn_dl4…

b24c5f2

…j_to_samediff # Conflicts: # deeplearning4j/deeplearning4j-nn/src/main/java/org/deeplearning4j/util/ToSameDiffUtils.java

bug fixes

8ae6711

Signed-off-by: Ryan Nett <ryan@konduit.ai>

bug fixes

fea9fd6

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Test fixes

da1f933

Signed-off-by: Ryan Nett <ryan@konduit.ai>

More tests, start of training tests

0b79ba3

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Fix tests not reporting loss failures

3a8f348

Signed-off-by: Ryan Nett <ryan@konduit.ai>

More fixes

594f4e7

Signed-off-by: Ryan Nett <ryan@konduit.ai>

More fixes

582da0f

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Simple test

7b493fa

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Use param map instead of view

551e3a1

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Partial fix?

75c7168

Signed-off-by: Ryan Nett <ryan@konduit.ai>

reshape order too

fb18d64

Signed-off-by: Ryan Nett <ryan@konduit.ai>

fixes

2eb57fa

Signed-off-by: Ryan Nett <ryan@konduit.ai>

add dup

80d15ca

Signed-off-by: Ryan Nett <ryan@konduit.ai>

correct epoch count

b75d9bf

Signed-off-by: Ryan Nett <ryan@konduit.ai>

try fix

78e14f9

Signed-off-by: Ryan Nett <ryan@konduit.ai>

More tests

d9dda1b

Signed-off-by: Ryan Nett <ryan@konduit.ai>

loss function div fixes

68f9ca7

Signed-off-by: Ryan Nett <ryan@konduit.ai>

fix

e6966b1

Signed-off-by: Ryan Nett <ryan@konduit.ai>

fix

ec2aabe

Signed-off-by: Ryan Nett <ryan@konduit.ai>

test w/ losses

08fda3a

Signed-off-by: Ryan Nett <ryan@konduit.ai>

disable test timeout when debugging

1cc3111

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Transform updater state the same way parameters are transformed

6f7f9c2

Signed-off-by: Ryan Nett <ryan@konduit.ai>

new tests

13fd860

Signed-off-by: Ryan Nett <ryan@konduit.ai>

more testing

0673ad9

Signed-off-by: Ryan Nett <ryan@konduit.ai>

Conversation

rnett commented Jun 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexDBlack left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlexDBlack Jun 26, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

AlexDBlack Jun 26, 2020

Choose a reason for hiding this comment

Uh oh!

rnett Jun 26, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlexDBlack left a comment

Choose a reason for hiding this comment

Uh oh!

AlexDBlack Jun 30, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlexDBlack Jun 30, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rnett commented Jun 23, 2020 •

edited

Loading

AlexDBlack left a comment •

edited

Loading