Please enable JavaScript.

Coggle requires JavaScript to display documents.

Training DNN Aurelien Geron Book - Coggle Diagram

- - - - Glorot and He Initialization
      - Nonsaturatng Activation Functions
        
        ReLU (rectified linear unit)
        
        Leacky ReLU
        
        ELU (exponential linear unit)
        
        SELU (scaled)
      - Batch Normalization
        
        momentumm hyperparameter
        
        axis hyperparameter
      - Gradient Clipping
  - - - Reusing pretrained Layers = Transfer Learning
    - - Self-supervised learning
  - - - There are many different strategies to reduce the learning rate during training.
  - - - Early stopping
      - Batch normalization
      - l1 and l2 Regularization
      - Dropout
        
        Dropout rate 'p'
      - Max-norm Regularization