Please enable JavaScript.

Coggle requires JavaScript to display documents.

LSTM attention, Audio, AST - Coggle Diagram

- - - - Attention: просто выбрать фиксированное количество LSTM шагов для Dense
- - - - Мало теоретических сравнений
        
        Sensitivity и сравнивают с perturbation
        
        Немного разные формулировки проблем в разных методах
    - - Occlusion: все равно нужен полный сэмпл
        
        Хотя можно посчитать chunk relevance * occlusion relevance
        
        Occlusion: долго, слишком много propagation (для каждого отдельного пикселя)
- - - - J. Li, V. Lavrukhin, B. Ginsburg, R. Leary, O. Kuchaiev, J. M. Cohen, H. Nguyen, and R. T. Gadde, “Jasper: An end-to-end convolutional neural acoustic model,” arXiv preprint arXiv:1904.03288,
        2019.
      - “Quartznet: Deep
        automatic speech recognition with 1d time-channel separable convolutions
      - Contextnet: Improving convolutional neural
        networks for automatic speech recognition with global context
      - Deep convolutional neural networks for lvcsr
      - Convolutional neural networks for speech recognition
- - - - grad: Computes and returns the sum of gradients of outputs with respect to input
        https://pytorch.org/docs/stable/autograd.html
      - allow_unused