L11.5 Weight Initialization -- Why Do We Care?

Поделиться
HTML-код
  • Опубликовано: 18 дек 2024

Комментарии • 2

  • @haochengzhao7483
    @haochengzhao7483 2 года назад

    Even if we use BN, is initialization still important?

    • @SebastianRaschka
      @SebastianRaschka  2 года назад

      I think it is far less important then. But it's hard to say exactly because in practice most people use the PyTorch defaults (which is currently He initialization I think)