Batch normalization provably avoids ranks collapse for randomly initialised deep networks

NeurIPS 2020