Skip to content

About truncated normal distribution based weight initialization #1284

Answered by rwightman
developer0hye asked this question in Q&A
Discussion options

You must be logged in to vote

@developer0hye moving to discussions, it was to attempt to match initialization for some networks implemented in JAX and Tensorflow (as that is a more common default layer init there)

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@developer0hye
Comment options

Answer selected by developer0hye
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1283 on May 28, 2022 22:04.