You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you very much for the update to support llama 3 model!
I noticed that config.initializer_range is default to 0.02, and jax.nn.initializers.normal(self.config.initializer_range / np.sqrt(config.hidden_size)) is used for initialization.
However, in the old version of EasyLM, config.initializer_range is default to 0.02, but jax.nn.initializers.normal(self.config.initializer_range) is used instead.
Will the new way of initialization have a much smaller variance, and is that by design?
Thank you very much for your time and help!
The text was updated successfully, but these errors were encountered:
Thank you very much for the update to support llama 3 model!
I noticed that
config.initializer_range
is default to 0.02, andjax.nn.initializers.normal(self.config.initializer_range / np.sqrt(config.hidden_size))
is used for initialization.However, in the old version of EasyLM,
config.initializer_range
is default to 0.02, butjax.nn.initializers.normal(self.config.initializer_range)
is used instead.Will the new way of initialization have a much smaller variance, and is that by design?
Thank you very much for your time and help!
The text was updated successfully, but these errors were encountered: