You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @sz85512678 , I am sorry about the missing file. This file is basically variable-by-variable copy from the trained transformer to the newly architected transformer. Note that all the layers are kept, except there are nerual ODE layers before each transformer block.
Unfortunately I have graduated two years ago and my account was deleted. If you have additional questions when implementing please feel free raise in this thread.
No description provided.
The text was updated successfully, but these errors were encountered: