Skip to content

I'm baffled by "post-normalized vision transformers don't not converge". Thoughts? #745

Unanswered
alexander-soare asked this question in General
Discussion options

You must be logged in to vote

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@alexander-soare
Comment options

@rwightman
Comment options

@alexander-soare
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants