Skip to content

Nest small and Nest base parameters for Cifar100 #1023

Answered by alexander-soare
abdohelmy asked this question in Q&A
Discussion options

You must be logged in to vote

@abdohelmy maybe you are already on top of this, but according to the official implementation, the only params that change should be embedding size and number of heads, ie they double.

If that is indeed what you are doing, then I'd be curious to know at what point it falls apart. Perhaps you could try changing only one of those two params? Or not doubling them, but only slightly increasing them?

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@abdohelmy
Comment options

Answer selected by abdohelmy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants