Skip to content

为什么参数量并没有下降反而上升了好几倍?? #17

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
HeiHeiCCC opened this issue Apr 20, 2023 · 1 comment
Open

Comments

@HeiHeiCCC
Copy link

我自己测试了一下用nn.Conv2d(16, 64, 1),输入大小是(1, 16, 224, 224),这个参数量只有1088,但是如果用ACmix得到的参数量是8604,这差了快8倍了,但是文章说 “同时与纯卷积或self-attention相比具有最小的计算开销”,好像没有体现,这是咋回事啊?

@ShenAoChen2001
Copy link

哥们你复现成功了么,为啥我在ImageNet1K上训练都不收敛

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants