-
Notifications
You must be signed in to change notification settings - Fork 238
sgemm gives incorrect results on 2.4 when transA != transB and N is 7, 11, 15, ... #246
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
(As far as "why dont I use 2.8, 2.10, eg travis build failures for 2.8 and 2.10 on mac: |
Isn't the OSX issue fixed in develop? |
Apparently so https://travis-ci.org/hughperkins/cltorch/builds/118785864 . But it's not the only reason I prefer 2.4 actually. I'm not sure if the 'path of least resistance' is 2.10, or 2.4, but 2.4 in its favor:
|
(well, I've migrated cltorch and clnn to use clblas 2.11/develop for now https://github.com/hughperkins/cltorch/#recent-changes Will see how that goes. If I get a zillion complaints about builds/bugs/opencl 1.2 support, I might try to fix 2.4; otherwise I note that 2.11/develop does seem to:
|
Example test program:
The text was updated successfully, but these errors were encountered: