Fix Float32/16 raised to integer typemin #57488

kuszmaul · 2025-02-21T04:45:43Z

The code for x^n where x::Float32 and n::Int previously failed for Float32(1.1)^typemin(Int) because it would reduce the problem to inv(x)^-n. That works fine unless n is typemin, in which case n==-n.

This PR makes a special case for n==typemin to effectively compute (x^(n/2))^2. Just in case n is also odd, we do x^cld(n,2) * x^fld(n,2).

test/math.jl

LilithHafner

Thanks for working on this! You picked a great first issue to contribute based on.

base/math.jl

oscardssmith · 2025-02-21T20:41:05Z

This looks like a functional fix, but I think it might be better to use the same approach that #53967 adds where we use a floating point algorithm for large powers (especially since it would turn this PR into a performance improvement rather than a regression. Specifically, I think the following would work well. (testing now)

@constprop :aggressive @inline function ^(x::Float32, n::Integer)
    n = clamp(n, Int64)
    n == 0 && return one(x)
    if use_power_by_squaring(n)
        n < 0 && return oftype(x, Base.power_by_squaring(inv(widen(x)), -n))
        return oftype(x, Base.power_by_squaring(inv(widen(x)), n))
    else
        s = ifelse(x < 0 && isodd(n), -1f0, 1f0)
        x = abs(x)
        y = float(n)
        return copysign(Float32(exp2(log2(Float64(x)*y))), s)
    end
end

kuszmaul · 2025-02-21T21:54:19Z

It does look like a performance improvement (once you get y outside of the log2. It should be exp2(log2(Float64(x))*y)
And it seems not to suffer from any errors in precision. You can just use pow_body.

kuszmaul · 2025-02-21T22:01:29Z

Although I don't think this PR is a regression. It runs the same code that it always ran for cases except typemin, which previously crashed.

oscardssmith · 2025-02-21T22:05:37Z

I meant performance regression. This code is pretty fast to the point that extra branches can have a significant speed impact. This way the branch is still there, but it is providing a speedup for lots of large powers as well.

kuszmaul · 2025-02-22T12:48:09Z

The branch is highly predictable (it's almost never taken), so it probably doesn't have any performance impact. This path already has many branches that aren't so predictable. Is there some evidence of a performance regression?

The x^y=exp(log(x)*y) approach does seem faster, however, and seems likely to be just as accurate with widening. Do you plan to prepare a PR for it?

mikmoore · 2025-02-24T15:48:45Z

Whatever we do needs to give (-1.0f0) ^ (2^60+1) == - (-1.0f0) ^ (2^60+2), so be careful with converting the exponent to a float.

This issue looks like it could be solved with the one-line change of replacing -n with Base.uabs(n). This is exactly what Base.uabs is for.

oscardssmith · 2025-02-24T16:30:45Z

That case is already handled (since s = ifelse(x < 0 && isodd(n), -1f0, 1f0 will flip the sign based on n and log2(x) is 0 so y won't factor into the equation at all.

LilithHafner · 2025-03-16T17:54:48Z

@oscardssmith, do you plan to prepare a PR with your suggestion, as @kuszmaul asked? Or would you rather put that implementation into this PR?

oscardssmith · 2025-03-16T23:00:55Z

oops, sorry for forgetting this. I'm happy to make the PR.

alternative to #57488 --------- Co-authored-by: Lilith Orion Hafner <[email protected]>

KlausC · 2025-04-12T08:27:28Z

base/math.jl

+        # It won't work to do `inv(x)^-n` if `n` and `-n` are both less than zero (e.g., if
+        # `n` is typemin).
+        if -n < 0
+            i = inv(widen(x))
+            y = Base.power_by_squaring(i, -fld(n, 2))
+            y = y * y
+            isodd(n) && (y = y * i)
+            return oftype(x, y)
+        end
+        return oftype(x, Base.power_by_squaring(inv(widen(x)),-n))


Why not simply for all n < 0:

i = inv(widen(x)) return oftype(x, Base.power_by_squaring(i, -(n+1)) * i)

KlausC · 2025-04-12T08:35:24Z

base/math.jl

@@ -1245,7 +1245,18 @@ end
 function ^(x::Union{Float16,Float32}, n::Integer)
    n == -2 && return (i=inv(x); i*i)


cases n == 0, n == 1and n == -1 perform unnecessary conversions (widen and recast).

KlausC

The fix seems more complex than necessary. The cases abs(n) <= 2 should be special cased IMO.

KlausC · 2025-04-12T08:39:19Z

test/math.jl

+    @test Float32(1.1)^big(0) === Float32(1.0)
+
+    # By using a limited-precision integer (3 bits) we can trigger issue 57464
+    # for a case where the answer isn't zero.


The same is true, if you use for example Int8 or Int16. So why define Int3 ?

oscardssmith · 2025-04-12T15:43:58Z

Is this PR not fixed by #57829?

KlausC · 2025-04-12T15:54:57Z

Is this PR not fixed by #57829?

Yes, it is superseded by that - I left similar comments there about the test cases. Unfortunately I saw this PR first.

KlausC · 2025-04-12T16:14:12Z

The #57829 is much better, of course, ant this PR should be closed IMO.

kuszmaul and others added 2 commits February 21, 2025 04:45

Fix JuliaLang#57464

1c08564

Merge branch 'master' into bck-RAI-57464c

14fa661

giordano reviewed Feb 21, 2025

View reviewed changes

test/math.jl Outdated Show resolved Hide resolved

LilithHafner reviewed Feb 21, 2025

View reviewed changes

base/math.jl Outdated Show resolved Hide resolved

LilithHafner added bugfix This change fixes an existing bug maths Mathematical functions labels Feb 21, 2025

kuszmaul and others added 2 commits February 21, 2025 15:03

Merge branch 'master' into bck-RAI-57464c

b32bad4

Better testing and fixes from the testing

419fd68

gbaraldi requested a review from oscardssmith February 21, 2025 20:14

oscardssmith self-assigned this Mar 16, 2025

oscardssmith mentioned this pull request Mar 19, 2025

fix Float32/Float16 power for giant integer exponents #57829

Merged

oscardssmith added a commit that referenced this pull request Mar 26, 2025

fix Float32/Float16 power for giant integer exponents (#57829)

638a6a2

alternative to #57488 --------- Co-authored-by: Lilith Orion Hafner <[email protected]>

KlausC reviewed Apr 12, 2025

View reviewed changes

KlausC suggested changes Apr 12, 2025

View reviewed changes

oscardssmith closed this Apr 12, 2025

oscardssmith removed their assignment Apr 23, 2025

		@@ -1245,7 +1245,18 @@ end
		function ^(x::Union{Float16,Float32}, n::Integer)
		n == -2 && return (i=inv(x); i*i)

Uh oh!

Fix Float32/16 raised to integer typemin #57488

Fix Float32/16 raised to integer typemin #57488

Uh oh!

Conversation

kuszmaul commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

LilithHafner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

oscardssmith commented Feb 21, 2025

Uh oh!

kuszmaul commented Feb 21, 2025

Uh oh!

kuszmaul commented Feb 21, 2025

Uh oh!

oscardssmith commented Feb 21, 2025

Uh oh!

kuszmaul commented Feb 22, 2025

Uh oh!

mikmoore commented Feb 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oscardssmith commented Feb 24, 2025

Uh oh!

LilithHafner commented Mar 16, 2025

Uh oh!

oscardssmith commented Mar 16, 2025

Uh oh!

KlausC Apr 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KlausC Apr 12, 2025

Choose a reason for hiding this comment

Uh oh!

KlausC left a comment

Choose a reason for hiding this comment

Uh oh!

KlausC Apr 12, 2025

Choose a reason for hiding this comment

Uh oh!

oscardssmith commented Apr 12, 2025

Uh oh!

KlausC commented Apr 12, 2025

Uh oh!

KlausC commented Apr 12, 2025

Uh oh!

Uh oh!

kuszmaul commented Feb 21, 2025 •

edited

Loading

mikmoore commented Feb 24, 2025 •

edited

Loading

KlausC Apr 12, 2025 •

edited

Loading