proposal: math/rand: rework for Go 2

# Introduction

I propose that math/rand be overhauled for Go 2.

Rob Pike has proposed some aspects of a better math/rand for Go 2 in #21835. This proposal is intended to complement and extend that proposal. It may make sense to merge them, although their scopes are (intentionally) mostly disjoint.

This proposal is not fully polished, but I have been sitting on it for months. It past time to post it. I look forward to a vigorous discussion.


A re-think of math/rand should include:

* compatibility guarantees and reproducibility
* the `Source` interface
* the core PRNG
* the concurrency strategy
* the routines that convert raw PRNG output to other forms and distributions
* the API surface
* migration and interoperability between Go 1 and Go 2

This proposal will discuss all of these, in turn, and then knit the results of those discussions into a single proposal.

Please note that I am *not* a subject matter expert. Input from experts (and non-experts) is welcomed.

# Compatibility guarantees

I begin with compatibility guarantees, as decisions here impact almost every aspect of the rest of the proposal.

The Go 1 Compatibility Guarantee left some ambiguity about when the values produced math/rand should remain stable. For the life of Go 1, math/rand has remained extremely stable. To my knowledge, only a single change to the value stream has ever occurred (#16124). Many opportunities to break stream stability have been declined. There's a fair amount of discussion in #8013; it is recommended background reading. More discussion can also be found in: #13215, #14416, #11871, #8731, #12290, #6721.

There are plausible use cases in which stability matters, such as test cases discovered using testing/quick. There are also plausible use cases in which stability is irrelevant, such as adding jitter to exponential backoff code. Stability adds significant brittleness; there's not much more to math/rand beyond the exact value steam.

I propose a compromise.

Top-level convenience functions, like `Intn`, will offer no stability guarantees. They will be documented accordingly. The top level `Seed` function will be removed; the PRNG will be seeded on startup. (See #11871 for discussion of how to seed; probably package `time`.) Using a new seed on every program execution will prevent accidental dependence on a particular value stream. People who just want some random numbers can just get them, and the implementation remains free to improve over time.

The `Rand` methods will provide stability for the lifetime of Go 2 and will be documented as such. Notably, to use `Rand` methods, you must provide a `Source`, which may be `Seed`ed. This explicitness fits well with the stability requirement.

This means that any given `Source` implemented in `math/rand` must provide stability. However, since `Source` is an interface, if there are significant advances in PRNGs during Go 2, they can be added as new `Source`s, or provided by third party libraries.

This also means that the `Rand` methods must be stable. This is the biggest source of brittleness, since the `Rand` methods are not easily swappable the way `Source`s are. And we learned during Go 1 that these methods will be found over time to have bugs, to have non-optimal implementations, and to use non-optimal algorithms (#16213). We could decide to make these converter methods pluggable, but that adds significant API overhead.

Taken together, it is likely that the top level convenience functions will diverge over time from the stable `Rand` methods. That is OK. Among other things, it means that if/when the time for Go 3 arrives, we will have well-tested, state of the art functions that we can promote to `Rand` methods. (This is the opposite approach to the one advocated in #25551.)


# The `Source` interface

As Rob Pike notes in #21835, `Source64` should become the default interface (and possibly renamed to `Source`); `Source` should be deprecated and removed.

I propose further that `Seed` be modified to accept a `uint64` seed instead of an `int64` seed. This better matches `Source64`, and highlights the opacity of the bits involved.


# The core PRNG

The Go 1 core PRNG has several shortcomings. They are ably discussed by Rob Pike in #21835. He proposes adopting PCG as a replacement, and that (after a series of compatibility steps) its constructor be renamed to `NewSource`.

I propose that we adopt PCG, but that the end goal be for it retain the name PGC and be implemented as a struct rather than using a constructor:

```go
type PCG struct {
	// unexported fields
}

func (*PCG) Uint64() uint64 { /* ... */ }
func (*PCG) Seed(uint64)     { /* ... */ }
```

Giving it an non-generic name makes it easier to add new PRNGs during Go 2, should there be significant improvements in the state of the art, or if it becomes clear that it would be valuable to have multiple PRNGs with different characteristics.

Exposing it as a struct reduces API surface area and allows performance-sensitive clients to make direct calls to its methods.


# Concurrency

The Go 1 top-level convenience functions are concurrency-safe. The mutex they contain is a hidden source of performance pain. See #24121, #25057, #20387, and #21393 for discussion. CLs 43611 and 109817 attempt to work around this while preserving stability; they have both, to my mind, met with failure.

If you relax the stability requirement, as I propose, then there are many ways to improve scalability while retaining concurrency-safety, including: (1) sharded mutexes, as in [CL 43611](https://go.dev/cl/43611); (2) a sync.Pool, as in [CL 109817](https://go.dev/cl/109817); and (3) per-CPU storage, as proposed in #18802.

One could also imagine adding a `sync.Locker` field to `Rand`, to be used only when non-nil, to assist with concurrency safety when using `Rand` methods. However, if the convenience functions are performant and scale well, the primary motivation to use `Rand` methods is reproducibility, and concurrent access is inimical to reproducibility. This suggests that asking developers to manage their own locking in such cases is very reasonable, and that we are better off avoiding the bloat.

See also recent related discussion in #25988.


# Conversion routines

Discussed elsewhere in this proposal.


# API surface

Go 2 provides an opportunity to improve the API. I have already proposed some modifications.

I further propose:

* Remove `Perm` and `Rand.Perm`, as it is trivially implemented using `Shuffle`. See also discussion in #20480.
* Make `NewZipf` a top-level function and add `Rand.NewZipf`, so that it matches the usage pattern of the rest of the package.
* Change `Int31n` and `Rand.Int31n` to `Uint32n` and `Rand.Uint32n`. Similarly so for `Int63n` and `Intn`. These are better primitives; you can implement `Int31n` in terms of `Uint32n`, but not vice versa, because of the lost bit.
* Considering adding normal distribution generators that accept a stddev and mean. This is a common use case. Also, I suspect (but do not actually know) that by pushing stddev and mean into the generation, we might be able to reduce floating point error, using ratio-of-uniforms generation instead of ziggurat. Maybe something similar for ExpFloat64 and rate parameter? Expert input on this point would be appreciated.
* Remove `Uint32` and `Uint64`, as they are trivially implemented in terms of the new `Source` interface--the former as a cast, the latter by calling the `Uint64` method directly.


# Migration and interoperability between Go 1 and Go 2

Rob Pike laid out a graceful migration plan for the PCG source in #21835. I will not repeat it here.

The changes in this proposal are more drastic. They involve changing APIs and changing reproducibility guarantees.

The modifications are gofix-able, assuming that you are OK losing reproducibility. But that is a large assumption. Given that backdrop, I'm inclined to lean on vgo (or whatever versioning support gets added to the toolchain) to allow Go 1 math/rand and Go 2 math/rand to co-exist. Code can be updated using an automated tool, but with the user's understanding that the value streams will change. The challenge, then, is primarily a documentation one, with some assists from gofix.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

proposal: math/rand: rework for Go 2 #26263

Introduction

Compatibility guarantees

The `Source` interface

The core PRNG

Concurrency

Conversion routines

API surface

Migration and interoperability between Go 1 and Go 2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

proposal: math/rand: rework for Go 2 #26263

Description

Introduction

Compatibility guarantees

The Source interface

The core PRNG

Concurrency

Conversion routines

API surface

Migration and interoperability between Go 1 and Go 2

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

The `Source` interface