jax.lax.map + batch_size + explicit sharding: How to? #29164

PhilipVinc · 2025-06-02T13:18:47Z

PhilipVinc
Jun 2, 2025

Is there a reasonably simple way to implement something like the code below?
In particular, I'm looking for a way to make use of batch size when using sharded arrays.

import numpy as np
import jax
import jax.numpy as jnp
from jax.sharding import Mesh, PartitionSpec as P, AxisType
from jax.experimental.shard_map import shard_map
from jax.experimental.shard import reshard, explicit_axes

# Setup: 2 CPU devices
jax.config.update("jax_num_cpu_devices", 4)
devices = np.array(jax.devices())
mesh = jax.make_mesh((2,2),("s","p"), axis_types=(AxisType.Explicit,AxisType.Explicit,),)
jax.sharding.set_mesh(mesh) # Set this as the default mesh for jax.

# Define a simple function: takes w, x -> outputs batch of scalars
def simple_func(w, x):
    # w: (3,), x: (batch, 3)
    return jnp.sum(w * x, axis=-1)

# Make inputs
w = jnp.array([1.0, 2.0, 3.0, 4.0])  # weights, size (3,)
x = jnp.ones((8, 4))            # batch of 4 vectors, shape (4, 3)

# Setup sharding
replicated_w = reshard(w, P("p"))           # replicated
sharded_x = reshard(x, P("s", None))           # replicated

# --- Evaluate normally ---
out = simple_func(replicated_w, sharded_x)
print("Simple call works:", out)

# works
jax.lax.map(lambda _x: simple_func(w, _x), x, batch_size=2)

# how to do this (only x is sharded) [ this does not work ]
jax.lax.map(lambda _x: simple_func(w, _x), sharded_x, batch_size=2)

# and this (harder, both x and w are sharded) [ this does not work ]
jax.lax.map(lambda _x: simple_func(replicated_w, _x), sharded_x, batch_size=2)

the last two examples fail with errors that say that jax.lax.map does not support sharding.

Answered by yashk2810

Jun 2, 2025

If you want to keep your inputs sharded on dim 0 that eventually get passed to scan, then I would suggest dropping into Auto mode via the auto_axes API.

If you want to stay in Explicit mode, then don't shard your input on dim 0 (i.e. the dimension that is getting scanned over).

View full answer

yashk2810 · 2025-06-02T15:39:19Z

yashk2810
Jun 2, 2025
Collaborator

If you want to keep your inputs sharded on dim 0 that eventually get passed to scan, then I would suggest dropping into Auto mode via the auto_axes API.

If you want to stay in Explicit mode, then don't shard your input on dim 0 (i.e. the dimension that is getting scanned over).

6 replies

PhilipVinc Jun 2, 2025
Author

Seems to be a bug/unsupported feature?

# works
jax.lax.map(lambda _x: simple_func(w, _x), sharded_x_r)
# crashes
jax.lax.map(lambda _x: simple_func(w, _x), sharded_x_r, batch_size=2)

yashk2810 Jun 2, 2025
Collaborator

Thanks! I'll take a look.

Can you cut your repro down to only the failing case please?

PhilipVinc Jun 2, 2025
Author

I just did. I updated the MWE above.
I think I also understand where this comes from:

In this particular case, I think that the jnp.concatenate call here

jax/jax/_src/lax/control_flow/loops.py

Line 2582 in 3e52872

    
           lambda x, y: lax.concatenate([flatten(x), y], dimension=0), scan_ys, remainder_ys,

ys = tree_map(
    lambda x, y: lax.concatenate([flatten(x), y], dimension=0), scan_ys, remainder_ys,
    )

is trying to concatenate a sharded array

In [21]: scan_ys.shape
Out[21]: (2, 2, 2)
In [20]: scan_ys.sharding.spec
Out[20]: PartitionSpec(None, None, 's')

with an empty array

In [22]: remainder_ys.shape
Out[22]: (0, 2)

In [23]: remainder_ys.sharding.spec
Out[23]: PartitionSpec(None, None)

and the empty array Is not sharded, so this fails.

yashk2810 Jun 2, 2025
Collaborator

#29185 should fix

PhilipVinc Jun 3, 2025
Author

Thank you

PhilipVinc · 2025-06-19T09:06:24Z

PhilipVinc
Jun 19, 2025
Author

For anybody passing through here, this is my current working implementation (jax 0.6.2) that wraps jax.lax.map to support sharding across the leading dimension

from functools import wraps

import jax
import jax.numpy as jnp
from jax.sharding import SingleDeviceSharding, NamedSharding
from jax.tree_util import tree_flatten, tree_unflatten, tree_map


@wraps(jax.lax.map)
def custom_map(f, x, batch_size: int | None = None):
    """
    Equivalent to jax.lax.map, but handles arbitrary NamedSharding
    across the first axis, and works on pytrees.
    """
    # If no explicit axes in the current mesh, just defer to lax.map
    if not jax.sharding.get_abstract_mesh()._any_axis_explicit:
        return jax.lax.map(f, x, batch_size=batch_size)

    # Flatten the pytree
    leaves, treedef = tree_flatten(x)

    # Get each leaf's aval and sharding
    avals     = [jax.typeof(leaf) for leaf in leaves]
    shardings = [aval.sharding for aval in avals]

    # Case 1: all SingleDeviceSharding → direct
    if all(isinstance(s, SingleDeviceSharding) for s in shardings):
        return jax.lax.map(f, x, batch_size=batch_size)

    # Mixed sharding types: some Named but not all
    if any(isinstance(s, NamedSharding) for s in shardings) and not all(isinstance(s, NamedSharding) for s in shardings):
        raise ValueError(
            "Mixed sharding types: some inputs are sharded while others are not. "
            "Please shard all inputs the same."
        )

    # Case 2: all NamedSharding → check and peel off the first‐axis shard
    if all(isinstance(s, NamedSharding) for s in shardings):
        specs0 = [s.spec[0] for s in shardings]
        # if *none* of them shard the first axis, direct
        if all(sp is None for sp in specs0):
            return jax.lax.map(f, x, batch_size=batch_size)
        # require *all* to shard the same named axis
        if any(sp is None for sp in specs0) or len({*specs0}) != 1:
            raise ValueError(
                f"Inconsistent first‐axis sharding across pytree: {specs0}"
            )
        axis_name = specs0[0]
        mesh      = shardings[0].mesh
        # find which mesh‐axis index it is
        n_devs     = mesh.shape[axis_name]

        # reshape + transpose helper
        def peel_and_move(leaf, sh):
            # leaf.shape = (B, *rest), where B = n_devs * local_batch
            local_shape = sh.shard_shape(leaf.shape)
            # first reshape → (n_devs, local_batch, *rest)
            y = jnp.reshape(leaf, (n_devs,) + tuple(local_shape))
            # then bring the local_batch in front → (local_batch, n_devs, *rest)
            y = jnp.transpose(y, (1, 0) + tuple(range(2, y.ndim)))
            return y

        # apply to every leaf
        peeled = [
            peel_and_move(leaf, sh)
            for leaf, sh in zip(leaves, shardings)
        ]

        x_tr      = tree_unflatten(treedef, peeled)
        # vmap over the extra axis to emulate lax.map semantics
        mapped   = jax.lax.map(jax.vmap(f), x_tr, batch_size=batch_size)

        # inverse reshape+transpose helper
        def reassemble(y):
            # y.shape = (local_batch, n_devs, *rest)
            # undo transpose → (n_devs, local_batch, *rest)
            y2 = jnp.transpose(y, (1, 0) + tuple(range(2, y.ndim)))
            # flatten back → (B, *rest)
            return jnp.reshape(y2, (-1,) + y2.shape[2:])

        res = tree_map(reassemble, mapped)
        return res

    # anything else is unsupported
    raise NotImplementedError(
        f"Unsupported sharding types: {set(type(s) for s in shardings)}"
    )

import numpy as np
import jax
import jax.numpy as jnp
from jax.sharding import Mesh, PartitionSpec as P, AxisType
from jax.experimental.shard import reshard, explicit_axes

# Setup: 2 CPU devices
jax.config.update("jax_num_cpu_devices", 2)
devices = np.array(jax.devices())
mesh = jax.make_mesh((2,),("s",), axis_types=(AxisType.Explicit,),)
jax.sharding.set_mesh(mesh) # Set this as the default mesh for jax.

def simple_func(w, x):
    return jnp.sum(w * x, axis=-1)

# Make inputs
w = jnp.array([1.0, 2.0, 3.0, 4.0])
x = jnp.ones((10, 4))

# Setup sharding
xs = reshard(x, P("s", None))

r1=jax.lax.map(lambda _x: simple_func(w, _x), x, batch_size=2)
r2= custom_map(lambda _x: simple_func(w, _x), xs, batch_size=2)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

jax.lax.map + batch_size + explicit sharding: How to? #29164

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 6 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

jax.lax.map + batch_size + explicit sharding: How to? #29164

Uh oh!

Uh oh!

PhilipVinc Jun 2, 2025

Replies: 2 comments · 6 replies

Uh oh!

yashk2810 Jun 2, 2025 Collaborator

Uh oh!

PhilipVinc Jun 2, 2025 Author

Uh oh!

yashk2810 Jun 2, 2025 Collaborator

Uh oh!

PhilipVinc Jun 2, 2025 Author

Uh oh!

yashk2810 Jun 2, 2025 Collaborator

Uh oh!

PhilipVinc Jun 3, 2025 Author

Uh oh!

Uh oh!

PhilipVinc Jun 19, 2025 Author

PhilipVinc
Jun 2, 2025

Replies: 2 comments 6 replies

yashk2810
Jun 2, 2025
Collaborator

PhilipVinc Jun 2, 2025
Author

yashk2810 Jun 2, 2025
Collaborator

PhilipVinc Jun 2, 2025
Author

yashk2810 Jun 2, 2025
Collaborator

PhilipVinc Jun 3, 2025
Author

PhilipVinc
Jun 19, 2025
Author