Bitpacked GF2 representation #594

amirebrahimi · 2025-04-08T16:17:55Z

This draft PR continues from #583.

The approach here is to have an extended version of GF2 that is used when np.packbits is used on a GF2 instance. The following operations have been overridden to operate within a bitpacked representation, unless it requires unpacking to complete the operation:

add
subtract
multiply
divide
matmul (handles matrix-vector and matrix-matrix multiplication via a repack of operand b)
concatenate
inverse

Performance-wise, we look to gain between 4-8x speed improvement for operations as well as an 8x reduction in memory.
One can pack bits along any axis, but depending on the operation it may require unpacking of operands.

Due to all the ways one can index arrays in Python the majority of the logic is in GF2BP._normalize_indexing_to_tuple and GF2BP.get_index_parameters.

amirebrahimi · 2025-04-08T16:25:01Z

tests/fields/test_bitpacked.py

+    # should this be using arr's data (as would be the case without packbits) or a new array?
+    reshaped = arr[:, np.newaxis]
+    reshaped = np.packbits(reshaped)
+    reshaped[:, 0] = GF([0, 0, 0, 0])


This is still an issue because it is a separate, new array rather than an alias to a block of memory in arr, which is what is expected. Perhaps we have to disallow this or issue a warning.

mhostetter · 2025-04-10T12:25:52Z

src/galois/_domains/_function.py

-            if func in field._FUNCTIONS_REQUIRING_VIEW:
-                output = field._view(output) if not np.isscalar(output) else field(output, dtype=self.dtype)
+        if func in field._FUNCTIONS_REQUIRING_VIEW:
+            output = field._view(output) if not np.isscalar(output) else field(output, dtype=self.dtype)


What is the need for these various changes to __array_function__()? Is it to ensure that _packbits() is called for GF2BP and not for GF2?

FunctionMixin previously had convolve, fft, and ifft overridden for all classes so that is why no try/except was needed. I needed to add packbits, unpackbits, and concatenate as well, but only GF2BP overrides those. So, for all other classes it needs to fall back to the numpy version.

mhostetter · 2025-04-10T12:29:09Z

src/galois/_fields/_gf2.py

+        cls._negative = negative_ufunc(cls, override=np.positive)
+        cls._subtract = subtract_ufunc_bitpacked(cls, override=np.bitwise_xor)
+        cls._multiply = multiply_ufunc_bitpacked(cls, override=np.bitwise_and)
+        cls._reciprocal = not_implemented  # reciprocal(cls)


What happens if np.reciprocal() is called on a GF2BP instance?

Good catch. These were left as potential TODOs, but I didn't mark them as such. I can work up bitpacked versions of them if you like.

mhostetter · 2025-04-10T12:31:48Z

Thanks for submitting this! This is a big change, so I'll need some time to review, checkout the code, play with it, and look for issues. I appreciate the contribution.

src/galois/_fields/_gf2.py

…; Add tests for coverage

…ts for Python 3.7.

Squash previous history

cdd65c5

amirebrahimi commented Apr 8, 2025

View reviewed changes

mhostetter reviewed Apr 10, 2025

View reviewed changes

amirebrahimi commented Apr 10, 2025

View reviewed changes