Feature/ssb benchmark #2280

ghafek · 2025-06-29T10:14:19Z

No description provided.

This patch resolves a remaining FIXME after improved rewrite code coverage by fixing the expressions and other rewrite configs so the test actually triggers the existing rewrite.

Closes apache#2130.

Closes apache#2132.

This patch makes some simple performance improvement in order to reduce the runtime of the sparse component tests (300+s -> 30s). In detail the runtime of specific tests improved as follows: * SparseBlockMerge: 149s -> 14.7s * SparseBlockIndexRange: 110s -> 13.4s * SparseBlockGetFirstIndex: 29s -> 1.3s

Closes apache#1946.

Closes apache#2134.

Closes apache#2135.

Closes apache#2137.

Closes apache#2139.

Closes apache#2133.

This patch adds real-data tests for the new adasyn builtin function, and changes the implementation to a vectorized implementation that extracts over-sampled rows via a randomized permutation matrix multiply. On the Diabetes dataset (with moderate class imbalance of 500 vs 268) ADASYN slightly improves the test accuracy from 78.3 to 78.7%. It is also noteworthy that the original ADASYN paper from 2008 only achieved 0.6831 and 0.6833 (with ADASYN) on this dataset.

This generalizes the adasyn test for additional real data set. On the titantic dataset, adasyn gives a 1.6% improvement of test accuracy (for a basic logreg model, 0.781 -> 0.797).

This patch fixes endless loops in transformencode, if the tfspec references columns outside the column range.

The multi-threaded implementation of ultra-sparse matrices has a couple of shortcomings (e.g., count column nnz, block allocation, too late fallback to single-threaded). On a large 85M x 85M graph with 90M non-zeros the transpose did not finish in hours. In this patch we now introduces a more sophisticated sparse row iterator (row and column lower/bounds) in order to facilitate a simple and fast transpose ultra sparse operation. However, this implementation was still much slower than falling back to single-threaded operations and thus use single-threaded transpose for all ultra-sparse matrices instead of if nnz < max(rows,cols). Now this operations completes in <9s.

Closes apache#2141.

Closes apache#2143.

There was a regression where all sparse matrix-vector elementwise operations are now only executed single-threaded. This patch fixes the most important branch for sparse-safe matrix-vector operations, but in subsequent task we also need to fix all the other cases. When running connected components on the Europe road network, the individual binary multiply operations improved by 10-20x on a box with 48 vcores. End-to-end the entire components() invocation with 20 iterations improved from 282s (246s for b(*)) to 112s (75s for b(*)). The 10x improvements do not carry fully through because the output MCSR is converted to CSR when appending to the buffer pool (57s of 75s).

Closes apache#2145.

This patch adds the missing multi-threading for all cases of binary elementwise operations, except one special case that directly constructs a CSR output. Furthermore, in safeBinaryMVSparseDenseRow we now avoid unnecessary allocation of temporary vectors by doing the filling inplace on the first output row of every task.

This patch adds a test that systematically applies the single- and multi-threaded writers/readers for matrices and frames, all formats, as well as dense and sparse data. These tests also revealed bugs in the hdf5 readers/writers where incorrect data is read for single-threaded sparse as well as multi-threaded dense and sparse.

… script level. Closes apache#2259

Closes apache#2263

This commit adds vectorized kernels for matrix multiplication. the vector API improves performance for single-threaded execution of our AMD box improves by ~80% and Intel by ~60% for dense mm. These improvements are with allocation overhead of the output and in ideal cases where the input is cached and the JIT compilation is done. The biggest change for users is that SystemDS now would require `--add-modules=jdk.incubator.vector` to all execution calls. The commit appropriately modifies all scripts to do this. However, all calling code must be modified if it bypasses the bin/systemds and calls Java directly. To measure the performance difference on your machine, use the added script: src/test/scripts/performance/matrixMultiplication.sh Closes apache#2216

Co-authored-by: Kevin Innerebner <[email protected]>

This patch adds an initial version of the representation optimizer for the Scuro library. It is a two stage optimization where in the first step the best unimodal representation for given raw modalities is found and in the next step the k-best unimodal rerpesentations are combined into multimodal representations and evaluated against the target downstream task. Additionally, this patch adds tests for each stage of the optimizer. Closes apache#2267

This patch downgrades the library versions of Scuro dependencies. Closes apache#2269

This patch fixes the incorrect size propagation of unique which led to incorrect results if the dimensions are used in subsequent ops. Thanks to Chi-Hsin Huang for catching this bug. Furthermore, this patch also includes minor updates for code quality (removed unused imports, annotated unused functions)

e.g., a-A-b -> (a-b)-A; a+A-b -> (a-b)+A Closes apache#2272.

This patch fixes issues of the test dml scripts in terms of missing casts from 1-x-1 matrices to scalars. Interestingly, the test ran fine in local environments because the parser validation runs differently, and subsequently these 1-x-1 matrices where automatically rewritten to scalars.

mboehm7 and others added 30 commits October 24, 2024 19:54

[SYSTEMDS-3785] Fix rewrite test for simplify bushy binary ops

2e0d1a0

This patch resolves a remaining FIXME after improved rewrite code coverage by fixing the expressions and other rewrite configs so the test actually triggers the existing rewrite.

[SYSTEMDS-3785] Fix rewrite test for simplify bushy binary ops, part 2

480cc1f

[SYSTEMDS-3782] Bag-of-words Encoder for CP

2fc7033

Closes apache#2130.

[SYSTEMDS-3172] Additional CSC sparse block implementation

1468aa2

Closes apache#2132.

[SYSTEMDS-3669] Builtin for computation of shapley values

1e11086

Closes apache#1946.

[SYSTEMDS-3781] New constant conjunction/disjunction rewrites

0da8c45

Closes apache#2134.

Bump codecov/codecov-action from 4.6.0 to 5.0.0 (apache#2138)

4bcef82

[MINOR] Cleanup code quality (tab formatting, method annotations)

74523da

[SYSTEMDS-1780] Final resource optimizer for AWS EMR

b9622da

Closes apache#2135.

[SYSTEMDS-3789] Fix federated covariance (missing weighted case)

3ff20ec

Closes apache#2137.

[MINOR] Fix missing licenses and remove wildcard imports

e2d55bb

[SYSTEMDS-3790] Restore federated planner tests (all, heuristic)

a3c027a

Closes apache#2139.

[SYSTEMDS-3777] New adasyn builtin function for TPCx-AI

a1cc2ca

Closes apache#2133.

[SYSTEMDS-3777] Fix adasyn test flakiness via fixed seeds

0bf8597

[SYSTEMDS-3777] Additional adasyn real data tests

7f5e79d

This generalizes the adasyn test for additional real data set. On the titantic dataset, adasyn gives a 1.6% improvement of test accuracy (for a basic logreg model, 0.781 -> 0.797).

[SYSTEMDS-3790] Fix transformencode robustness for non-existing columns

8baef5e

This patch fixes endless loops in transformencode, if the tfspec references columns outside the column range.

Bump codecov/codecov-action from 5.0.0 to 5.0.2 (apache#2140)

a451a89

[SYSTEMDS-3790] New Federated Planner MemoTable

bac35cc

Closes apache#2141.

[MINOR] Fix javadoc issue in fed planner memo table

be24376

[SYSTEMDS-3701] Add test suite for Scuro

55899c2

Closes apache#2143.

[SYSTEMDS-3782] Bag-of-words encoder for Spark backend

ffa6186

Closes apache#2145.

[MINOR] Improved code coverage and fixes sparsity estimators

a3109ff

[MINOR] Updated reader/write tests to force parallel writers

d10b77a

[SYSTEMDS-3796] Fix flaky federated primitive tests and instructions

1f91b8c

Baunsgaard and others added 25 commits May 15, 2025 11:51

[MINOR] Add ColGroupFactory logging tests

c4e8985

[MINOR] Add validation error if Quantizing_compress is not allowed on…

61bbde0

… script level. Closes apache#2259

[DOCS] Update Docs to Specify Java 17

9c66227

[MINOR] Merge InstructionType Tsmm and TSMM

e06d551

Closes apache#2263

Bump codecov/codecov-action from 5.4.2 to 5.4.3 (apache#2264)

4acc2ec

[SYSTEMDS-3874] Initial vector gather and remove vector staging

b08da49

Co-authored-by: Kevin Innerebner <[email protected]>

downgrade library versions

f285fc1

This patch downgrades the library versions of Scuro dependencies. Closes apache#2269

[SYSTEMDS-3889] New simplification rewrite for matrix-scalar ops

47b2d77

e.g., a-A-b -> (a-b)-A; a+A-b -> (a-b)+A Closes apache#2272.

q1firstDraft

19d2494

first try

4aab7f6

Remove large .tbl files and ignore them

944adf4

modifying some paths

2bd9a24

more paths modification

05176f1

empty the content of copied query q2_3.dml

fb2ca21

q2_1 angepasst, order by funktioniert noch nicht

60b93d8

queries 2.1-2.3 implementiert, spec für part hinzugefügt

0807cf5

Paths modification q2_2& q2_3

dc5c47f

Add SSB benchmark queries Q3 and Q4, and customer recode spec

30c874e

getting the right Paths

df78f50

modify things

b4f0bc2

Cleaning up stuffs

4d24ed2

github-project-automation bot moved this to In Progress in SystemDS PR Queue Jun 29, 2025

github-project-automation bot added this to SystemDS PR Queue Jun 29, 2025

Can and others added 3 commits June 29, 2025 18:37

queries optimizations

9c362d8

Some changes on the way

073a378

adding the ssb queries as comments

d79b19b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/ssb benchmark #2280

Feature/ssb benchmark #2280

Uh oh!

ghafek commented Jun 29, 2025

Uh oh!

Uh oh!

Feature/ssb benchmark #2280

Are you sure you want to change the base?

Feature/ssb benchmark #2280

Uh oh!

Conversation

ghafek commented Jun 29, 2025

Uh oh!

Uh oh!