Skip to content

feat: Introduce gzipped test fixtures and steps to reproduce. #11

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Mar 12, 2025

Conversation

clflushopt
Copy link
Owner

In order to build a nice test suite we need some initial fixtures that can be easily reproduced. In this change I add two sets of data files that I generated from the Java implementation for small scale factors (0.01 + 0.001) the idea is to have a small set of test data that we can easily build our test suite against.

In the change are two patch files that explain how to reproduce the data itself (this is important since we should aim to have a reproducible flow).

This PR is the first step to address #4 (I will have a separate PR for #3), since we still want a tiered package of tests (unit for fast iteration and integration for ensuring compatibility).

@alamb FYI

In order to build a nice test suite we need some initial fixtures
that can be easily reproduced. In this change I add two sets of data
files that I generated from the Java implementation for small scale
factors (0.01 + 0.001) the idea is to have a small set of test data
that we can easily build our test suite against.

In the change are two patch files that explain how to reproduce the data
itself (this is important since we should aim to have a reproducible
flow).
@clflushopt clflushopt requested review from alamb and removed request for alamb March 12, 2025 03:36
@clflushopt clflushopt added the enhancement New feature or request label Mar 12, 2025
@clflushopt clflushopt added this to the v0.1.0 milestone Mar 12, 2025
@clflushopt clflushopt self-assigned this Mar 12, 2025
@clflushopt clflushopt merged commit 8096a8f into main Mar 12, 2025
1 check passed
@alamb
Copy link
Collaborator

alamb commented Mar 12, 2025

Thanks @clflushopt

In this change I add two sets of data files that I generated from the Java implementation for small scale factors (0.01 + 0.001) the idea is to have a small set of test data that we can easily build our test suite against.

I will try and do the same thing with the original c implementation just to check that they match

@alamb alamb deleted the clflsuhopt/feature/add-csv-test-fixtures branch March 12, 2025 10:53
@alamb
Copy link
Collaborator

alamb commented Mar 12, 2025

Hi @clflushopt -- I compared the results that you checked in in this PR with the results from dbgen (the orignal C one) for both scale factors and I found one discrepancy

I also made a PR with instructions that explain what I did:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants