Skip to content

Releases: clflushopt/tpchgen-rs

v1.1.1

06 Jun 04:15
v1.1.1
d16b0db
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.1.0...v1.1.1

v1.1.0

29 Apr 21:32
v1.1.0
2ed321e
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.0.0...v1.1.0

v1.0.0

12 Apr 20:30
ea94a80
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.1...v1.0.0

v0.1.1

05 Apr 15:15
6fa2b4e
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.0...v0.1.1

v0.1.0

30 Mar 20:26
4568384
Compare
Choose a tag to compare

What's Changed

This is the initial release of the tpchgen-rs package that contains the tpchgen data generator crate and tpchgen-cli which is a drop-in replacement for dbgen written in Rust !

Changelog

  • fix: run clippy fix to address some linting issues by @clflushopt in #8
  • feat: Introduce gzipped test fixtures and steps to reproduce. by @clflushopt in #11
  • Add instructions for verifying checked in results by @alamb in #13
  • fix: date format for fixtures by @clflushopt in #15
  • feat: Introduce end-to-end conformance integration tests. by @clflushopt in #16
  • chore: Add issue and pr templates by @clflushopt in #18
  • feat: Create TextPool once per process rather than once per table by @alamb in #19
  • feat: Use tbl format for fixtures and tests by @clflushopt in #20
  • feat: scaffold basic dbgen like api by @clflushopt in #12
  • feat: Update conformance test to run at scale factors up to 1 by @clflushopt in #22
  • feat: Create CONTRIBUTING.md by @clflushopt in #23
  • docs: Some small README improvements by @clflushopt in #25
  • Change Random* generators to return &str or structs rather than String (10% faster) by @alamb in #26
  • feat: optimize single threaded performance by @clflushopt in #21
  • Change date handling to use TPCHDate type rather than String (10% faster) by @alamb in #27
  • feat: Drop unused dependencies and regex crate by @clflushopt in #29
  • feat: Add fmt + clippy to the build action by @clflushopt in #30
  • Avoid copying String when generating LineItem (20% faster) by @alamb in #32
  • Unbox writer (3-5% faster) by @alamb in #33
  • Revert "Avoid copying String when generating LineItem (20% faster… by @alamb in #36
  • Add lifetimes to generators and types by @scsmithr in #35
  • Remove lazy_static dep, replace with std::sync::LazyLock by @scsmithr in #40
  • Pass distribution references to generators by @scsmithr in #43
  • Avoid copying strings for Addresses (10% faster for customers, 3% faster overall) by @alamb in #39
  • feat: Derive Debug for all generator and iterator types by @clflushopt in #47
  • Remove more String copying (5% faster) by @alamb in #37
  • Add decimal type, avoid floating point calculations (7% faster) by @scsmithr in #48
  • Precompute date strings (10% faster) by @alamb in #50
  • Make 'static lifetime explicit in Generators by @alamb in #38
  • Add ARCHITECTURE.md documentation with design goals (keep data generator crate dependencies minimal) by @alamb in #49
  • Add docs, improve doc tests by @alamb in #55
  • Add into_inner for wrapper types, and derive some more traits` by @alamb in #52
  • Implement output as CSV by @alamb in #54
  • Remove hashmap lookups in distribution creation, remove IndexMap dependency (2x startup time speedup) by @alamb in #62
  • (POC): Remove some hashmap lookups in distribution lookup by @scsmithr in #59
  • Avoid String creation while loading Distributions (2% less startup time) by @alamb in #63
  • feat: Use lookup tables for julian date format (3-5% improvement) by @clflushopt in #64
  • Parallel output in tpchgen-cli (Nx faster, where N is number of cores) by @alamb in #58
  • Speed up initial TextPool generation more (15% faster) by @alamb in #67
  • feat: remove chrono dependency by @clflushopt in #68
  • Minor README update by @alamb in #70
  • fix: quote columns that may contain comma for csv output by @niebayes in #66
  • Provide access to TPCHDates as (y,m,d) by @alamb in #69
  • Add tpchgen-arrow crate to generate Arrow RecordBatches, implement conversion for LineItem by @alamb in #71
  • Implement arrow conversion for remaining tables by @alamb in #74
  • Support writing directly to parquet files by @alamb in #61
  • Stop early on parquet write error by @alamb in #82
  • Avoid double buffering when writing CSV/TBL by @alamb in #79
  • Quit early on error for tbl/csv by @alamb in #84
  • feat: allow writing to stdout by @clflushopt in #83
  • fix: Use a different shorthand for parquet compression argument by @clflushopt in #86
  • Support writing parquet to stdout, document use of pv by @alamb in #85
  • Limit number of parquet row groups by @alamb in #87
  • Do not generate CSV header multiple times by @alamb in #78
  • Add documentation for zstd compression by @alamb in #88
  • fix: Align the cli with the original dbgen by @clflushopt in #90
  • fix: Make sure table name handling is done by the crate instead of clap by @clflushopt in #92
  • fix: Align all manifest files to use workspace level configuration for v0.1.0 by @clflushopt in #93

New Contributors

Full Changelog: https://github.com/clflushopt/tpchgen-rs/commits/v0.1.0