Releases: clflushopt/tpchgen-rs
Releases · clflushopt/tpchgen-rs
v1.1.1
What's Changed
- fix: Inconsistent implementation of IntoIterator by @askyx in #148
- feat: prepare v1.1.1 by @clflushopt in #149
New Contributors
Full Changelog: v1.1.0...v1.1.1
v1.1.0
What's Changed
- Add rust-toolchain.toml to control rust version by @alamb in #120
- Change arrow version by @matthewmturner in #122
- fix: Move README to the top level to fix broken link issue by @clflushopt in #125
- Documentation tweaks by @alamb in #117
- feat: fix CLI option docs by @kevinjqliu in #127
- feat: fix CI and run CI for every PR by @kevinjqliu in #128
- [feat] update tpchgen-cli's README by @kevinjqliu in #129
- add python CLI bindings by @kevinjqliu in #121
- fix: Document via examples the behavior of multipart generation by @clflushopt in #135
- Fix up
tpchgen-cli
's README by @kevinjqliu in #133 - [feat] pyproject use cargo version by @kevinjqliu in #136
- [feat] publish to pypi by @kevinjqliu in #134
- [feat] skip conformance test for markdown file changes by @kevinjqliu in #137
- Replace ByteArrayBuilder with Vec by @scsmithr in #138
- fix: Drop unused TextPoolGenerator by @clflushopt in #141
- feat: Add TPC-H queries by @clflushopt in #140
- feat: prepare v1.1.0 by @clflushopt in #142
- feat: prepare v1.1.0 by @clflushopt in #143
New Contributors
- @kevinjqliu made their first contribution in #127
Full Changelog: v1.0.0...v1.1.0
v1.0.0
What's Changed
- Move chrono to dev dep by @matthewmturner in #107
- Improve documentation on additional shasum checks by @alamb in #109
- Fix typo by @alamb in #111
- fix: Use lowercase for possible values of table names by @szarnyasg in #114
- refactor: Simplify DuckDB export by @szarnyasg in #112
- feat: Link to @alamb's demo and blog in README by @clflushopt in #115
- feat: Small documentation improvements by @clflushopt in #116
- feat: prepare cargo release for v1.0.0 by @clflushopt in #118
New Contributors
- @matthewmturner made their first contribution in #107
- @szarnyasg made their first contribution in #114
Full Changelog: v0.1.1...v1.0.0
v0.1.1
What's Changed
- Update project documentation and benchmarking by @alamb in #72
- Add
as_str
helper toOrderStatus
by @scsmithr in #95 - fix: pin local crate versions by @clflushopt in #94
- fix: minor README improvements by @clflushopt in #96
- Update with benchmark numbers by @alamb in #98
- feat: add to_unix_epoch helper to TPCHDate by @clflushopt in #97
- chore: tweak doc comments, add
inline
for good measure by @alamb in #102 - feat: pin chrono package to 0.4.39 by @clflushopt in #105
- fix: bump tpchgen-arrow and tpchgen-cli to 0.1.1 by @clflushopt in #106
- fix: bump tpchgen to 0.1.1 by @clflushopt in #108
Full Changelog: v0.1.0...v0.1.1
v0.1.0
What's Changed
This is the initial release of the tpchgen-rs
package that contains the tpchgen
data generator crate and tpchgen-cli
which is a drop-in replacement for dbgen
written in Rust !
Changelog
- fix: run clippy fix to address some linting issues by @clflushopt in #8
- feat: Introduce gzipped test fixtures and steps to reproduce. by @clflushopt in #11
- Add instructions for verifying checked in results by @alamb in #13
- fix: date format for fixtures by @clflushopt in #15
- feat: Introduce end-to-end conformance integration tests. by @clflushopt in #16
- chore: Add issue and pr templates by @clflushopt in #18
- feat: Create
TextPool
once per process rather than once per table by @alamb in #19 - feat: Use tbl format for fixtures and tests by @clflushopt in #20
- feat: scaffold basic dbgen like api by @clflushopt in #12
- feat: Update conformance test to run at scale factors up to 1 by @clflushopt in #22
- feat: Create CONTRIBUTING.md by @clflushopt in #23
- docs: Some small README improvements by @clflushopt in #25
- Change Random* generators to return
&str
or structs rather thanString
(10% faster) by @alamb in #26 - feat: optimize single threaded performance by @clflushopt in #21
- Change date handling to use TPCHDate type rather than String (10% faster) by @alamb in #27
- feat: Drop unused dependencies and regex crate by @clflushopt in #29
- feat: Add fmt + clippy to the build action by @clflushopt in #30
- Avoid copying
String
when generatingLineItem
(20% faster) by @alamb in #32 - Unbox writer (3-5% faster) by @alamb in #33
- Revert "Avoid copying
String
when generatingLineItem
(20% faster… by @alamb in #36 - Add lifetimes to generators and types by @scsmithr in #35
- Remove lazy_static dep, replace with std::sync::LazyLock by @scsmithr in #40
- Pass distribution references to generators by @scsmithr in #43
- Avoid copying strings for Addresses (10% faster for customers, 3% faster overall) by @alamb in #39
- feat: Derive Debug for all generator and iterator types by @clflushopt in #47
- Remove more String copying (5% faster) by @alamb in #37
- Add decimal type, avoid floating point calculations (7% faster) by @scsmithr in #48
- Precompute date strings (10% faster) by @alamb in #50
- Make
'static
lifetime explicit in Generators by @alamb in #38 - Add
ARCHITECTURE.md
documentation with design goals (keep data generator crate dependencies minimal) by @alamb in #49 - Add docs, improve doc tests by @alamb in #55
- Add
into_inner
for wrapper types, and derive some more traits` by @alamb in #52 - Implement output as CSV by @alamb in #54
- Remove hashmap lookups in distribution creation, remove IndexMap dependency (2x startup time speedup) by @alamb in #62
- (POC): Remove some hashmap lookups in distribution lookup by @scsmithr in #59
- Avoid String creation while loading
Distributions
(2% less startup time) by @alamb in #63 - feat: Use lookup tables for julian date format (3-5% improvement) by @clflushopt in #64
- Parallel output in tpchgen-cli (Nx faster, where N is number of cores) by @alamb in #58
- Speed up initial TextPool generation more (15% faster) by @alamb in #67
- feat: remove chrono dependency by @clflushopt in #68
- Minor README update by @alamb in #70
- fix: quote columns that may contain comma for csv output by @niebayes in #66
- Provide access to TPCHDates as (y,m,d) by @alamb in #69
- Add
tpchgen-arrow
crate to generate ArrowRecordBatch
es, implement conversion forLineItem
by @alamb in #71 - Implement arrow conversion for remaining tables by @alamb in #74
- Support writing directly to parquet files by @alamb in #61
- Stop early on parquet write error by @alamb in #82
- Avoid double buffering when writing CSV/TBL by @alamb in #79
- Quit early on error for tbl/csv by @alamb in #84
- feat: allow writing to stdout by @clflushopt in #83
- fix: Use a different shorthand for parquet compression argument by @clflushopt in #86
- Support writing parquet to stdout, document use of
pv
by @alamb in #85 - Limit number of parquet row groups by @alamb in #87
- Do not generate CSV header multiple times by @alamb in #78
- Add documentation for zstd compression by @alamb in #88
- fix: Align the cli with the original dbgen by @clflushopt in #90
- fix: Make sure table name handling is done by the crate instead of clap by @clflushopt in #92
- fix: Align all manifest files to use workspace level configuration for v0.1.0 by @clflushopt in #93
New Contributors
- @alamb made their first contribution in #13
- @scsmithr made their first contribution in #35
- @niebayes made their first contribution in #66
Full Changelog: https://github.com/clflushopt/tpchgen-rs/commits/v0.1.0