Skip to content

Remove hashmap lookups in distribution creation, remove IndexMap dependency (2x startup time speedup) #62

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Mar 25, 2025

Conversation

alamb
Copy link
Collaborator

@alamb alamb commented Mar 25, 2025

This PR is based on great work from @scsmithr in #59 and makes initial Distribution loading time 2x faster.

The key change is to loading Distributions so the names are resolved once on load rather than on each access.

This significantly speeds up the startup time

Testing with

time target/release/tpchgen-cli --tables line-item -s 0.001 --output-dir=/tmp/tpchdbgen-rs
Header Header
main 0m2.036s
this branch 0m0.985s

@alamb alamb changed the title Remove hashmap lookups in distribution creation, remove IndexMap dependency Remove hashmap lookups in distribution creation, remove IndexMap dependency (2x startup time speedup) Mar 25, 2025
@alamb alamb marked this pull request as ready for review March 25, 2025 00:07
Copy link
Contributor

@scsmithr scsmithr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀 🚀 🚀

@alamb
Copy link
Collaborator Author

alamb commented Mar 25, 2025

Thank you for the super quick review @scsmithr

@alamb alamb merged commit 22c754a into clflushopt:main Mar 25, 2025
6 checks passed
@alamb alamb deleted the sean/dist-speed2 branch March 25, 2025 10:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants