Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Record hashing perf #781

Merged
merged 31 commits into from
Dec 15, 2021
Merged

Record hashing perf #781

merged 31 commits into from
Dec 15, 2021

Conversation

johnkerl
Copy link
Owner

@johnkerl johnkerl commented Dec 15, 2021

The hashRecords=false narrative at https://github.com/johnkerl/miller/blob/main/internal/pkg/types/mlrmap.go#L1-L52 has been in place ever since the start of Miller in 2015 -- through the C implementation and the Go port as well.

On #779 (https://github.com/johnkerl/miller/blame/main/internal/pkg/types/mlrmap.go#L61) two days ago I left hashRecords=true experimentally for a while. However I find that I can't produce (yet) situations where hashRecords=false isn't faster. Performance improvement ranges from 5% to 40% for performance tests here, here, and here.

So, experiment complete -- I'm going back to the historical (and well-reasoned) behavior, and documenting it for future reference.

(For reference, this PR was incorrectly rebased -- of the 31 commits on this PR, only the last one -- 447f139 -- is new.)

@johnkerl johnkerl merged commit 58d9ad1 into main Dec 15, 2021
@johnkerl johnkerl deleted the record-hashing-perf branch December 15, 2021 03:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant