Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable the minimum number of matches and common code thresholds by default #45

Merged
merged 2 commits into from
Dec 29, 2023

Conversation

Gadiguibou
Copy link
Contributor

There is no point to setting min-matches to a value above 0 by default. Even though it may yield a few project pairs that have very few matches, it may also eliminate project pairs with large, valid matches after match expansion.

A similar argument could be made for disabling common-code-threshold by default: For a small set of submissions (< 20), the threshold corresponds to < 2 submissions which implies no match gets reported.

There is no point to setting min-matches to a value above 0 by default. Even though it may yield a few project pairs that have very few matches, it may also eliminate project pairs with large, valid matches after match expansion.

A similar argument could be made for disabling `common-code-threshold` by default: For a small set of submissions (< 20), the threshold corresponds to < 2 submissions which implies no match gets reported.
@Gadiguibou Gadiguibou self-assigned this Dec 29, 2023
@louis-hildebrand
Copy link
Contributor

Good point! I think it would be a good idea to also set the common code threshold to zero, as you suggested.

It might be nice to add a "verbose" flag which makes the tool print out various stats like number of projects found, number of project pairs, number of project pairs after applying min-matches, etc. That way users can figure out which command-line arguments are problematic.

@Gadiguibou
Copy link
Contributor Author

Good point! I think it would be a good idea to also set the common code threshold to zero, as you suggested.

It might be nice to add a "verbose" flag which makes the tool print out various stats like number of projects found, number of project pairs, number of project pairs after applying min-matches, etc. That way users can figure out which command-line arguments are problematic.

Yes we should move that to another issue though.

@Gadiguibou Gadiguibou force-pushed the disable-min-matches-by-default branch from 0f43ef5 to fe06dd3 Compare December 29, 2023 19:17
@Gadiguibou Gadiguibou merged commit 7035045 into main Dec 29, 2023
2 checks passed
@Gadiguibou Gadiguibou deleted the disable-min-matches-by-default branch December 29, 2023 19:19
@Gadiguibou Gadiguibou changed the title Set the default value for min-matches to 0 Disabled the minimum number of matches and common code thresholds by default Dec 29, 2023
@Gadiguibou Gadiguibou changed the title Disabled the minimum number of matches and common code thresholds by default Disable the minimum number of matches and common code thresholds by default Dec 29, 2023
@louis-hildebrand
Copy link
Contributor

Yes we should move that to another issue though.

I created a new issue for it: #46.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants