You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
based in part on experiences in #3534, I think we could improve our human-readable output for inspecting taxonomy results. this is particularly important as we head towards more comprehensive examination of tax membership
a few specific things come to mind -
options to limit display to either total % explained (e.g. "I want to see what explains 99% of tax assignments, and no more" - to avoid the tons of .01% matches), or total number of hits (I only want to see n hits), or only hits above
hierarchical output organized by higher level ranks, e.g. "order hits by phylum level % explained, but then show me details for the top x% of species underneath that"
for multiple files, etc. Right now it's specialized for AllTheBacteria/AllTheBacteria#59 but it'd be easy to make it nicer. It's reasonably fast, takes about 5 minutes for 2,000 files containing 3.3m fastmultigather results. Uses polars, of course ;).
based in part on experiences in #3534, I think we could improve our human-readable output for inspecting taxonomy results. this is particularly important as we head towards more comprehensive examination of tax membership
a few specific things come to mind -
this could definitely be a plugin!
see also https://github.com/ctb/2025-explore-sourmash-gather for a similarly-motivated script for exploring gather results w/o taxonomy.
The text was updated successfully, but these errors were encountered: