Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data load error #53

Closed
abhiachoudhary opened this issue Nov 3, 2023 · 3 comments
Closed

Data load error #53

abhiachoudhary opened this issue Nov 3, 2023 · 3 comments

Comments

@abhiachoudhary
Copy link

I just starting using BPCells and was following along the tutorial. But in the RNA matrix conversion section, the command with open_matrix_10x_hdf5 gets stuck (doesn't finish for hours):
mat_raw <- open_matrix_10x_hdf5("pbmc_3k_10x.h5", feature_type="Gene Expression") %>% write_matrix_dir("pbmc_3k_rna_raw")

It shows no error, progress, nor it stops running. Is there a bug in open_matrix_10x_hdf5, or I'm doing something wrong?

@bnprks
Copy link
Owner

bnprks commented Nov 4, 2023

This looks likely to be a bug in BPCells, potentially related to issue #52. It looks like I'm getting a similar issue when I test this out right now too -- I see continuous 100% CPU usage on one core, whereas the import should finish within seconds if things were working properly. So probably my fault, not yours -- maybe I changed something recently that inadvertently broke the 10x matrix import. I'll do some closer looking and hopefully post back here soon with a fix.

@bnprks
Copy link
Owner

bnprks commented Nov 4, 2023

I think I likely found + fixed the cause with a one-line bugfix 🤦 . This should be fixed in 21f8dcf, so if you re-install from the latest github things should be good.

It looks like I introduced this bug on October 18th in 3711a40, and the effect is to re-read the entire list of cell IDs / gene IDs for each time we want to read one ID (in this case resulting in about 100,000x too much work).

Thanks so much for reporting this bug and sorry for the inconvenience to you and the other ~400 people who have installed a broken version of BPCells in those two weeks. Let me know if you continue to have issues after this attempted fix

@abhiachoudhary
Copy link
Author

That fixed it. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants