Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VCF - Mismatch AS/SP in headers doesn't fall back #62

Closed
keiranmraine opened this issue May 15, 2018 · 4 comments
Closed

VCF - Mismatch AS/SP in headers doesn't fall back #62

keiranmraine opened this issue May 15, 2018 · 4 comments
Labels

Comments

@keiranmraine
Copy link
Contributor

If the BAM headers are mismatched for AS/SP a warning should be emitted and fall back to command line option accepted as a resolution.

ERROR: Trying to merge contigs with conflicting data:
$VAR1 = bless( {
                 '_checksum' => undef,
                 '_length' => '249250621',
                 '_species' => 'Human',
                 '_assembly' => 'NCBI37',
                 '_name' => '1'
               }, 'Sanger::CGP::Vcf::Contig' );
 
$VAR1 = bless( {
                 '_checksum' => undef,
                 '_length' => 249250621,
                 '_species' => 'human',
                 '_assembly' => 'GRCh37d5',
                 '_name' => '1'
               }, 'Sanger::CGP::Vcf::Contig' );
at /opt/wtsi-cgp/bin/pindel_2_combined_vcf.pl line 70.
@keiranmraine
Copy link
Contributor Author

Looks like the $contig->compare() method will have to be dropped in favour of a manual check of -name and -length.

The alternative is to alter cgpVcf, passing options to ignore species/build mismatches (behaviour should only change when they are defined).

https://github.com/cancerit/cgpVcf/blob/bd2ba581c6392cce01a79f0df8eff43d13fedba7/lib/Sanger/CGP/Vcf/Contig.pm#L84-L107

But if checksum is absent in one-input that should pass, if it's present in both it should fail if a mismatch.

@sb43 what are your thoughts? Caveman doesn't use the function in cgpVcf

@sb43
Copy link
Member

sb43 commented Nov 23, 2018

@keiranmraine I think modifying cgpVCF is ideal option, I will look into it.

@sb43
Copy link
Member

sb43 commented Nov 26, 2018

As discussed with @keiranmraine adding separate functions to compare length & name and assembly & species will be added to pindel code base.

@sb43
Copy link
Member

sb43 commented Dec 10, 2018

Fixed in v3.0.6

@sb43 sb43 closed this as completed Dec 10, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants