Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regression on BigQuery load on scio 0.14.12 #5607

Open
turb opened this issue Feb 13, 2025 · 5 comments
Open

Regression on BigQuery load on scio 0.14.12 #5607

turb opened this issue Feb 13, 2025 · 5 comments

Comments

@turb
Copy link
Contributor

turb commented Feb 13, 2025

Loading in BigQuery with saveAsTypedBigQueryTable (method = STORAGE_WRITE_API) with scio 0.14.11 (or on branch fix-typed-bq-avro from #5529) works just fine.

However after an upgrade to 0.14.12 every entity goes to failed inserts, which can be observed using tap.output(BigQueryIO.FailedStorageApiInserts):

syntax error while parsing value - invalid literal; last read: 'G' with: GenericData{classInfo=[f], {keyColumn=keyValue, otherColumn=otherValue, jsonColumn=GenericData{classInfo=[f], {jsonKey=jsonValue, otherJsonKey=otherJsonValue}}, dateColumn=142507353448901608}}

(anonymized data)

It looks like either the TableRow or the JSON column is sent to BigQuery as its toString, not its JSON serialization.

This data has a JSON property, I did not test without.

@turb
Copy link
Contributor Author

turb commented Feb 13, 2025

cc @clairemcginty I suppose it's a side effect of #5598.

@clairemcginty
Copy link
Contributor

thanks for reporting ! reproduced it here. looking into a fix

@turb
Copy link
Contributor Author

turb commented Feb 24, 2025

Is it fixed by #5611?

@clairemcginty
Copy link
Contributor

hey @turb! Yes. However, we discovered an additional bug that breaks writing byte-arrays when using BigQueryType + storage write API. It's been fixed in Beam 2.63 and we're working on an upgrade PR in #5613. Hoping to have a patch release with everything by mid-week 👍

@clairemcginty
Copy link
Contributor

0.14.13 is released -- let me know if you have any further issues!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants