Reputation: 636
I've got my pipeline up and running, but I need to save a number to a file, which will allow me to come back to previous processing in case something goes wrong ( or when I want to deploy new version of pipeline, and I cannot use the update-pipeline feature ).
I was trying to add libraries from google-cloud-storage, but there are errors in Runtime (it does not find some classes from google-cloud-core even when I have all dependencies up to date ). There need to be a simple way, just to save ( and then read ) a file. I will be grateful for some snippet or link.
Upvotes: 0
Views: 258
Reputation: 151
Probably it is too late to answer this :D. But perhaps can help other people. If you are encountering dependency hell because of using different GCP components, use https://github.com/googleapis/google-cloud-java/tree/master/google-cloud-bom. They have created a pom file that keeps track of all the compatible versions in different components.
Upvotes: 1
Reputation: 5049
If you are stuck in dependency hell, and must do this step from within dataflow, then I would recommend looking at side-inputs and outputs...and then pairing that with some simple TextIO operations.
Upvotes: 0