lordcenzin
lordcenzin

Reputation: 359

Apache Beam / GCP Datawflow: read video/image files

I am struggling to understand how to create a pipeline which involves reading and manipulating a video/image file. Looking into the documentation, which is quite "essential" without examples and few comments, maybe the beam.io package should help, i.e. class LocalFileSystem

However I have no idea how to use it to create a working pipeline which reads and apply some transform (i.e. frame extraction with ffmpeg)

I am using using python, however if java is more documented i can switch.

Any example? any help? Thanks in advance

Upvotes: 3

Views: 1266

Answers (1)

enle lin
enle lin

Reputation: 1714

IMHO, you can specifying ffmpeg in the workers in order to use it for image/video processing. To upload the specified resources instead of default ones to the workers, use filesToStage pipeline option. To use this option, you should use Java SDK since it is not available in Python.

See this SO question for more details about use ffmpeg in pipeline and this question to have a overview about the process.

Upvotes: 2

Related Questions