user374374
user374374

Reputation: 343

ordering message pub/sub GCP

I am new to Dataflow and pub-sub tools in GCP.

Need to migrate current on prem process to GCP.

Current Process is as follows:

We have two types of data feeds

  1. Full Feed – its adhoc job – Size of full XML is ~100GB (Single XML – very complex one – Complete data – ETL Job process this xml and load it into ~60 tables)
  1. Delta Feed - Every 30 min need to process delta files(XML files – it will have only changes with in last 30 min)

I was asked to propose the design to Migrate this to GCP. Need to have two process in GCP as well full and delta. My proposed solution should be handle/suitable for both the feeds.

Initially I thought below design.

Pub/sub -> DataFlow -> mySQL/BigQuery

Then came to know that pub/sub will not give the guarantee to process the files in sequence/order. After doing some research learn that recently google introduced ordering key concept for pub/sub, which will make sure to process the messages in order. In google cloud docs it was mentioned that, this feature is in Beta.

I have two questions:

Upvotes: 0

Views: 1011

Answers (1)

Nick_Kh
Nick_Kh

Reputation: 5253

As was mentioned by @guillaume blaquiere, Beta product launching phase brings some restrictions but they are mostly related to the product support:

At beta, products or features are ready for broader customer testing and use. Betas are often publicly announced. There are no SLAs or technical support obligations in a beta release unless otherwise specified in product terms or the terms of a particular beta program. The average beta phase lasts about six months.

Commonly, Cloud Pub/Sub message ordering feature works as intended, once you have something for developers attention it is highly appreciated to send a report via Google Issue tracker.

Upvotes: 0

Related Questions