Claudia_S
Claudia_S

Reputation: 11

Does data in Amazon S3 go on public internet when i use job glue?

I'm using AWS services to create a datapipeline I have data stored in an Amazon S3 bucket and I plan to use the glue crawler to crawl the data under a prefix to extract the metadata and after a glue job to do ETL and save the data in another bucket.

My question is : in which network the services works and communicates each other? it is possible that the data will be moved from Amazon S3 to glue through the public internet?

is there any link to aws documentation that explain which networks AWS services uses when they transfer data between them?

Upvotes: 1

Views: 621

Answers (2)

klaudiuszb
klaudiuszb

Reputation: 180

Traffic between S3 and other services doesn't traverse the public internet. However, S3 traffic does leave VPC boundaries and traverses AWS Network using its IP addresses and routing.

To prevent this, you can use VPC Endpoints to change routing and make sure traffic never leaves the VPC boundaries.

You can read more about VPC endpoints here.

Upvotes: 0

alabalistic
alabalistic

Reputation: 307

You need to grand explicit permission to any resource to be able access your S3 bucket.

AIM Roles. Using policy create a role and attach that role to AWS resource.

Bucket Policy is another mechanism to grant access.

By default everything is private, you need to grant access otherwise No is not accessible from the internet.

Upvotes: -1

Related Questions