Reputation: 19
AWS Marketplace has a number of different datasets available in their 'Data Products' category
I'd like to find a way to programatically parse this catalogue and extract some metadata for each of these data products, such as their descriptions, so I can aggregate it in a file.
I've tried using the code snippets below, but they seem to only provide what the account is currently subscribed to, not all of whats available
client = boto3.client('marketplace-catalog',
region_name='us-east-1')
response = client.list_entities(
Catalog='AWSMarketplace',
EntityType='DataProduct'
)
return response['EntitySummaryList']
and
client = boto3.client('dataexchange',
region_name='us-east-1'
)
response = client.list_data_sets()
return response['DataSets']
In both cases, EntitySummaryList and DataSets are empty.
Is there any way to do this without webscraping? I'm also somewhat familiar with the hack of using 'describe-images' with AWS marketplace as the owner, but it doesn't seem to be a neat solution
Upvotes: 1
Views: 40