Need recommendation to create an API by aggregating data from multiple source APIs

Question

Before I start doing this I wanted to get advice from the community on the best and most efficient manner to go about doing it.

Here is what I want to do:

Ingest data from multiple API's which returns JSON
Store it in either S3 or DynamoDB
Modify the data to use my JSON structure
Pipe out the aggregate data as an API

The data will be updated twice a day, so I would pull in the data from the source APIs and put it through my pipeline twice a day.

So basically I want to create an API by aggregating data from multiple source APIs.

I've started playing with Lambda and created the following function using Python.

#https://stackoverflow.com/a/41765656
import requests
import json

def lambda_handler(event, context):
    #https://www.nylas.com/blog/use-python-requests-module-rest-apis/ USEFUL!!!
    #https://stackoverflow.com/a/65896274
    response = requests.get("https://remoteok.com/api")
    #print(response.json())
    return {
        'statusCode': 200,
        'body': response.json()
    }
    #https://stackoverflow.com/questions/63733410/using-lambda-to-add-json-to-dynamodb DYNAMODB

This works and returns a JSON response.

Here are my questions:

Should I store the data on S3 or DynamoDB?
Which AWS service should I use to aggregate the data into my JSON structure?
Which service should I use to publish the aggregate data as an API, API Gateway?

However, before I go further I would like to know what is the best way to go about doing this.

If you have experience with this I would love to hear from you.

Need recommendation to create an API by aggregating data from multiple source APIs

Answers (1)

Related Questions