Which architecture strategy offers a scalable and cost-effective solution that meets these requirements?

Posted by: Pdfprep Category: BDS-C00 Tags: , ,

A company receives data sets coming from external providers on Amazon S3. Data sets from different providers are dependent on one another. Data sets will drive at different and is no particular order.

A data architect needs to design a solution that enables the company to do the following:

• Rapidly perform cross data set analysis as soon as the data becomes available

• Manage dependencies between data sets that arrives at different times

Which architecture strategy offers a scalable and cost-effective solution that meets these requirements?
A . Maintain data dependency information in Amazon RDS for MySQ
B . Use an AWS Pipeline job to load an Amazon EMR Hive Table based on task dependencies and event notification triggers in Amazon S3
C . Maintain data dependency information in an Amazon DynamoDB table. Use Amazon SNS and event notification to publish data to a fleet of Amazon EC2 workers. Once the task dependencies have been resolved process the data with Amazon EMR
D . Maintain data dependency information in an Amazon ElasticCache Redis cluster. Use Amazon S3 event notifications to trigger an AWS Lambda function that maps the S3 object to Redis. Once the dependencies have been resolved process the data with Amazon EMR
E . Maintain data dependency information in an Amazon DynamoDB table. Use Amazon S3 event notifications to trigger an AWS Lambda function that maps the S3 object to the task associated with it in DynamoD
F . Once all task dependencies have been resolved process the data with Amazon EMR

Answer: C

Leave a Reply

Your email address will not be published.