Performance increase of Data Pipelines from S3 to Dynamodb

Written by lakindu | Published 2019/08/09
Tech Story Tags: pipeline | aws | performance | dynamodb | s3 | latest-tech-stories | software-development | programming

TLDR AWS data pipelines are one of the best mechanisms to transfer data from one storage to another storage with a different data type. While transferring data from pipelines, there are several techniques which can be used to optimize the process of copying data. In this article, the scenario would be copying 3 CSV format files which are stored in S3 bucket, to 3 Dynamodb tables. The performance is not up to what is expected. Even though we added an m4.large instance type as the EMR cluster, performance is lagging.via the TL;DR App

no story

Written by lakindu | Software Engineer
Published by HackerNoon on 2019/08/09