paint-brush
Performance increase of Data Pipelines from S3 to Dynamodbby@lakindu
1,927 reads
1,927 reads

Performance increase of Data Pipelines from S3 to Dynamodb

by Lakindu Gunasekara2mAugust 9th, 2019
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

AWS data pipelines are one of the best mechanisms to transfer data from one storage to another storage with a different data type. While transferring data from pipelines, there are several techniques which can be used to optimize the process of copying data. In this article, the scenario would be copying 3 CSV format files which are stored in S3 bucket, to 3 Dynamodb tables. The performance is not up to what is expected. Even though we added an m4.large instance type as the EMR cluster, performance is lagging.

Company Mentioned

Mention Thumbnail
featured image - Performance increase of Data Pipelines from S3 to Dynamodb
Lakindu Gunasekara HackerNoon profile picture
Lakindu Gunasekara

Lakindu Gunasekara

@lakindu

Software Engineer

About @lakindu
LEARN MORE ABOUT @LAKINDU'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Lakindu Gunasekara HackerNoon profile picture
Lakindu Gunasekara@lakindu
Software Engineer

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite