Please enable JavaScript.
Coggle requires JavaScript to display documents.
Additional Key Services - Analytics (Amazon Kinesis (Amazon Kinesis…
Additional Key Services - Analytics
Amazon Kinesis
Amazon Kinesis Firehose
loading massive amounts of streaming data into AWS
goes into S3, Redshift, or ElasticSearch
for Redshift destination, it's first stored in S3 and then moved to RedShift
Amazon Kinesis Streams
analysing streaming data real-time
architecture: distribution of massive data into shards for further analysis
Amazon Kinesis Analytics
analysing streaming data with SQL
fairly new, as of book's date announced but not released
Amazon Elastic MapReduce (Amazon EMR)
storage options
Hadoop Distributed File System (HDFS)
EC2 or EBS used underneath
EC2 storage is fadter, but not persistent
EMR filesystem (EMRFS)
data stored in S3
Use Cases
Log processing
Clickstream analysis
Genomics and life sciences
AWS DataPipeline
AWS DataPipeline
best for regular batch processing