Please enable JavaScript.
Coggle requires JavaScript to display documents.
image, AZZAM Nidal Salima
Module: Big Data
Master1 S1 2022/2023
FPT -…
-
What
is a programming model for writing applications that can process Big Data in parallel on multiple nodes (cluster)
-
-
Why
-
This makes it well-suited for dealing with big data and data science applications, where data sets can be too large to process and analyze on a single computer.
is used because it allows for the efficient processing and generation of large data sets in a parallel and distributed manner.
Who
is used by programmers and data analysts who need to process and generate large data sets in a parallel and distributed manner.
was first introduced by Google in 2004, and popularized by Hadoop
-
Where
is typically used on a cluster of computers, often in a distributed computing environment such as a cloud computing platform.
When
is used when large data sets need to be processed and analyzed in a parallel and distributed manner.
-