Please enable JavaScript.
Coggle requires JavaScript to display documents.
Big Data, Nosql, Data grid/ cache products, Relational DB, NewSQL,…
Big Data
UpRising
-
Fall of relational model
-
-
-
-
Weak performance, evolutivity, flexibility
-
Storage
-
-
-
-
HDFS
-
DataNode
-
-
-
replication 2x locally, 1x elsewhere
-
-
-
-
-
HBase
-
-
-
Code
Creating Table
-
-
-
if (admin.tableExists(table.getTableName())) { admin.disableTable(table.getTableName()); admin.deleteTable(table.getTableName());
}
admin.createTable(table);
Put
-
p.addColumn(family1.getBytes(), "name".getBytes(), Bytes.toBytes("ahmed"));
p.addColumn(family1.getBytes(), "address".getBytes(),Bytes.toBytes("tunis"));
-
-
Get
-
-
System.out.println(Bytes.toString(r.getValue(family1.getBytes(), "name".getBytes())));
-
-
Usages (services)
-
-
Personnalisation
Who, why : page construct
-
Foundings
-
-
MVCC
-
Timestamping : data(lign,col,tstamp) big table
-
-
-
Map reduce
-
-
-
Itérer, Extraire, regroup, aggregate, generate
Map
treats (K,V), generates (K,Vs)
-
-
-
-
Spark
-
-
Architecture
Elements
Driver program (Context)
RDD Graph
-
-
Operations
Transformations
-
Variants
-
-
-
groupByKey : to be avoided, use reduceByKey
-
-
-
-
-
-
-
-
-
-
Actions
Output, saving, counting, input
Les actions produisent quelque chose qui peut être
inspecté directement au lieu de créer un autre RDD
-
-
-
-
Life cycle
-
count, collect... that can be parallelised
-
-
-
-
-
-
-
Nosql
-
-
-
Classification by
-
Data schema
-
-
-
Graph oriented
-
-
-
-
-
-
sharding, even though there might be solutions.
-
-
-
-
-
-
-
-