Please enable JavaScript.
Coggle requires JavaScript to display documents.
Azure DE Fundamentals - Coggle Diagram
Azure DE Fundamentals
File Storage
Delimited Text Files "CSV"
JSON
XML
BLOB
Binary Format
Used to store images and videos
Optimized File Formats
AVRO
Header stored as JSON
Data stored as Binary
Row based
ORC
Column based
Optimize read & write ops in hive
contains stripes of data
index
data
footer
data stats
Parquet
Column based
file contains row groups
chunks of data
metadata for each chunk
storing and processing nested data
efficient compression and encoding
Analytical Data Processing
OLAP Solutions
Optimized for Read Ops
Process
ETL
Aggregated
Measures
Dimensions
Data warehouses
Denormalization
Common Data Formats
Structured
Semi-Structured
Unstructured
Transactional Data Processing
OLTP Solutions
Optimized for Read & Write
ACID
Atomicity
each transaction is treated as a single unit, which succeeds completely or fails completely.
Consistency
transactions can only take the data in the database from one valid state to another.
Isolation
concurrent transactions cannot interfere with one another
Durability
when a transaction has been committed, it will remain committed.
Databases
Relational Databases
Tables
Columns & Rows
SQL
Non-Relational
Key-Value
Document
Graph