Please enable JavaScript.
Coggle requires JavaScript to display documents.
DE on Azure - Coggle Diagram
DE on Azure
Synapse Analytics
-
-
How ASA works?
1, Create Synapse analytics workspace
-
2, Working with files in a data lake
-
-
3, Ingest & transforming data with pipeline
create, run, manage pipelines (orchestrate)
-
4, Query & manipulating data with SQL
5, Process & analyze data with Spark
-
-
6, Explore data with Data Explorer
7, Integrated with other Azure services
-
-
-
-
-
-
pool types
-
Spark pool
-
Operations
-
work with data from various sources, including
-
-
-
-
-
-
-
-
-
-
-
Partition data files
-
More performance gains can be achieved when filtering data in queries by eliminating unnecessary disk IO
-
Table
-
metastore, a metadata layer that encapsulates relational abstractions over files.
external table
External tables are "loosely bound" to the underlying files and deleting the table does not delete the files
External tables are relational tables in the metastore that reference files in a data lake location that you specify.
managed table
Managed tables are "tightly-bound" to the files, and dropping a managed table deletes the associated files.
managed tables, for which the underlying data files are stored in an internally managed storage location associated with the metastore
-
-
-
Introduce DE on Azure
-
Data Operations
data Ingestion
-
enable secure, reliable access to data across multiple systems.
-
-
-
-
-
ADLS Gen2
Defination
stored in its natural format, usually as blobs or files
a comprehensive, massively scalable, secure, and cost-effective data lake solution
-
-
-
Azure Data Lake Storage Gen2 stores data in an HDFS compatible file system in an Azure Storage blob container.
To enable Azure Data Lake Storage Gen2 containers, you must turn on the Hierarchical namespace option.
-
-
-
-
-
-
-
-