Data documentation
Data can get complex and often documentation is non-existent. We will look into two areas that can be documented with code. Using code to document GUI tools allows for code to be stored and reviewed effectively like any other code written.
diagrams
The diagrams
library is very useful for creating solution diagrams. A solution diagram is often a broad picture of the architecture and key components. It can be organized in a way that explains key interactions.
Here, we are creating a small example document using the diagrams
package:
from diagrams import Cluster, Diagram from diagrams.aws.analytics import Quicksight, EMR with Diagram("Data Platform", show=False): with Cluster("Dev"): dashboards = Quicksight("Tableau") spark_clusters = [EMR("Notebook_cluster"), EMR("Jobs_cluster")] ...