In this article, I will guide you step by step to create dynamic and interactive visual documentation using Diagram-as-Code tools. Instead of static images, we will generate diagrams programmatically, ensuring they are always up-to-date and easy to maintain.
Diagram as Code is an approach that allows you to create diagrams through code instead of traditional graphic tools. Instead of manually building diagrams, you can write code in a text file to define the structure, components, and connections of your diagrams.
This code is then translated into graphical images, making it easier to integrate and document in software projects, where it is especially useful for creating and updating architectural and flow diagrams programmatically.
Diagrams is a 🐍Python library that implements the Diagram as Code approach, enabling you to create architectural infrastructure diagrams and other types of diagrams through code.
With Diagrams, you can easily define cloud infrastructure components (such as AWS
, Azure
, and GCP
), network elements, software services, and more, all with just a few lines of code.
I was currently using version '0.23.4'
for this tutorial.
!pip install diagrams=='0.23.4'
The library allows you to create architectural diagrams programmatically, using nodes to represent different infrastructure components and services.
Nodes in Diagrams represent components from different cloud service providers as well as other architectural elements. Here are the main categories of available nodes:
The Diagrams library allows you to use different nodes to represent various programming languages. These nodes are helpful for indicating in your diagrams if any part of your architecture utilizes scripts or components developed in a specific programming language.
Below, we will showcase all the available languages in the library. If any language is missing, you can add custom nodes by uploading the corresponding logo into a specific folder.
# Create the diagram object
with diagrams.Diagram("Programming Languages", show=False, filename="languages"):
# Get all the languages available in this library
= [item for item in dir(diagrams.programming.language) if item[0] != '_']
languages
# Divide the representation in two lines
= len(languages) // 2
mid_index = languages[:mid_index]
first_line = languages[mid_index:]
second_line
# Add nodes in the first row
= None
prev_node
for language in first_line:
= eval(f"diagrams.programming.language.{language}(language)")
current_node if prev_node is not None:
>> current_node
prev_node = current_node
prev_node
# Add nodes in the second row
= None
prev_node
for language in second_line:
= eval(f"diagrams.programming.language.{language}(language)")
current_node if prev_node is not None:
>> current_node
prev_node = current_node
prev_node
"languages.png") Image(
We can use Amazon nodes, which are organized into several categories, such as:
Next, we will represent one of these categories to visualize the available nodes within aws.database
.
from diagrams import Diagram
from IPython.display import Image
import diagrams.aws.database as aws_database
= []
database_components for item in dir(aws_database):
if item[0] != '_':
if not any(comp.startswith(item) or item.startswith(comp) for comp in database_components):
database_components.append(item)
with Diagram("AWS Database", show=False, filename="aws_database"):
= len(database_components) // 2
mid_index = database_components[:mid_index]
first_line = database_components[mid_index:]
second_line
= None
prev_node for item_database in first_line:
= eval(f"aws_database.{item_database}(item_database)")
current_node if prev_node is not None:
>> current_node
prev_node = current_node
prev_node
= None
prev_node for item_database in second_line:
= eval(f"aws_database.{item_database}(item_database)")
current_node if prev_node is not None:
>> current_node
prev_node = current_node
prev_node
"aws_database.png") Image(
Now, let’s create a simple blueprint that corresponds to importing a dataset and training a machine learning model on AWS.
from diagrams import Diagram, Cluster
from diagrams.aws.storage import S3
from diagrams.aws.analytics import Glue, Athena
import diagrams.aws.ml as ml
from diagrams.aws.integration import StepFunctions
from diagrams.aws.compute import Lambda
from diagrams.aws.network import APIGateway
from IPython.display import Image
with Diagram("AWS Data Processing Pipeline", show=False):
= Lambda('Get Raw Data')
lambda_raw # Buckets de S3
with Cluster("Data Lake"):
= S3("raw_data")
s3_rawData = S3("staging_data")
s3_stage = S3("data_capture")
s3_data_capture
= Athena("Athena")
athena >> athena
s3_rawData >> athena
s3_stage >> athena
s3_data_capture
# Step Functions Pipeline
with Cluster("Data Processing Pipeline"):
= StepFunctions("Pipeline")
step_functions
# Glue Jobs in Step Functions
with Cluster("Glue Jobs"):
= Glue("job_data_quality")
data_quality = Glue("job_data_transform")
transform = Glue("job_dataset_model")
dataset_preparation
# Define Step Functions Flows
>> data_quality >> transform >> dataset_preparation
step_functions >> data_quality
s3_rawData
# SageMaker for model training and deployment
with Cluster("SageMaker Model Deployment"):
= ml.SagemakerTrainingJob("job_train_model")
train_model = ml.SagemakerGroundTruth("job_evaluate_model")
eval_model = ml.SagemakerModel("model_enpoint")
endpoint
# API Gateway and Lambda for the endpoint
= APIGateway("API_gateway")
api_gateway = Lambda("invoke_endpoint")
lambda_fn
# Connection
>> s3_rawData
lambda_raw >> train_model >> eval_model >> endpoint
s3_stage >> lambda_fn >> api_gateway
endpoint >> s3_data_capture
endpoint >> train_model
dataset_preparation
"aws_data_processing_pipeline.png") Image(
Below are the link to all the code, if you find it useful, you can leave a star ⭐️ and follow me to receive notifications of new articles. This will help me grow in the tech community and create more content.
If you want to see how to implement a documentation site using this pipeline you can read the article I published in the following link
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".
For attribution, please cite this work as
Mendez (2025, Jan. 10). Romina Mendez: Diagram-as-Code: Creating Dynamic and Interactive Documentation for Visual Content. Retrieved from https://r0mymendez.github.io/posts_en/2025-01-10-diagram-as-code-creating-dynamic-and-interactive-documentation-for-visual-content/
BibTeX citation
@misc{mendez2025diagram-as-code:, author = {Mendez, Romina}, title = {Romina Mendez: Diagram-as-Code: Creating Dynamic and Interactive Documentation for Visual Content}, url = {https://r0mymendez.github.io/posts_en/2025-01-10-diagram-as-code-creating-dynamic-and-interactive-documentation-for-visual-content/}, year = {2025} }