In this article, I will guide you step by step to create a dynamic documentation site, adaptable to any project, where you can connect your documentation to a database to extract and display data, ensuring the information is always up to date. We will also explore how to automate the entire process, from content generation to deployment in the cloud with AWS.
The solution includes support for charts
and diagrams
, continuous integration (CI/CD)
using a simple workflow in GitHub Actions
, and automatic deployment using Terraform
.
Let’s get started!
Documentation and its updates are an important process in many companies that develop software, often carried out using different tools, many of which are paid solutions.
Therefore, in recent times, the concept of “doc as code” has emerged.
This means using the same tools and workflows used in software development to manage
, version
, and deploy
documentation.
This approach not only allows for better tracking of the documentation but also facilitates its maintenance and ensures alignment with the same best practices used in software development, not just in the code but also in the documentation
.
For the development of these sites, it is essential to understand some practices and tools that allow us to implement this approach. Below is a detailed list of the most important aspects to cover in this tutorial.
MkDocs is a static site generator written in 🐍Python, designed specifically for documenting projects. Its goal is to simplify creating documentation using Markdown files, which are easy to write and read.
With minimal configuration, MkDocs converts Markdown files into a navigable and well-structured documentation website, making it ideal for developers and teams who want to keep their documentation up to date.
MkDocs Material is an advanced theme for MkDocs that follows Google’s Material Design guidelines.
Mermaid is a **JavaScript** library for creating diagrams and charts from text. By integrating with MkDocs Material, Mermaid allows you to generate visualizations such as flowcharts, entity-relationship diagrams, and other charts within the documentation without external tools.
Jinja is a library that allows embedding variables and data from Python dictionaries into HTML, making web pages dynamic. This library is commonly used for generating dynamic HTML and sending personalized emails.
Docusaurus is an open-source project developed by Meta in 2007 that simplifies the creation, deployment, and maintenance of documentation websites in a fast and efficient way. It allows the use of Markdown and MDX to write content, while its core built on React enables full customization of the styles to fit the specific needs of the project.
Additionally, Docusaurus supports Mermaid through the @docusaurus/theme-mermaid
plugin, enabling the inclusion of charts and diagrams directly within the documentation.
Diagram as Code is an approach that allows you to create diagrams through code, rather than using traditional graphic tools. Instead of manually building diagrams, you write code in a text file to define the structure, components, and connections of your diagrams.
This code is then translated into graphical images, making it easier to integrate and document in software projects. It’s especially useful for creating and updating architectural and flow diagrams programmatically.
As previously mentioned, Diagrams allows you to generate blueprints using the icons of major cloud technologies. The representation of these diagrams is done through nodes, and in our example, we’ll use all cloud-related nodes and AWS services.
For more details on how I created this, you can read my article about Diagram as Code, and the full implementation can be found in this repository:
In this use case, I will create a documentation site
for a machine learning project involving 🏥 hospital data.
The goal is to build an interactive documentation site using MkDocs initially and later migrate it to Docusaurus.
The site will include both static and dynamic components to meet specific requirements, such as embedding visual diagrams and updating data dynamically from a SQLite database.
Diagram as Code
) to illustrate the architecture of the machine learning pipeline effectively.For this reason our documentation site will have the following pages:
In this section, we’ll walk through the steps to set up a documentation project using MkDocs from scratch and explain its organized directory structure.
To get started, you’ll need to install the following 🐍Python libraries:
Install MkDocs and the Material bash pip install mkdocs mkdocs-material
Install additional libraries to enable dynamic content updating bash pip install aiosql pandas sqlite3 jinja2 shutil
—
Initialize the Project
Start by creating a new MkDocs project.
Run the following commands in your terminal:
bash mkdocs new mkdocs cd mkdocs
This command creates a basic MkDocs project with a default structure
.
Explore the Directory Structure
Once the MkDocs site is created, you need to add the following files and folders, as they are not included by default.
Remember, the links to the repository are provided at the end of this post for your reference, and each component will be explained in detail below.
📁 docs/
├── 📁 img/
├── `architecture.md`
├── `glossary.md`
├── `index.md`
├── `tables.md`
├── 📁 template/
│ ├── 📁 db/
│ │ ├── 📁 data/
│ │ │ ├── hospital.db
│ │ ├── 📁 queries/
│ ├── `architecture.md`
│ ├── `glossary.md`
│ ├── `index.md`
│ ├── `tables.md`
│ └── `update.py`
📁 infraestructure/
📁 github/
├── 📁 workflows/
│ ├── main.yml
📄 mkdocs.yml
Component | Directory | Description |
---|---|---|
Database (db ) |
db |
Contains the SQLite database (hospital.db ) and queries (metadata.sql , person.sql ) to manage dynamic data. Learn more about managing SQL queries in Python in my previous article: Python Projects with SQL. |
🖋️ Templates & Pages | template |
Markdown templates: index.md , tables.md , architecture.md , glossary.md . Supports Mermaid diagrams, embedded images, and database-driven content. |
🖼️ Static Content (docs ) |
docs |
Final site generated by update.py , including images (img/ ) and dynamic content populated from template . |
🌐 Infrastructure (infraestructure ) |
infraestructure |
Terraform scripts (main.tf , variables.tf ) to deploy an S3 bucket for documentation hosting. |
Configuring mkdocs.yml
Once we have our project structure set up, we will configure it step by step, starting with the mkdocs.yml file
.
This file defines the structure and settings for your documentation site.
Here’s how it should be structured:
mkdocs.yml
site_name: Hospital Documentation
nav:
- Home: index.md
- Synthea Tables: tables.md
- AWS Architectur: architecture.md
- Glossary: glossary.md
markdown_extensions:
- pymdownx.superfences:
custom_fences:
- name: mermaid
class: mermaid
theme:
name: material
In this configuration file, you can primarily see in the nav section the pages that will be accessible from the menu.
Then, we specify the Mermaid
extension, which will be explained in the next section.
Finally, the theme section applies the Material theme
, enabling styling and components available within this library.
As mentioned earlier, Mermaid is a JavaScript library for creating diagrams and charts from text. Below, we will see some examples. In our case, we will use it to generate an Entity Relationship Diagram (ERD) on the tables page of the documentation.
In the repository, you will be able to see how to construct this code based on the Entity Relationship Diagram (ERD) found in the official Synthea documentation. You can also check the example of the tables page in the following link: tables.md.
To enable dynamic content generation for our documentation site, we’ll use Jinja to process templates and replace placeholders with actual data. Below is a step-by-step breakdown:
Set Up a templates
Folder
Create a folder named templates
to store all Markdown files for the site.
These files should include placeholders.
For instance, in index.md
, you might have placeholders like {{database.version_date}}
and {{database.version}}
.
Utilize Placeholders
Placeholders are dynamic variables in the Markdown files.
These variables will be updated automatically using Python dictionaries to inject relevant data.
Generate Dynamic Content with update.py
Prepare your Markdown templates by identifying the sections where dynamic data is required.
Use a Python script (update.py
), available in my repository, to process the templates. The script performs the following tasks:
docs
folder, ready for rendering in MkDocs.from jinja2 import Template
def render_template(template_str, data):
"""Render the template with the data."""
= Template(template_str)
template return template.render(data)
# Data structure
= {
data_dict 'database': {
'version': 1,
'version_date': '2024-01-01'
}
}# Render the template
= render_template("Data updated to {{database.version_date}}", data_dict)
rendered_content print(rendered_content)
Data updated to 2024-10-01 10:20:30
By following these steps, you can automate the updating process for your documentation site, ensuring the content remains dynamic and relevant without manual edits.
In the next example, we will update the content in the tables.md
file to show an example of the persons
table from the database.
To do this, we will create a placeholder {{table.person}}
within the Markdown file.
The idea is to dynamically fetch the data from the persons
table, and then use the Jinja library along with pandas to convert the query results into a Markdown table format.
Here’s an example of how the tables.md
file looks with the placeholder:
#### Example Person Table
## Person Table
{{table.person}}
The process is as follows:
persons
table in the SQLite database to fetch relevant records.{{table.person}}
placeholder in the tables.md
file will be replaced by the generated Markdown table.import sqlite3
import pandas as pd
import aiosql
def get_queries():
= aiosql.from_path('template/db/queries', 'sqlite3')
sql return sql
def get_table_person(db_name):
"""Get a DataFrame from the PATIENTS table."""
= get_queries().get_example_patients.sql
query = sqlite3.connect(db_name)
connection
= pd.read_sql_query(query, connection)
df
connection.close()return df
def update_tables_file(template_path, output_dir, db_name):
"""Update the tables.md file"""
= get_table_person(db_name)
df # Convert the dataframe in markdown format
= {
data_dict 'table': {
'person': df.to_markdown(index=False)
}
}
print(get_table_person("hospital.db"))
This way, the documentation always reflects up-to-date data, displaying dynamic examples based on the actual content from the database.
docs/template
directory.update.py
: Populate dynamic content and generate the final files in docs/output
.mkdocs serve
to preview the site on localhost.mkdocs build
to generate a static site in the docs/
folder.In the following sections, I will provide detailed steps and insights on how to implement a documentation site using Docusaurus. This includes setup, customization, and deployment options.
To get started with Docusaurus, we follow a quick setup process, which is very similar to the steps we used for MkDocs but with different tools.
Create a New Docusaurus Project: First, install Node.js and run the following command to create a new Docusaurus site:
npx create-docusaurus@latest my-website classic
Install Mermaid Package: To enable Mermaid diagrams, install the required package:
npm install @docusaurus/theme-mermaid
Run the Development Server: Once installed, navigate to your project directory and run the development server:
cd my-website
npx docusaurus start
Visit the Site: Your site will be live locally at: http://localhost:3000
.
The configuration file docusaurus.config.js
is where we customize the title, theme, navigation, and enable features like Mermaid for diagram rendering.
Example snippet for enabling Mermaid:
.exports = {
moduletitle: 'Hospital Documentation',
tagline: 'Documentation for Hospital Data ML Project',
favicon: 'img/favicon.ico',
url: 'https://your-site-url.com',
markdown: {
mermaid: true, // Enable Mermaid diagrams
,
}themeConfig: {
navbar: {
title: 'Hospital Docs',
items: [
to: 'docs/', label: 'Home', position: 'left' },
{ to: 'docs/tables', label: 'Tables', position: 'left' },
{ to: 'docs/architecture', label: 'Architecture', position: 'left' },
{ to: 'docs/glossary', label: 'Glossary', position: 'left' },
{ ,
],
}footer: {
style: 'dark',
links: [
label: 'GitHub', href: 'https://github.com/your-repo' },
{ ,
],
},
}; }
To customize the homepage, we modify the src/components/HomepageFeatures/index.js
file.
Here, you can adjust the FeatureList object to update the features displayed on the homepage.
Just like in MkDocs, Docusaurus supports Markdown files for content, and we organize the structure as follows: 1.
Template Folder: Store your Markdown files in the docs/template
directory, and create a Python script (similar to update.py
) to fetch and populate dynamic data into these templates.
2.
Category File (__category__.json
): To manage the order of documents in the sidebar, create a __category__.json
file in each folder.
For example:
Architecture
├── architecture.md
├── img
└── __category__.json
__category__.json
Example: json { "label": "Architecture", "position": 2, "link": { "type": "generated-index", "description": "AWS Data Processing Blueprint" } }
To incorporate dynamic content, such as database tables, we use a 🐍Python script named update.py
, which you can find in the repository.
This script fetches data from a SQLite database and processes the Markdown files stored in the templates
folder.
It then updates these files with the fetched data and copies them into the docs
folder, preparing them for site rendering.
This workflow ensures that the content remains up-to-date and ready for deployment, following a similar approach to what we implemented with MkDocs.
docs/template
directory.npx docusaurus start
to preview the site.npx docusaurus build
to generate the static site.In this section, we will cover the deployment process for both MkDocs and Docusaurus using AWS S3 for hosting. While the deployment steps are the same for both tools, the installation processes differ, with MkDocs being Python-based and Docusaurus being JavaScript-based.
To deploy a static documentation site to AWS S3, we use Terraform to provision and configure the required resources.
The setup defines the S3 bucket, enables static website hosting, and configures public access with a bucket policy to allow read-only access.
You can find the main.tf
file in the repository.
index.html
and error.html
as the main and error documents.You can access the complete Terraform file and the corresponding configurations for deploying the site in the repository:
Terraform Configuration File: * mkdocs file * docusaurus file
GitHub Action Workflow for Automatic Deployment: A CI/CD pipeline to automate the deployment process is also included in the repository. * mkdocs file * docusaurus file
GitHub Actions Configuration Make sure to configure your AWS credentials in the GitHub repository secrets under Settings > Secrets > Actions. This will allow GitHub Actions to securely access your AWS account and perform actions like uploading files to S3 when you push changes to the
main
branch.
Below are the links to all the code to deploy your documentation site. If you find it useful, you can leave a star ⭐️ and follow me to receive notifications of new articles. This will help me grow in the tech community and create more content.
Both solutions are easy to implement, but in the following items, we can explore some differences, and what is the best solution depends on the context, knowledge, and complexity you may need to implement.
Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".
For attribution, please cite this work as
Mendez (2025, Jan. 10). Romina Mendez: Deploying Docs-as-Code on AWS: Building Dynamic Documentation Sites in MkDocs and Docusaurus. Retrieved from https://r0mymendez.github.io/posts_en/2025-01-10-deploying-docs-as-code-on-aws-building-dynamic-documentation-sites-in-mkdocs-and-docusaurus/
BibTeX citation
@misc{mendez2025deploying, author = {Mendez, Romina}, title = {Romina Mendez: Deploying Docs-as-Code on AWS: Building Dynamic Documentation Sites in MkDocs and Docusaurus}, url = {https://r0mymendez.github.io/posts_en/2025-01-10-deploying-docs-as-code-on-aws-building-dynamic-documentation-sites-in-mkdocs-and-docusaurus/}, year = {2025} }