Code-to-Doc: Automating Technical Documentation from Codebase

How industry-standard coding practices enable automatic customer documentation generation — and why customer documentation should be a by-product of quality code, not a separate effort.

TL;DR — Developers already document their code in READMEs, OpenAPI specs, e2e tests, error handlers, CLI help text, and commit messages. Code-to-Doc is a methodology that extracts that information and transforms it into customer documentation — release notes, API references, how-to guides, troubleshooting references, CLI references, and more — so docs can never drift from reality.

Scope & Expectations

This blog is primarily about customer-facing technical documentation for SaaS products (API docs, CLI docs, how-to guides, troubleshooting, and release notes).

Code-to-Doc works best in new or evolving codebases where teams can adopt coding standards early.

In mature products, retrofitting existing code with structured annotations/comments can be difficult due to legacy patterns, migration effort, and developer adoption challenges.

This is not a “one-click replacement” for technical writers. It is a methodology to automate draft/reference documentation from code, with human curation for clarity, narrative, and information architecture.

The Documentation Problem
What is Code-to-Doc?
Code-to-Doc vs. Docs-as-Code
The Codebase-to-Documentation Map
The 8 Pillars of Automation
Value add by writers
The By-Product Mindset
Getting Started
Conclusion
Further Reading

The Documentation Problem

Every software team faces the same challenge: documentation drifts from reality.

You ship a feature on Monday. The docs say one thing, the code does another. By next Monday, two more changes have landed, and the documentation is a historical artifact rather than a useful guide.

We’ve tried to solve this:

Approach	Why It Falls Short
More writers	Doesn’t scale; still falls behind
Developers writing docs	Inconsistent, often neglected
Docs-as-Code	Better workflow, but still manual writing
Wiki pages or articles	Become graveyards of outdated information

The fundamental problem is not process or tooling. It’s that documentation and code are separate artifacts that must be manually kept in sync.

What if they weren’t separate?

What is Code-to-Doc?

Code-to-Doc is a methodology where technical documentation is automatically generated from the codebase itself — not as an afterthought, but as a natural by-product of well-structured code that follows industry standards.

The Key Insight

Developers already document their code — just not in a format that reaches end users:

What Developers Write	Where It Lives	What End Users Need
Feature descriptions	README files	Conceptual overviews
Config explanations	Inline comments	Guides information
Code samples	Integration tests	Usage examples
User workflow steps	E2E test scenarios	How-to guides
API spec and descriptions	OpenAPIs	API reference docs
Command definitions	CLI help text	CLI reference docs
Error messages	Exception handling	Troubleshooting reference
Log messages	Logging information	Log reference
Change descriptions	Commit messages	Release notes

The information exists. It’s just trapped in the codebase.

Code-to-Doc extracts this information and transforms it into product technical documentation.

Code-to-Doc vs. Docs-as-Code

These are complementary, not competing approaches.

%%{init: {'theme':'base', 'flowchart': {'layout': 'elk'}}}%% flowchart TB subgraph DaC["Docs-as-Code to Manage"] direction TB A1["content in markup language"] A2["Version control"] A3["Pull requests and reviews"] A4["CI/CD pipelines"] end subgraph CtD["Code-to-Doc to Generate"] direction TB B1["Extract from OpenAPI"] B2["Extract from tests"] B3["Extract from error handling"] B4["Extract from READMEs"] B5["Extract from CLI"] B6["Extract from commit messages"] end DaC -->|"Better docs workflow"| Outcome["Published Documentation"] CtD -->|"Content that cannot drift"| A1 style DaC fill:#e8f4fd,stroke:#2196F3 style CtD fill:#e8f9e8,stroke:#4CAF50 style Outcome fill:#fff3e0,stroke:#FF9800

Use both together: Code-to-Doc generates the content; Docs-as-Code manages the workflow. One eases content creation and the other smoothens the publishing pipeline.

The Codebase-to-Documentation Map

Here’s the complete mapping from code artifacts to the documentation your end users actually need:

%%{init: {'theme':'base', 'flowchart': {'layout': 'elk'}}}%% flowchart LR README["📖 README Files</br>📋 Comments in Code"] -->|extract| Concepts["Conceptual Overviews"] E2E["🧪 E2E Test Suites</br>🔌 Integration Tests</br>⚙️ Config Files"] -->|transform| HowTo["How-To Guides"] ERR["❌ Error Handling"] -->|extract| Troubleshoot["Error Message Reference and Troubleshooting"] LOG["📋 Log Messages"] -->|extract| Diagnostic["Log Reference"] GIT["📝 Commit Messages"] -->|generate| Release["Release Notes"] CLI["💻 CLI Definitions"] -->|generate| CLIRef["CLI Reference"] OAS["📄 OpenAPI Specs"] -->|generate| APIRef["API Reference"] style Concepts fill:#e3f2fd,stroke:#1565C0 style APIRef fill:#e3f2fd,stroke:#1565C0 style HowTo fill:#e3f2fd,stroke:#1565C0 style Troubleshoot fill:#e3f2fd,stroke:#1565C0 style Diagnostic fill:#e3f2fd,stroke:#1565C0 style Release fill:#e3f2fd,stroke:#1565C0 style CLIRef fill:#e3f2fd,stroke:#1565C0

Each mapping relies on coding standards that teams already adopt for code quality. Technical Documentation draft generation is the bonus.

Quick check: if your team already has OpenAPI, tests, commits, and CLI help text, you already have the raw material for Code-to-Doc.

If you’ve read this far, you’ve probably seen documentation drift firsthand. Now let’s move from the pain to practical, code-level solutions.

1. Conceptual Documentation from READMEs

The Source: README Files

Every well-maintained codebase has README files that explain what the code does. These are goldmines for conceptual documentation.

What developers write (in a service README):

# Authentication Service

## Overview
Handles user identity management with multiple auth methods.

## Key Concepts

### Authentication Methods
- **Username/Password**: Traditional login with bcrypt-hashed passwords
- **OAuth2**: Social login via GitHub, Google, GitLab
- **API Keys**: Machine-to-machine authentication

### Token Management
- Access tokens: Short-lived (15 min), used for API requests
- Refresh tokens: Long-lived (7 days), used to obtain new access tokens

What this generates (for end users):

# Authentication Concepts

## Overview
The platform supports multiple ways to authenticate users and services.

## Authentication Methods

| Method | Best For |
|--------|----------|
| Username/Password | Internal users, simple setups |
| OAuth2 Social Login | Developer products, reducing signup friction |
| API Keys | CI/CD pipelines, service-to-service |

## Understanding Tokens

| Token Type | Lifetime | Purpose |
|------------|----------|---------|
| Access Token | 15 minutes | API requests |
| Refresh Token | 7 days | Get new access tokens |

README Documentation Standard

To enable concept extraction, structure READMEs consistently:

# Service/Feature Name

## Overview
[2-3 sentences explaining what this is and why it exists]

## Key Concepts
[Define important terms and ideas users need to understand]

### Concept 1
[Explanation]

### Concept 2
[Explanation]

## How It Works
[High-level explanation of behavior]

## Getting Started
[Quick start steps]

## Configuration
[Key configuration options]

## Related
[Links to related services/features]

2. API Reference from OpenAPI

The Source: OpenAPI Specification

OpenAPI specs define your REST APIs. When properly documented, they generate complete API reference documentation — and the tooling is already mature.

What developers write (in an OpenAPI spec):

openapi: 3.1.0
info:
  title: User Management API
  description: |
    API for managing user accounts and authentication.
    
    ## Authentication
    All endpoints require a Bearer token except `/auth/login`.
  version: 2.1.0

paths:
  /auth/login:
    post:
      operationId: login
      summary: Authenticate user
      description: |
        Authenticates a user with email and password,
        returning access and refresh tokens.
      tags:
        - Authentication
      requestBody:
        required: true
        content:
          application/json:
            schema:
              type: object
              required: [email, password]
              properties:
                email:
                  type: string
                  description: User's email address
                  example: "user@example.com"
                password:
                  type: string
                  description: User's password
                  example: "securepassword123"
      responses:
        '200':
          description: Authentication successful
          content:
            application/json:
              example:
                access_token: "eyJhbGciOiJIUzI1NiIs..."
                expires_in: 900
        '401':
          description: Invalid credentials
  # ... additional endpoints follow the same pattern

What this generates:

Complete API reference with all endpoints grouped by tag
Request/response examples with schema validation
Error code reference tables
Interactive API explorer (Swagger UI / Redoc)
Client SDK stubs (via OpenAPI Generator)

OpenAPI Documentation Standard

To maximise documentation generation, ensure every endpoint includes:

Element	Required	Purpose
`summary`	Yes	Becomes the endpoint title
`description`	Yes	Becomes the endpoint documentation
`tags`	Yes	Groups endpoints in navigation
`parameters[].description`	Yes	Documents each parameter
`parameters[].example`	Yes	Shows example values
`responses[].description`	Yes	Documents each response
`responses[].example`	Yes	Shows example responses
`operationId`	Yes	Used for SDK method names

Tooling: Redoc, Swagger UI, Stoplight, and ReadMe can all consume an OpenAPI spec and produce a polished API reference site with zero manual writing.

3. How-To Guides from E2E Tests

The Source: End-to-End Tests

E2E tests simulate real user workflows. When documented with user intent, they become step-by-step how-to guides.

What developers write (in E2E tests):

/**
 * @feature Project Management
 */
describe('Project Management', () => {
  
  /**
   * @scenario Create a new project
   * @goal Set up a new project to organize your work
   * 
   * @prerequisites
   *   - Logged in with member or admin role
   * 
   * @result Project is created and you're redirected to project page
   */
  it('should create a new project', () => {
    cy.visit('/dashboard');
    cy.get('[data-cy=newProjectButton]').click();
    cy.get('[data-cy=projectNameInput]').type('Website Redesign');
    cy.get('[data-cy=visibilitySelect]').select('private');
    cy.get('[data-cy=createButton]').click();
    cy.get('[data-cy=successToast]').should('contain', 'created');
  });
  
  // ... additional tests follow the same pattern
});

What this generates (for end users):

# Create a New Project

Set up a new project to organize your work.

## Prerequisites
- Logged in with member or admin role

## Steps

1. From the dashboard, click **New Project**.
2. Enter your project name and description.
3. Choose visibility: **Private** or **Public**.
4. Click **Create Project**.

## Result
Project is created and you're taken to the project page.

Test Documentation Standard

To enable comprehensive how-to extraction, document tests with clear annotations and comments:

/**
 * @scenario [What user is trying to accomplish]
 * @goal [Why user wants to do this]
 * 
 * @prerequisites
 *   - [What must be true before starting]
 * 
 * @steps
 *   1. [First action in user language]
 *   2. [Second action]
 * 
 * @result [What success looks like]
 * @note [Optional: important caveats]
 * @next_steps [Optional: what to do after]
 */

Note: The @scenario, @goal, @steps, and @result tags are custom conventions you define for your project — not built-in JSDoc tags. A simple parser (or an LLM-assisted extractor) reads these annotations or comments and produces Markdown output. The key is test coverage and consistency across your test suite.

4. Code Examples from Integration Tests

The Source: Integration and API Tests

Integration tests exercise your public API with real HTTP calls. With light annotation, they become copy-pasteable code examples in your technical guides.

What developers write (annotated integration test):

class TestProjectAPI:
    """
    @sdk_section Projects
    """

    def test_create_project(self, client):
        """
        @sdk_example Create a new project
        @sdk_description Create a project with a name and visibility setting.
        """
        response = client.post("/api/v2/projects", json={
            "name": "Website Redesign",
            "visibility": "private",
            "description": "Q1 website overhaul"
        })
        assert response.status_code == 201
        assert response.json()["name"] == "Website Redesign"

    def test_list_projects(self, client):
        """
        @sdk_example List all projects
        @sdk_description Retrieve a paginated list of projects
            you have access to.
        """
        response = client.get("/api/v2/projects", params={
            "page": 1,
            "per_page": 20
        })
        assert response.status_code == 200
        assert "items" in response.json()

What this generates (for end users):

## Create a new project

Create a project with a name and visibility setting.

```python
response = client.post("/api/v2/projects", json={
    "name": "Website Redesign",
    "visibility": "private",
    "description": "Q1 website overhaul"
})
# Response: 201 Created
# { "name": "Website Redesign", ... }

List all projects

Retrieve a paginated list of projects you have access to.

response = client.get("/api/v2/projects", params={
    "page": 1,
    "per_page": 20
})
# Response: 200 OK
# { "items": [...], "page": 1, "per_page": 20 }

5. Troubleshooting Reference from Error Handling

The Source: Error Messages and Exception Handling

Your code already defines what can go wrong. With structured error handling, this becomes troubleshooting documentation.

What developers write (structured error handling):

public enum AuthError {
    
    /**
     * @userMessage "The email or password you entered is incorrect."
     * @troubleshooting 
     *   - Verify the email address is correct
     *   - Check for typos in the password
     *   - Try "Forgot Password" to reset
     * @commonCause Typos, forgotten passwords
     */
    INVALID_CREDENTIALS("AUTH001", "Invalid credentials"),
    
    /**
     * @userMessage "Too many login attempts. Try again in {minutes} minutes."
     * @troubleshooting
     *   - Wait for the lockout period
     *   - Use "Forgot Password" while waiting
     * @commonCause Forgotten password, brute force attempt
     */
    RATE_LIMITED("AUTH002", "Rate limit exceeded"),

    /**
     * @userMessage "Your session has expired. Please log in again."
     * @troubleshooting
     *   - Log in again to get a new session
     *   - If this happens frequently, check your token refresh configuration
     * @commonCause Idle timeout, clock skew between client and server
     */
    SESSION_EXPIRED("AUTH003", "Session expired");
    
    // ... additional errors follow the same pattern
}

What this generates (for end users):

# Troubleshooting Authentication Errors

## AUTH001: Invalid Credentials

**Message**: "The email or password you entered is incorrect."

**Common causes**: Typos, forgotten passwords.

**How to fix**:
1. Double-check your email address for typos
2. Verify Caps Lock is not enabled
3. Try "Forgot Password" to reset

---

## AUTH002: Too Many Login Attempts

**Message**: "Too many login attempts. Try again in X minutes."

**Common causes**: Forgotten password, automated brute-force protection.

**How to fix**:
1. Wait for the lockout period to expire
2. Use "Forgot Password" while waiting

---

## AUTH003: Session Expired

**Message**: "Your session has expired. Please log in again."

**Common causes**: Idle timeout, clock skew between client and server.

**How to fix**:
1. Log in again to get a new session
2. If this happens frequently, check your token refresh configuration

Error Documentation Standard

Structure error definitions with metadata:

/**
 * @code        [Unique error code, e.g. AUTH001]
 * @userMessage [What to show the end user]
 * @commonCause [Why this typically happens]
 * @troubleshooting
 *   - [Step-by-step resolution actions]
 * @securityNote [Optional: security implications]
 */

Note: Like the test annotations, @userMessage, @troubleshooting, and @commonCause are custom Javadoc-style conventions you define. A custom Doclet or annotation processor extracts these into structured data for doc generation.

6. Log Messages References

The Source: Logging Statements

Applications generate logs that operators and users see during installation, operation, and troubleshooting. Structured logging metadata turns these into diagnostic documentation.

What developers write (structured logging):

public class ConnectionService {
    
    /**
     * @troubleshooting
     *   - Verify the database server is running
     *   - Check connection string in configuration
     *   - Verify network connectivity and firewall rules
     */
    public void handleConnectionFailed() {
        log.error("CONN-001: Failed to connect after {} attempts", maxRetries);
    }
    
    /**
     * @troubleshooting
     *   - Check for connection leaks in application code
     *   - Consider increasing pool size in configuration
     *   - Review long-running queries
     */
    public void handlePoolLow() {
        log.warn("CONN-002: Connection pool low. {} of {} available",
                 available, total);
    }
}

What this generates (for end users):

# Log Message Reference

## CONN-001: Failed to connect

**Level**: ERROR  
**Example**: `CONN-001: Failed to connect after 5 attempts`

**What it means**: The application cannot reach the database after multiple retries.

**Action required**: Immediate attention needed.

1. Verify the database server is running
2. Check connection string in configuration
3. Verify network connectivity and firewall rules

---

## CONN-002: Connection pool low

**Level**: WARN  
**Example**: `CONN-002: Connection pool low. 2 of 20 available`

**What it means**: Most database connections are in use.

**Action required**: Monitor and investigate if persistent.

1. Check for connection leaks in application code
2. Consider increasing pool size in configuration
3. Review long-running queries

7. Release Notes from Commits

The Source: Conventional Commits

When developers write commits in a standard format, release notes write themselves. This is the most mature Code-to-Doc pipeline — tools like release-please and semantic-release handle it end to end.

What developers write (following Conventional Commits):

# Feature
git commit -m "feat(projects): add project templates

Users can now select from pre-built templates when creating a project.
Templates include Agile, Kanban, and Bug Tracking configurations.

Closes #234"

# Bug fix  
git commit -m "fix(notifications): prevent duplicate email sends

Fixed race condition where rapid status changes could trigger
multiple notification emails for the same event.

Fixes #456"

# Breaking change
git commit -m "feat(api)!: require authentication for all endpoints

BREAKING CHANGE: All API endpoints now require authentication.
Previously, some read endpoints were publicly accessible.

Migration: Add Authorization header to all API requests.
See docs/migration/v3-auth.md for details.

Closes #789"

What this generates (via release-please):

# Release Notes — v3.2.0 (January 30, 2026)

## New Features

- **Project Templates** — Select from pre-built templates (Agile, Kanban,
  Bug Tracking) when creating a project. (#234)

## Bug Fixes

- **Notifications** — Fixed duplicate emails on rapid status changes. (#456)

## Breaking Changes

- **API Authentication** — All endpoints now require authentication.
  Previously, some read-only endpoints were public.  
  **Migration**: Add `Authorization: Bearer <token>` header to all requests.
  See [Upgrading to v3.2](/docs/migration/v3-auth). (#789)

Setting Up Automated Release Notes

1. Enforce commit format with commitlint:

npm install -g @commitlint/cli @commitlint/config-conventional
echo "module.exports = {extends: ['@commitlint/config-conventional']}" > commitlint.config.js

2. Add a release-please GitHub Action (.github/workflows/release-please.yml):

name: Release Please
on:
  push:
    branches: [main]

jobs:
  release-please:
    runs-on: ubuntu-latest
    steps:
      - uses: google-github-actions/release-please-action@v4
        with:
          release-type: node

3. Customise changelog sections (release-please-config.json):

{
  "packages": {
    ".": {
      "changelog-sections": [
        { "type": "feat", "section": "New Features" },
        { "type": "fix", "section": "Bug Fixes" },
        { "type": "perf", "section": "Performance Improvements" },
        { "type": "docs", "hidden": true },
        { "type": "chore", "hidden": true }
      ]
    }
  }
}

8. CLI Reference from Command Definitions

The Source: CLI Framework Help Text

Modern CLI frameworks (Click, Cobra, argparse, picocli, oclif) already require developers to write help strings for every command and option. This is a ready-made source for CLI reference documentation.

What developers write (Python Click example):

@cli.group()
def artifacts():
    """Manage artifacts in repositories.
    
    Commands for searching, downloading, uploading,
    and deleting artifacts across all accessible repositories.
    """
    pass

@artifacts.command()
@click.argument('query', required=True)
@click.option('--repo', '-r', default=None,
              help='Limit search to a specific repository.')
@click.option('--format', '-f',
              type=click.Choice(['table', 'json', 'csv']),
              default='table',
              help='Output format for results.')
@click.option('--limit', '-l', type=int, default=50,
              help='Maximum number of results to return (1–1000).')
def search(query, repo, format, limit):
    """Search for artifacts matching a query pattern.
    
    Supports wildcards (*) and regular expressions.
    
    Examples:\n
        myctl artifacts search "libs-release/*.jar"\n
        myctl artifacts search --repo docker-local --format json "nginx*"\n
        myctl artifacts search --limit 10 "*.tgz"
    """
    pass

What developers write (Go Cobra example):

var searchCmd = &cobra.Command{
    Use:   "search [query]",
    Short: "Search for artifacts matching a query pattern",
    Long: `Search for artifacts across all accessible repositories.
Supports wildcards (*) and regular expressions.`,
    Example: `  myctl artifacts search "libs-release/*.jar"
  myctl artifacts search --repo docker-local --format json "nginx*"
  myctl artifacts search --limit 10 "*.tgz"`,
    Args: cobra.ExactArgs(1),
    RunE: runSearch,
}

func init() {
    searchCmd.Flags().StringVarP(&repo, "repo", "r", "",
        "Limit search to a specific repository")
    searchCmd.Flags().StringVarP(&format, "format", "f", "table",
        "Output format: table, json, csv")
    searchCmd.Flags().IntVarP(&limit, "limit", "l", 50,
        "Maximum results to return (1-1000)")
}

What this generates (for end users):

# CLI Reference

## myctl artifacts search

Search for artifacts matching a query pattern.

Supports wildcards (`*`) and regular expressions.

### Usage

    myctl artifacts search [OPTIONS] QUERY

### Arguments

| Argument | Required | Description |
|----------|----------|-------------|
| `QUERY` | Yes | Search pattern (supports wildcards and regex) |

### Options

| Option | Short | Default | Description |
|--------|-------|---------|-------------|
| `--repo` | `-r` | *(all)* | Limit search to a specific repository |
| `--format` | `-f` | `table` | Output format: `table`, `json`, `csv` |
| `--limit` | `-l` | `50` | Maximum results to return (1–1000) |

### Examples

    # Search for all JAR files in libs-release
    myctl artifacts search "libs-release/*.jar"

    # Search in a specific repo, output as JSON
    myctl artifacts search --repo docker-local --format json "nginx*"

    # Return only the first 10 results
    myctl artifacts search --limit 10 "*.tgz"

CLI Documentation Standard

Most CLI frameworks generate --help output automatically. To go further:

Always write a Long description — not just Short / summary.
Include Example blocks with realistic, copy-pasteable commands.
Use consistent flag naming (--format, --output, --verbose) across all commands.
Export help as structured data — many frameworks support JSON or man-page output that can be parsed into docs.

Tooling: Cobra has doc.GenMarkdownTree(), Click has click-man and sphinx-click, oclif generates docs natively. These produce a complete CLI reference site from your command definitions.

The By-Product Mindset

The most important aspect of Code-to-Doc is philosophical, not technical.

The Wrong Pitch

“Follow these standards so we can generate documentation.”

This fails because:

Developers don’t see documentation as their problem
Standards feel like bureaucratic overhead
Compliance is minimal and grudging

The Right Pitch

“Follow these industry standards for better code quality, faster onboarding, and easier debugging. As a by-product, customer technical documentation can be automated.”

This works because:

Developers care about code quality
Every standard has intrinsic engineering value
Documentation accuracy becomes automatic

Value add by writers

Value for Each Audience

Audience	Primary Benefit	Documentation Benefit
Developers	Better code, less grunt work	No manual doc writing
Tech Writers	Focus on high-value content	Accuracy guaranteed
End Users	Always-accurate docs	Better product experience
Leadership	Faster releases	Lower documentation costs

Code-to-Doc is powerful, but it doesn’t replace all documentation. Some content requires human judgement, empathy, and experience that can’t be extracted from code.

Documentation Type	Why It Needs a Human
Information Architecture & UX	Organizing automated snippets into a logical, searchable hierarchy that matches user mental models.
Pipeline Curation & Maintenance	Auditing automated output for clarity and ensuring the “extraction glue” evolves alongside the product.
Getting-started tutorials	Require a curated learning journey with a specific narrative arc
Conceptual “why” explanations	Code tells you what and how — writers explain why and when
Best practices & patterns	Come from experience, not from code structure
Architecture decision records	Capture trade-offs and context that code doesn’t encode
Video & visual content	Walkthroughs, diagrams, and demos need creative work
Use-case & solution guides	Combine multiple features into a real-world scenario

The sweet spot: Code-to-Doc handles the reference and procedural documentation (what and how); writers handle the conceptual and tutorial content (why and learning journeys). This aligns with the Diátaxis documentation framework, which categorises docs into Tutorials, How-To Guides, Reference, and Explanation.

Getting Started

Read this far? Great. The next section is the practical part: what to implement first, and what can wait.

Approach	Effort	Best For
Off-the-shelf tools (Swagger, Redoc, release-please, Cobra doc gen)	Medium	OpenAPI, release notes, CLI reference
Custom scripts (regex / AST parsers in CI)	High	Test annotations, error enums, config comments
LLM-assisted extraction (GPT / Claude in a CI step)	High	READMEs, tests, log messages, annotations, comments

The key insight: you don’t need to build everything at once. Start with the off-the-shelf tools (release notes, API reference, CLI docs), then add custom extractors as the value becomes clear.

Quick Wins (Start Here)

Adopt Conventional Commits → Automated release notes

npm install -g @commitlint/cli @commitlint/config-conventional

Document your OpenAPI specs → API reference
- Add description and example to every endpoint, parameter, and response
Generate CLI docs from your framework → CLI reference
- Cobra: doc.GenMarkdownTree()
- Click: sphinx-click or click-man
- oclif: built-in doc generation

Add JSDoc annotations to E2E tests → How-to guides

/** @scenario What user accomplishes  @steps 1. ... 2. ... */

Structure your error / log enums → Troubleshooting reference
- Add @userMessage and @troubleshooting to every error code

Conclusion

Documentation drifts when it’s separate from code. Code-to-Doc makes the codebase the single source of truth for user documentation.

Key Principles

Standards first — Adopt coding standards for quality; docs follow as a by-product
Extract, don’t duplicate — Information exists once, in the code
Automate the reference layer — Let humans focus on tutorials and conceptual content
Start small — Commit format and OpenAPI descriptions are day-one wins

The Result

Release notes that write themselves from conventional commits
API docs that can’t be wrong because they come from the spec
CLI reference generated directly from command definitions
How-to guides that match the actual UI because they come from tests
Troubleshooting reference built from real error codes, resolutions, and logs

Code-to-Doc: Because the best documentation is the documentation that writes itself.

If you read all the way to the end, you care about where technical documentation is heading. If you’re working on modernizing docs, I’d be happy to discuss ideas or collaborate.

Code-to-Doc: Automating Technical Documentation from Codebase

Scope & Expectations

Table of Contents

The Documentation Problem

What is Code-to-Doc?

The Key Insight

Code-to-Doc vs. Docs-as-Code

The Codebase-to-Documentation Map

1. Conceptual Documentation from READMEs

The Source: README Files

README Documentation Standard

2. API Reference from OpenAPI

The Source: OpenAPI Specification

OpenAPI Documentation Standard

3. How-To Guides from E2E Tests

The Source: End-to-End Tests

Test Documentation Standard

4. Code Examples from Integration Tests

The Source: Integration and API Tests

List all projects

5. Troubleshooting Reference from Error Handling

The Source: Error Messages and Exception Handling

Error Documentation Standard

6. Log Messages References

The Source: Logging Statements

7. Release Notes from Commits

The Source: Conventional Commits

Setting Up Automated Release Notes

8. CLI Reference from Command Definitions

The Source: CLI Framework Help Text

CLI Documentation Standard

The By-Product Mindset

The Wrong Pitch

The Right Pitch

Value add by writers

Value for Each Audience

Getting Started

Quick Wins (Start Here)

Conclusion

Key Principles

The Result

Further Reading

Keywords

Schedule a meeting