6.3 Configuration Management

Table of Contents

Introduction

Configuration management on Linux is the practice of defining, recording, and enforcing how systems should be configured. Instead of changing servers manually, you describe the desired state in files, store those files in version control, and let tools apply them consistently. This is essential for reliability, repeatability, and scaling beyond a few machines.

Why Configuration Management Matters

When you manage one personal machine, you can log in and install packages or edit configuration files by hand. As soon as you manage multiple servers, manual work becomes slow, error prone, and difficult to track. Two servers that are supposed to be identical can silently drift apart. When something fails, you may not know what actually changed, or when.

Configuration management solves this by treating infrastructure as something you describe in code. Instead of thinking “I ran these commands yesterday,” you think “this server’s state is defined in this repository.” You can recreate a server from scratch, recover more quickly from failures, and be more confident that testing and production environments match.

Key idea: Configuration management replaces ad hoc manual changes with repeatable, version controlled definitions of system state.

Declarative State vs Imperative Steps

In configuration management there is an important distinction between describing “what” you want and “how” to do it.

In an imperative approach, you write step by step instructions. For example, you might write a shell script:

#!/bin/sh
apt update
apt install -y nginx
systemctl enable nginx
systemctl start nginx

This describes actions in sequence. If the system is already in the correct state, the script may still repeat work or fail.

In a declarative approach, you describe the final state you want. For example, you might say, “the nginx package must be installed and the nginx service must be enabled and running.” The tool then figures out what actions are needed to reach that state. If the state is already correct, it will do nothing. If something has drifted, it will correct it.

Most configuration management tools centered in this course use a declarative style. You define resources, such as packages, files, services, and users, along with their desired properties. The tool ensures that reality matches your definitions.

Idempotence and Convergence

Configuration management tools aim for idempotence. A change is idempotent if running it multiple times produces the same result as running it once. This is important because configuration tools usually run repeatedly over time, for example on a schedule or during deployments.

If you say that a file should have specific content, applying that definition again and again should leave the file in that same content without further side effects. This leads to convergence. Over time, as configuration is applied, every system should move closer to, and then remain at, the defined desired state.

Rule: An idempotent configuration can be safely applied multiple times. Each application leaves the system in the same desired state.

Infrastructure as Code and Version Control

Configuration management is a core part of the broader idea of infrastructure as code. Instead of relying on documentation and memory, you place the definitions of systems into text files. These files live in version control systems such as Git.

With configuration stored as code, you can:

Record history. You can see when configurations changed, who changed them, and why.

Review changes. You can use pull requests or merge requests so that peers review modifications before they affect servers.

Roll back. If a change breaks something, you can revert to an earlier version of the configuration.

Share knowledge. New team members can read the configuration repository and understand how systems are built.

Configuration management tools read this code and apply it to machines. Over time, the configuration repository becomes the primary source of truth for how your Linux infrastructure is set up.

Desired State and Drift

When you adopt configuration management, you establish a desired state for each system. This includes what packages should be installed, which services should be enabled, how configuration files should look, and even which users and groups should exist.

Over time, there is a risk of configuration drift. Drift occurs when the actual state of a system moves away from the desired state. This can happen if someone applies a manual change directly on a server, or if a one time script adjusts something without updating the configuration definitions.

Configuration management tools help you detect and correct drift. When they run, they compare the current system to the desired state and report or fix differences. In stricter setups, manual changes are avoided and all modifications must go through configuration code, which greatly reduces drift.

Abstraction Over Individual Commands

Manual configuration usually focuses on individual commands. You might think, “I need to run this apt command, then edit that file with nano.” Configuration management tools encourage you to think in terms of resources and relationships.

Instead of “run apt install nginx,” you define a resource that states that the nginx package must be present. Instead of “run systemctl enable nginx,” you define that the nginx service must be enabled and started.

These tools also manage dependencies for you. For example, a service might require a package to be installed and a configuration file to exist. You can express that relationship, and the tool will ensure things happen in the right order. This abstraction means you spend less time thinking about sequences of commands and more time describing the shape of the system.

Centralization and Scale

As the number of Linux systems grows, it becomes harder to manage each one individually. Configuration management tools allow you to define shared configurations that apply to many machines, and more specific configurations that apply to only a few.

You might have a base configuration that sets security policies, logging, and monitoring for all servers. On top of that, you might define web server roles, database roles, or specialized application roles. Each machine can be assigned one or more roles, and the tool will merge and apply the relevant configuration.

This centralization makes it possible to:

Onboard new servers quickly, with minimal manual intervention.

Keep fleets of machines consistent with each other.

Change settings across many hosts using a single, well reviewed change to configuration code.

Testing and Validation of Configuration

Because configuration is expressed as code, it becomes possible to test it. On Linux, you can stand up temporary virtual machines or containers, apply configuration, and verify that the results look correct before affecting production systems.

You can also perform static checks on configuration files. These might include syntax validation, style checks, or basic safety rules. Combined with automation in continuous integration pipelines, this reduces the chance that a typo or mistake will reach real servers.

This testing mindset treats configuration as something that can be developed and improved like application code. Over time, you build confidence that your configuration changes are safe and predictable.

Integration with Automation and DevOps

Configuration management plays a central role in automation and DevOps practices. When configuration is defined as code and applied automatically, it becomes easier to integrate it with other automated workflows.

For example, when you deploy a new version of an application, you can also adjust the configuration that goes with it. When you build or destroy infrastructure in the cloud, configuration tools can automatically shape the new servers to match your standards.

In a DevOps environment, application teams and operations teams often collaborate on configuration repositories. Both sides can see and understand how applications are configured on Linux systems, and both can propose improvements. This reduces the gap between writing code and running it.

Important: In DevOps, configuration management provides a shared, automated way to define how applications and infrastructure behave on Linux systems.

Summary

Configuration management on Linux is about expressing system configuration in reusable, version controlled definitions instead of performing ad hoc manual changes. It relies on concepts like declarative state, idempotence, convergence, and desired state versus drift. By centralizing configuration, enabling testing, and integrating with automation, configuration management becomes a foundation for reliable and scalable Linux operations. Subsequent chapters introduce specific tools and formats that implement these ideas in practice.

6.3.1 Ansible basics

6.3.2 Playbooks and inventories

6.3.3 Puppet overview

6.3.4 Chef overview