Backup monitoring and IT environment failure alerting (Python)

A tool designed to monitor the correctness of backups and key elements of the IT environment. This solution was created to address the real need to verify that backups are actually performed, complete, and recoverable – not just “configured.”

The system automatically analyzes backup results, detects irregularities, and notifies the administrator of issues before they lead to data loss or operational downtime.

Solution Scope

  • Automatic verification of backup execution (local/network)
  • Checking:
    – presence of backup files
    – data size and integrity
    – backup execution time
  • Detection of:
    – missing backups
    – outdated backups
    – data size anomalies
  • Backup process log analysis
  • Support for multiple data sources (servers, workstations, network resources)

Alerts and Response

  • Email notifications of errors and threshold violations
  • Optional integration with messaging apps (e.g., Telegram)
  • Clear messages containing:
    – scope of the problem
    – time of occurrence
    – indication of the potential cause
  • Data preparation for quick remediation

Architecture and Operation

  • Cyclically run tool (cron / Task Scheduler)
  • Logic separated from the environment configuration
  • Easily adaptable to various backup policies
  • Central logging of audit results
  • Designed for reliability and minimal environmental impact Production

Technologies

Python, file system/log analysis, SMTP (email notifications), instant messaging API (optional), Linux/Windows environments

See other projects