Каталог плейбуков
Библиотека восстановления: 270 плейбуков. Каждый — YAML с прекчеками, шагами, верификацией результата и политикой отката; исполняются только команды из allowlist самого плейбука.
Риск: низкий — авто, средний — авто по политике, высокий — только после approve оператора.
Kubernetes и контейнеры 50
| Класс проблемы | Что делает | Риск |
|---|---|---|
| container_image_bloat | Prune unused container images and old exited containers | средний |
| service_down | Recover containerd | высокий |
| service_down | Recover containerd on Oracle Linux | высокий |
| coredns_unhealthy | CoreDNS — clear local cache + force pod restart | средний |
| coredns_unhealthy | Rollout-restart CoreDNS deployment | средний |
| etcd_alarm_active | Disarm etcd NOSPACE alarm after defrag | средний |
| etcd_db_pressure | Defragment etcd backend and disarm NOSPACE alarm | средний |
| etcd_member_down | Restart local etcd member | средний |
| k3s_disabled_addon_conflict | Diagnose and disable conflicting k3s addon | средний |
| k3s_agent_unable_to_join | Diagnose k3s-agent join failure and collect evidence | низкий |
| k3s_kubeconfig_drift | Restore k3s kubeconfig after CA rotation or cluster restore | высокий |
| k3s_server_token_corrupt | K3s — regenerate corrupted server token and restart cluster | высокий |
| k3s_sqlite_db_bloat | Compact k3s embedded SQLite backend (VACUUM) | высокий |
| kubernetes_api_connectivity_failure | Diagnose and repair Kubernetes API connectivity failure | высокий |
| kube_apiserver_crashloop | Reload kubelet to recover stuck kube-apiserver static pod | высокий |
| kube_apiserver_crashloop | Surgical restart of crash-looping kube-apiserver static pod via crictl | средний |
| kube_controller_manager_crashloop | Surgical restart of crash-looping kube-controller-manager via crictl | средний |
| kube_deployment_progress_deadline_exceeded | Undo stuck Kubernetes deployment rollout | высокий |
| kube_etcd_compaction_lag | Compact etcd at current revision and defragment (HIGH RISK) | высокий |
| kube_evicted_pods_accumulated | Delete pods stuck in Failed/Evicted phase across the cluster | низкий |
| kube_hpa_stuck | Diagnose HPA stuck — ScalingActive=False or metrics absent | низкий |
| kube_image_pull_backoff | Diagnose ImagePullBackOff (manual_only — operator review) | низкий |
| kube_pod_init_container_failed | Diagnose imagePullSecrets behind ImagePullBackOff (read-only) | низкий |
| kube_ingress_controller_unhealthy | Rollout-restart ingress controller (ingress-nginx or traefik) | средний |
| kube_kubelet_down_local | Restart local kubelet | средний |
| kube_namespace_terminating_stuck | Force-strip finalizers off a stuck Terminating Namespace (HIGH RISK) | высокий |
| kube_node_disk_pressure_local | Clean container logs and images to relieve node disk pressure | высокий |
| kube_node_not_ready | Diagnose Kubernetes node NotReady state | низкий |
| kube_node_pressure | Diagnose Kubernetes node memory/CPU pressure conditions | низкий |
| kube_oom_killed_container | Diagnose containers killed by OOM | низкий |
| kube_pod_liveness_probe_failed | Force-delete a Pod stuck in Terminating (grace-period=0) | высокий |
| kube_pod_pending_unschedulable | Advisor — Kubernetes pod stuck Pending, scheduler can't place | низкий |
| kube_pod_readiness_probe_failed | Diagnose Kubernetes pod readiness probe failures | низкий |
| kube_proxy_unhealthy | Recover kube-proxy daemonset via rollout restart | средний |
| kube_pv_orphaned | Release orphaned PV (reclaimPolicy=Retain) for re-binding | средний |
| kube_pvc_pending | Diagnose PVC stuck in Pending state | низкий |
| kube_replicaset_orphaned | Delete orphaned ReplicaSets with zero desired/ready replicas | средний |
| kube_scheduler_crashloop | Surgical restart of crash-looping kube-scheduler static pod via crictl | средний |
| kube_service_endpoint_empty | Diagnose empty Service endpoints (selector vs ready pods) | низкий |
| kube_statefulset_rollout_stuck | Diagnose StatefulSet rollout stuck (ordered update wedged) | низкий |
| kube_static_pod_crash | Restore crashed static pod manifest and restart kubelet | высокий |
| kube_static_pod_surgical_restart | Surgically restart a crashed static pod by moving its manifest | высокий |
| cert_expired | Emergency kubeadm cert renewal when API server is already down (CRITICAL) | высокий |
| cert_expiry_soon | kubeadm-style cert renewal with PKI backup (HIGH RISK) | высокий |
| service_down | Recover kubelet on ALT Linux / Astra Linux | средний |
| service_down | Recover kubelet on Oracle Linux | высокий |
| service_down | Recover kubelet on Ubuntu / Debian | средний |
| kubernetes_cni_failure | Recover standard Kubernetes CNI plugins on Oracle Linux x86_64 | высокий |
| kubernetes_cni_failure | Recover standard Kubernetes CNI plugins on Oracle Linux arm64 | высокий |
| kubernetes_control_plane_pressure | Diagnose Kubernetes control-plane pressure on Oracle Linux | средний |
PostgreSQL 20
| Класс проблемы | Что делает | Риск |
|---|---|---|
| pg_extension_outdated | Update outdated PostgreSQL extension to default version | низкий |
| pg_stat_statements_bloat | Reset pg_stat_statements to reclaim shared memory | низкий |
| pg_table_autovacuum_disabled | Re-enable autovacuum for table that has it disabled | высокий |
| pg_2pc_stuck | Roll back orphan prepared transactions older than the freeze horizon | высокий |
| pg_archive_command_failing | Capture pg_stat_archiver evidence when archive_command keeps failing | низкий |
| pg_table_bloat | Run VACUUM ANALYZE on a bloated table to relieve autovacuum debt | средний |
| pg_autovacuum_disabled | Re-enable PostgreSQL autovacuum (cluster-wide) | высокий |
| pg_buffer_hit_ratio_low | Snapshot shared-buffers hit ratio and sizing for operator review | низкий |
| pg_checkpoint_too_frequent | Snapshot checkpoint tuning parameters when forced checkpoints dominate | низкий |
| pg_idle_in_txn | Terminate long-running idle-in-transaction sessions | высокий |
| pg_index_bloat | Snapshot bloated indexes for operator-scheduled REINDEX CONCURRENTLY | низкий |
| pg_lock_contention | Capture lock-wait tree when many backends are blocked on locks | низкий |
| pg_long_running_query_critical | Cancel active queries running longer than 30 minutes | высокий |
| pg_max_connections_reached | Advisor — PostgreSQL near max_connections, recommend pgbouncer / app pooling | низкий |
| pg_replica_idle_disconnect | Advisor — PostgreSQL streaming replica idle/disconnected | низкий |
| pg_replication_lag | Resume paused WAL replay on a lagging PostgreSQL standby | средний |
| pg_replication_slot_orphaned | Drop orphaned replication slots that pin WAL beyond 1 GiB | высокий |
| pg_temp_files_high | Advisor — PostgreSQL spilling to temp files, work_mem likely too small | низкий |
| pg_wal_fill_critical | Reclaim space on a near-full pg_wal partition | высокий |
| pg_xid_wraparound_risk | VACUUM FREEZE the database with the oldest XID horizon | высокий |
MySQL / MariaDB 9
| Класс проблемы | Что делает | Риск |
|---|---|---|
| mariadb_galera_node_non_primary | Bootstrap MariaDB Galera primary node from non-Primary state | высокий |
| mariadb_health_failure | Recover MariaDB service health | высокий |
| mysql_health_failure | Recover MySQL service health | высокий |
| mysql_innodb_log_full | Capture InnoDB log stall evidence and current tuning values | низкий |
| mysql_long_running_query | Kill the top 3 long-running MySQL queries | высокий |
| mysql_max_connections_reached | Kill idle MySQL connections older than the threshold | средний |
| mysql_replication_io_thread_down | Restart MySQL replication IO thread | средний |
| mysql_replication_sql_thread_down | Skip one event and resume MySQL replication SQL thread | высокий |
| mysql_replication_lag_high | Diagnose MySQL/MariaDB replication lag and collect status | низкий |
ClickHouse 6
| Класс проблемы | Что делает | Риск |
|---|---|---|
| clickhouse_max_memory_usage_exceeded | Relieve ClickHouse memory pressure by dropping server-side caches | средний |
| clickhouse_disk_full_data_dir | Drop the oldest ClickHouse partition to relieve disk pressure | высокий |
| clickhouse_mutations_stuck | Kill a stuck ClickHouse mutation | высокий |
| clickhouse_too_many_parts | OPTIMIZE TABLE FINAL on the most fragmented ClickHouse table | высокий |
| clickhouse_replica_max_queue_size | Unwedge a stuck ClickHouse replication queue by reinitializing the replica | средний |
| clickhouse_zookeeper_session_expired | Restart the ClickHouse replica session to recover Zookeeper connectivity | средний |
Elasticsearch / Redis / MongoDB 3
| Класс проблемы | Что делает | Риск |
|---|---|---|
| elasticsearch_circuit_breaker_tripped | Raise ES request and parent circuit-breaker limits (transient) | средний |
| elasticsearch_too_many_open_files | Raise Elasticsearch open-file limit via systemd drop-in | высокий |
| elasticsearch_unassigned_shards | Reroute unassigned ES shards (after consulting allocation/explain) | средний |
Веб-серверы и прокси 35
| Класс проблемы | Что делает | Риск |
|---|---|---|
| apache_modproxy_backend_failed | Recover Apache mod_proxy 502/503 by restarting local upstream | средний |
| apache_config_syntax_error | Diagnose Apache config syntax error (apache2ctl/httpd/httpd2 -t) | низкий |
| apache_excessive_500 | Diagnose Apache 500-flood (upstream app sick) | низкий |
| apache_health_failure | Recover Apache service health | средний |
| apache_health_failure | Recover Apache (httpd2) service health on ALT Linux | средний |
| apache_health_failure | Recover Apache service health on Astra Linux | средний |
| apache_health_failure | Recover Apache (httpd) service health on Oracle Linux | средний |
| apache_health_failure | Recover Apache (httpd-prefork) service health on openSUSE / SLES | средний |
| apache_keepalive_too_high | Diagnose Apache KeepAlive holding workers idle | низкий |
| apache_log_growing_fast | Force logrotate when Apache access.log grows >100 MB/min (DDoS / bot flood) | средний |
| apache_module_missing | Diagnose Apache LoadModule references whose .so is not on disk | низкий |
| apache_worker_mpm_overloaded | Restart Apache after MPM worker exhaustion / OOM | средний |
| apache_php_fpm_socket_unavailable | Recover Apache mod_proxy_fcgi when PHP-FPM socket is missing or unreadable | средний |
| apache_rate_limiting_excessive | Diagnose Apache mod_security / mod_evasive over-blocking | низкий |
| apache_segfault | Diagnose Apache child SIGSEGV (loaded module crash) | низкий |
| apache_ssl_cert_expiry_soon | Renew Let's Encrypt SSL cert for Apache and reload | высокий |
| cert_chain_broken | Capture broken TLS chain evidence for operator review | низкий |
| haproxy_health_failure | Recover HAProxy service health | средний |
| nginx_excessive_502_503 | Recover from excessive nginx 502/503 errors | средний |
| nginx_config_syntax_error | Diagnose nginx config syntax error (no auto-fix) | низкий |
| nginx_health_failure | Recover nginx health | средний |
| nginx_health_failure | Recover nginx health on ALT Linux | средний |
| nginx_health_failure | Recover nginx health on Astra Linux | средний |
| nginx_access_log_growing_fast | Force rotate nginx logs (USR1 + logrotate force, no nginx restart) | средний |
| nginx_no_active_listening | Diagnose and reload nginx when master has no listening socket | высокий |
| nginx_worker_oom | Restart nginx workers after OOM kill (preserve connections when possible) | средний |
| nginx_rate_limit_exceeded | Diagnose nginx limit_req saturation (no auto-tune) | низкий |
| nginx_ssl_cert_already_expired | Renew expired nginx TLS cert via certbot and reload nginx | высокий |
| nginx_ssl_cert_expiry_soon | Renew Let's Encrypt cert for nginx and reload | средний |
| nginx_upstream_failed | Diagnose nginx upstream failures (read-only, multi-tool) | низкий |
| nginx_worker_too_many_open_files | Raise nginx worker_rlimit_nofile after fd exhaustion | средний |
| php_fpm_health_failure | Recover PHP-FPM service health | высокий |
| tomcat_connector_threads_exhausted | Double Tomcat Connector maxThreads (capped at 1000) | высокий |
| tomcat_jdbc_connection_pool_exhausted | Double Tomcat JDBC connection pool size (maxActive / maxTotal) | высокий |
| tomcat_session_storage_full | Clear Tomcat session storage + stale temp files | средний |
Java / JVM 15
| Класс проблемы | Что делает | Риск |
|---|---|---|
| java_class_loader_leak | Capture classloader leak evidence via jcmd VM.metaspace | низкий |
| java_deadlock_detected | Capture thread dump then restart Java service in deadlock | высокий |
| java_dns_caching_stale | Fix infinite JVM DNS cache by setting networkaddress.cache.ttl | средний |
| java_gc_overhead_limit | Rotate Java GC log (mv + runtime VM.log reconfig on JDK11+) | средний |
| java_gc_overhead_limit | Take heap dump then restart Java service hitting GC overhead limit | высокий |
| java_heap_oom | Collect Java heap dump for offline analysis | низкий |
| java_heap_oom | Capture Java heap dump with disk-space pre-flight and size-aware strategy | средний |
| java_high_thread_count | Collect read-only Java diagnostic bundle (threads + heap histogram + GC stats) | низкий |
| java_jfr_recording_stuck | Stop stuck JFR recording and dump flight data | средний |
| java_metaspace_oom | Diagnose Metaspace OOM and capture JVM flag evidence | низкий |
| java_native_memory_leak | Enable Native Memory Tracking + 5min diff snapshot | низкий |
| java_old_gen_full_consistent | Capture heap dump before restarting JVM with consistent Old Gen OOM | средний |
| java_safepoint_long_pause | Capture safepoint pause evidence and JIT counters | низкий |
| java_thread_blocked_on_lock | Capture thread dump for JVM with lock contention | низкий |
| java_truststore_corrupted | Rebuild $JAVA_HOME/lib/security/cacerts from system CA bundle | высокий |
Сеть и DNS 23
| Класс проблемы | Что делает | Риск |
|---|---|---|
| bind9_health_failure | Recover BIND9 service health | средний |
| bind9_health_failure | Recover BIND (named) service health on Oracle Linux | средний |
| dns_dnssec_failure | Capture DNSSEC validation failure evidence | низкий |
| dns_resolution_failure | Revalidate DNS stack | средний |
| dns_resolution_failure | Revalidate DNS stack on ALT Linux | средний |
| dns_resolution_failure | Revalidate DNS stack on Astra Linux | средний |
| dns_resolution_failure | Revalidate DNS stack on Oracle Linux | средний |
| conntrack_table_full | Bump nf_conntrack_max to avoid table fillup drops | средний |
| conntrack_table_full | Bump nf_conntrack_max on ALT Linux | средний |
| conntrack_table_full | Bump nf_conntrack_max on Astra Linux | средний |
| network_default_route_lost | Restart ALT etcnet to restore lost default route | высокий |
| network_default_route_lost | Restart Astra networking to restore lost default route | высокий |
| network_default_route_lost | Restart systemd-networkd to restore lost default route | высокий |
| network_default_route_lost | Reload NetworkManager to restore lost default route | высокий |
| network_iface_errors | Diagnose NIC errors via ethtool and interface statistics | низкий |
| network_link_flap | Reset a flapping network interface (link down/up cycle) | высокий |
| network_link_flap | Reset a flapping network interface on ALT Linux | высокий |
| network_link_flap | Reset a flapping network interface on Astra Linux | высокий |
| network_mtu_blackhole | Lower MTU on affected interface to resolve PMTU blackhole | средний |
| network_route_failure | Recheck network route | средний |
| network_route_failure | Recheck network route on ALT Linux | средний |
| network_route_failure | Recheck network route on Astra Linux | средний |
| network_route_failure | Recheck network route on Oracle Linux | средний |
Диски, ФС и хранилище 15
| Класс проблемы | Что делает | Риск |
|---|---|---|
| cifs_mount_credential_failed | Refresh CIFS credential file and remount failed network share | высокий |
| disk_full | Cleanup disk pressure | средний |
| disk_full | Cleanup disk pressure on ALT Linux | средний |
| disk_full | Cleanup disk pressure on Astra Linux | средний |
| fs_corruption_marker | Diagnose filesystem corruption markers (capture evidence, no fix) | низкий |
| fs_quota_exceeded | Report filesystem quota exceeded — identify user and paths | низкий |
| inode_exhaustion | Cleanup inode pressure | средний |
| inode_exhaustion | Cleanup inode pressure on ALT Linux | средний |
| inode_exhaustion | Cleanup inode pressure on Astra Linux | средний |
| io_wait_sustained | Capture top IO-wait processes and disk stats (diagnostic only) | низкий |
| lvm_volume_inactive | Reactivate inactive LVM logical volume | средний |
| lvm_metadata_damage | Restore LVM metadata from archive backup | высокий |
| lvm_snapshot_full | Diagnose nearly-full LVM snapshot (log state, no auto-extend/auto-merge) | средний |
| lvm_thin_pool_full | Extend an LVM thin pool when the parent VG has free PE | средний |
| raid_array_recoverable | Re-add a removed mdadm array member | средний |
Память и процессы 13
| Класс проблемы | Что делает | Риск |
|---|---|---|
| cgroup_oom_detected | Restart workload after cgroup OOM kill | высокий |
| core_service_inactive | Start auth/session service after dependency-cascade outage | высокий |
| core_service_inactive | Start critical network service after dependency-cascade outage | средний |
| cpu_pressure | Observe CPU pressure (no destructive action) | низкий |
| fd_exhaustion_process | Advisory — process near rlimit-NOFILE cap | низкий |
| fd_exhaustion_system | Raise fs.file-max via sysctl drop-in | средний |
| oom_detected | Recover from OOM pressure | высокий |
| oom_detected | Recover from OOM pressure on ALT Linux | высокий |
| oom_detected | Recover from OOM pressure on Astra Linux | высокий |
| oom_victim_recurring | Advisory — recurring OOM victim needs MemoryHigh tuning | низкий |
| swap_exhaustion | Advisory — swap entry saturated, OOM imminent | низкий |
| swap_thrashing | Relieve swap thrashing via swapoff/swapon cycle (capture top RAM users first) | высокий |
| zombie_process_buildup | Reap zombie processes by signalling their parents (SIGCHLD) | средний |
Службы и systemd 30
| Класс проблемы | Что делает | Риск |
|---|---|---|
| arp_table_overflow | Relieve ARP table overflow by raising neighbour GC thresholds | средний |
| boot_disk_full | Purge old kernels from /boot (Debian/Ubuntu/Astra) | высокий |
| boot_disk_full | Purge old kernels from /boot (Rocky/Alma/Oracle/openSUSE) | высокий |
| btrfs_scrub_errors | Start btrfs scrub to detect and repair filesystem errors | средний |
| system_cert_bundle_corrupt | Refresh system CA bundle (Debian update-ca-certificates / RHEL update-ca-trust) | средний |
| config_drift | Manual review of changed critical config file | низкий |
| config_missing | Restore missing critical config file from backup | средний |
| kube_containerd_down_local | Recover container runtime when kubelet sees no pods (multi-tool) | высокий |
| kube_daemonset_pod_crash | Rollout-restart kube-system DaemonSet | средний |
| dstate_processes | Capture D-state (uninterruptible IO wait) process evidence | низкий |
| filesystem_read_only_remount | Remount a kernel-forced read-only filesystem back to read-write | высокий |
| firewalld_reload_failure | Reload firewalld safely; restore zone config on failure | средний |
| transparent_hugepages_pressure | Diagnose transparent/explicit hugepages pressure (log state, no auto-tune) | низкий |
| journal_corrupted | Rotate corrupt active journal | низкий |
| nfs_mount_stale | Recover stale NFS mount via lazy unmount + remount | средний |
| pam_auth_failure | Refresh sssd cache + clear pam_tally2 lockouts | средний |
| sssd_realm_failure | Recover PBIS domain integration on Oracle Linux | высокий |
| postgresql_health_failure | Recover PostgreSQL service health | высокий |
| process_priority_misuse | Renice high-priority processes abusing CPU scheduling | средний |
| pvc_stuck | Diagnose stuck PersistentVolumeClaim and suggest remediation | низкий |
| resource_undersized | Recommendation — node hardware envelope is too small for its workload | низкий |
| service_crash_loop | Recover crash looping service | высокий |
| service_crash_loop | Recover crash looping service on ALT Linux | высокий |
| service_crash_loop | Recover crash looping service on Astra Linux | высокий |
| service_down | Capture failure evidence then restart failed service | высокий |
| service_down | Restart failed service on ALT Linux | высокий |
| service_down | Capture failure evidence then restart failed service on Astra Linux | высокий |
| tcp_syn_flood | Mitigate SYN flood by enabling tcp_syncookies + bumping backlog | высокий |
| tmpfs_full | Find largest files on full tmpfs and clean stale temporaries | средний |
| sssd_realm_failure | Recover winbind domain integration on Oracle Linux | высокий |
Пакеты и обновления 10
| Класс проблемы | Что делает | Риск |
|---|---|---|
| dnf_module_conflict | Detect dnf module conflict; recommend module reset (no auto-fix) | низкий |
| package_manager_failure | Recover package manager | средний |
| package_manager_failure | Recover package manager (ALT Sisyphus) | средний |
| package_manager_failure | Recover package manager on Astra Linux | средний |
| package_manager_failure | Recover package manager on Oracle Linux | средний |
| package_manager_failure | Recover package manager (SUSE / zypper) | средний |
| package_state_inconsistent | dpkg --configure -a after half-applied apt upgrade (Debian/Ubuntu/Astra) | средний |
| package_state_inconsistent | dnf check + rpm verify after partial transaction (Rocky/Alma/Oracle/openSUSE) | средний |
| repository_mirror_failure | Advisory — package repository mirror unreachable | низкий |
| rhsm_subscription_unknown | Detect paid Red Hat subscription failure (alert only) | низкий |
Active Directory / домен 11
| Класс проблемы | Что делает | Риск |
|---|---|---|
| ad_clock_skew_kerberos | Force chrony makestep to recover Kerberos auth after clock skew | средний |
| ad_clock_skew_kerberos | Force chrony makestep on ALT Linux to recover Kerberos auth | средний |
| ad_clock_skew_kerberos | Force chrony makestep on Astra Linux to recover Kerberos auth | средний |
| ad_clock_skew_kerberos | Restart systemd-timesyncd to recover Kerberos auth after clock skew | средний |
| ad_dns_srv_missing | Repair /etc/resolv.conf to restore AD DNS SRV record resolution | средний |
| ad_keytab_corrupt | Re-fetch Kerberos keytab for AD-joined host | высокий |
| ad_machine_password_expired | Re-join Active Directory domain after machine password expiry | высокий |
| ad_sssd_cache_corrupt | Purge SSSD cache and restart to recover AD user resolution | средний |
| java_runtime_failure | Clear Keycloak realm/user/keys cache via kcadm.sh | низкий |
| sssd_realm_failure | Recover SSSD realm integration | высокий |
| sssd_realm_failure | Recover SSSD realm integration on Oracle Linux | высокий |
Время и синхронизация 7
| Класс проблемы | Что делает | Риск |
|---|---|---|
| chrony_no_sources_synced | Recover chrony with no synced sources (makestep + restart) | низкий |
| service_down | Recover NTP time synchronization | средний |
| tcp_time_wait_exhausted | Enable tcp_tw_reuse to relieve TIME_WAIT socket exhaustion | средний |
| time_skew | Force NTP resync to correct clock skew (timesyncd) | средний |
| time_skew | Force NTP resync on ALT Linux | средний |
| time_skew | Force NTP resync on Astra Linux | средний |
| time_skew | Force NTP resync to correct clock skew (chrony) | средний |
Почта 2
| Класс проблемы | Что делает | Риск |
|---|---|---|
| mail_health_failure | Recover mail stack health | высокий |
| mail_queue_stuck | Flush postfix mail queue (force retry) | средний |
Безопасность и доступ 2
| Класс проблемы | Что делает | Риск |
|---|---|---|
| selinux_denied | Diagnose recent SELinux AVC denials and recommend a fix | низкий |
| selinux_denied | Restore SELinux contexts for known service paths when drift is detected | низкий |
Astra Linux / ALT Linux 16
| Класс проблемы | Что делает | Риск |
|---|---|---|
| alt_kernel_modules_failed | Rebuild initramfs after failed kernel module on ALT Linux | высокий |
| alt_rpm_bdb_corrupted | Recover ALT rpmdb Berkeley DB corruption | высокий |
| alt_rpm_bdb_severe_corrupt | Severe rpmdb recovery (Packages corrupt) — rpm.org canonical procedure | высокий |
| alt_tcb_password_expired | Recover ALT TCB service account locked by password expiry | средний |
| alt_tcb_password_corrupt | Restore corrupted TCB shadow file from backup on ALT Linux | высокий |
| alt_apt_rpm_partial_state | Reconcile broken apt-rpm dependency state (ALT) | высокий |
| package_manager_failure | Recover stale apt-rpm and rpmdb locks on ALT Linux | средний |
| alt_control_facility_misconfig | Apply ALT control(8) facility state | средний |
| alt_etcnet_iface_misconfig | Restart an etcnet-managed interface (ALT) | средний |
| alt_fcron_job_failure | Restart fcron after repeated job failures (ALT) | низкий |
| alt_initramfs_corrupt | Rebuild initramfs with make-initrd (ALT) | высокий |
| alt_kernel_update_pending | Apply pending ALT kernel update via update-kernel | высокий |
| alt_sisyphus_repo_unreachable | Switch ALT sources.list to a reachable mirror | высокий |
| alt_sysconfig_drift | Restore /etc/sysconfig file from owning rpm package (ALT) | средний |
| astra_parsec_audit_health | Audit Astra PARSEC log presence and rotation | низкий |
| astra_mac_label_conflict | Diagnose Astra SE MAC/PARSEC label mismatch on critical paths | низкий |
RHEL / SUSE специфика 3
| Класс проблемы | Что делает | Риск |
|---|---|---|
| oracle_health_failure | Diagnose Oracle Database health and restart listener/instance | высокий |
| suse_btrfs_snapshot_rollback_request | Diagnose btrfs/snapper state for manual rollback | высокий |
| suse_transactional_update_pending | Apply pending SUSE transactional-update batch | высокий |