Skip to content
Snippets Groups Projects
  1. Jan 22, 2024
    • Yuhao Zhang's avatar
      Offline control plane recover (#10660) · 0e971a37
      Yuhao Zhang authored
      * ignore_unreachable for etcd dir cleanup
      
      ignore_errors ignores errors occur within "file" module. However, when
      the target node is offline, the playbook will still fail at this task
      with node "unreachable" state. Setting "ignore_unreachable: true" allows
      the playbook to bypass offline nodes and move on to proceed recovery
      tasks on remaining online nodes.
      
      * Re-arrange control plane recovery runbook steps
      
      * Remove suggestion to manually update IP addresses
      
      The suggestion was added in 48a18284 4
      years ago. But a new task added 2 years ago, in
      ee0f1e9d, automatically update API
      server arg with updated etcd node ip addresses. This suggestion is no
      longer needed.
      Unverified
      0e971a37
  2. Jul 01, 2021
  3. Mar 24, 2021
  4. Feb 11, 2020
    • qvicksilver's avatar
      Fix recover-control-plane to work with etcd 3.3.x and add CI (#5500) · ac2135e4
      qvicksilver authored
      * Fix recover-control-plane to work with etcd 3.3.x and add CI
      
      * Set default values for testcase
      
      * Add actual test jobs
      
      * Attempt to satisty gitlab ci linter
      
      * Fix ansible targets
      
      * Set etcd_member_name as stated in the docs...
      
      * Recovering from 0 masters is not supported yet
      
      * Add other master to broken_kube-master group as well
      
      * Increase number of retries to see if etcd needs more time to heal
      
      * Make number of retries for ETCD loops configurable, increase it for recovery CI and document it
      Unverified
      ac2135e4
  5. Dec 04, 2019
  6. Apr 29, 2019
Loading