Commits · c75f394707dc3e92e7291e8e5751ef1f7f142c94 · Mirror / Kubespray

Dec 13, 2016

Address standalone kubelet config case · c75f3947

Bogdan Dobrelya authored Dec 13, 2016

Also place in global vars and do not repeat the kube_*_config_dir
and kube_namespace vars for better code maintainability and UX.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

c75f3947

Update main.yml · f52ed9f9
Bogdan Dobrelya authored Dec 12, 2016

f52ed9f9

Dec 12, 2016

Rework DNS stack to meet hostnet pods needs · 3117858d

Bogdan Dobrelya authored Nov 30, 2016

* For Debian/RedHat OS families (with NetworkManager/dhclient/resolvconf
  optionally enabled) prepend /etc/resolv.conf with required nameservers,
  options, and supersede domain and search domains via the dhclient/resolvconf
  hooks.

* Drop (z)nodnsupdate dhclient hook and re-implement it to complement the
  resolvconf -u command, which is distro/cloud provider specific.
  Update docs as well.

* Enable network restart to apply and persist changes and simplify handlers
  to rely on network restart only. This fixes DNS resolve for hostnet K8s
  pods for Red Hat OS family. Skip network restart for canal/calico plugins,
  unless https://github.com/projectcalico/felix/issues/1185

 fixed.

* Replace linefiles line plus with_items to block mode as it's faster.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>
Co-authored-by: Matthew Mosesohn <mmosesohn@mirantis.com>

3117858d

Make growpart only run on Azure · 5176e5c9
Alexander Block authored Dec 12, 2016

5176e5c9
Add growpart role to allow growing the root partition on CentOS · 9fd14cb6
Alexander Block authored Dec 09, 2016
```
At least the OS images from Azure do not grow the root FS automatically.
```
9fd14cb6
Disable fastestmirror on CentOS · 4e34803b
Alexander Block authored Dec 07, 2016
```
It actually slows down things dramatically when used in combination
with Ansible.
```
4e34803b
Remove requiretty from sudoers to actually make pipelining work · 7abcf6e0
Alexander Block authored Dec 09, 2016
```
Some systems (e.g. CentOS on Azure) have requiretty in sudoers which makes
pipelining fail.
```
7abcf6e0

Dec 09, 2016

Preconfigure DNS stack and docker early · a15d6267

Bogdan Dobrelya authored Dec 07, 2016



In order to enable offline/intranet installation cases:
* Move DNS/resolvconf configuration to preinstall role. Remove
  skip_dnsmasq_k8s var as not needed anymore.

* Preconfigure DNS stack early, which may be the case when downloading
  artifacts from intranet repositories. Do not configure
  K8s DNS resolvers for hosts /etc/resolv.conf yet early (as they may be
  not existing).

* Reconfigure K8s DNS resolvers for hosts only after kubedns/dnsmasq
  was set up and before K8s apps to be created.

* Move docker install task to early stage as well and unbind it from the
  etcd role's specific install path. Fix external flannel dependency on
  docker role handlers. Also fix the docker restart handlers' steps
  ordering to match the expected sequence (the socket then the service).

* Add default resolver fact, which is
  the cloud provider specific and remove hardcoded GCE resolver.

* Reduce default ndots for hosts /etc/resolv.conf to 2. Multiple search
  domains combined with high ndots values lead to poor performance of
  DNS stack and make ansible workers to fail very often with the
  "Timeout (12s) waiting for privilege escalation prompt:" error.

* Update docs.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

a15d6267

More granular control for download/upload images/binaries · fd9b2667

Bogdan Dobrelya authored Dec 09, 2016



Add upload tag allow users to exclude distributing images across nodes
when running with the download tag set.
Add related tags and update docs as well.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

fd9b2667

Changes according to code review · eb33f085
Alexander Block authored Dec 09, 2016

eb33f085

Bump kubedns version to 1.9 · 459bee6d

Matthew Mosesohn authored Dec 09, 2016

Version 1.9 has reduced verbosity for federation dns queries
which flood container logs.

459bee6d

Use proper style (spacing) for docker_storage_options · 8a5ba6b2
Alexander Block authored Dec 09, 2016

8a5ba6b2
Allow to specify docker storage driver · c3ec3ff9
Alexander Block authored Dec 07, 2016

c3ec3ff9

Add tags · 8cc84e13

Bogdan Dobrelya authored Dec 08, 2016



Add tags to allow more granular tasks filtering.
Add generator script for MD formatted tags found.
Add docs for tags how-to.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

8cc84e13

Add playbook and role to reset the cluster · 00ad1511
Alexander Block authored Dec 07, 2016
```
This deletes everything related to the cluster and allows to start from
scratch.
```
00ad1511

Convert docker_versioned_pkg dict keys to string · ee8d6ab4

Aleksandr Didenko authored Dec 09, 2016

This will allow to use '-e docker_version=1.12' in ansible playbook
execution. It's also backward-compatible and will work with floating
docker_version format in custom yaml files.

Closes #702

ee8d6ab4

Dec 07, 2016

Allow etcd_access_addresses to be more flexible · eec2ed58

Dan Bode authored Dec 01, 2016

The variale etcd_access_addresses is used to determine
how to address communication from other roles to
the etcd cluster.

It was set to the address that ansible uses to
connect to instance ({{ item }})s and not the
the variable:
  ip_access
which had already been created and could already
be overridden through the access_ip variable.

This change allows ansible to connect to a machine using
a different address than the one used to access etcd.

eec2ed58

Force hardlink for calico/canal certs · bfc9bcb8
Matthew Mosesohn authored Dec 07, 2016
```
Fixes: #669
```
bfc9bcb8

Change GCE sysctls placement and docs · f0f2b812

Bogdan Dobrelya authored Dec 07, 2016



Override GCE sysctl in /etc/sysctl.d/99-sysctl.conf instead of
the /etc/sysctl.d/11-gce-network-security.conf. It is recreated
by GCE, f.e. if gcloud CLI invokes some security related changes,
thus losing customizations we want to be persistent.

Update cloud providers firewall requirements in calico docs.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

f0f2b812

Fix possible problems with legacy calicoctl · c9290182

Aleksandr Didenko authored Dec 07, 2016

When running legacy calicoctl we do not specify calico hostname in
calico-node container thus we should not specify it in CNI config.

Also move 'legacy_calicoctl' set_fact task to the top.

c9290182

add cluster-signing to kube-controller-manager · 246c8209

fen4o authored Dec 07, 2016

kube-controller-manager's cluster signing cert and key points by default to not
existing `/etc/kubernetes/ca/ca.pem` and `/etc/kubernetes/ca/ca.key` [docs][1]

[1]: http://kubernetes.io/docs/admin/kube-controller-manager/#options

246c8209

Dec 06, 2016

Calico: fix peering with routers for new version · b0079ccd

Aleksandr Didenko authored Dec 06, 2016

In new `calicoctl` version nodes peering with routers is broken.
We need to use predictable node names for calico-node and the
same names in calico `bgpPeer` resources and CNI.

b0079ccd

Update calico-node systemd unit · f1d7af11

Aleksandr Didenko authored Dec 05, 2016

New calicoctl does not support --detach=false option, so we should
use a recommended way to run calico-node service:
http://docs.projectcalico.org/v2.0/usage/configuration/as-service

Closes #674, #675

f1d7af11

Fix ipv4 forwarding on GCE · 7a3a473c

Matthew Mosesohn authored Dec 05, 2016

ipv4 forwarding gets broken when restarting networking, which
breaks all networking for all pods.

7a3a473c

Dec 05, 2016
- Add dbus socket dir to kube-proxy · 2cdf7524
  Matthew Mosesohn authored Dec 05, 2016
  
  2cdf7524
Dec 03, 2016
- Docker Options Refactor · 8b5b27bb
  Chad Swenson authored Nov 04, 2016
  
  8b5b27bb
Dec 02, 2016
- Fail all nodes on error · dba20260
  ant31 authored Dec 02, 2016
  
  dba20260
Nov 29, 2016
- add basic azure support for kargo · bb55f68f
  Sebastian Melchior authored Nov 29, 2016
  
  bb55f68f
Nov 28, 2016

Set proxy_timeout to 10m in nginx.conf · 658543c9

Yuriy Taraday authored Nov 28, 2016

Fixes #655.

This is a teporary solution for long-polling idle connections to
apiserver. It will make Nginx not cut them for the duration of expected
timeout. It will also make Nginx extremely slow in realizing that there
is some issue with connectivity to apiserver as well, so it might not be
perfect permanent solution.

658543c9

Add advanced net check for DNS K8s app · b7692fad

Bogdan Dobrelya authored Sep 30, 2016



* Add an option to deploy K8s app to test e2e network connectivity
  and cluster DNS resolve via Kubedns for nethost/simple pods
  (defaults to false).
* Parametrize existing k8s apps templates with kube_namespace and
  kube_config_dir instead of hardcode.
* For CoreOS, ensure nameservers from inventory to be put in the
  first place to allow hostnet pods connectivity via short names
  or FQDN and hostnet agents to pass as well, if netchecker
  deployed.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

b7692fad

Nov 25, 2016

Tune dnsmasq/kubedns limits, replicas, logging · 2d18e192

Bogdan Dobrelya authored Nov 25, 2016



* Add dns_replicas, dns_memory/cpu_limit/requests vars for
dns related apps.
* When kube_log_level=4, log dnsmasq queries as well.
* Add log level control for skydns (part of kubedns app).
* Add limits/requests vars for dnsmasq (part of kubedns app) and
  dnsmasq daemon set.
* Drop string defaults for kube_log_level as it is int and
  is defined in the global vars as well.
* Add docs

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

2d18e192

Update calico/ctl image tag · ff7d489f

Aleksandr Didenko authored Nov 24, 2016

We no longer need to use v0.22.0 for calicoctl since Kargo has
support for new calicoctl CLI format.

Also fixing condition logic for calico pool task.

ff7d489f

Nov 24, 2016

Fix download dnsmasq image dependency on docker · aa447585

Bogdan Dobrelya authored Nov 24, 2016



When download_run_once with download_localhost is used, docker is
expected to be running on the delegate localhost. That may be not
the case for a non localhost delegate, which is the kube-master
otherwise. Then the dnsmasq role, had it been invoked early before
deployment starts, would fail because of the missing docker dependency.

* Fix that dependency on docker and do not pre download dnsmasq image
  for the dnsmasq role, if download_localhost is disabled.
* Remove become: false for docker CLI invocation because that's not
  the common pattern to allow users access docker CLI w/o sudo.
* Fix opt bin path hack for localhost delegate to ignore errors when
  it fails with "sudo password required" otherwise.
* Describe download_run_once with download_localhost use case in docs
  as well.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

aa447585

Nov 23, 2016

Ensure /etc/resolv.conf content for CoreOS · d208896c

Bogdan Dobrelya authored Nov 23, 2016

Use cloud-init config to replace /etc/resolv.conf with the
content for kubelet to properly configure hostnet pods.

Do not use systemd-resolved yet, see
https://coreos.com/os/docs/latest/configuring-dns.html


"Only nss-aware applications can take advantage of the
systemd-resolved cache. Notably, this means that statically
linked Go programs and programs running within Docker/rkt
will use /etc/resolv.conf only, and will not use the
systemd-resolve cache."

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

d208896c

Fix Calico jinja template (systemd) · 2c4b11f3
Artem Panchenko authored Nov 23, 2016

2c4b11f3

Fix nginx container download for download_run_once mode · d890d2f2

Bogdan Dobrelya authored Nov 23, 2016



W/o this patch, the "Download containers" task may be skipped
when running on the delegate node due to wrong "when" confition.
Then it fails to upload nginx image to the nodes as well.

Fix download nginx dependency so it always can be pushed to
nodes when download_run_once is enabled.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

d890d2f2

Nov 22, 2016

Set defaults for ansible_ssh_user · db03f174

Aleksandr Didenko authored Nov 22, 2016

When setting permission for containers download/upload dir we're
using `ansible_ssh_user`. But if playbook is executed without
user being explicitly set `ansible_ssh_user` may be undefined.
In such situations dir ownership will default to `ansible_user_id`

Closes: #644

db03f174

Allow pre-downloaded images to be used effectively · dff78f61

Bogdan Dobrelya authored Nov 22, 2016

According to http://kubernetes.io/docs/user-guide/images/

 :
By default, the kubelet will try to pull each image from the
specified registry. However, if the imagePullPolicy property
of the container is set to IfNotPresent or Never, then a local\
image is used (preferentially or exclusively, respectively).

Use IfNotPresent value to allow images prepared by the download
role dependencies to be effectively used by kubelet without pull
errors resulting apps to stay blocked in PullBackOff/Error state
even when there are images on the localhost exist.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

dff78f61

Download images as dependencies of roles · 66f27ed1

Bogdan Dobrelya authored Nov 21, 2016



Pre download all required container images as roles' deps.
Drop unused flannel-server-helper images pre download.
Improve pods creation post-install test pre downloaded busybox.
Improve logs collection script with kubectl describe, fix sudo/etcd/weave
commands.

Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com>

66f27ed1

Nov 21, 2016
- Fix conditional when setting loadbalancer_apiserver_localhost · 32a54534
  Paweł Skrzyński authored Nov 21, 2016
  
  32a54534