Commit Graph

420 Commits

Author SHA1 Message Date
Stefan Reimer 2ee31f60e2 Minor fluent-bit tuning 2020-10-01 12:32:21 -07:00
Stefan Reimer c6ae3d2461 Fluentd tuning 2020-10-01 10:14:04 -07:00
Stefan Reimer 21c6b0ea58 Fluentd tuning 2020-10-01 10:11:48 -07:00
Stefan Reimer 6003765dc9 Disable pipeline still cpu issues 2020-09-28 04:54:47 -07:00
Stefan Reimer 0b50dbcfbe Reenable fluentd ingest pipeline again 2020-09-28 04:45:39 -07:00
Stefan Reimer a5952f850d Make the kiam annotate namespace job optional 2020-09-18 16:18:59 +01:00
Stefan Reimer 4a918f6d83 Logging fixes for NOT using nameoverride 2020-09-18 16:12:52 +01:00
Stefan Reimer f753a1fc71 Slightly allow ArgoCD a bit more processing 2020-09-18 14:21:39 +01:00
Stefan Reimer 85837c1666 Bump argocd to 1.7.5 as 1.7.4 has a deadlock CPU issue 2020-09-18 13:09:18 +01:00
Stefan Reimer b4c2195eef Add EnvoyFilter to enable tcp keepalive for all Ingress Envoys 2020-09-17 22:25:09 +01:00
Stefan Reimer 182ae141a0 Revert TCP keepalive for fluentd listener 2020-09-17 19:44:34 +01:00
Stefan Reimer dd9e465ead Enable TCP keepalive for fluentd listener 2020-09-17 19:24:24 +01:00
Stefan Reimer 47455bf4f0 TCP keepalive tuning for Istio 2020-09-17 17:54:57 +01:00
Stefan Reimer d3c8c92f9a Set global meshpolicy to prevent upgrade to http2 by default 2020-09-16 16:50:48 +01:00
Stefan Reimer ddb51294c9 Another argocd resource tweak 2020-09-15 11:48:07 +01:00
Stefan Reimer 16bc828a0d Introduce resources for at least the argocd controller 2020-09-15 11:15:55 +01:00
Stefan Reimer 900863acae Docs update 2020-09-14 17:26:39 +01:00
Stefan Reimer ce5290591f fluent-bit version bump 2020-09-14 17:26:19 +01:00
Stefan Reimer 09d29f2704 New bootstrap flow 2020-09-14 16:06:53 +01:00
Stefan Reimer 3a97bbed31 Latest deploy bootstrap tweaks 2020-09-14 15:24:40 +01:00
Stefan Reimer c347c56764 Disable default poddisruptionbudgets, replace with individual todo 2020-09-11 18:21:00 +01:00
Stefan Reimer 4a405a0cbc Still double CRDs 2020-09-11 16:03:22 +01:00
Stefan Reimer fb5229613d Istio is really picky 2020-09-11 16:01:15 +01:00
Stefan Reimer f7ba0ffa33 Move ports > 1024 as we run non-root 2020-09-11 15:45:04 +01:00
Stefan Reimer 530934e603 Set JSON for access logs 2020-09-11 15:39:47 +01:00
Stefan Reimer aa664bec01 Remove xp settings 2020-09-11 15:32:14 +01:00
Stefan Reimer 7a93b34331 Re-enable JSON access logs 2020-09-11 15:28:51 +01:00
Stefan Reimer 33339dbe21 Re-enable access logs 2020-09-11 15:22:34 +01:00
Stefan Reimer 05d9e25f8d Remove deprecated fields for 1.7 2020-09-11 15:20:51 +01:00
Stefan Reimer 6f60ec1dd9 Remove deprecated fields for 1.7 2020-09-11 15:18:30 +01:00
Stefan Reimer e9c0d35695 Remove deprecated fields for 1.7 2020-09-11 15:15:53 +01:00
Stefan Reimer 203f236e23 Version bump Istio to 1.7.1 2020-09-11 15:06:38 +01:00
Stefan Reimer eba052f2f6 Remove double CRD for Istio 2020-09-11 14:42:25 +01:00
Stefan Reimer a09327f3f0 more istio cleanup 2020-09-11 12:37:22 +01:00
Stefan Reimer 5c64544dcb more istio cleanup 2020-09-11 12:32:46 +01:00
Stefan Reimer 72a2a40e81 more istio cleanup 2020-09-11 12:23:08 +01:00
Stefan Reimer 2f7693388e Minot istio tweaks 2020-09-11 12:08:58 +01:00
Stefan Reimer d13fc9d519 Fix math in resources calc 2020-09-11 11:07:49 +01:00
Stefan Reimer e56d0661d6 Make ES heap configurable, set resources accordingly 2020-09-11 11:00:51 +01:00
Stefan Reimer 4cea722fd4 Istio version bump to 1.6.9 2020-09-10 16:44:49 +01:00
Stefan Reimer 790badc1cc Add resources to Kiam 2020-09-10 14:22:47 +01:00
Stefan Reimer f99cb5b21b Another prometheus resources tweak to prevent being killed during restarts 2020-09-10 14:09:23 +01:00
Stefan Reimer db5e587070 Adjust and limit Prometheus resources 2020-09-10 14:01:28 +01:00
Stefan Reimer 71de050f9e ArgoCD version bump to 1.7.4 2020-09-10 13:44:48 +01:00
Stefan Reimer c9b830f727 Change log tag for audit log to not collide with regular tags 2020-09-09 20:59:03 +01:00
Stefan Reimer 122cf5bd52 Calico version bump to 3.16.1 2020-09-09 14:17:02 +01:00
Stefan Reimer 9e043a6241 Dont remove other fields for valid json 2020-09-08 15:41:20 +01:00
Stefan Reimer da503ab38c Fix fluentd parsing of json 2020-09-08 15:34:16 +01:00
Stefan Reimer a1af1a2753 Fix fluentd typo 2020-09-08 15:07:17 +01:00
Stefan Reimer 3b438711dc Update fluentd to latest quay.io image, add json parser for message 2020-09-08 15:05:31 +01:00
Stefan Reimer b7feeae83c Remove CRD property to fix OutofSync Argo 2020-09-08 13:44:31 +01:00
Stefan Reimer 9e0e819fd6 Handle empty message events 2020-09-08 13:40:09 +01:00
Stefan Reimer e09935a819 Add Lua functions to reassemble partial cri-o logs 2020-09-08 13:12:21 +01:00
Stefan Reimer 6b1b02a743 Fluent-bit version bump and support for api audit logs 2020-09-08 12:40:28 +01:00
Stefan Reimer 63537919a4 Move scrape username to its own secret as eck operator cleans up otherwise 2020-09-04 01:13:39 +01:00
Stefan Reimer 3fb65140af Enabled scraping etcd 2020-09-02 15:05:57 +01:00
Stefan Reimer 42b792bb4b More fluentd tuning 2020-08-27 01:13:34 +01:00
Stefan Reimer e2d560c881 Disable ingest pipeline until we know what breaks / jams in ES 2020-08-27 01:03:35 +01:00
Stefan Reimer 7f540d57db Revert ES fixes as servicemonitor is retarted 2020-08-26 23:02:47 +01:00
Stefan Reimer ea3432445e Hardcode es user for now 2020-08-26 22:50:51 +01:00
Stefan Reimer f9821762f7 fluentd / ES fixes 2020-08-26 18:13:21 +01:00
Stefan Reimer c78e9c04ce Fix default value 2020-08-25 14:46:22 +01:00
Stefan Reimer 74abf0fbb3 Make Istio Ingress hosts specific matching the cert 2020-08-25 14:45:56 +01:00
Stefan Reimer 31aa92a971 Revert default fluentd image, latest has issues 2020-08-24 11:38:47 +01:00
Stefan Reimer e4c478ed19 Increase default read-timeout for fluentd 2020-08-23 17:47:28 +01:00
Stefan Reimer 8b5d9ad785 Use quay.io fluentd-es image until we roll our own 2020-08-23 17:41:37 +01:00
Stefan Reimer 80867bd1c2 Fix default fluentd hostname 2020-08-23 15:50:14 +01:00
Stefan Reimer 2d58d73798 Remove Cri parser as it is already incl. upstream now 2020-08-22 19:24:58 +01:00
Stefan Reimer 93edcec5a2 Update docs 2020-08-22 18:27:31 +01:00
Stefan Reimer be346b592f Add fluent-bit support to kuberzero-logging, istio fixes 2020-08-22 18:27:18 +01:00
Stefan Reimer 47fa523694 Refactor argo apps factory 2020-08-21 20:39:55 +01:00
Stefan Reimer 3cfa3512e6 Switch istio ingress to http healthchecks, more tuning 2020-08-21 14:17:47 +01:00
Stefan Reimer 5dac264e17 Also apply improved healthcehck handling and draining to public ingress 2020-08-20 18:32:01 +01:00
Stefan Reimer bdc9687bc3 Apply graceful shutdown fixes interim like Contour 2020-08-20 17:38:18 +01:00
Stefan Reimer 89d765dc53 Add graceful shutdown to Ingress gateway, might need istio 1.7 to actually work though 2020-08-20 16:55:47 +01:00
Stefan Reimer 225526869e Set Istio idle timeout to 1h 2020-08-20 16:12:41 +01:00
Stefan Reimer c5e0187475 Set Istio idle timeout 2020-08-20 15:55:49 +01:00
Stefan Reimer d49ff51379 Disable default syncPolicy, use values instead 2020-08-20 11:40:08 +01:00
Stefan Reimer e782303703 Revert to default images 2020-08-18 13:13:30 +01:00
Stefan Reimer fbc203a2c9 Fix istio to service mapping 2020-08-18 12:45:15 +01:00
Stefan Reimer 279dde5ee2 Revert ot quay image, disable plugins 2020-08-18 12:36:56 +01:00
Stefan Reimer 4a6cbfbbcf Disable persistence by default 2020-08-18 12:08:49 +01:00
Stefan Reimer e6e0aa103b Add missing fluentd secrets 2020-08-18 11:58:37 +01:00
Stefan Reimer 777a0d7f94 Disable statefulset for fluentd being broken upstream 2020-08-18 11:41:09 +01:00
Stefan Reimer 12abcacdd9 Add fluentd to logging 2020-08-18 11:34:34 +01:00
Stefan Reimer fba3e8bfc4 Make old ECK resources optional 2020-08-17 13:12:07 +01:00
Stefan Reimer 1398484af8 Make argocd metrics work 2020-08-16 19:25:07 +01:00
Stefan Reimer 0db65bd060 Wire up prometheus metrics for argo-cd 2020-08-16 15:49:57 +01:00
Stefan Reimer e8afc6ddbb EBS-CSI version bump, reduce ArgoCD concurrency to reduce load spikes, sync from 180s to 300s 2020-08-15 23:37:45 +01:00
Stefan Reimer 4b734dc1bc Add cert-manager state handling for argo 2020-08-15 15:59:57 +01:00
Stefan Reimer 03bab16aa8 Exlude stateuful service objects to prevent double scrapes 2020-08-15 14:49:30 +01:00
Stefan Reimer aac2e235f8 Exlude stateuful service objects to prevent double scrapes 2020-08-15 14:45:43 +01:00
Stefan Reimer a6eab7d24b Add label for servicemonitor 2020-08-15 14:33:41 +01:00
Stefan Reimer 1ae1aac294 More logging fixes 2020-08-15 14:25:07 +01:00
Stefan Reimer 5595fff159 Fix optional prometheus support 2020-08-15 13:24:14 +01:00
Stefan Reimer 0e2e8502ed More logging fixes 2020-08-14 23:02:30 +01:00
Stefan Reimer 15605d0cef Adjust scrape internal for kiam to match others, and servicemonitor for agents 2020-08-14 22:31:34 +01:00
Stefan Reimer 55b0f02394 Add proper label for kiam servicemonitors 2020-08-14 17:39:05 +01:00
Stefan Reimer a9cdc7109e Add elastic-system ns to kubezero 2020-08-14 17:12:06 +01:00
Stefan Reimer 30f6432e59 Bugfix for prometheus service for calico 2020-08-14 17:10:25 +01:00
Stefan Reimer f11e98d69e Bugfix release for Calico, README updates 2020-08-14 17:05:25 +01:00
Stefan Reimer 4f4fa92b6b More logging fixes... ready for first trial 2020-08-14 15:52:10 +01:00
Stefan Reimer 7eabb00e72 Various logging fixes to get a first version of ES and Kibana running 2020-08-13 19:44:50 +01:00
Stefan Reimer f703a6d7e1 Add rabbitmq ingress gateway def 2020-08-11 15:09:48 +01:00
Stefan Reimer 3464bae762 Add logging as default ns to look for servicemonitors 2020-08-10 13:53:41 +01:00
Stefan Reimer 51bef938f6 Make sure nodeselector is a string 2020-08-10 13:28:45 +01:00
Stefan Reimer e4193dca73 make nodeselector for private ingress configurable 2020-08-10 13:20:36 +01:00
Stefan Reimer eb4b102291 Version bump of aws-ebs csi driver to 0.6.0 2020-08-10 12:33:53 +01:00
Stefan Reimer 36edd7bb73 Update all charts to use latest lib 2020-08-07 17:02:22 +01:00
Stefan Reimer f43e078c57 New istio naming schema for virtualservices 2020-08-06 19:07:06 +01:00
Stefan Reimer 08a73b6e82 Various deps updates, Istio to 1.6.7 2020-08-06 18:43:59 +01:00
Stefan Reimer bf6c856153 Extend shared library for naming functions 2020-08-06 17:21:27 +00:00
Stefan Reimer 58bf565359 Add default labels 2020-08-06 17:15:32 +00:00
Stefan Reimer d34536e9e2 Add latest docs 2020-08-06 12:38:40 +01:00
Stefan Reimer d532aaf0c9 Add optional istio ingress policies to metrics 2020-08-06 11:34:32 +00:00
Stefan Reimer 7cb0c4432c Add custom prometehus-operator settings 2020-08-06 11:52:16 +01:00
Stefan Reimer ae1d721a09 Controller and scheduler use self-signed certs 2020-08-05 15:58:37 +01:00
Stefan Reimer b541463829 Add cert-manager backup support in bootstrap, enable schedule and controller metrics 2020-08-05 15:42:15 +01:00
Stefan Reimer 07167f3908 ArgoCd version bump 2020-08-05 13:29:50 +01:00
Stefan Reimer ef38eef245 Finally fix go templating awkwardness 2020-08-05 01:05:05 +01:00
Stefan Reimer f81ba2c008 Volume features need more testing <1.17 2020-08-05 01:00:11 +01:00
Stefan Reimer ee6510a208 Fix Go template specialness 2020-08-05 00:56:31 +01:00
Stefan Reimer e25c125a43 Enable Volumesnapshot/resize, disable leader election of single instance 2020-08-05 00:50:26 +01:00
Stefan Reimer da9207bf79 Disable kubelet cadadvisor metrics 2020-08-04 14:45:42 +01:00
Stefan Reimer 8c0c48f37d Enable kube_proxy metrics as a trial 2020-08-04 10:08:10 +01:00
Stefan Reimer 874039f08c Add node_exporter relabel for adapter 2020-08-04 01:38:26 +01:00
Stefan Reimer 6707669929 Adapter config from kube-prometheus 2020-08-03 22:19:16 +01:00
Stefan Reimer 50bed0573d Adapter config from kube-prometheus 2020-08-03 22:15:05 +01:00
Stefan Reimer 61196f4ccc Disable default rules for the adapter 2020-08-03 21:29:24 +01:00
Stefan Reimer a838b2c108 Temp add custom prometheus url 2020-08-03 20:56:52 +01:00
Stefan Reimer 40e992d9c5 Temp add custom prometheus url 2020-08-03 20:52:57 +01:00
Stefan Reimer ee19bd42d8 Fix scopt for adapter rules 2020-08-03 18:53:18 +01:00
Stefan Reimer 6f23589fc5 Try default settings fro adapter 2020-08-03 18:47:11 +01:00
Stefan Reimer 0a0aef5d08 Disable unreachable metrics for now 2020-08-03 18:30:33 +01:00
Stefan Reimer f6da71451e Add istio for prometheus 2020-08-03 17:44:58 +01:00
Stefan Reimer 3682efcdf1 Frist mostly working version 2020-08-03 17:15:12 +01:00
Stefan Reimer 89cbcc9746 Add all the rules minus alertmanager 2020-08-03 17:01:39 +01:00
Stefan Reimer 161fbf9fa2 Config fixes, svc name fix 2020-08-03 16:34:57 +01:00
Stefan Reimer a49b01d768 Add istio support for metrics grafana 2020-08-03 16:24:32 +01:00
Stefan Reimer 7348f6dfa6 Add Grafana 2020-08-03 16:08:16 +01:00
Stefan Reimer 94d05241e3 Fix scope of prometheus options 2020-08-03 15:51:44 +01:00
Stefan Reimer 5002af7b58 Revert to default Prometheus version 2020-08-03 15:43:56 +01:00
Stefan Reimer a90ee78922 Enable operator on release ns 2020-08-03 13:57:14 +01:00
Stefan Reimer 9af637cb2e Enable operator on release ns 2020-08-03 13:50:32 +01:00
Stefan Reimer 546efb81fd Add basic Prometheus itself 2020-08-03 13:26:00 +01:00
Stefan Reimer 91c98e36a4 Enable node_exporter 2020-08-03 13:16:48 +01:00
Stefan Reimer 9afa4828be Latest deploy, add prometheus adapter to metrics 2020-08-03 13:06:07 +01:00
Stefan Reimer 0a0f54a316 Disable webhooks for now, latest operator 2020-07-31 01:32:44 +01:00
Stefan Reimer 794f249eeb Add webhooks and set tolerations 2020-07-31 01:18:07 +01:00
Stefan Reimer 6faf757e30 Let Argo take care of CRDs 2020-07-30 18:56:46 +01:00
Stefan Reimer a5142b523d Add montoring and logging NS to kubezero argo project 2020-07-30 18:27:43 +01:00
Stefan Reimer 585c3770ef Move metrics to monitoring NS 2020-07-30 18:26:11 +01:00
Stefan Reimer db26111872 Add draft metrics chart 2020-07-30 18:18:32 +01:00
Stefan Reimer a84fcf0a63 Initial metrics chart 2020-07-30 17:19:48 +01:00
Stefan Reimer 5aed60f21a minor bootstrap fix 2020-07-30 17:19:04 +01:00
Stefan Reimer 19b1fd61c9 Convert argo-cd ACL to DENY policy 2020-07-29 18:02:18 +01:00
Stefan Reimer 8f334fdb10 Apparently no patch levels in requirements 2020-07-29 15:12:06 +01:00
Stefan Reimer 4468cafd52 More cleanup, kiam doc update 2020-07-29 15:07:41 +01:00
Stefan Reimer 6b342e72db Calico cleanup, add efs-csi 2020-07-29 14:46:55 +01:00
Stefan Reimer 51c1f039f3 Tweaks for aws-ebs-csi-driver, added initial aws-efs-csi-driver 2020-07-24 15:40:24 +01:00
Stefan Reimer afcd0cbc5e Remove duplicate CRD 2020-07-24 12:31:22 +01:00
Stefan Reimer 7759977a4f Helm bugfixes 2020-07-24 12:24:21 +01:00
Stefan Reimer e878b9de0b Allow Argo-CD to listen on public ingress, add argo-cd istio policy 2020-07-24 11:07:25 +00:00
Stefan Reimer 853fdfe6f8 Add LoadBalancer support to private ingress 2020-07-24 10:31:01 +00:00
Stefan Reimer d0b0829812 Make private Ingress optional but enabled by default 2020-07-23 15:01:34 +00:00
Stefan Reimer 474a0b6a96 Make kubezero work on baremetal 2020-07-23 14:52:11 +00:00
Stefan Reimer 72b6330ba8 Another round of Istio fixes 2020-07-22 20:25:18 +01:00
Stefan Reimer 9231f402f0 More istio bootstrap tweaks 2020-07-22 20:11:34 +01:00
Stefan Reimer 5d9a50c369 Fix jsonpointers 2020-07-22 16:28:27 +01:00
Stefan Reimer c4a1427f8b Add more excludes to improve ArgoCD integration with Istio 2020-07-22 16:18:00 +01:00
Stefan Reimer d5ee713f73 Bugfix, move CRDs into plain manifest to help bootstrapping 2020-07-22 16:01:36 +01:00
Stefan Reimer e56bce1f41 Fix private ingress gateways affinity 2020-07-22 00:11:38 +01:00
Stefan Reimer 318e6094e2 More Istio 1.6 tuning 2020-07-22 00:08:19 +01:00
Stefan Reimer 050a56a3f8 Add KubeZero default Istio config 2020-07-21 18:30:21 +01:00
Stefan Reimer c6a7b972a5 Add sync waves to cert-manager 2020-07-16 14:09:00 +01:00
Stefan Reimer 63b60b655a Derp 2020-07-15 18:15:18 +01:00
Stefan Reimer 1afe82f831 More minor fixed for new Istio integration 2020-07-15 18:13:23 +01:00
Stefan Reimer fc2698d11f Make calico nodes a little more tolerant to CPU spikes 2020-07-15 17:13:40 +01:00
Stefan Reimer 3fc58d5c1e Set proper istio-operator values 2020-07-15 16:17:47 +01:00
Stefan Reimer 05fa490031 We skip Istio 1.5, hook into deploy flow 2020-07-15 15:21:49 +01:00
Stefan Reimer 66f6bab15b Make IPPool optional to fix bootstrap issues, doc fixes 2020-07-15 11:44:43 +01:00
Stefan Reimer 660fa06e21 Update docs 2020-07-14 20:49:20 +01:00
Stefan Reimer 03c4ce6405 Fix handling of CRDs for bootstrapping calico 2020-07-14 20:48:58 +01:00
Stefan Reimer 3551c07381 Revert istio-operator to 1.5 as we cannot skip versions during upgrades 2020-07-14 18:17:08 +01:00
Stefan Reimer d4f59253f8 Istio-operator to 1.6 2020-07-14 18:01:13 +01:00
Stefan Reimer a5ede1bea9 Add istio namespace to kubezero project 2020-07-14 17:41:27 +01:00
Stefan Reimer ca52f90153 Add Istio into kubezero, disabled by default 2020-07-14 17:39:12 +01:00
Stefan Reimer f1e78bc6b4 Fix local-path-provisioner, map istio-operator to control plane 2020-07-14 17:30:43 +01:00
Stefan Reimer a9d0a2967a Kiam minor version bump, first draft of istio chart 2020-07-14 16:58:14 +01:00
Stefan Reimer ac82ef5b77 Convert local-static-provisioner into Umbrella Helm chart 2020-07-09 16:17:23 +01:00
Stefan Reimer 340cbe07f1 Fix crd for ippool 2020-07-09 12:19:58 +01:00
Stefan Reimer 66880e08fa Map calico controller to controller nodes, add IPPool 2020-07-09 12:01:01 +01:00
Stefan Reimer c0b519dd92 missing default values 2020-07-08 21:03:55 +01:00
Stefan Reimer 418e40b343 Derp 2020-07-08 17:09:52 +01:00
Stefan Reimer 43dd389348 Disable prune for migration 2020-07-08 16:59:25 +01:00
Stefan Reimer d81e9839d6 Reenable pruning for calico 2020-07-08 15:17:35 +01:00
Stefan Reimer 1096509121 Improve app logic 2020-07-08 14:52:13 +01:00
Stefan Reimer 2e2d9e9476 Make calico migration explicit 2020-07-07 14:28:32 +01:00
Stefan Reimer 1fe510da6c Disable prune of retain is set 2020-07-07 13:54:45 +01:00
Stefan Reimer 8512b2db5f Calico migration bug fixes 2020-07-07 13:50:08 +01:00