Commit Graph

598 Commits

Author SHA1 Message Date
320dd670ca Add CoreDNS and etcd dashboards 2021-04-26 17:07:17 +02:00
10492add9b feat: add Kiam Grafana dashboard 2021-04-26 17:06:42 +02:00
2ab696c19e fix: fix naming of kubernetes grafana dashboards, make decompress work for all dashboards 2021-04-26 16:46:35 +02:00
421ab21a6b feat: use kube-mixin dashboards directly 2021-04-26 16:27:19 +02:00
9be889efed chore: update ES dashboard DS 2021-04-25 22:29:55 +02:00
8ddf39b383 feat: Add Grafana dashboards for logging 2021-04-25 22:19:06 +02:00
5af45ade23 fix: Fix datasource in Istio Grafana dashboards 2021-04-25 21:25:06 +02:00
af5092475d fix: adjust datasource and encoding of grafana dashboard tooling 2021-04-25 21:24:39 +02:00
4bf189a1e9 feat: kubezero-metrics version bump, new Grafana dashboard management tooling for KubeZero, add ability to for compressed dashboards in configmaps 2021-04-25 11:59:54 +02:00
a3d47cdb12 feat: Add Istio Grafana dashboards, enable metrics 2021-04-25 11:58:17 +02:00
a78ad7a7f9 feat: integrating metrics support for Istio with KubeZero metrics 2021-04-22 15:43:10 +02:00
1133902b5d fix: add missing standard labels to custom Istio resources 2021-04-22 12:00:13 +02:00
6e173070a8 feat: First version of KubeZero NATS module 2021-04-22 11:59:18 +02:00
63ecf315d7 chore: Rename timemachine to timecapsule 2021-04-21 16:13:40 +02:00
401f024be6 feat: introduce ingress proxy recommended hardening/uning settings 2021-04-20 16:33:45 +02:00
64dd6160cb feat: improved update strategy and timing to reduce 5XX during istio ingress deployments 2021-04-20 12:49:29 +02:00
fd35a46b66 fix: enable crds for aws-ebs-csi-driver to make snapshots work 2021-04-19 13:19:30 +02:00
f1cb2dbb66 feat: Map gemini controller to controller nodes, fix ebs storageclass, integrate timemachine into kubezero 2021-04-19 12:46:42 +02:00
b9c341a55b feat: First version of KubeZero Timemachine backup solutions 2021-04-19 11:31:28 +02:00
31a3848172 fix: re-add terminationtimeout settings for istio as new way doesnt seem to work yet 2021-04-16 13:55:04 +02:00
1b22720d4b chore: sync resources between public and private ingress 2021-04-16 13:49:55 +02:00
1f68cea76b fix: increase default memory limit to ingress envoy to 512MB, enable podDisruptionBudgets on demand 2021-04-16 13:41:31 +02:00
33133f359c refactor: Move Istio config to new place 2021-04-16 12:38:57 +02:00
321c2fe58b fix: Upgrade Istio to 1.9.3 due to various security issues upstream 2021-04-16 11:41:13 +02:00
adb54b7663 feat: first stab at Kubernetes 1.20 kubeadm config 2021-04-15 15:51:46 +02:00
882165cc58 fix: adjust deployment strategy to replace for Grafana because we enabled persistence 2021-04-15 15:16:28 +02:00
bb53c8cf35 feat: kube-prometheus-stack version bump, adjust filesytem alerts 2021-04-15 15:03:52 +02:00
f9dbcee502 feat: add runtimeclass for crio, reorg kubeadm for 1.20 2021-04-14 16:05:16 +02:00
f2d7d7821f fix: loosen kiam memory limits slightly to prevent OOM endless loops 2021-04-09 13:44:48 +02:00
4f9524c8b7 feat: add support for volumeAttributes to aws-efs-csi-driver to allow to disable buggy TLS encryption 2021-04-08 16:19:51 +02:00
c912860b60 chore: fluent-bit version bump to fix excessive logging 2021-04-07 12:00:53 +02:00
9a362607c1 fix: use evictionHard to reserve node memory to avoid systemd mess 2021-04-04 16:52:18 +02:00
2dc912cf9a chore: chart dep fix 2021-04-01 14:51:14 +02:00
a3eebeaf61 fix: Use latest livenessprobe for aws-efs-csi-driver to tackle memory leak 2021-04-01 14:46:36 +02:00
e758795467 feat: Version bump of aws-ebs-csi-driver 2021-04-01 14:15:43 +02:00
784ac2a4fb style: use quote function in kubeadm helm template 2021-04-01 12:35:56 +02:00
6536a655a4 feat: enable GenericEphemeralVolume feature gate 2021-03-30 16:18:46 +02:00
9391958a3a refactor: Unifi feature-gate handling in kubeadm chart 2021-03-30 14:50:37 +02:00
c1a1aea29f fix: Set Redis cluster proxy policy to PREFER_MASTER 2021-03-26 17:35:21 +01:00
1a1f5e7cd6 chore: Bump Istio version from 1.9.1 to 1.9.2 2021-03-26 17:34:43 +01:00
eca69f8b5f Reduce fluent-bit memory consumption under backpressure 2021-03-26 10:56:17 +01:00
4d015cc4c6 Update chart READMEs 2021-03-25 16:32:49 +01:00
ae5e5e1c5f Ensure we use our version of the aws-efs chart 2021-03-24 13:17:27 +01:00
9f0e8a422c Ensure we use our version of the aws-efs chart 2021-03-24 13:08:17 +01:00
383fafce43 Ensure we use our version of the aws-efs chart 2021-03-24 13:06:31 +01:00
2adde8f713 Ensure we use our version of the aws-efs chart 2021-03-24 13:05:36 +01:00
6300826394 Fix resources location 2021-03-24 12:21:03 +01:00
019b5b0ac4 Fix resources location 2021-03-24 12:17:19 +01:00
48d3d269e7 Add custom support for nodeaffinity and resources to aws-efs-csi-driver 2021-03-24 12:11:47 +01:00
6ea7500e41 ArgoCD version bump 2021-03-23 16:21:14 +01:00
92a3bc06a3 Redis proxy upgrade to match Istio 1.9 2021-03-22 17:00:54 +01:00
f85d842267 Add more resources to metrics 2021-03-22 12:05:02 +01:00
27c1be4085 Add more resources to metrics 2021-03-22 11:41:26 +01:00
a2355df60f Remove unsupported resources from aws-efs 2021-03-22 11:17:56 +01:00
13aaae8a54 Version bump aws-efs-csi-driver 2021-03-22 11:14:27 +01:00
1b3dbe36eb Version bump of kube-prometheus stack 2021-03-22 10:23:27 +01:00
ec87dd7dcc Some tool tweaks 2021-03-19 16:16:13 +01:00
1018270620 Add nodeAffinity to all logging components, add resources to fluent-bit, tuning 2021-03-19 16:15:58 +01:00
419d43cf9f Kubelet tuning 2021-03-18 14:31:10 +01:00
3f204b5e04 Bugfixes for control plane, proper calico cpu requests 2021-03-17 17:29:44 +01:00
de2602c3d5 Updates for etcd 1.19 2021-03-15 11:51:56 +01:00
6fe69c9a38 Bump kiam-server memory limit 2021-03-11 09:05:26 +01:00
64a0736dfd Bump ECK operator to 1.4.1 2021-03-11 09:00:47 +01:00
cde586a0df Reduce fluentd chunk size and increase retry timeout 2021-03-10 10:44:51 +01:00
22a1e8d171 Update fluentd.patch for helm chart 2021-03-10 10:34:17 +01:00
e666d1079a Upgrade fluentd to use new upstream helm and image 2021-03-10 10:32:12 +01:00
0d6d22b0d4 Disable metadata via kubelet for now 2021-03-09 10:33:40 +01:00
9dc2881f15 Remove unnecessary docker mount 2021-03-07 12:47:06 +01:00
05b2edf089 Switch fluent-bit to use kubelet rather than kube-api 2021-03-07 12:38:53 +01:00
e991e7247a Initial aws-node-termination still disabled, local-volume tweaks for new tag layout 2021-03-05 18:18:45 +01:00
50ffcf28eb Version upgrade ES/Kibana and Fluentbit, various tunings 2021-03-05 16:53:02 +01:00
532710b77b remove cpu limit for aws-iam-auth, enable cpufs kubelet feature flag 2021-03-05 14:00:00 +01:00
f38fe4f790 More request tuning for aws-ebs-csi 2021-03-05 13:58:54 +01:00
4a9eb00f9d aws-ebs-csi-driver version bump, remove cpu limts 2021-03-05 10:32:42 +01:00
6e85de3722 remove default cpu limmits for kiam 2021-03-05 10:22:54 +01:00
ef3e8f4535 aws-ebs-csi-driver version bump introducing readiness probes 2021-03-03 10:59:12 +01:00
3df6229722 remove patch left overs 2021-03-02 11:37:02 +01:00
65eabacf49 Slightly increase cpu limits for aws-ebs 2021-03-02 11:32:00 +01:00
948764eca7 Version bump charts 2021-03-02 11:28:13 +01:00
80ea9488f6 aws-ebs-csi-driver version bump and resource limits 2021-03-02 11:22:34 +01:00
cbeb7b9704 Istio version bump due to security release 2021-03-02 10:33:12 +01:00
9531073c36 Prometheus-stack version bump 2021-02-26 22:25:43 +01:00
491057ed65 Minor version bump of aws-ebs-csi-driver to update livenessprobe 2021-02-26 01:18:32 +01:00
c3a36a2d7d Fix gateway protocol 2021-02-26 00:35:21 +01:00
97ec77f3b7 Update ingress default config 2021-02-26 00:24:12 +01:00
b6e92ceba2 Upgrade Istio to 1.9 2021-02-25 23:44:33 +01:00
4a7f7f8187 Reduce loglevel for efs driver 2021-02-25 00:23:50 +01:00
3758a86553 Version bump for aws-efs-csi-driver, use upstream helm chart 2021-02-25 00:17:50 +01:00
d858146a1d Version bump for aws-ebs-csi driver, enable volume resize, snapshot, patch for loglevel and leader election 2021-02-24 20:36:34 +01:00
064012d083 Version bump of ArgoCD required for Kube > 1.18 latest charts 2021-02-24 00:10:14 +01:00
1218033166 Further tuning of fluentd throughput 2021-02-22 21:34:45 +01:00
eb4f22c5c2 Fix kubelet config 2021-02-22 21:32:41 +01:00
62a8f82f01 Version bump cert-manager 2021-02-22 21:32:12 +01:00
d969e53d40 Make kubeadm config work on bare-metal, minor tuning 2021-02-22 14:41:32 +01:00
8e8f747686 Kubeadm chart for 1.19, improved tooling 2021-02-12 11:04:16 +00:00
19d10828f6 README updates 2021-01-26 13:47:33 +00:00
fc45e7fd0b Istio minor version bump 2021-01-26 12:54:56 +00:00
9ca8920387 Fix changed key for kiam 2021-01-21 13:35:20 +00:00
7587564da0 Version bump for aws-ebs-csi and kiam, ES bugfix bump, fluentd tuning 2021-01-21 12:31:06 +00:00
d28e18766a CI/CD tools update 2021-01-21 10:53:53 +00:00
adefd7433b Reduce logLevel of prometheus adapter 2021-01-20 15:31:00 +00:00
da6a1fdf51 Reduce loglevel of prometheus adapter 2021-01-20 15:22:28 +00:00
a26b652690 Allow custom memory overwrites for ES cluster 2021-01-18 17:18:30 +00:00
d7091434db Add basic mapping for aws-iam-auth 2021-01-11 20:41:12 +00:00
ce7645cb57 Split out crds for aws-iam-authenticator 2021-01-04 18:13:36 +00:00
4fe40a1345 Add aws-iam-authenticator support 2021-01-04 14:56:41 +00:00
924310ca5b Remove stable repo 2021-01-03 16:33:13 +01:00
67f1157848 Integrate and patch prometheus-stack chart to customize alerts 2020-12-17 16:46:15 -08:00
4892d6c073 Switch to gp3 as default EBS class, version bump for metrics components 2020-12-17 15:36:23 -08:00
fdcb6f7e6f Remove repositories to make argo happy 2020-12-17 12:24:12 -08:00
214bfec2a4 Remove repositories to make argo happy 2020-12-17 12:22:48 -08:00
38b2d56da9 Re-add fluentd chart until we migrate off 2020-12-17 12:17:19 -08:00
521bb2a5c1 Istio version bump, ingress terminationgraceperiod patch, aws-ebs version bump 2020-12-16 03:40:14 -08:00
79dc6e9413 EBS driver version bump 2020-12-10 07:06:31 -08:00
3820858046 More logging tuning 2020-12-10 06:44:58 -08:00
89dc890c74 More logging tuning 2020-12-10 06:36:26 -08:00
a8314d4074 Lua fix fluent-bit 2020-12-08 07:15:00 -08:00
77a7ba2ed6 Integrare fluent-bit into logging to allow better config 2020-12-08 07:05:25 -08:00
f78c382be6 Use upstream released chart for aws-ebs-csi 2020-12-07 15:01:40 -08:00
5909fcd841 Fix empty CRDs, only deploy eck-operator if needed 2020-12-07 13:06:00 -08:00
835aae9df8 Re-enable geoip lookups 2020-12-07 04:33:33 -08:00
b30b41ab15 Disable CRDs from eck-operator defaults 2020-12-05 14:16:33 -08:00
a31a945094 Adjust argo ingnores for latest eck webhooks 2020-12-05 14:08:40 -08:00
2a56489273 ECK fixes for Kube 1.18, Redis cluster support incl. Enyoy proxy 2020-12-04 06:05:35 -08:00
33495c83de Add helm version check to bootstrap.sh 2020-12-03 02:04:08 -08:00
8fcbcb680b Minor version bump for redis, added redis-cluster support 2020-12-02 07:23:17 -08:00
83b9b566db Switch all metrics logs to json 2020-12-02 06:24:07 -08:00
89780039fc Fix service names in metrics 2020-12-02 04:30:17 -08:00
ee83391296 Add alertmanager istio config for metrics, metrics values reorg 2020-12-02 03:53:19 -08:00
a510dd06d9 More fixes and upgrade docs 2020-12-01 07:46:04 -08:00
2be387b87b ArgoCd naming fixes 2020-11-30 09:30:06 -08:00
0e7a2e70d6 More fixes 2020-11-30 04:13:52 -08:00
b0f53257ac cert-manager version bump, local-path-provisioner fixes 2020-11-30 11:34:44 +00:00
59ff3cb015 Add local-path-provisioner, re-org bootstrap 2020-11-30 01:52:11 -08:00
4b48da5935 Metrics update 2020-11-28 23:54:40 +00:00
e27692430e More bugfixes, ingress certs 2020-11-28 15:01:20 -08:00
09d2b52f74 More fixes 2020-11-27 08:19:44 -08:00
10db0f09d0 Add missing .helmignore 2020-11-26 15:31:40 -08:00
052efd077c Latest fixes, fluent-bit version bump 2020-11-26 09:37:10 -08:00
c8a903110f More fixes now adding ArgoCD 2020-11-26 05:21:10 -08:00
ec6d7a4d11 Another argo tweak 2020-11-24 07:29:38 -08:00
5b317db251 Bug fixes and argo tweaks 2020-11-24 07:18:14 -08:00
32ed7cf3a0 Revert Kube version check to make argo work 2020-11-24 06:51:48 -08:00
cd24b9fa1a First try adding argoCD day 2 2020-11-24 06:44:57 -08:00
35b1570d18 Update of various components, new aroless bootstrap working 2020-11-21 04:24:57 -08:00
cd0e559678 First steps of argoless bootstrap 2020-11-03 12:51:57 +00:00
b6929002dc Minor version bump for prometheus-stack, remove default CPU limit 2020-10-27 14:13:52 +00:00
53b638da5e Update docs, bump argo-cd parallel jobs 2020-10-27 11:54:44 +00:00
9e76512fcc Remove argocd from control plane 2020-10-21 14:18:02 +01:00
74c47d7391 Enable json logs for argo-cd finally 2020-10-21 13:29:49 +01:00
d7006faa60 Bump argo-cd chart version 2020-10-21 13:14:23 +01:00
4fb425676d Bump argo-cd version 2020-10-21 13:12:23 +01:00
8874c9869d Revert more prometheus-adapter config 2020-10-21 13:05:08 +01:00
72a917bdae Revert prometheus adapter changes 2020-10-21 12:51:15 +01:00
44d08c7abc More EFS fixes, cert-manager version bump 2020-10-21 04:37:33 -07:00
19d915cb92 Adjust prometheus URLs 2020-10-09 18:41:43 -07:00
509f8d59fb First stab at new prometheus charts 2020-10-09 17:58:44 -07:00
05993ab6b0 Cleanup 2020-10-09 12:38:20 -07:00
5781494eda Minor tweak to aws efs upate tooling 2020-10-09 11:15:19 -07:00
fea850afcc Actually update the default version of aws ebs to 0.7.0 2020-10-09 11:14:51 -07:00
004503d633 AWS EBS driver version bump 2020-10-09 10:53:32 -07:00
d7132ca90c Revert minimal kube version due to issues with argocd 2020-10-09 07:43:05 -07:00
959d61ef66 Add multi PV support to EFS 2020-10-09 07:30:25 -07:00
54335c4c0a Update EFS tooling to track releases 2020-10-08 07:52:34 -07:00
4285db835d Typo 2020-10-07 09:11:22 -07:00
a951e7d9a0 New Lua function to nest entries into kube.<namespace>.* 2020-10-07 09:09:24 -07:00
cb3c6a93ba fluent-bit tag improvements 2020-10-05 17:27:58 -07:00
b0286ff858 Add some spaces 2020-10-05 09:03:47 -07:00
846d7d2d87 More logging fixes, try to decode json at the source 2020-10-05 09:01:50 -07:00
42f8a5a0b5 Disable json logging, crashed Argo 2020-10-05 08:43:18 -07:00
31f86360d9 Revert ArgoCd 1.7.7 2020-10-05 08:27:37 -07:00
baa9b69265 Latest argocd 2020-10-05 04:31:00 -07:00
5854468f09 Derp 2020-10-05 04:09:03 -07:00
c556df65ff Updated helm-docs, fluentd SSL handled by Istio, ES&Istio tuning 2020-10-05 03:50:23 -07:00
4aeb23d8cc Disable borken json parsing for now 2020-10-02 14:46:07 -07:00
bbd6d25429 Disable borken json parsing for now 2020-10-02 14:41:40 -07:00
1aba6fcbe6 Fix the warning due to double CRDs 2020-10-02 10:44:15 -07:00
cd5b38bb6c Istio version bump, make http10 support optional, enable redis,mysql protocol support 2020-10-02 10:38:09 -07:00
4cb3bd01c5 Minor fluent-bit tuning 2020-10-01 12:32:21 -07:00
84a80f3b97 Fluentd tuning 2020-10-01 10:14:04 -07:00
ea2391a212 Fluentd tuning 2020-10-01 10:11:48 -07:00
fad0597302 Disable pipeline still cpu issues 2020-09-28 04:54:47 -07:00
8de44f18d4 Reenable fluentd ingest pipeline again 2020-09-28 04:45:39 -07:00
d30ca895ec Make the kiam annotate namespace job optional 2020-09-18 16:18:59 +01:00
0939405c7a Logging fixes for NOT using nameoverride 2020-09-18 16:12:52 +01:00
2c600c2fd0 Slightly allow ArgoCD a bit more processing 2020-09-18 14:21:39 +01:00
8af14e3e8e Bump argocd to 1.7.5 as 1.7.4 has a deadlock CPU issue 2020-09-18 13:09:18 +01:00
df20d07d10 Add EnvoyFilter to enable tcp keepalive for all Ingress Envoys 2020-09-17 22:25:09 +01:00
d61752703e Revert TCP keepalive for fluentd listener 2020-09-17 19:44:34 +01:00
bcf8093b84 Enable TCP keepalive for fluentd listener 2020-09-17 19:24:24 +01:00
ec18529956 TCP keepalive tuning for Istio 2020-09-17 17:54:57 +01:00
a0873631c4 Set global meshpolicy to prevent upgrade to http2 by default 2020-09-16 16:50:48 +01:00
0b2b5acff7 Another argocd resource tweak 2020-09-15 11:48:07 +01:00
628a7e7ac9 Introduce resources for at least the argocd controller 2020-09-15 11:15:55 +01:00
93723e6a6a Docs update 2020-09-14 17:26:39 +01:00
c9a5691acf fluent-bit version bump 2020-09-14 17:26:19 +01:00
2171a4211e New bootstrap flow 2020-09-14 16:06:53 +01:00
f9770ce483 Latest deploy bootstrap tweaks 2020-09-14 15:24:40 +01:00
189899c296 Disable default poddisruptionbudgets, replace with individual todo 2020-09-11 18:21:00 +01:00
94a0db6a80 Still double CRDs 2020-09-11 16:03:22 +01:00