SB 5.1 - OCI - Deploy/Kubernetes/Monitoring failing #321
-
Env details ` MSG: AnsibleUndefinedVariable: 'dict object' has no attribute 'graylog' ` Could you please share how to define the group graylog in the ansible inventory host file.? |
Beta Was this translation helpful? Give feedback.
Replies: 17 comments 16 replies
-
You can add a dummy host to get around this error as below
|
Beta Was this translation helpful? Give feedback.
-
After adding this it has gone past the previous failure..but now getting error at ` STDOUT: Release "prometheus-operator" does not exist. Installing it now. STDERR: Error: failed to install CRD crds/crd-alertmanager.yaml: unable to recognize "": no matches for kind "CustomResourceDefinition" in version "apiextensions.k8s.io/v1beta1" MSG: non-zero return code failed: [localhost] (item=cassandra-jmx-exporter) => { STDOUT: Release "cassandra-jmx-exporter" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1" MSG: non-zero return code failed: [localhost] (item=elasticsearch-exporter) => { STDOUT: Release "elasticsearch-exporter" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1" MSG: non-zero return code failed: [localhost] (item=logger) => { STDOUT: Release "logger" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1" MSG: non-zero return code changed: [localhost] => (item=blackbox-exporter) => { STDOUT: Release "blackbox-exporter" has been upgraded. Happy Helming! failed: [localhost] (item=additional-scrape-configs) => { STDOUT: Release "additional-scrape-configs" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1" MSG: non-zero return code failed: [localhost] (item=alertrules) => { STDOUT: Release "alertrules" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "PrometheusRule" in version "monitoring.coreos.com/v1" MSG: non-zero return code failed: [localhost] (item=kafka-topic-exporter) => { STDOUT: Release "kafka-topic-exporter" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: [unable to recognize "": no matches for kind "PrometheusRule" in version "monitoring.coreos.com/v1", unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1"] MSG: non-zero return code failed: [localhost] (item=prometheus-redis-exporter) => { STDOUT: Release "prometheus-redis-exporter" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: [unable to recognize "": no matches for kind "Role" in version "rbac.authorization.k8s.io/v1beta1", unable to recognize "": no matches for kind "RoleBinding" in version "rbac.authorization.k8s.io/v1beta1"] MSG: non-zero return code failed: [localhost] (item=processing-kafka-exporter) => { STDOUT: Release "processing-kafka-exporter" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: [unable to recognize "": no matches for kind "PrometheusRule" in version "monitoring.coreos.com/v1", unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1"] MSG: non-zero return code failed: [localhost] (item=json-path-exporter) => { STDOUT: Release "json-path-exporter" does not exist. Installing it now. STDERR: Error: unable to build kubernetes objects from release manifest: unable to recognize "": no matches for kind "ServiceMonitor" in version "monitoring.coreos.com/v1" MSG: non-zero return code changed: [localhost] => (item=oauth2-proxy) => { STDOUT: Release "oauth2-proxy" has been upgraded. Happy Helming! kubectl --namespace=monitoring get pods -l "app=oauth2-proxy" PLAY RECAP ********************************************************************* ` |
Beta Was this translation helpful? Give feedback.
-
does any other deployment job depends on Deploy/K8S/Monitoring? |
Beta Was this translation helpful? Give feedback.
-
@gandham-santhosh does it mean we cant enabled service monitoring as of today? |
Beta Was this translation helpful? Give feedback.
-
i tried with release-5.1.0 and job failed with the below error
The status of all pods
|
Beta Was this translation helpful? Give feedback.
-
@gandham-santhosh the problem here is that when we have two k8s cluster ( core and flink)
|
Beta Was this translation helpful? Give feedback.
-
how will the flink cluster know about the crd ServiceMonitor |
Beta Was this translation helpful? Give feedback.
-
@ddevadat Please run Deploy/DataPipeline/Monitoring Also, I hope the Deploy/DataPipeline/BootstrapMinimal job is already ran. If not, please run that first before that. |
Beta Was this translation helpful? Give feedback.
-
tried deploying from release-5.1.0 but it giving the below kind of errors
|
Beta Was this translation helpful? Give feedback.
-
@santhosh-tg i tried running this from But getting the below error
|
Beta Was this translation helpful? Give feedback.
-
Still errored
|
Beta Was this translation helpful? Give feedback.
-
its failing for the below
|
Beta Was this translation helpful? Give feedback.
-
here are the list of pods
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
uninstall helped..job is successfull now. I still run from https://github.com/santhosh-tg/sunbird-data-pipeline/tree/release-5.1.0-monitoring-fix Also do we need to run the Deploy/DP/Logging ? |
Beta Was this translation helpful? Give feedback.
-
@santhosh-tg i couldnt run deploy/DP/logging with the same branch as above
|
Beta Was this translation helpful? Give feedback.
uninstall helped..job is successfull now. I still run from https://github.com/santhosh-tg/sunbird-data-pipeline/tree/release-5.1.0-monitoring-fix
Also do we need to run the Deploy/DP/Logging ?