Monitoring with Prometheus (and metrics-server) #3

bradenmacdonald · 2022-12-06T18:57:41Z

Use helm subcharts to deploy Prometheus and Grafana onto the chart.

TBD: Can we have them auto-detect and start monitoring Open edX instances on the cluster as they get deployed using Tutor, like how the ingress controller works?

antoviaque · 2022-12-07T20:09:12Z

@bradenmacdonald Thanks for creating this task!

@lpm0073 was it the one you are taking on?

lpm0073 · 2022-12-07T20:26:31Z

yes, this is me

antoviaque · 2022-12-07T20:29:06Z

@lpm0073 Alright, I'm assigning the issue to you then, if that works :)

antoviaque · 2023-01-11T14:04:18Z

@lpm0073 Could you post a status update on this task here? This way we could follow & discuss here async, ahead of the next meeting.

antoviaque · 2023-01-16T12:58:43Z

@lpm0073 Are you still interested in this task?

antoviaque · 2023-02-08T09:22:43Z

Recap from the meeting: now @lpm0073 is unblocked to work on this ticket, based if I understood correctly on the autoscaling work from @jfavellar90 in #2

felipemontoya · 2023-02-21T17:15:27Z

@lpm0073 could you give us an update on this? are you interested in pursuing this still?

lpm0073 · 2023-02-27T13:50:23Z

i'm beginning today. i'll start here, focusing on the Karpenter dependencies, which include:

Kubecost effectively shares the same set of dependencies, so i'll use this as a guide for scaffolding purposes. Separately, when running on AWS EKS, these supporting systems will benefit from kube-proxy and coredns, so i'll look into whether we can detect if these are running, and if not then try to at least echo something to the console.

bradenmacdonald · 2023-02-27T19:08:37Z

@lpm0073 Thanks! Note that metrics-server and VPA are already being worked on in #17

lpm0073 · 2023-02-27T23:37:22Z

Question to the group: aside from the helm charts, there are a few AWS resources that need to exist, and need to be provided to the helm chart:

IAM role for service accounts
EC2 instance profile
EC2 tagging role and policy attachment

I have these Terraform scripts. how should i incorporate these into this repo?

bradenmacdonald · 2023-02-28T19:23:28Z

@lpm0073 I put some DigitalOcean example Terraform Scripts in https://github.com/openedx/openedx-k8s-harmony/tree/main/infra-example ; you could rename that folder to infra-example-do and create a new infra-example-aws folder with the AWS terraform.

lpm0073 · 2023-03-06T11:29:38Z

confirming that pr #17 takes care of metrics-server and vpa dependencies for this issue.

felipemontoya · 2023-03-07T17:47:03Z

After PR#17 is merged we will need to create a new PR with the helm charts for grafana and prometheus

adzuci · 2023-03-21T17:50:45Z

Hi there! Though I don't have a lot of context currently, I wanted to mention that 2U has looked into the way FairwindsOps enables chart users to opt to toggle on and off the installation of Prometheus via the prometheus-metrics.installPrometheusServer flag in https://github.com/FairwindsOps/charts/tree/master/stable/insights-agent

I would be interested in talking more about how the over Slack or a call if others would.

antoviaque · 2023-04-04T16:24:23Z

@adzuci Did you get the information and discussions you wanted during the last meeting about this?

Note that @bradenmacdonald has also created a dedicated task to follow-up on topics of interest for you and 2U, at #28 - comments there are welcomed! Or during the meeting later today.

felipemontoya · 2023-04-18T17:43:20Z

@lpm0073 we are getting close to finisht the autoscaling part of the charts which were closely related to this.

Are you interested in pursuing this further?

felipemontoya · 2023-05-02T17:07:25Z

@lpm0073 following the meeting we will split this ticket into two less ambiguous issues. Please comment if you have a different opinion

felipemontoya · 2023-05-30T03:25:54Z

Now that the issues for grafana and prometheus have been created we can go ahead and close this.

antoviaque assigned lpm0073 Dec 7, 2022

antoviaque mentioned this issue Feb 8, 2023

Karpenter? #7

Closed

antoviaque moved this to Backlog in DevOps Working Group Feb 8, 2023

antoviaque added this to DevOps Working Group Feb 8, 2023

antoviaque moved this from Backlog to In Progress in DevOps Working Group Feb 8, 2023

bradenmacdonald changed the title ~~Monitoring with Prometheus and Grafana~~ Monitoring with Prometheus and metrics-server Mar 21, 2023

MoisesGSalas mentioned this issue Mar 27, 2023

Identify next steps until we can run this #26

Closed

felipemontoya changed the title ~~Monitoring with Prometheus and metrics-server~~ Monitoring with Prometheus (and metrics-server) May 2, 2023

This was referenced May 30, 2023

Adding a subchart to host grafana as a global service #37

Closed

Adding a subchart to include prometheus as a data source for grafana #38

Closed

felipemontoya closed this as completed May 30, 2023

github-project-automation bot moved this from In Progress to Done in DevOps Working Group May 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitoring with Prometheus (and metrics-server) #3

Monitoring with Prometheus (and metrics-server) #3

bradenmacdonald commented Dec 6, 2022

antoviaque commented Dec 7, 2022

lpm0073 commented Dec 7, 2022

antoviaque commented Dec 7, 2022

antoviaque commented Jan 11, 2023

antoviaque commented Jan 16, 2023

antoviaque commented Feb 8, 2023

felipemontoya commented Feb 21, 2023

lpm0073 commented Feb 27, 2023 •

edited

Loading

bradenmacdonald commented Feb 27, 2023

lpm0073 commented Feb 27, 2023

bradenmacdonald commented Feb 28, 2023

lpm0073 commented Mar 6, 2023

felipemontoya commented Mar 7, 2023

adzuci commented Mar 21, 2023

antoviaque commented Apr 4, 2023

felipemontoya commented Apr 18, 2023

felipemontoya commented May 2, 2023

felipemontoya commented May 30, 2023

Monitoring with Prometheus (and metrics-server) #3

Monitoring with Prometheus (and metrics-server) #3

Comments

bradenmacdonald commented Dec 6, 2022

antoviaque commented Dec 7, 2022

lpm0073 commented Dec 7, 2022

antoviaque commented Dec 7, 2022

antoviaque commented Jan 11, 2023

antoviaque commented Jan 16, 2023

antoviaque commented Feb 8, 2023

felipemontoya commented Feb 21, 2023

lpm0073 commented Feb 27, 2023 • edited Loading

bradenmacdonald commented Feb 27, 2023

lpm0073 commented Feb 27, 2023

bradenmacdonald commented Feb 28, 2023

lpm0073 commented Mar 6, 2023

felipemontoya commented Mar 7, 2023

adzuci commented Mar 21, 2023

antoviaque commented Apr 4, 2023

felipemontoya commented Apr 18, 2023

felipemontoya commented May 2, 2023

felipemontoya commented May 30, 2023

lpm0073 commented Feb 27, 2023 •

edited

Loading