Thanks a lot. I use Prometheus as the main part of our observability stack but never spend enough time to clearly understand how it works. Thanks to your videos it will be easier to learn with informations from the best source!
Thank you! Someone had asked on ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-7uy_yovtyqw.html as well, here's my current setup (a bit evolved from this first video I did on the channel): - Everything is 100% recorded and produced/edited on Linux - Recording software: OBS Studio - Video editing software: DaVinci Resolve Studio - Chart animations: built on motioncanvas.io (basically reimplemented rate()/irate()/increase() and other stuff from previous videos in there, with animations) - Camera: Sony A6400 (connected via Elgato Cam Link to record 4K directly into OBS) - Mic: Deity VO-7U - Teleprompter: Desview T3 (this doesn't fit natively on my Sony cam lens, so I built a custom contraption to mount them together) - Key lights: 2x Elgato Key Light - Background: Walimex Pro Roll-Up Panel - Background lighting: 2x NEEWER 660 PRO RGB LED - Script writing: Google Docs - Teleprompter app: Nano Teleprompter (nice because you can sync it with your Google Docs for script writing!) - ffmpeg to convert the 30 FPS of PNGs produced by Motion Canvas into a video file - Thumbnails: Krita for cutting out shots of myself, a custom GatsbyJS + Puppeteer setup for composing all the thumbnails in HTML/CSS and then automatically screenshotting and uploading them to RU-vid (so I can replace them all at once with a new design if I need to)
Very informative videos! Kudos! I'm looking for easy to understand and implement tutorials/resources about "prometheus at scale for k8s clusters in production" either using thanos/cortex or prometheus federation. It would be great if you can make few videos on this topic. Because I think if someone can make this topic easy to understand, it has to be none other than you.
Thanks for this amazing job! I have a question though. How technically Prometheus server is able to scrape the status code of server like nginx ? From my knowledge it is not something that is exposed by nginx through a client library.
See the part of the video at ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-STVMGrYIlfg.html where I talk about targets that don't have native Prometheus instrumentation. For software (or even hardware devices) that don't expose native Prometheus metrics, you would use a so-called exporter - an agent process sitting next to the thing you want to monitor that gets the metrics from the target and that Prometheus can scrape. For example, for nginx you would use the nginx-exporter: github.com/nginxinc/nginx-prometheus-exporter
Thanks for this amazing job. I have a question though. When you add the sum by(path) does that mean that the expression will return the sum of all the values of the time series ? I don't quite get this part.
Yes, the sum() aggregator aggregates across multiple time series, adding together all the individual values of the aggregated time series into fewer series. In this case, the "by(path)" preserves the "path" label, so the individual paths are not aggregated over, but still present in the result. That's kind of similar to the behavior of GROUP BY in SQL.
Hi Sir, I have a question. Does this tools suitable for Maintenance Planning use? If yes, would you please explain a bit on how this Prometheus helping with the planning for maintenance work? Many thanks.
Maintenance planning sounds like a mostly unrelated task, so not really what Prometheus was designed for (systems monitoring). However, you can temporarily silence alerts in the Alertmanager when there are known maintenance windows going on, so you don't send notifications in that case.
@@PromLabs another question, does Prometheus able to integrate with Pronto. I can't find the support article on this. So far, only found Prometheus integrated with SAP, MAXIMO and Oracle. Would you please advise. Cheers.
@@HaziHaznidaUsolludin Sorry, I've never heard of Pronto before. If you have specific usage and integration questions like this one, I would recommend using one of the open community channels like the Prometheus Users mailing list (see prometheus.io/community/).
I’m struggling to figure out how to get cpu utilization and memory for Kubernetes jobs. Jobs don’t run indefinitely like a web application does. Additionally, the applications don’t support incoming HTTP requests. What is a good way to capture these metrics?
Hi! Since the question is a bit more involved and not directly related to the video, I'd recommend asking it on one of the public community channels (especially the users mailing list is really good at getting quick replies from people): prometheus.io/community/
@@PromLabs please do an introductory video to metrics. In all honesty it’s a bit overwhelming and I think it would be great to see how Prometheus helps you get common metrics like cpu utilization, memory, etc. Understanding how Prometheus gathers metrics like this would be awesome. Thanks for your response!
awesome video, your channel is one of the best for getting on the Prometheus & Grafana, really helped me to get the first hands-on experience and stimulated me to get my hands dirty!