Service Metrics
4 min
feature in development and not yet available documentation may change introduction a technical dashboard is added, mainly for the support team, which shows the metrics of the different containers and performance of the servers this allows a visual analysis of the current performance and can show the need for upscaling before there are actual issues it is also possible to review previous data to investigate root causes and unexpected behaviors of the containers technology prometheus & grafana stack grafana metrics dashboards grafana is an open source analytics and visualisation platform in mantsu it is used for technical monitoring of the services and infrastructure operational mes reporting will be handled by the reporting application it connects to prometheus as its primary data source and provides service health dashboards (cpu, memory, latency, error rates) infrastructure level visibility across all kubernetes nodes custom alerting rules with visual threshold indicators prometheus metrics collection prometheus is an open source time series database and monitoring system it scrapes metrics endpoints exposed by each service at regular intervals and stores them for querying and alerting prometheus is the de facto standard for kubernetes native monitoring and integrates natively with grafana prometheus alert manager alert manager is part of the prometheus ecosystem it receives alerts fired by prometheus when metric thresholds are breached and routes them to the appropriate notification channel (email, slack, teams, or similar) this enables the mantsu service team to take proactive action before issues affect production operations
