Metrics and tracingΒΆ
- setup graphite as docker service
- send simple events
- technical events vs business events
- SImple KPI reporting on top of graphite
- use that to monitor SLA violations
Done. There are more complex approaches, dtrace, new relic etc. But these require a (close to full time) dedicated DevOps staff to get real value, and the value is on the few percent from our core ops - that is when a few percent win is a Full time staff cost its worth doing.
dtrace and python - use it to track a program Similar to new relic? https://docs.python.org/dev/howto/instrumentation.html