Logging, monitoring, and alerting
One of the most important aspects of operating an OpenStack cloud is logging, monitoring and alerting (LMA). Since OpenStack isn’t your legacy bare metal based infrastructure platform it requires a different approach. The traditional LMA methods tend to fall short when considering the scale and elasticity of an OpenStack environment. Additionally, the old binary methods of alerting “this service is down”, “the resource is at 95%”, or even “file system full” messages do not deliver the depth of operational information really required to know the health of an OpenStack cloud.
While there are many different tools to actually monitor log events and create alerts from the systems that run the OpenStack infrastructure, in this chapter, we will focus more on the architectural principles that will help you choose logging, monitoring, and alerting tools.
No matter what is used to do the logging, monitoring, and alerting, from an architectural standpoint, the solution...