Logging, monitoring, and alerting
One of the most important aspects of operating an OpenStack cloud is logging, monitoring, and alerting (LMA). Since OpenStack isn't your legacy bare-metal-based infrastructure platform, it requires a different approach. The traditional LMA methods tend to fall short when considering the scale and elasticity of an OpenStack environment. Additionally, the old binary methods of alerting this service is down, the resource is at 95%, or even filesystem full messages do not deliver the depth of operational information really required to know the health of an OpenStack cloud. Since a cloud is an amalgamation of resources that are shared across a platform, it is the different services that clouds provide for network, storage, and compute that determines health versus the individual health of the underlying hardware components. In a properly configured HA architecture, OpenStack can withstand multiple underlying failures of infrastructure and only experience a decrease...