Recipe Difficulty: Medium
Python Version: 2.7
Operating System: Any
Log files can quickly become quite sizable due to the level of detail and time frame preserved. As you may have noticed, the CSV report from the prior recipe can easily become too large for our spreadsheet application to open or browse efficiently. Rather than analyzing this data in a spreadsheet, one alternative would be to load the data into a database.
Splunk is a platform that incorporates a NoSQL database with an ingestion and query engine, making it a powerful analysis tool. Its database operates in a manner like Elasticsearch or MongoDB, permitting the storage of documents or structured records. Because of this, we do not need to provide records with a consistent key-value mapping to store them in the database. This is what makes NoSQL databases so useful for log analysis, as log formats...