Standardizing the date format
The date time format in the logs is different depending on the location. In the Canadian logs, the format is the standard ISO 8601 in the format YYYY-MM-DD. The logs coming from the USA use the format MM-DD-YYYY. In this recipe, we will append a new column with a standard format to unify both dates.
Getting ready
We will use the resulting CSV file from the previous recipe that receives and transforms logs in the following format:
[<Timestamp>] - SALE - PRODUCT: <product id> - PRICE: <price>
Each line will represent a sale log.
We will use the parse
module. We should install the module, adding it to our requirements.txt
file as follows:
$ echo "parse==1.14.0" >> requirements.txt
$ pip install -r requirements.txt
In the GitHub repository, there are some log files to process with the following structure:
sale_logs/
OH
logs.txt
ON
logs.txt
The code can be found in the...