With the concepts we've learned, we are going to build a malware detector with artificial neural networks. Let's begin by identifying our dataset. By now, you are familiar with the steps required to build a machine learning models. For this model, we are going to use the Portable Executable (PE) files as feeding data. As you noticed in the previous chapter, we installed a Python library called pefile. It is time to use it in this model. Once we install pefile with the Python package installation manager PIP, we can start using pefile to extract information from any PE file. In order to gather information from a PE file, import the os and pefile libraries. The os library allows you to interface with the underlying operating system that Python is running on.
As a demonstration, we are going to download a malware PE file delivered by...