Processing large datasets requires reliability to be looked at from a slightly different point of view. It is quite common to have a small percentage of errors in such large datasets. An acceptable error tolerance level can only be defined by business rules. Large datasets are generally processed by a network of computers, where failures are more common compared to processing on a single computer. In this section, we will look at the following aspects of error handling:
- Input data errors
- Processing failures