Big data overview
Big data has garnered an immense following in the data industry and with SQL Server's entry, it is a new ball game altogether! In this chapter, we will explore the big data phenomenon, along with the part SQL Server plays in it.
Big data processing brings several challenges. The 5Vs of big data present those challenges:
- Volume: This represents the quantity of the data. Just imagine that, during 2020, the digital universe is expected to reach the size of 40 ZB (zettabytes). The challenge here is to store, replicate, and consume such a huge volume. The next year, the volume will be larger again.
- Veracity: This represents the quality of the data, which can vary greatly. Not every information system or source of data produces the same quality data. Data can be incomplete and have missing information or it can be inconsistent, and while processing such datasets, you must implement the data cleansing process to overcome such challenges.
- Variety:...