The sheer volume of collected data can cause problems. With the accumulation of more and more data, managing and moving the data along with its underlying big data infrastructure becomes increasingly difficult. The rise of cloud providers has facilitated the ability to move applications to the data. Multiple sources of data result in increased volumes, velocity, and variety. The following are some common computer-generated data sources:
- Application server logs: Application logs and games
- Clickstream logs: From website clicks and browsing
- Sensor data: Weather, water, wind energy, and smart grids
- Images and videos: Traffic and security cameras
Computer-generated data can vary from semi-structured logs to unstructured binaries. This data source can produce pattern-matching or correlations in data that generate recommendations for social networking and online gaming in particular. You can also use computer-generated data to track applications or service behavior...