Twitter is an excellent source of random data. Given the volume and variety of data, we can readily come up with example (and real) problems to solve. In our case, we're going to build a serverless processing system by sipping off the public twitter stream. Our example system will have the following workflow:
- Read a tweet with cat or dog images from the Twitter firehose
- Place messages on an SQS queue.
- Worker processes will read those image URLs off the queue and perform image recognition.
While this example can be a bit contrived, the concepts demonstrated are true to life. We'll use the AWS Rekognition service to perform image recognition and labeling of any cat or dog images we find. Rekognition is quite fast at what it does, but it's easy to imagine processing images with a much slower service. In that case,...