Let's start the discussion from the beginning. What does a background subtraction process look like? Consider the following image:
The previous image represents the background scene. Now, let's introduce a new object into this scene:
As we can see, there is a new object in the scene. So, if we compute the difference between this image and our background model, you should be able to identify the location of the TV remote:
The overall process looks like this: