The technical purpose of Q-learning is to discover solutions for a type of optimization problem called an MDP.
When we talk about states and the actions that we can take from states, we are discussing concepts developed in the context of MDPs (and the Markov chains and other state space models that they are derived from).