Every application has its own ability to serve requests. An application’s ability refers to its performance and how it meets its objectives when load is increased.
Many web applications refer to this as a number of requests in a stipulated time.
It’s very important to make the right design decision when designing your web application; design decisions impact the scalability of your service. Be sure to strike the right balance so that your approach considers your services as well as their infrastructure, along with any need for scaling.
Performance and scalability are two different characteristics of a system. Performance deals with the throughput of the system, whereas scalability deals with serving the desired throughput for a larger number of users, or a larger number of transactions.