In this chapter, we learned about connecting data steps using Proc SQL instead of using data steps. We explored the various types of join that help connect datasets using Proc SQL. Having reviewed the pros and cons of connecting datasets in Proc SQL and data steps, we found that sorting is essential in the latter method of connecting datasets. This may mean that data step merging could be a good alternative for smaller datasets but it may lead to processing delays on a large dataset due to the sorting requirement.
We also reviewed how we can create data subsets and summarize data. We used an example where the WHERE, GROUP BY and HAVING clauses were used together to highlight the role of each of these clauses. In previous chapters, we touched upon the concept of Dictionary tables and Columns. In this chapter, we looked at an exhaustive list of options available to leverage...