Chapter 4
Data transfers direct from the data provider must be stored in a highly secure environment because they often contain sensitive data that is necessary to have during transformation, but will be removed before the data is loaded into the warehouse, mart, or lake.
Maintaining modular, systematically named code files, rather than one long code file, allows the opportunity for manual rollback. This makes troubleshooting to repair variables easier if a problem is identified after processing.
The main advantage of naming these variables
COST1
,COST2
, and so on up toCOST10
is that they could be easily declared in an array by using the rangeCOST1-COST10
. The main disadvantage of renaming these variables with names such asCOST1
andCOST2
is that the variable names would become unintuitive and documentation would be needed in order to understand what they mean.Formats can be created in SAS using
PROC FORMAT
and saved outside of the data step code. In contrast,...