Technical requirements
All the code is available in this book’s GitHub repository https://github.com/PacktPublishing/Practical-Machine-Learning-on-Databricks and is self-contained. To execute the notebooks, you can import the code repository directly into your Databricks workspace using repos. We discussed repos in our previous chapters.
This chapter also assumes that you have a preliminary understanding of what user-defined functions are in Apache Spark. You can read more about them here: https://docs.databricks.com/en/udf/index.html.