- Which variables of the heart dataset are suitable for regression?
Actually, all of them. But the ideal ones are those that are real-valued.
- Does the scaling of the data change the distribution of the data?
No. The distribution remains the same. Statistical metrics such as the mean and variance may change, but the distribution remains the same.
- What is the main difference between supervised and unsupervised dimensionality reduction methods?
Supervised algorithms use the target labels, while unsupervised algorithms do not need that information.
- When is it better to use batch-based dimensionality reduction?
When you have very large datasets.