This case-study examines the patterns, symmetries, associations and causality in a rare but devastating disease, amyotrophic lateral sclerosis (ALS). A major clinically relevant question in this biomedical study is: What patient phenotypes can be automatically and reliably identified and used to predict the change of the ALSFRS slope over time?. This problem aims to explore the data set by unsupervised learning.
- Load and prepare the data.
- Perform summary and preliminary visualization.
- Train a k-Means model on the data, experiment at least two different k values, and explain which k value is a better choice.
- Evaluating the model performance by report the center of clusters.
- Visualize the final clustering result.
Submit Python code, report that explains the k experiment, performance evaluation, and visualizations.