Using from the previous assignment:
Required:
- Define your dependent and independent variables (pick the column you want to predict and a column for your feature(X))
- This should be something that has a hope of being linear, from the last assignment we saw not much correlated with tomorrow’s percent change. So that would be a bad Y. Try predicting low,high,volume, etc
- Split your data into training and testing sets. Use 20% of the data for testing
- Train(fit) a Linear Regression model
- Report the square error
For the heck of it if you want:
Use pyplot to plot your X and Y values, along with your prediction