Linear regression

 

Using Preview the document from the previous assignment:

Required:

  • Define your dependent and independent variables (pick the column you want to predict and a column for your feature(X))
    • This should be something that has a hope of being linear, from the last assignment we saw not much correlated with tomorrow’s percent change. So that would be a bad Y. Try predicting low,high,volume, etc
  • Split your data into training and testing sets. Use 20% of the data for testing
  • Train(fit) a Linear Regression model
  • Report the square error

For the heck of it if you want:

Use pyplot to plot your X and Y values, along with your prediction