Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create HoustonJ_sub2.csv #160

Merged
merged 1 commit into from
Jan 29, 2017
Merged

Create HoustonJ_sub2.csv #160

merged 1 commit into from
Jan 29, 2017

Conversation

HoustonJ2013
Copy link
Contributor

submitted the prediction result.

@kwinkunks kwinkunks merged commit c363cb7 into seg:master Jan 29, 2017
@kwinkunks
Copy link
Member

Hello! Thanks for this. I laughed the the 'brutal grid search' comment. XGBoost is worth it in the end though: this was pretty dang good and scores 0.584. Cheers!

@kwinkunks
Copy link
Member

Oh, btw, I haven't taken the time yet to look at this in detail, but you might want to double-check that your final model is trained with all the training data. My quick glance made me think you might just have the last model from that 'brutal' loop --- and it only has a subset for cross-validation. The model might (should) improve if you retrain it before making the prediction. Does that make sense? (I hope I'm not misreading your code).

@kwinkunks
Copy link
Member

Hello again. I got a chance to try this:

y_pred = clf.fit(X, y).predict(X_test)
np.save('y_pred.npy', y_pred)

This vector scores 0.600 with your hyperparameters, so I'll use that as your high score.

If you're still in the top 10 on Wednesday, I'll also be scoring all these stochastic models with the media score from 100 realizations, using different random seeds. See issue #114

@HoustonJ2013
Copy link
Contributor Author

HoustonJ2013 commented Jan 30, 2017 via email

@kwinkunks
Copy link
Member

Gresat story, thanks for sharing. The next few years will be very exciting! I'm so glad you were able to get involved in this contest.

You might like to know about the Slack group 'Software Underground' -- a chat group for about 250 geoscience/code folks all over the world. Please join if you like at http://swung.rocks/. Lots of machine learning chat, lots of Pythonistas. Including another chap (Gram Ganssle) who's interested in reproducing Ben Bougher's work.

Cheers! Matt

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants