Favourite data science platform?

Anyone using things like H2O and similar products? What do you like and dislike?

Uber GMie27 Jul 13, 2017

For non-tree ML models, use Spark mllib. For tree model, use XGBoost. Stay away from H2O.

ARM hpQW77 OP Jul 13, 2017

Thanks for the info. Any specific reasons you dislike the H2o platform? Isn't it just a user friendly interface to spark?

Uber GMie27 Jul 13, 2017

No, it's scalability and reliability are no match for Spark.

Microsoft rainier1 Jul 13, 2017

keras

Comcast goldenmonk Jul 16, 2017

Dislike H2o. Spark is nice on distributed data and for pipeline work, but the modeling in mllib kind of stinks. SAS while hated by open sources is really excellent for the modeling, but stay away from it for "deployment"