Industries

Tech

Financial Services

Healthcare

Job Groups

Software Engineering

Product Management

Finance

General Topics

Ask Blinders

CryptoNEW

CarsNEW

2024 Presidential ElectionNEW

Data Science CareerAug 11, 2022

BHG Financialdata_dude1

Predicting things that are hard to predict.

Has anyone been assigned a task that requires you to build a model, where the available training data isn’t great (lots of nulls, features not correlated with target)? What do you do when you just can’t achieve good performance metrics on a model?

Sort by : ...

Data Science CareerAug 11, 2022

BHG Financialdata_dude1

Predicting things that are hard to predict.

Sort by :

Financial Services Company IsQk45 Aug 11, 2022

Garbage in garbage out

SIXGEN IDAistoo$$ Aug 11, 2022

Lots of data cleanup

Cisco KDxw48 Aug 11, 2022

You are talking about 90% of the real world DS problems😛

BHG Financial data_dude1 OP Aug 11, 2022

Does every data scientist have a therapist on standby or what

NXP lil_t_pot Aug 11, 2022

You might already know, but it will help to understand how the data was collected. If you know how the underlying distribution looks, you can do a Monte Carlo simulation to simulate your responses and use that to build your model. Always know how much random effect is affecting your dataset before you simulate your responses. Extensive EDA combine that with domain knowledge will help you do wonders. Good luck

Intel batmancrow Aug 12, 2022

what methods are you using to figure out how much random effect is affecting your dataset?

Zoom odedgal Aug 11, 2022

This means that you do not have adequate data. You need to seek the data.

Zoom ChickFilAA Aug 11, 2022

How to predict stock prices?

Dell batman-1 Aug 11, 2022

Talk to the stakeholders and explain the situation

New

ceze123 Aug 11, 2022

Have you tried feature engineering. Maybe you can formulate features from the existing features which may have a better correlation.

Komodo Health J3ffBenzos Aug 12, 2022

I’d probably break down the features to the model that are derived from the data. Just univariate tests to show predictive value would be enough to show feasibility. After that, you want to figure out next steps. From what it sounds, you should come up with plans to improve data collection. If that seems like a dead end then look to other avenues to add value to the business.

Sort by :

Financial Services Company IsQk45 Aug 11, 2022

Garbage in garbage out

SIXGEN IDAistoo$$ Aug 11, 2022

Lots of data cleanup

Cisco KDxw48 Aug 11, 2022

You are talking about 90% of the real world DS problems😛

BHG Financial data_dude1 OP Aug 11, 2022

Does every data scientist have a therapist on standby or what

NXP lil_t_pot Aug 11, 2022

Intel batmancrow Aug 12, 2022

what methods are you using to figure out how much random effect is affecting your dataset?

Zoom odedgal Aug 11, 2022

This means that you do not have adequate data. You need to seek the data.

Zoom ChickFilAA Aug 11, 2022

How to predict stock prices?

Dell batman-1 Aug 11, 2022

Talk to the stakeholders and explain the situation

New

ceze123 Aug 11, 2022

Have you tried feature engineering. Maybe you can formulate features from the existing features which may have a better correlation.

Komodo Health J3ffBenzos Aug 12, 2022

Industries

Job Groups

General Topics

Predicting things that are hard to predict.

Sponsored

Most Read

Predicting things that are hard to predict.

Most Read