Symantecyoulearn

Full Stack interview questions

Hi Folks, I am looking for resources to practice real-time . full stack interview questions related to Data engineering.. i.e; ( Data modeling, product sense, data visualization, python coding ) for real world business cases Any pointers would help,, Thank you !

Add a comment
Google 🐇 . . . Jun 19, 2018

what does full stack data engineering mean?!?!

C.H. Robinson iGUX33 Jun 19, 2018

This is the exact outline for the FB data engineer interview. Is that what you’re interviewing for?

Symantec youlearn OP Jun 24, 2018

Yes it for the FB Data ENgg role

C.H. Robinson iGUX33 Jun 24, 2018

The interview prep material they give you does a pretty good job of outlining what to expect. I’m not aware of any good end to end practice material per se. It’s not the contrived problem solving that you tend to see on Leetcode and given for the SWE interviews. If you have some experience in this space it should feel much more like a conversation with a colleague and reflect a problem you may actually be solving in real life. I would encourage you not to overthink the problems. Time is limited and there’s a lot to go through so it tends to be fairly straight forward. You’ll talk about some important metrics, look at a basic table structure, write some SQL around it like “calculate the average value for X”, “find the count of X that satisfies Y condition”. For the coding you’ll be working with a similar set of data except instead of it being a table it’d be a stream. You’ll calculate one of those metrics using code off the stream of data instead of SQL and a table.

Microsoft bitterbyte Jun 19, 2018

If you’re referring to the total journey, from problem definition, data acquisition, feature engineering, training, evaluation, and deployment, I recommend the following book. It’s got an interesting theoretical journey of a problem and how data is obtained and used properly as well as the journey of visualizing, discovering errors and correcting for them, etc. The domain is in prediction of housing costs in various neighborhoods for a real estate company. The data is totally unclean, the problem ill-defined, and progresses through many stages to build a functioning pipeline that isn’t just a mess of “I tried this and that.” http://shop.oreilly.com/product/0636920052289.do

Symantec youlearn OP Jun 24, 2018

Isn’t this the book for machine learning concepts?