Hi Folks, I am looking for resources to practice real-time . full stack interview questions related to Data engineering.. i.e; ( Data modeling, product sense, data visualization, python coding ) for real world business cases Any pointers would help,, Thank you !
This is the exact outline for the FB data engineer interview. Is that what youâre interviewing for?
Yes it for the FB Data ENgg role
The interview prep material they give you does a pretty good job of outlining what to expect. Iâm not aware of any good end to end practice material per se. Itâs not the contrived problem solving that you tend to see on Leetcode and given for the SWE interviews. If you have some experience in this space it should feel much more like a conversation with a colleague and reflect a problem you may actually be solving in real life. I would encourage you not to overthink the problems. Time is limited and thereâs a lot to go through so it tends to be fairly straight forward. Youâll talk about some important metrics, look at a basic table structure, write some SQL around it like âcalculate the average value for Xâ, âfind the count of X that satisfies Y conditionâ. For the coding youâll be working with a similar set of data except instead of it being a table itâd be a stream. Youâll calculate one of those metrics using code off the stream of data instead of SQL and a table.
If youâre referring to the total journey, from problem definition, data acquisition, feature engineering, training, evaluation, and deployment, I recommend the following book. Itâs got an interesting theoretical journey of a problem and how data is obtained and used properly as well as the journey of visualizing, discovering errors and correcting for them, etc. The domain is in prediction of housing costs in various neighborhoods for a real estate company. The data is totally unclean, the problem ill-defined, and progresses through many stages to build a functioning pipeline that isnât just a mess of âI tried this and that.â http://shop.oreilly.com/product/0636920052289.do
Isnât this the book for machine learning concepts?
what does full stack data engineering mean?!?!