I have an interview scheduled at Amazon. They are looking for below requirement. “The ideal candidate is an independent Data Engineer who can source data, cleanse, analyze, refine, enrich, model, present, automate and document our business data pipelines. “ I am really not sure what type of candidate they are looking for and what I should prepare for. Any pointers would be helpful.
Perhaps sql will be examined
Google it and do your research to understand the process. Let us know how it went! Good luck!
For our team, out data engineers must be know SQL, from built in functions to fast running sql. Also you must know how to build scalable data etl pipelines, how to optimize DB and schema design. How to look and analyze underlying data to provide business insights and do data quality checks. A plus would be knowing one big data tool like spark. AWS like redshift and Emr is also a plus
I bet they want to a candidate who is really good with data analysis