Can someone share interesting Data Engineering project examples on YouTube, Medium, GitHub, Kaggle, etc. I'm looking for best practices and inspiration. Thanks! T.C. 150k
stack exchange updates their full db every once for a while. Could make an interesting work.
GitHub
One time, back before Elasticsearch had X-Pack as a basic feature, I downloaded all the IP spaces of all the publically facing Elastic stacks. Pulled the data. Elastic turned on X-Pack for free after that. Fun times, fun finds. Do something similar, mass data aggregation from tens of thousands of different sources.