| ▲ | fifilura 5 hours ago | |||||||
No joins in that article? The comments here smell of "real engineers use command line". But I am not sure they ever actually worked with analysing data more than using it as a log parser. Yes Hadoop is 2014. These days you obviously don't set up a Hadoop cluster. You use the cloud provider service provided (BigQuery or AWS Athena for example). Or map your data into DuckDB or use polars if it is small. | ||||||||
| ▲ | ziml77 3 hours ago | parent [-] | |||||||
> But I am not sure they ever actually worked with analysing data more than using it as a log parser. It really feels that way. Real data analysis involves a lot more than just grepping logs. And the reason to be wary of starting out unprepared for that kind of analysis is that migrating to a better solution later is a nightmare. | ||||||||
| ||||||||