Remix clone Hacker News

new | show | ask | jobs Github

	▲	rennokki 5 hours ago
		> Uses jq for TB json files > Hadoop: bro > Spark: bro > hive: bro > data team: bro
	▲	f311a 2 hours ago \| parent \| next [-]
		JQ is very convenient, even if your files are more than 100GB. I often need to extract one field from huge JSON line files, I just pipe jq to it to get results. It's slower, but implementing proper data processing will take more time.
	▲	anonymoushn 2 hours ago \| parent \| prev \| next [-]
		are those tools known for their fast json parsers?
	▲	3 hours ago \| parent \| prev [-]
		[deleted]