You are viewing a single comment's thread from:

RE: August Hive Data - Word Count, Effort, Reward, Rambling

Neat way to express the data. I love this kind of stuff. If you do scale these back, I'd be really interested to see maybe quarterly data sets instead of monthly, if that's not a hassle and you think it's worth exploring. Of course, if that doesn't interest you then no worries!

Sort:  

The two months of data I've got stored locally so far is about 5.7GB - its an enormous data set. I have to chunk the queries to avoid overloading HiveSQL at the moment, so quarterly would be another chunking exercise ;P

I know there are other solutions to grabbing the data - but at the moment, I really want to focus on my stuff, and not productionising data. If I manage to get my own hive witness node up and running and have the data available locally, then that might be another story all together :)

Looks like the entire history of the chain is something like 5-600GB :D Gonna take a while to sync!

Oh dang haha yeah that totally makes sense! Following your own motivation is definitely better than dealing with that!