Open Source Big Data Tools

in #big7 years ago (edited)

Big data is one of the promising technologies that is gaining a lot of attention by the industry. There are various tools that can help you process data when you are dealing with big data analytics.

The most well-known characteristics of big data are, Volume, Velocity, Variety,  Veracity, and Value. The follow big data tools try to tackle one or more characteristics of big data.

Apache Flink

Apache Flink is an open-source stream processing framework for distributed, high-performing, always-available, and accurate data streaming applications.

Apache Spark

Apache Spark is a fast and general engine for large-scale data processing.

Apache Storm

Apache Storm is a free and open source distributed realtime computation system. 

Apache Hadoop

Apache Hadoop project develops open-source software for reliable, scalable, distributed computing.

Apache Kafka

Apache Kafka is used for building real-time data pipelines and streaming apps.

Sort:  

Thanks for your good posts, I followed you!

Congratulations @sylar58! You have received a personal award!

1 Year on Steemit
Click on the badge to view your Board of Honor.

Do you like SteemitBoard's project? Then Vote for its witness and get one more award!

Congratulations @sylar58! You received a personal award!

Happy Birthday! - You are on the Steem blockchain for 2 years!

You can view your badges on your Steem Board and compare to others on the Steem Ranking

Vote for @Steemitboard as a witness to get one more award and increased upvotes!