Open Source BigData Platforms

Product Name Description
Hadoop Used for Large-Scale Processing data.
Apache Spark This is one of the big data tools
Apache Storm This is for processing unbounded data stream.
Cassandra From Apache, it is a distributed type database to manage large set of data across servers
RapidMiner Used for Data Science Activities.
MongoDB Open source NoSQL database
R Programming Tool Big Data Analytics on Statistical Analysis of Data.
Neo4j Graph data in Big Data Analytics
Apache SAMOA Big Data Tool used for distributed streaming algorithms for Big Data mining
HPCC
  • Thor: For Batch Oriented manipulation and analytics.
  • Roxie: For Real-Time Data Delivery and analytics.