Top Rated Hadoop HDFS Alternatives
Video Reviews
140 Hadoop HDFS Reviews
Overall Review Sentiment for Hadoop HDFS
Log in to view review sentiment.

I appreciate the high data availability of data and data processing speed. There is no latency compared to other file systems; you don't need to worry about node failures. I am one of the early users of HDFS, and the features that it has grown are amazing making the system wholistically more robust Review collected by and hosted on G2.com.
Lack of real-time analysis. We need extensive processing and data cleaning before we can do any analytics. It would have been great if there was some intelligence that would allow at least a high level of analytics Review collected by and hosted on G2.com.

hadoop hdfs was developed using file system design. which holds large amount of data and provide easily access. hadoop store huge data, file which stored accross multiple machine. this is the best about hadoop hdfs Review collected by and hosted on G2.com.
their is nothing to dislike. i would recommend Hadoop hdfs to enterprise.
hadppp hdfs is best solution for enterprise. it is enterprise grade solutions
their is nothing to dislike. Review collected by and hosted on G2.com.
It scales well with large datasets and provides high throughput for storing large files. It is open source, and the community is quite vibrant for helping in resolving issues. Plugins are available in multiple languages to use HDFS. Review collected by and hosted on G2.com.
It's not fully HA even with 3-node HA deployment. If the two namenode servers go down, the HDFS cluster goes down. The throughput decreases if there are a lot of small files. And the data is not evenly stored across all the datanodes; not sure if it is related to our deployment configurations. Review collected by and hosted on G2.com.

The best part about Hadoop HDFS is used to replicate data and it's architecture is easy to learn and implement.It helps in processing larger files in short period of time and it helps us to improve the performance. Review collected by and hosted on G2.com.
Managing or developing a complex applications will be challenging.Main thing is it supports only bath processing not a real time data processing and not suitable for minimal data. Review collected by and hosted on G2.com.
HDFS ensures high fault tolerance through data replication across multiple nodes in the cluster, typically maintaining three copies of each data block. This replication strategy guarantees data availability in case of node failures, as HDFS automatically redirects requests to alternative replicas, thus preserving data integrity and reliability. Review collected by and hosted on G2.com.
HDFS is primarily designed for batch processing and may not be the best choice for low-latency or real-time processing needs. Its data replication and block-based storage model introduce inherent delays in data access and processing, which diminish its suitability for real-time analytics or interactive queries. Review collected by and hosted on G2.com.

It completely change our processing speed as it is a distributed file system that help to do parallel processing so that solve our problem of processing of Tera bytes of data with horizontal scaling Review collected by and hosted on G2.com.
It is very costly setup as it needs nods which is basically in it self cpu that need lots of Money to setup the cluster also small file problem means we can't store small file due to the meta information handling problem Review collected by and hosted on G2.com.

HDFS has been around for a while now. There are plethora of documentation and community support available. Review collected by and hosted on G2.com.
Interacting with Kerberos authenticated HDFS can be esoteric and hard to understand initially. Review collected by and hosted on G2.com.

Fault tolerance facility,when data node failed it can use the other as replication factor is 3. Review collected by and hosted on G2.com.
Support only batch processing engine,it cannot produce realtime output. Review collected by and hosted on G2.com.

Hadoop is a widely used open-source framework for storing and processing extensive data sets because of its scalability, cost efficiency, Fault tollerence etc. Review collected by and hosted on G2.com.
Hadoop has some disadvantages like its Complexity, Steep learning curve, high hardware requirement. Review collected by and hosted on G2.com.

Hadoop is very fast and flexible towards the data for implementing and improve the data quality Review collected by and hosted on G2.com.
As per my experience Hadoop has no disadvantages as per my experience Review collected by and hosted on G2.com.