What Is Hadoop
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
Hadoop Core, our flagship sub-project, provides a distributed filesystem (HDFS) and support for the MapReduce distributed computing metaphor.
HBase builds on Hadoop Core to provide a scalable, distributed database.
Hadoop is a an open source Apache project written in Java and designed to provide users with two things:
a distributed file system (HDFS) and a method for distributed computation.
It’s based on Google’s published Google File System and MapReduce concept which discuss how to build a framework capable of executing intensive computations across tons of computers.
Something that might, you know, be helpful in building a giant search index. Read the Hadoop project description and wiki for more information and background on Hadoop.