What Is Hadoop

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:

Hadoop Core, our flagship sub-project, provides a distributed filesystem (HDFS) and support for the MapReduce distributed computing metaphor.

HBase builds on Hadoop Core to provide a scalable, distributed database.


Hadoop is a an open source Apache project written in Java and designed to provide users with two things:
 a distributed file system (HDFS) and a method for distributed computation.

 It’s based on Google’s published Google File System and MapReduce concept which discuss how to build a framework capable of executing intensive computations across tons of computers. 

Something that might, you know, be helpful in building a giant search index. Read the Hadoop project description and wiki for more information and background on Hadoop.

有人有興趣嗎?
arrow
arrow
    全站熱搜

    Oscar 發表在 痞客邦 留言(2) 人氣()