Hadoop Distributed File System (HDFS) is an Apache project. It's a file system which is used to store the initial and 'reduced' data once the data is processed using MapReduce. Google File System (GFS) was the database created by Google initially to store the website indexing data for the search engine.
HDFS is based on GFS.
个人认为,GFS与HDFS的不同是由于“专”与“通”的区别。众所周知,Hadoop是一个开源软件/框架,在设计之初就考虑到了用户(面向世界上的所有个人、企业)在需求上的差异,比如数据密集型(如淘宝的数据存储)、计算密集型(百度的PR算法)、混合型等等。而GFS在设计之初就对目标比较明确,都是Google的嘛,因此GFS可以对其主要功能进行性能上的优化。
具体的请参考附件,因为有图发不上来,只能用附件。
附件:
gfs和hdfs文件系统的区别.docx (127.56 KB)
收起