The Baidu File System (BFS) is a distributed file system designed to support real-time applications. Like many other distributed file systems, BFS is highly fault-tolerant. But different from others, BFS provides low read/write latency while maintains high throughout rates. Together with Galaxy and Tera, BFS supports many real-time products in Baidu, including Baidu webpage database, Baidu incremental indexing system, Baidu user behavior analysis system, etc.
Features
Continuous availability
Nameserver is implemented as a raft group, no single point failure.
High throughput
High performance data engine to maximize IO utils.
Low latency
Global load balance and slow node detection.
Linear scalability
Support multi data center deployment and up to 10,000 data nodes.