uber / kraken
- четверг, 7 марта 2019 г. в 00:17:26
Go
P2P docker registry capable of distributing TBs of data in seconds
Kraken is a P2P-powered docker registry that focuses on scalability and availability. It is designed for docker image management, replication and distribution in a hybrid cloud environment. With pluggable backend support, Kraken can easily integrate into existing docker registry setups as the distribution layer.
Kraken has been in production at Uber since early 2018. In our busiest cluster, Kraken distributes more than 1 million blobs per day, including 100k 1G+ blobs. At its peak production load, Kraken distributes 20K 100MB-1G blobs in under 30 sec.
Below is the visualization of a small Kraken cluster at work:
Following are some highlights of Kraken:
The high level idea of Kraken is to have a small number of dedicated hosts seed content to a network of agents running on each host in the cluster. A central component, the tracker, will orchestrate all participants in the network to form a pseudo-random regular graph. Such a graph has high connectivity and small diameter, so all participants in a reasonably sized cluster can reach > 80% of max upload/download speed in theory, and performance doesn't degrade much as the blob size and cluster size increase.
The following data is from a test where a 3G Docker image with 2 layers is downloaded by 2600 hosts concurrently (5200 blob downloads), with 300MB/s speed limit on all agents (using 5 trackers and 5 origins):
All Kraken components can be deployed as docker containers. To build the docker images:
$ make images
To start a herd container (which contains origin, tracker, build-index and proxy) and two agent containers with development configuration:
$ make devcluster
Docker-for-Mac is required for making dev-cluster work on your laptop. For more information on devcluster, please check out devcluster README. For information about how to configure and use Kraken, please refer to the documentation.
Dragonfly cluster has one or a few "supernodes" that coordinates transfer of every 4MB chunk of data in the cluster. While the supernode would be able to make optimal decisions, the throughput of the whole cluster is limited by the processing power of one or a few hosts, and the performance would degrade linearly as either blob size or cluster size increases.
Kraken's tracker only helps orchestrate the connection graph, and leaves negotiation of actual data transfer to individual peers, so Kraken scales better with large blobs. On top of that, Kraken is HA and supports cross cluster replication, both are required for a reliable hybrid cloud setup.
Kraken was initially built with a BitTorrent driver, however we ended up implementing our own P2P driver based on BitTorrent protocol to allow for tighter integration with storage solutions and more control over performance optimizations.
Kraken's problem space is slightly different than what BitTorrent was designed for. Kraken's goal is to reduce global max download time and communication overhead in a stable environment, while BitTorrent was designed for an unpredictable and adversarial environment, so it needs to preserve more copies of scarce data and defend against malicious or bad behaving peers.
Despite the differences, we re-examine Kraken's protocol from time to time, and if it's feasible, we hope to make it compatible with BitTorrent again.
docker pull
. To actually speed up docker pull
, consider
switching to Makisu to improve layer reusability at build time, or
tweak compression ratios, as docker pull
spends most of the time on data decompression.latest
tag) is allowed, however a few things will not work: tag
lookups immediately afterwards will still return the old value due to Nginx caching, and replication
probably won't trigger. We are working on supporting this functionality better. If you need tag
mutation support right now, please reduce cache interval of build-index component. If you also need
replication in a multi-cluster setup, please consider setting up another docker registry as Kraken's
backend.Please check out our guide.
To contact us, please join our Slack channel.