HDFS SnapshotsとDistCpを利用したHDFSデータの差分更新 - Qiita

[Cloudera Engineering Blog](https://blog.cloudera.com/blog/2015/12/distcp-performance-improvements-in-apache-hadoop/...