Abstract
Deep learning based artificial intelligence has made many breakthroughs. The training process of deep learning usually requires a lot of data. The availability of big data, especially privacy-sensitive data, is impeding the application of deep learning. Collecting the data may cause big privacy concerns. Some privacy-preserving deep learning methods have emerged in academia and industry. In this paper, we propose BDML+, a decentralized framework based on consortium Blockchain for privacy-preserving distributed deep learning. It focuses on statistical challenges such as different data distributions and data amounts among participants. Statistical challenges are tackled by several techniques. In the first block, a small amount of publicly shared data and a bootstrap warm-up model are given. During the training process, the local training epochs are automatically adjusted with an adaptive boosting method to prevent local training from non-convergence or overfitting. Besides, factors such as local data amount, the base block and the number of training steps are considered to avoid integrating parameter weights with large divergence. The experimental results show that BDML+ has strong adaptability to various data distributions and data amounts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
General Data Protection Regulation (GDPR). https://gdpr-info.eu/.
- 2.
A simple Blockchain in Python. https://gihub.com/dvf/blockchain/.
- 3.
The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/.
References
Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. In: USENIX Symposium on Operating Systems Design and Implementation (OSDI), pp. 265–283 (2016)
Abadi, M., Chu, A., et al.: Deep learning with differential privacy. In: ACM SIGSAC Conference on Computer and Communications Security (CCS), pp. 308–318 (2016)
Androulaki, E., Barger, A., et al.: Hyperledger fabric: a distributed operating system for permissioned blockchains. In: Proceedings of the 30th EuroSys Conference, pp. 1–15 (2018)
Ben-Nun, T., Hoefler, T.: Demystifying parallel and distributed deep learning: an in-depth concurrency analysis. ACM Comput. Surv. (CSUR) 52(4), 1–43 (2019)
Bonawitz, K., Eichner, H., et al.: Towards federated learning at scale: system design. In: Proceedings of the Conference on Systems and Machine Learning (SysML) (2019)
Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. Siam Rev. 60(2), 223–311 (2018)
Casino, F., Dasaklis, T.K., Patsakis, C.: A systematic literature review of blockchain-based applications: current status, classification and open issues. Telemat. Inf. 36, 55–81 (2019)
Chilimbi, T., Suzue, Y., Apacible, J., Kalyanaraman, K.: Project adam: building an efficient and scalable deep learning training system. In: Proceedings of OSDI 2014, pp. 571–582 (2014)
Gentry, C.: Fully homomorphic encryption using ideal lattices. In: Proceedings of the 41th Annual ACM Symposium on Theory of Computing (STOC), pp. 169–178 (2009)
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
Huang, L., Yin, Y., Fu, Z., Zhang, S., Deng, H., Liu, D.: Loadaboost: loss-based adaboost federated machine learning on medical data. arXiv preprint arXiv:1811.12629 (2018)
Krizhevsky, A., Hinton, G.: Learning Multiple Layers of Features from Tiny Images (2009)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
LeCun, Y., Bottou, L., Bengio, Y., et al.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324 (1998)
Liu, T.Y., et al.: Distributed machine learning: foundations, trends, and practices. In: 26th International Conference on World Wide Web Companion (WWW), pp. 913–915 (2017)
McMahan, B., Moore, E., et al.: Communication-efficient learning of deep networks from decentralized data. In: Artificial Intelligence and Statistics (AISTATS), pp. 1273–1282 (2017)
Nakamoto, S.: Bitcoin: A Peer-to-Peer Electronic Cash System (2008)
Phong, L.T., et al.: Privacy-preserving deep learning via additively homomorphic encryption. IEEE Trans. Inf. Forens. Secur. 13(5), 1333–1345 (2018)
Shokri, R., Shmatikov, V.: Privacy-preserving deep learning. In: ACM SIGSAC Conference on Computer and Communications Security (CCS), pp. 1310–1321 (2015)
Smith, V., Chiang, C.K., Sanjabi, M., Talwalkar, A.S.: Federated multi-task learning. In: Advances in Neural Information Processing Systems (NeurIPS), pp. 4424–4434 (2017)
Wang, Q., et al.: BDML: blockchain-based distributed machine learning for model training and evolution. In: Symposium on Foundations and Applications of Blockchain (FAB) (2019)
Weng, J., et al.: Deepchain: auditable and privacy-preserving deep learning with blockchain-based incentive. IEEE Trans. Dependable Secur. Comput. PP(99), 1 (2019)
Yang, Q., Liu, Y., Chen, T., et al.: Federated machine learning: concept and applications. ACM Trans. Intell. Syst. Technol. 10(2), 12 (2019)
Zhang, W., Wang, Q., Li, M.: Medical image collaborative training based on multi-blockchain. In: International Conference on Bioinformatics and Biomedicine (BIBM) (2019)
Zhao, Y., Li, M., Lai, L., Suda, N., Civin, D., Chandra, V.: Federated learning with non-IID data. arXiv preprint arXiv:1806.00582 (2018)
Zhu, T., Philip, S.Y.: Applying differential privacy mechanism in artificial intelligence. In: International Conference on Distributed Computing Systems (ICDCS), pp. 1601–1609 (2019)
Zhuo, H., et al.: Federated reinforcement learning. arXiv preprint arXiv:1901.08277 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Li, M., Wang, Q., Zhang, W. (2020). Blockchain-Based Distributed Machine Learning Towards Statistical Challenges. In: Zheng, Z., Dai, HN., Fu, X., Chen, B. (eds) Blockchain and Trustworthy Systems. BlockSys 2020. Communications in Computer and Information Science, vol 1267. Springer, Singapore. https://doi.org/10.1007/978-981-15-9213-3_42
Download citation
DOI: https://doi.org/10.1007/978-981-15-9213-3_42
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-9212-6
Online ISBN: 978-981-15-9213-3
eBook Packages: Computer ScienceComputer Science (R0)