Blockchain-Based Distributed Machine Learning Towards Statistical Challenges | SpringerLink
Skip to main content

Blockchain-Based Distributed Machine Learning Towards Statistical Challenges

  • Conference paper
  • First Online:
Blockchain and Trustworthy Systems (BlockSys 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1267))

Included in the following conference series:

Abstract

Deep learning based artificial intelligence has made many breakthroughs. The training process of deep learning usually requires a lot of data. The availability of big data, especially privacy-sensitive data, is impeding the application of deep learning. Collecting the data may cause big privacy concerns. Some privacy-preserving deep learning methods have emerged in academia and industry. In this paper, we propose BDML+, a decentralized framework based on consortium Blockchain for privacy-preserving distributed deep learning. It focuses on statistical challenges such as different data distributions and data amounts among participants. Statistical challenges are tackled by several techniques. In the first block, a small amount of publicly shared data and a bootstrap warm-up model are given. During the training process, the local training epochs are automatically adjusted with an adaptive boosting method to prevent local training from non-convergence or overfitting. Besides, factors such as local data amount, the base block and the number of training steps are considered to avoid integrating parameter weights with large divergence. The experimental results show that BDML+ has strong adaptability to various data distributions and data amounts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
JPY 3498
Price includes VAT (Japan)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
JPY 11439
Price includes VAT (Japan)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
JPY 14299
Price includes VAT (Japan)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    General Data Protection Regulation (GDPR). https://gdpr-info.eu/.

  2. 2.

    A simple Blockchain in Python. https://gihub.com/dvf/blockchain/.

  3. 3.

    The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/.

References

  1. Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. In: USENIX Symposium on Operating Systems Design and Implementation (OSDI), pp. 265–283 (2016)

    Google Scholar 

  2. Abadi, M., Chu, A., et al.: Deep learning with differential privacy. In: ACM SIGSAC Conference on Computer and Communications Security (CCS), pp. 308–318 (2016)

    Google Scholar 

  3. Androulaki, E., Barger, A., et al.: Hyperledger fabric: a distributed operating system for permissioned blockchains. In: Proceedings of the 30th EuroSys Conference, pp. 1–15 (2018)

    Google Scholar 

  4. Ben-Nun, T., Hoefler, T.: Demystifying parallel and distributed deep learning: an in-depth concurrency analysis. ACM Comput. Surv. (CSUR) 52(4), 1–43 (2019)

    Article  Google Scholar 

  5. Bonawitz, K., Eichner, H., et al.: Towards federated learning at scale: system design. In: Proceedings of the Conference on Systems and Machine Learning (SysML) (2019)

    Google Scholar 

  6. Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. Siam Rev. 60(2), 223–311 (2018)

    Article  MathSciNet  Google Scholar 

  7. Casino, F., Dasaklis, T.K., Patsakis, C.: A systematic literature review of blockchain-based applications: current status, classification and open issues. Telemat. Inf. 36, 55–81 (2019)

    Google Scholar 

  8. Chilimbi, T., Suzue, Y., Apacible, J., Kalyanaraman, K.: Project adam: building an efficient and scalable deep learning training system. In: Proceedings of OSDI 2014, pp. 571–582 (2014)

    Google Scholar 

  9. Gentry, C.: Fully homomorphic encryption using ideal lattices. In: Proceedings of the 41th Annual ACM Symposium on Theory of Computing (STOC), pp. 169–178 (2009)

    Google Scholar 

  10. He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)

    Google Scholar 

  11. Huang, L., Yin, Y., Fu, Z., Zhang, S., Deng, H., Liu, D.: Loadaboost: loss-based adaboost federated machine learning on medical data. arXiv preprint arXiv:1811.12629 (2018)

  12. Krizhevsky, A., Hinton, G.: Learning Multiple Layers of Features from Tiny Images (2009)

    Google Scholar 

  13. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)

    Article  Google Scholar 

  14. LeCun, Y., Bottou, L., Bengio, Y., et al.: Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, pp. 2278–2324 (1998)

    Google Scholar 

  15. Liu, T.Y., et al.: Distributed machine learning: foundations, trends, and practices. In: 26th International Conference on World Wide Web Companion (WWW), pp. 913–915 (2017)

    Google Scholar 

  16. McMahan, B., Moore, E., et al.: Communication-efficient learning of deep networks from decentralized data. In: Artificial Intelligence and Statistics (AISTATS), pp. 1273–1282 (2017)

    Google Scholar 

  17. Nakamoto, S.: Bitcoin: A Peer-to-Peer Electronic Cash System (2008)

    Google Scholar 

  18. Phong, L.T., et al.: Privacy-preserving deep learning via additively homomorphic encryption. IEEE Trans. Inf. Forens. Secur. 13(5), 1333–1345 (2018)

    Article  Google Scholar 

  19. Shokri, R., Shmatikov, V.: Privacy-preserving deep learning. In: ACM SIGSAC Conference on Computer and Communications Security (CCS), pp. 1310–1321 (2015)

    Google Scholar 

  20. Smith, V., Chiang, C.K., Sanjabi, M., Talwalkar, A.S.: Federated multi-task learning. In: Advances in Neural Information Processing Systems (NeurIPS), pp. 4424–4434 (2017)

    Google Scholar 

  21. Wang, Q., et al.: BDML: blockchain-based distributed machine learning for model training and evolution. In: Symposium on Foundations and Applications of Blockchain (FAB) (2019)

    Google Scholar 

  22. Weng, J., et al.: Deepchain: auditable and privacy-preserving deep learning with blockchain-based incentive. IEEE Trans. Dependable Secur. Comput. PP(99), 1 (2019)

    Google Scholar 

  23. Yang, Q., Liu, Y., Chen, T., et al.: Federated machine learning: concept and applications. ACM Trans. Intell. Syst. Technol. 10(2), 12 (2019)

    Article  Google Scholar 

  24. Zhang, W., Wang, Q., Li, M.: Medical image collaborative training based on multi-blockchain. In: International Conference on Bioinformatics and Biomedicine (BIBM) (2019)

    Google Scholar 

  25. Zhao, Y., Li, M., Lai, L., Suda, N., Civin, D., Chandra, V.: Federated learning with non-IID data. arXiv preprint arXiv:1806.00582 (2018)

  26. Zhu, T., Philip, S.Y.: Applying differential privacy mechanism in artificial intelligence. In: International Conference on Distributed Computing Systems (ICDCS), pp. 1601–1609 (2019)

    Google Scholar 

  27. Zhuo, H., et al.: Federated reinforcement learning. arXiv preprint arXiv:1901.08277 (2019)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mei Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, M., Wang, Q., Zhang, W. (2020). Blockchain-Based Distributed Machine Learning Towards Statistical Challenges. In: Zheng, Z., Dai, HN., Fu, X., Chen, B. (eds) Blockchain and Trustworthy Systems. BlockSys 2020. Communications in Computer and Information Science, vol 1267. Springer, Singapore. https://doi.org/10.1007/978-981-15-9213-3_42

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-9213-3_42

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-9212-6

  • Online ISBN: 978-981-15-9213-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics