{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,7,8]],"date-time":"2024-07-08T00:52:20Z","timestamp":1720399940789},"reference-count":41,"publisher":"MDPI AG","issue":"15","license":[{"start":{"date-parts":[[2020,7,22]],"date-time":"2020-07-22T00:00:00Z","timestamp":1595376000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"the National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U1831118"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"The SKA (Square Kilometer Array) radio telescope will become the most sensitive telescope by correlating a huge number of antenna nodes to form a vast array of sensors in a region over one hundred kilometers. Faceting, the wide-field imaging algorithm, is a novel approach towards solving image construction from sensing data where earth surface curves cannot be ignored. However, the traditional processor of cloud computing, even if the most sophisticated supercomputer is used, cannot meet the extremely high computation performance requirement. In this paper, we propose the design and implementation of high-efficiency FPGA (Field Programmable Gate Array) -based hardware acceleration of the key algorithm, faceting in SKA by focusing on phase rotation and gridding, which are the most time-consuming phases in the faceting algorithm. Through the analysis of algorithm behavior and bottleneck, we design and optimize the memory architecture and computing logic of the FPGA-based accelerator. The simulation and tests on FPGA are done to confirm the acceleration result of our design and it is shown that the acceleration performance we achieved on phase rotation is 20\u00d7 the result of the previous work. We then further designed and optimized an efficient microstructure of loop unrolling and pipeline for the gridding accelerator, and the designed system simulation was done to confirm the performance of our structure. The result shows that the acceleration ratio is 5.48 compared to the result tested on software in gridding parts. Hence, our approach enables efficient acceleration of the faceting algorithm on FPGAs with high performance to meet the computational constraints of SKA as a representative vast sensor array.<\/jats:p>","DOI":"10.3390\/s20154070","type":"journal-article","created":{"date-parts":[[2020,7,22]],"date-time":"2020-07-22T11:31:28Z","timestamp":1595417488000},"page":"4070","source":"Crossref","is-referenced-by-count":2,"title":["Accelerating Faceting Wide-Field Imaging Algorithm with FPGA for SKA Radio Telescope as a Vast Sensor Array"],"prefix":"10.3390","volume":"20","author":[{"ORCID":"http:\/\/orcid.org\/0000-0001-7383-0812","authenticated-orcid":false,"given":"Yuefeng","family":"Song","sequence":"first","affiliation":[{"name":"School of Microelectronics, Shanghai Jiao Tong University, Shanghai 200240, China"}]},{"given":"Yongxin","family":"Zhu","sequence":"additional","affiliation":[{"name":"Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China"},{"name":"School of Microelectronics, Shanghai Jiao Tong University, Shanghai 200240, China"},{"name":"University of Chinese Academy of Sciences, Beijing 100049, China"}]},{"given":"Tianhao","family":"Nan","sequence":"additional","affiliation":[{"name":"School of Microelectronics, Shanghai Jiao Tong University, Shanghai 200240, China"}]},{"given":"Junjie","family":"Hou","sequence":"additional","affiliation":[{"name":"School of Microelectronics, Shanghai Jiao Tong University, Shanghai 200240, China"}]},{"given":"Sen","family":"Du","sequence":"additional","affiliation":[{"name":"School of Microelectronics, Shanghai Jiao Tong University, Shanghai 200240, China"}]},{"given":"Shijin","family":"Song","sequence":"additional","affiliation":[{"name":"School of Microelectronics, Shanghai Jiao Tong University, Shanghai 200240, China"}]}],"member":"1968","published-online":{"date-parts":[[2020,7,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Schilizzi, R.T. (2004, January 28). The square kilometer array. Proceedings of the Ground-based Telescopes, Glasgow, UK.","DOI":"10.1117\/12.551206"},{"key":"ref_2","unstructured":"Turner, W., Cornwell, T., McPherson, A., and Diamond, P. (2014). Ska Phase 1 System (Level 1) Requirements Specification, SKA Org.. Tech. Rep. SKA-TEL-SKO-0000008."},{"key":"ref_3","unstructured":"Stergiopoulou, A. (2016). Combining E-ELT HIRES Instrument and SKA to Probe the Chemical Enrichment by the First Stars. [Ph.D. Thesis, Observatory Astronomy Uppsala University]."},{"key":"ref_4","unstructured":"Dewdney, P.E., Turner, W., Millenaar, R., McCool, R., Lazio, J., and Cornwell, T.J. (2020, July 21). SKA1 system baseline design. SKA-TELSKO-0000002 Rev 2016; 3. Available online: https:\/\/www.skatelescope.org\/wp-content\/uploads\/2013\/03\/SKA-TEL-SKO-DD-001-1_BaselineDesign1.pdf."},{"key":"ref_5","unstructured":"Kogan, L., and Greisen, E.W. (2009). Faceted Imaging in AIPS. AIPS Memo, National Radio Astronomical Observatory."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Tayara, H., Ham, W., and Chong, K.T. (2016). A Real-Time Marker-Based Visual Sensor Based on a FPGA and a Soft Core Processor. Sensors, 16.","DOI":"10.3390\/s16122139"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Huang, J., Zhou, G., Zhou, X., and Zhang, R. (2018). A New FPGA Architecture of FAST and BRIEF Algorithm for On-Board Corner Detection and Matching. Sensors, 18.","DOI":"10.3390\/s18041014"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1109\/MC.2014.235","article-title":"An end-to-end computing model for the square kilometre array","volume":"47","author":"Jongerius","year":"2014","journal-title":"Computer"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Nan, T., Zhu, Y., Li, W., Chen, X., Song, Y., and Hou, J. (2019, January 10\u201312). An FPGA-based Hardware Acceleration for Key Steps of Facet Imaging Algorithm. Proceedings of the Smartcloud Meeting, Tokyo, Japan.","DOI":"10.1109\/SmartCloud.2019.00025"},{"key":"ref_10","unstructured":"Veenboer, B., Petschow, M., and Romein, J.W. (June, January 29). Image-domain gridding on graphics processors. Proceedings of the IEEE International Parallel & Distributed Processing Symposium (IPDPS), Orlando, FL, USA."},{"key":"ref_11","unstructured":"Cornwell, T.J., Golap, K., and Bhatnagar, S. (2005, January 23). Wide field imaging problems in radio astronomy. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA, USA."},{"key":"ref_12","unstructured":"Perley, R.A. (1999). Synthesis Imaging in Radio Astronomy II ASP Conference Series, NRAO\/NMIMT."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"A87","DOI":"10.1051\/0004-6361\/201731474","article-title":"Faceting for direction-dependent spectral deconvolution","volume":"611","author":"Tasse","year":"2018","journal-title":"Astron. Astrophys."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"647","DOI":"10.1109\/JSTSP.2008.2005290","article-title":"The Noncoplanar Baselines Effect in Radio Interferometry: The W-Projection Algorithm","volume":"2","author":"Cornwell","year":"2008","journal-title":"IEEE J. Sel. Top. Signal Process."},{"key":"ref_15","unstructured":"Humphreys, B., and Cornwell, T. (2020, July 21). Analysis of Convolutional Resampling Algorithm Performance. Available online: https:\/\/www.skatelescope.org\/uploaded\/59116_132_Memo_Humphreys.pdf."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Cornwell, T.J., Voronkov, M.A., and Humphreys, B. (2012, January 19). Wide field imaging for the Square Kilometre Array. Proceedings of the SPIE\u2014The International Society for Optical Engineering, San Diego, CA, USA.","DOI":"10.1117\/12.929336"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1071\/AS07033","article-title":"Science with the Australian Square Kilometre Array Pathfinder","volume":"24","author":"Johnston","year":"2007","journal-title":"Publ. Astron. Soc. Aust."},{"key":"ref_18","unstructured":"(2019, December 20). SKAO Frequently Asked Questions. Available online: https:\/\/skatelescope.org\/frequently-asked-questions\/."},{"key":"ref_19","unstructured":"Bolton, R., Malan, F., Nijboer, R., Scaife, A., and SDP Architecture Group (2020, July 21). SDP Meeting at ASTRON Netherland. Available online: http:\/\/www.astron.nl\/~broekema\/papers\/SDP-PDR\/PDR01%20System%20Architecture.pdf."},{"key":"ref_20","unstructured":"TOP500 LIST (2019, June 30). TOP10. Available online: https:\/\/www.top500.org\/lists\/2019\/06\/."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.ascom.2017.04.001","article-title":"Calibration of LOFAR data on the cloud","volume":"19","author":"Sabater","year":"2017","journal-title":"Astron. Comput."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Wang, Y., Yang, J., Guo, X., and Qu, Z. (2019). Satellite edge computing for the internet of things in aerospace. Sensors, 19.","DOI":"10.3390\/s19204375"},{"key":"ref_23","first-page":"317","article-title":"Chapter 11\u2014Cloud application development","volume":"10","author":"Marinescu","year":"2013","journal-title":"Cloud Comput."},{"key":"ref_24","first-page":"1","article-title":"Analysis of convolutional resampling algorithm performance","volume":"132","author":"Humphreys","year":"2001","journal-title":"SKA Memo"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Romein, J.W. (2012, January 25\u201329). An efficient work-distribution strategy for gridding radio telescope data on GPUs. Proceedings of the 26th ACM International Conference on Supercomputing (ICS), Dwingeloo, The Netherlands.","DOI":"10.1145\/2304576.2304620"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1016\/j.ascom.2016.05.004","article-title":"Faster GPU-based convolutional gridding via thread coarsening","volume":"16","author":"Merry","year":"2016","journal-title":"Astron. Comput."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"424","DOI":"10.1016\/j.chinastron.2018.12.001","article-title":"Research on parallel algorithms for uv-faceting Imaging","volume":"43","author":"Lao","year":"2019","journal-title":"Chin. Astron. Astrophys."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Veenboer, B., and Romein, J.W. (2019, January 13). Radio-astronomical Imaging: FPGAs vs GPUs. Proceedings of the European Conference on Parallel Processing, Berlin, Germany.","DOI":"10.1007\/978-3-030-29400-7_36"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"840","DOI":"10.1016\/j.sysarc.2011.03.005","article-title":"Resource allocation robustness in multi-core embedded systems with inaccurate information","volume":"57","author":"Li","year":"2011","journal-title":"J. Syst. Archit."},{"key":"ref_30","unstructured":"Li, J., Qiu, M., Niu, J., Gao, W., Zong, Z., and Qin, X. (September, January 31). Feedback Dynamic Algorithms for Preemptable Job Scheduling in Cloud Systems. Proceedings of the IEEE\/WIC\/ACM International Conference on Web Intelligence & Intelligent Agent Technology IEEE Computer Society, Toronto, ON, Canada."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Vestias, M., and Neto, H. (2014, January 2\u20134). Trends of CPU, GPU and FPGA for high-performance computing. Proceedings of the 2014 24th International Conference on Field Programmable Logic and Applications (FPL) IEEE, Munich, Germany.","DOI":"10.1109\/FPL.2014.6927483"},{"key":"ref_32","first-page":"383","article-title":"Imaging in Radio Astronomy II","volume":"Volume 180","author":"Taylor","year":"1999","journal-title":"A Collection of Lectures from the Sixth NRAO\/NMIMT Synthesis Imaging Summer School, ASP Conference Series"},{"key":"ref_33","unstructured":"Muscat, D. (2014). High-Performance Image Synthesis for Radio Interferometry. arXiv."},{"key":"ref_34","first-page":"50","article-title":"Research on Wide-field Imaging Technologies for Low-frequency Radio Array","volume":"58","author":"Lao","year":"2017","journal-title":"Acta Astron. Sin."},{"key":"ref_35","unstructured":"Iupikov, O.A., Ivashina, M.V., and Smirnov, O.M. (2011, January 11\u201315). Reducing the complexity of the beam calibration models of phased-array radio telescopes. Proceedings of the 5th European Conference on IEEE Antennas and Propagation (EUCAP), Rome, Italy."},{"key":"ref_36","unstructured":"SKA-ScienceDataProcessor (2019, November 12). Algorithm-Reference Library. Available online: https:\/\/github.com\/SKA-ScienceDataProcessor\/algorithm-reference-library."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Meher, P.K., and Park, S.Y. (2019). Design of Cascaded CORDIC Based on Precise Analysis of Critical Path. Electronics, 8.","DOI":"10.3390\/electronics8040382"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Fan, Y.-C., Liu, Y.-C., and Chu, C.-A. (2019). Efficient CORDIC Iteration Design of LiDAR Sensors\u2019 Point-Cloud Map Reconstruction Technology. Sensors, 19.","DOI":"10.3390\/s19245412"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Qiao, W., Du, J., Fang, Z., Wang, L., Lo, M., Chang, M.-C.F., and Cong, J. (May, January 29). High-Throughput Lossless Compression on Tightly Coupled CPU-FPGA Platforms. Proceedings of the 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), Boulder, CO, USA.","DOI":"10.1109\/FCCM.2018.00015"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Wu, Q., Zhu, Y., Wang, X., Li, M., and Masoumi, A. (May, January 30). Exploring High Efficiency Hardware Accelerator for the Key Algorithm of Square Kilometer Array Telescope Data Processing. Proceedings of the 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM).","DOI":"10.1109\/FCCM.2017.32"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1109\/TNS.2014.2304691","article-title":"High Performance FPGA-Based DMA Interface for PCIe","volume":"61","author":"Kavianipour","year":"2012","journal-title":"IEEE Trans. Nuclear Sci."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/15\/4070\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,1]],"date-time":"2024-07-01T06:56:30Z","timestamp":1719816990000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/15\/4070"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,22]]},"references-count":41,"journal-issue":{"issue":"15","published-online":{"date-parts":[[2020,8]]}},"alternative-id":["s20154070"],"URL":"https:\/\/doi.org\/10.3390\/s20154070","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7,22]]}}}