{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,30]],"date-time":"2024-10-30T22:15:12Z","timestamp":1730326512214,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,11,17]]},"DOI":"10.1145\/3581807.3581808","type":"proceedings-article","created":{"date-parts":[[2023,5,23]],"date-time":"2023-05-23T00:02:28Z","timestamp":1684800148000},"page":"1-6","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Multi-Scale Channel Attention for Chinese Scene Text Recognition"],"prefix":"10.1145","author":[{"ORCID":"http:\/\/orcid.org\/0000-0002-2815-8817","authenticated-orcid":false,"given":"Haiqing","family":"Liao","sequence":"first","affiliation":[{"name":"Fujian Key Laboratory of Pattern Recognition and Image Understanding, Xiamen University of Technology, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-6298-846X","authenticated-orcid":false,"given":"Xia","family":"Du","sequence":"additional","affiliation":[{"name":"Fujian Key Laboratory of Pattern Recognition and Image Understanding, Xiamen University of Technology, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-2476-7746","authenticated-orcid":false,"given":"Yun","family":"Wu","sequence":"additional","affiliation":[{"name":"Fujian Key Laboratory of Pattern Recognition and Image Understanding, Xiamen University of Technology, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-5901-0778","authenticated-orcid":false,"given":"Da-Han","family":"Wang","sequence":"additional","affiliation":[{"name":"Fujian Key Laboratory of Pattern Recognition and Image Understanding, Xiamen University of Technology, China"}]}],"member":"320","published-online":{"date-parts":[[2023,5,22]]},"reference":[{"key":"e_1_3_2_1_1_1","first-page":"1","article-title":"Reading Text in the Wild with Convolutional Neural Networks","volume":"1842","author":"Jaderberg Max","year":"2016","unstructured":"Jaderberg , Max , \" Reading Text in the Wild with Convolutional Neural Networks ..\" International Journal of Computer Vision abs\/1412 . 1842 ( 2016 ): 1 - 20 . Jaderberg, Max, \"Reading Text in the Wild with Convolutional Neural Networks..\" International Journal of Computer Vision abs\/1412.1842 (2016): 1-20.","journal-title":"International Journal of Computer Vision abs\/1412"},{"key":"e_1_3_2_1_2_1","volume-title":"IEEE","author":"Shi Baoguang","year":"2017","unstructured":"Shi , Baoguang , \"Icdar2017 competition on reading chinese text in the wild (rctw-17).\" 2017 14th iapr international conference on document analysis and recognition (ICDAR). Vol. 1 . IEEE , 2017 . Shi, Baoguang, \"Icdar2017 competition on reading chinese text in the wild (rctw-17).\" 2017 14th iapr international conference on document analysis and recognition (ICDAR). Vol. 1. IEEE, 2017."},{"key":"e_1_3_2_1_3_1","volume-title":"IEEE","author":"Zhang Rui","year":"2019","unstructured":"Zhang , Rui , \" Icdar 2019 robust reading challenge on reading chinese text on signboard.\" 2019 international conference on document analysis and recognition (ICDAR) . IEEE , 2019. Zhang, Rui, \"Icdar 2019 robust reading challenge on reading chinese text on signboard.\" 2019 international conference on document analysis and recognition (ICDAR). IEEE, 2019."},{"key":"e_1_3_2_1_4_1","volume-title":"Datasets, Baselines, and an Empirical Study.\" arXiv preprint arXiv:2112.15093","author":"Chen Jingye","year":"2021","unstructured":"Chen , Jingye , \" Benchmarking Chinese Text Recognition : Datasets, Baselines, and an Empirical Study.\" arXiv preprint arXiv:2112.15093 ( 2021 ). Chen, Jingye, \"Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study.\" arXiv preprint arXiv:2112.15093 (2021)."},{"key":"e_1_3_2_1_5_1","volume-title":"2231-2239","author":"Lee Chen-Yu","year":"2016","unstructured":"Lee , Chen-Yu , and Simon Osindero . \"Recursive Recurrent Nets With Attention Modeling For Ocr In The Wild .\" abs\/1603.03101 ( 2016 ): 2231-2239 . Lee, Chen-Yu, and Simon Osindero. \"Recursive Recurrent Nets With Attention Modeling For Ocr In The Wild.\" abs\/1603.03101 (2016): 2231-2239."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Shi Baoguang \"ASTER: An Attentional Scene Text Recognizer with Flexible Rectification..\" 41 (2019): 2035-2048. Shi Baoguang \"ASTER: An Attentional Scene Text Recognizer with Flexible Rectification..\" 41 (2019): 2035-2048.","DOI":"10.1109\/TPAMI.2018.2848939"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Luo Canjie \"MORAN: A Multi-Object Rectified Attention Network for scene text recognition..\" 90 (2019): 109-118. Luo Canjie \"MORAN: A Multi-Object Rectified Attention Network for scene text recognition..\" 90 (2019): 109-118.","DOI":"10.1016\/j.patcog.2019.01.020"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"Yang Maoke \"Denseaspp for semantic segmentation in street scenes.\" Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. Yang Maoke \"Denseaspp for semantic segmentation in street scenes.\" Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.","DOI":"10.1109\/CVPR.2018.00388"},{"key":"e_1_3_2_1_9_1","unstructured":"Jaderberg Max Karen Simonyan and Andrew Zisserman. \"Spatial transformer networks.\" Advances in neural information processing systems 28 (2015). Jaderberg Max Karen Simonyan and Andrew Zisserman. \"Spatial transformer networks.\" Advances in neural information processing systems 28 (2015)."},{"key":"e_1_3_2_1_10_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition.","author":"Hu Jie","year":"2018","unstructured":"Hu , Jie , Li Shen , and Gang Sun . \"Squeeze-and-excitation networks.\" Proceedings of the IEEE conference on computer vision and pattern recognition. 2018 . Hu, Jie, Li Shen, and Gang Sun. \"Squeeze-and-excitation networks.\" Proceedings of the IEEE conference on computer vision and pattern recognition. 2018."},{"key":"e_1_3_2_1_11_1","first-page":"1457","article-title":"End-to-end scene text recognition","volume":"2011","author":"Wang Kai","year":"2011","unstructured":"Wang , Kai , \" End-to-end scene text recognition .\" International Conference on Computer Vision 2011 ( 2011 ): 1457 - 1464 . Wang, Kai, \"End-to-end scene text recognition.\" International Conference on Computer Vision 2011 (2011): 1457-1464.","journal-title":"International Conference on Computer Vision"},{"key":"e_1_3_2_1_12_1","volume-title":"3304-3308","author":"Wang Tao","year":"2012","unstructured":"Wang , Tao , \"End-to-end text recognition with convolutional neural networks.\" ( 2012 ): 3304-3308 . Wang, Tao, \"End-to-end text recognition with convolutional neural networks.\" (2012): 3304-3308."},{"key":"e_1_3_2_1_13_1","volume-title":"3999-4004","author":"Liu Xinhao","year":"2016","unstructured":"Liu , Xinhao , \" Scene Text Recognition With Cnn Classifier And Wfst- Based Word Labeling .\" ( 2016 ): 3999-4004 . Liu, Xinhao, \"Scene Text Recognition With Cnn Classifier And Wfst-Based Word Labeling.\" (2016): 3999-4004."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.cviu.2016.01.002","article-title":"Enhancing Energy Minimization Framework for Scene Text Recognition with Top-Down Cues","volume":"03128","author":"Mishra Anand","year":"2016","unstructured":"Mishra , Anand , \" Enhancing Energy Minimization Framework for Scene Text Recognition with Top-Down Cues .\" Computer Vision and Image Understanding abs\/1601 . 03128 ( 2016 ): 30 - 42 . Mishra, Anand, \"Enhancing Energy Minimization Framework for Scene Text Recognition with Top-Down Cues.\" Computer Vision and Image Understanding abs\/1601.03128 (2016): 30-42.","journal-title":"Computer Vision and Image Understanding abs\/1601"},{"key":"e_1_3_2_1_15_1","first-page":"569","volume-title":"International Conference on Computer Vision 2013 (2013)","author":"Phan Trung Quy","unstructured":"Phan , Trung Quy , \" Recognizing Text with Perspective Distortion in Natural Scenes .\" International Conference on Computer Vision 2013 (2013) : 569 - 576 . Phan, Trung Quy, \"Recognizing Text with Perspective Distortion in Natural Scenes.\" International Conference on Computer Vision 2013 (2013): 569-576."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Yao Cong \"Strokelets: A Learned Multi-scale Representation for Scene Text Recognition.\" IEEE Conference on Computer Vision and Pattern Recognition (2014): 4042-4049. Yao Cong \"Strokelets: A Learned Multi-scale Representation for Scene Text Recognition.\" IEEE Conference on Computer Vision and Pattern Recognition (2014): 4042-4049.","DOI":"10.1109\/CVPR.2014.515"},{"key":"e_1_3_2_1_17_1","volume-title":"2956-2964","author":"Gordo Albert","year":"2015","unstructured":"Gordo , Albert . \" Supervised Mid-Level Features For Word Image Representation .\" abs\/1410.5224 ( 2015 ): 2956-2964 . Gordo, Albert. \"Supervised Mid-Level Features For Word Image Representation.\" abs\/1410.5224 (2015): 2956-2964."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"crossref","unstructured":"Baek Jeonghun \"What is wrong with scene text recognition model comparisons? dataset and model analysis.\" Proceedings of the IEEE\/CVF International Conference on Computer Vision. 2019. Baek Jeonghun \"What is wrong with scene text recognition model comparisons? dataset and model analysis.\" Proceedings of the IEEE\/CVF International Conference on Computer Vision. 2019.","DOI":"10.1109\/ICCV.2019.00481"},{"key":"e_1_3_2_1_19_1","volume-title":"2298-2304","author":"Shi Baoguang","year":"2016","unstructured":"Shi , Baoguang , Xiang Bai , and Cong Yao . \"An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition.\" IEEE transactions on pattern analysis and machine intelligence 39.11 ( 2016 ): 2298-2304 . Shi, Baoguang, Xiang Bai, and Cong Yao. \"An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition.\" IEEE transactions on pattern analysis and machine intelligence 39.11 (2016): 2298-2304."},{"key":"e_1_3_2_1_20_1","unstructured":"Simonyan Karen and Andrew Zisserman. \"Very deep convolutional networks for large-scale image recognition.\" arXiv preprint arXiv:1409.1556 (2014). Simonyan Karen and Andrew Zisserman. \"Very deep convolutional networks for large-scale image recognition.\" arXiv preprint arXiv:1409.1556 (2014)."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-020-01411-1"},{"key":"e_1_3_2_1_22_1","unstructured":"Goodfellow Ian \"Generative adversarial nets.\" Advances in neural information processing systems 27 (2014). Goodfellow Ian \"Generative adversarial nets.\" Advances in neural information processing systems 27 (2014)."},{"key":"e_1_3_2_1_23_1","volume-title":"Springer","author":"Wang Wenjia","year":"2020","unstructured":"Wang , Wenjia , \" Scene text image super-resolution in the wild.\" European Conference on Computer Vision . Springer , Cham , 2020 . Wang, Wenjia, \"Scene text image super-resolution in the wild.\" European Conference on Computer Vision. Springer, Cham, 2020."},{"key":"e_1_3_2_1_24_1","volume-title":"Degradation aware scene text recognition supervised by a pluggable super-resolution unit.\" European Conference on Computer Vision","author":"Mou Yongqiang","year":"2020","unstructured":"Mou , Yongqiang , \" Plugnet : Degradation aware scene text recognition supervised by a pluggable super-resolution unit.\" European Conference on Computer Vision . Springer , Cham , 2020 . Mou, Yongqiang, \"Plugnet: Degradation aware scene text recognition supervised by a pluggable super-resolution unit.\" European Conference on Computer Vision. Springer, Cham, 2020."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"crossref","unstructured":"Zhan Fangneng Hongyuan Zhu and Shijian Lu. \"Spatial fusion gan for image synthesis.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2019. Zhan Fangneng Hongyuan Zhu and Shijian Lu. \"Spatial fusion gan for image synthesis.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2019.","DOI":"10.1109\/CVPR.2019.00377"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","unstructured":"Yang Mingkun \"Symmetry-constrained rectification network for scene text recognition.\" Proceedings of the IEEE\/CVF International Conference on Computer Vision. 2019. Yang Mingkun \"Symmetry-constrained rectification network for scene text recognition.\" Proceedings of the IEEE\/CVF International Conference on Computer Vision. 2019.","DOI":"10.1109\/ICCV.2019.00924"},{"key":"e_1_3_2_1_27_1","volume-title":"Towards accurate text recognition in natural images.\" Proceedings of the IEEE international conference on computer vision","author":"Cheng Zhanzhan","year":"2017","unstructured":"Cheng , Zhanzhan , \"Focusing attention : Towards accurate text recognition in natural images.\" Proceedings of the IEEE international conference on computer vision . 2017 . Cheng, Zhanzhan, \"Focusing attention: Towards accurate text recognition in natural images.\" Proceedings of the IEEE international conference on computer vision. 2017."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"crossref","unstructured":"Yang Xiao \"Learning to read irregular text with attention mechanisms.\" IJCAI. Vol. 1. No. 2. 2017. Yang Xiao \"Learning to read irregular text with attention mechanisms.\" IJCAI. Vol. 1. No. 2. 2017.","DOI":"10.24963\/ijcai.2017\/458"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.24792"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"He Kaiming \"Deep residual learning for image recognition.\" Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. He Kaiming \"Deep residual learning for image recognition.\" Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_1_31_1","volume-title":"855-868","author":"Graves Alex","year":"2008","unstructured":"Graves , Alex , \"A novel connectionist system for unconstrained handwriting recognition.\" IEEE transactions on pattern analysis and machine intelligence 31.5 ( 2008 ): 855-868 . Graves, Alex, \"A novel connectionist system for unconstrained handwriting recognition.\" IEEE transactions on pattern analysis and machine intelligence 31.5 (2008): 855-868."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Zhang Yaping \"Sequence-to-sequence domain adaptation network for robust text image recognition.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2019. Zhang Yaping \"Sequence-to-sequence domain adaptation network for robust text image recognition.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2019.","DOI":"10.1109\/CVPR.2019.00285"},{"key":"e_1_3_2_1_33_1","unstructured":"Bahdanau Dzmitry Kyunghyun Cho and Yoshua Bengio. \"Neural machine translation by jointly learning to align and translate.\" arXiv preprint arXiv:1409.0473 (2014). Bahdanau Dzmitry Kyunghyun Cho and Yoshua Bengio. \"Neural machine translation by jointly learning to align and translate.\" arXiv preprint arXiv:1409.0473 (2014)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Cho Kyunghyun \"Learning phrase representations using RNN encoder-decoder for statistical machine translation.\" arXiv preprint arXiv:1406.1078 (2014). Cho Kyunghyun \"Learning phrase representations using RNN encoder-decoder for statistical machine translation.\" arXiv preprint arXiv:1406.1078 (2014).","DOI":"10.3115\/v1\/D14-1179"},{"key":"e_1_3_2_1_35_1","unstructured":"Chen Liang-Chieh \"Rethinking atrous convolution for semantic image segmentation.\" arXiv preprint arXiv:1706.05587 (2017). Chen Liang-Chieh \"Rethinking atrous convolution for semantic image segmentation.\" arXiv preprint arXiv:1706.05587 (2017)."},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"crossref","unstructured":"Chen Liang-Chieh \"Deeplab: Semantic image segmentation with deep convolutional nets atrous convolution and fully connected crfs.\" IEEE transactions on pattern analysis and machine intelligence 40.4 (2017): 834-848. Chen Liang-Chieh \"Deeplab: Semantic image segmentation with deep convolutional nets atrous convolution and fully connected crfs.\" IEEE transactions on pattern analysis and machine intelligence 40.4 (2017): 834-848.","DOI":"10.1109\/TPAMI.2017.2699184"},{"key":"e_1_3_2_1_37_1","volume-title":"IEEE","author":"Sun Yipeng","year":"2019","unstructured":"Sun , Yipeng , \" ICDAR 2019 competition on large-scale street view text with partial labeling-RRC-LSVT.\" 2019 International Conference on Document Analysis and Recognition (ICDAR) . IEEE , 2019. Sun, Yipeng, \"ICDAR 2019 competition on large-scale street view text with partial labeling-RRC-LSVT.\" 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2019."},{"key":"e_1_3_2_1_38_1","volume-title":"IEEE","author":"Chng Chee Kheng","year":"2019","unstructured":"Chng , Chee Kheng , \"Icdar2019 robust reading challenge on arbitrary-shaped text-rrc-art.\" 2019 International Conference on Document Analysis and Recognition (ICDAR) . IEEE , 2019 . Chng, Chee Kheng, \"Icdar2019 robust reading challenge on arbitrary-shaped text-rrc-art.\" 2019 International Conference on Document Analysis and Recognition (ICDAR). IEEE, 2019."},{"issue":"3","key":"e_1_3_2_1_39_1","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1007\/s11390-019-1923-y","article-title":"A large chinese text dataset in the wild","volume":"34","author":"Yuan Tai-Ling","year":"2019","unstructured":"Yuan , Tai-Ling , \" A large chinese text dataset in the wild .\" Journal of Computer Science and Technology 34 . 3 ( 2019 ): 509 - 521 . Yuan, Tai-Ling, \"A large chinese text dataset in the wild.\" Journal of Computer Science and Technology 34.3 (2019): 509-521.","journal-title":"Journal of Computer Science and Technology"},{"key":"e_1_3_2_1_40_1","volume-title":"An imperative style, high-performance deep learning library.\" Advances in neural information processing systems 32","author":"Paszke Adam","year":"2019","unstructured":"Paszke , Adam , \" Pytorch : An imperative style, high-performance deep learning library.\" Advances in neural information processing systems 32 ( 2019 ). Paszke, Adam, \"Pytorch: An imperative style, high-performance deep learning library.\" Advances in neural information processing systems 32 (2019)."},{"key":"e_1_3_2_1_41_1","volume-title":"an adaptive learning rate method.\" arXiv preprint arXiv:1212.5701","author":"Zeiler Matthew D","year":"2012","unstructured":"Zeiler , Matthew D . \" Adadelta : an adaptive learning rate method.\" arXiv preprint arXiv:1212.5701 ( 2012 ). Zeiler, Matthew D. \"Adadelta: an adaptive learning rate method.\" arXiv preprint arXiv:1212.5701 (2012)."},{"key":"e_1_3_2_1_42_1","volume-title":"Semantics enhanced encoder-decoder framework for scene text recognition.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition","author":"Qiao Zhi","year":"2020","unstructured":"Qiao , Zhi , \" Seed : Semantics enhanced encoder-decoder framework for scene text recognition.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition . 2020 . Qiao, Zhi, \"Seed: Semantics enhanced encoder-decoder framework for scene text recognition.\" Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2020."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1162\/tacl_a_00051"}],"event":{"name":"ICCPR 2022: 2022 11th International Conference on Computing and Pattern Recognition","acronym":"ICCPR 2022","location":"Beijing China"},"container-title":["Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3581807.3581808","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,23]],"date-time":"2023-05-23T00:26:49Z","timestamp":1684801609000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3581807.3581808"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,17]]},"references-count":43,"alternative-id":["10.1145\/3581807.3581808","10.1145\/3581807"],"URL":"https:\/\/doi.org\/10.1145\/3581807.3581808","relation":{},"subject":[],"published":{"date-parts":[[2022,11,17]]},"assertion":[{"value":"2023-05-22","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}