{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,7,6]],"date-time":"2024-07-06T00:16:03Z","timestamp":1720224963474},"reference-count":45,"publisher":"World Scientific Pub Co Pte Ltd","issue":"06","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2024,5]]},"abstract":" As the demand for recognizing irregular text in natural scenes increases, people are increasingly realizing the value of such applications, such as license plate recognition systems, image search, handwriting recognition, and autonomous driving, which are profoundly changing our lives in the field of text recognition. Recent studies have shown that the recognition of curved text and perspective text has become an important challenge in the field of text recognition, and the correction of curved text is a key step to achieve accurate recognition. However, current methods use strained text image correction methods, resulting in poor recognition accuracy when recognizing curved text. Therefore, we propose an end-to-end framework called Scene Text Recognizer with Appearance-Flow rectification (SterAF), which includes a correction network and a recognition network. Specifically, the framework\u2019s steps are as follows: first, the input text image is deformed through an appearance flow-based correction network to adaptively warp the text image, to prevent irregular and unnatural deformations of the text image. Second, a sequence-to-sequence recognition network predicts the sequence of characters in the corrected text image to accurately recognize the text in the image. Through subjective and objective experiments, our SterAF model has shown excellent performance in both qualitative and quantitative experiments. <\/jats:p>","DOI":"10.1142\/s0218001424500113","type":"journal-article","created":{"date-parts":[[2024,4,25]],"date-time":"2024-04-25T07:39:35Z","timestamp":1714030775000},"source":"Crossref","is-referenced-by-count":0,"title":["SterAF: A Scene Text Recognizer with Appearance-Flow Rectification"],"prefix":"10.1142","volume":"38","author":[{"ORCID":"http:\/\/orcid.org\/0009-0006-9163-0594","authenticated-orcid":false,"given":"Chunyan","family":"Liao","sequence":"first","affiliation":[{"name":"School of Foreign Languages, Wuhan Textile University, Wuhan, P.\u00a0R.\u00a0China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-7275-5064","authenticated-orcid":false,"given":"Chenghu","family":"Du","sequence":"additional","affiliation":[{"name":"School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan, P.\u00a0R.\u00a0China"}]},{"ORCID":"http:\/\/orcid.org\/0009-0000-7251-0710","authenticated-orcid":false,"given":"Yating","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Foreign Languages, Wuhan Textile University, Wuhan, P.\u00a0R.\u00a0China"}]},{"ORCID":"http:\/\/orcid.org\/0009-0000-0096-6153","authenticated-orcid":false,"given":"Yanbao","family":"Tan","sequence":"additional","affiliation":[{"name":"School of Foreign Languages, Wuhan Textile University, Wuhan, P.\u00a0R.\u00a0China"}]}],"member":"219","published-online":{"date-parts":[[2024,6,8]]},"reference":[{"key":"S0218001424500113BIB001","doi-asserted-by":"publisher","DOI":"10.1109\/JIOT.2022.3224947"},{"key":"S0218001424500113BIB002","first-page":"1","author":"Yu F.","year":"2023","journal-title":"Vis. Comput."},{"key":"S0218001424500113BIB003","first-page":"1","volume-title":"2022 6th Int. Conf. Computing Methodologies and Communication (ICCMC)","author":"Kamisetty V. N. S. R.","year":"2022"},{"key":"S0218001424500113BIB004","doi-asserted-by":"publisher","DOI":"10.1109\/I2C2.2017.8321962"},{"key":"S0218001424500113BIB005","doi-asserted-by":"publisher","DOI":"10.1109\/RIVF51545.2021.9642128"},{"key":"S0218001424500113BIB006","doi-asserted-by":"publisher","DOI":"10.1109\/ICTer51097.2020.9325428"},{"key":"S0218001424500113BIB007","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2007.4377043"},{"key":"S0218001424500113BIB008","doi-asserted-by":"publisher","DOI":"10.3390\/s21051919"},{"key":"S0218001424500113BIB009","doi-asserted-by":"publisher","DOI":"10.3390\/electronics10222780"},{"key":"S0218001424500113BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2011.6126402"},{"key":"S0218001424500113BIB011","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2012.6247990"},{"key":"S0218001424500113BIB012","first-page":"3304","volume-title":"Proc. 21st Int. Conf. Pattern Recognition (ICPR2012)","author":"Wang T.","year":"2012"},{"key":"S0218001424500113BIB013","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-16865-4_3"},{"key":"S0218001424500113BIB014","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0823-z"},{"key":"S0218001424500113BIB015","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.543"},{"key":"S0218001424500113BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2018.2848939"},{"key":"S0218001424500113BIB017","first-page":"28","author":"Jaderberg M.","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"S0218001424500113BIB018","doi-asserted-by":"publisher","DOI":"10.1016\/j.firesaf.2022.103547"},{"key":"S0218001424500113BIB019","doi-asserted-by":"publisher","DOI":"10.1109\/TMM.2022.3152367"},{"key":"S0218001424500113BIB020","doi-asserted-by":"publisher","DOI":"10.1016\/j.jvcir.2023.103980"},{"key":"S0218001424500113BIB021","doi-asserted-by":"publisher","DOI":"10.1109\/TCE.2023.3306206"},{"key":"S0218001424500113BIB022","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46493-0_18"},{"key":"S0218001424500113BIB023","doi-asserted-by":"publisher","DOI":"10.1109\/ICPR48806.2021.9412775"},{"key":"S0218001424500113BIB024","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2018.2825107"},{"key":"S0218001424500113BIB025","doi-asserted-by":"publisher","DOI":"10.1109\/TAI.2021.3116216"},{"key":"S0218001424500113BIB026","doi-asserted-by":"publisher","DOI":"10.1016\/0031-3203(95)00030-4"},{"key":"S0218001424500113BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.452"},{"key":"S0218001424500113BIB028","first-page":"3304","volume-title":"Proc. 21st Int. Conf. Pattern Recognition (ICPR2012)","author":"Wang T.","year":"2012"},{"key":"S0218001424500113BIB029","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"S0218001424500113BIB031","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.254"},{"key":"S0218001424500113BIB032","doi-asserted-by":"publisher","DOI":"10.1007\/s10032-004-0134-3"},{"key":"S0218001424500113BIB033","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2013.221"},{"key":"S0218001424500113BIB034","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2015.7333942"},{"key":"S0218001424500113BIB035","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.76"},{"key":"S0218001424500113BIB036","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2014.07.008"},{"key":"S0218001424500113BIB037","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.102"},{"key":"S0218001424500113BIB038","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2339814"},{"key":"S0218001424500113BIB039","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.515"},{"key":"S0218001424500113BIB040","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0793-6"},{"key":"S0218001424500113BIB041","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10593-2_34"},{"key":"S0218001424500113BIB042","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298914"},{"key":"S0218001424500113BIB043","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2646371"},{"key":"S0218001424500113BIB044","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.452"},{"key":"S0218001424500113BIB045","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.245"},{"issue":"2","key":"S0218001424500113BIB046","first-page":"3","volume":"1","author":"Yang X.","year":"2017","journal-title":"Int. J. Conf. Artif. Intell."}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001424500113","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,5]],"date-time":"2024-07-05T05:06:10Z","timestamp":1720155970000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001424500113"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5]]},"references-count":45,"journal-issue":{"issue":"06","published-print":{"date-parts":[[2024,5]]}},"alternative-id":["10.1142\/S0218001424500113"],"URL":"https:\/\/doi.org\/10.1142\/s0218001424500113","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5]]}}}