default search action
WACV 2023: Waikoloa, HI, USA
- IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023, Waikoloa, HI, USA, January 2-7, 2023. IEEE 2023, ISBN 978-1-6654-9346-8
- Vivek Trivedy, Longin Jan Latecki:
CNN2Graph: Building Graphs for Image Classification. 1-11 - Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel:
Token Pooling in Vision Transformers for Image Classification. 12-21 - Yuting Wang, Ricardo Guerrero, Vladimir Pavlovic:
D2F2WOD: Learning Object Proposals for Weakly-Supervised Object Detection via Progressive Domain Adaptation. 22-31 - Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben Baruch, Asaf Noy:
ML-Decoder: Scalable and Versatile Classification Head. 32-41 - Andres Palechor, Annesha Bhoumik, Manuel Günther:
Large-Scale Open-Set Classification Protocols for ImageNet. 42-51 - George Adaimi, David Mizrahi, Alexandre Alahi:
Composite Relationship Fields with Transformers for Scene Graph Generation. 52-64 - Yutong Bai, Angtian Wang, Adam Kortylewski, Alan L. Yuille:
CoKe: Contrastive Learning for Robust Keypoint Detection. 65-74 - Quentin Bouniot, Angélique Loesch, Amaury Habrard, Romaric Audigier:
Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient? 75-84 - Tyler LaBonte, Yale Song, Xin Wang, Vibhav Vineet, Neel Joshi:
Scaling Novel Object Detection with Weakly Supervised Detection Transformers. 85-96 - Yung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bulò, Peter Kontschieder, Fisher Yu:
Dense Prediction with Attentive Feature Aggregation. 97-106 - Chull Hwan Song, Jooyoung Yoon, Shunghyun Choi, Yannis Avrithis:
Boosting vision transformers for image retrieval. 107-117 - Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness:
Is your noise correction noisy? PLS: Robustness to label noise with two stage detection. 118-127 - Martin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua:
Two-level Data Augmentation for Calibrated Multi-view Detection. 128-136 - Soufiane Belharbi, Ismail Ben Ayed, Luke McCaffrey, Eric Granger:
TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained Videos. 137-146 - Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari:
LAVA:Label-efficient Visual Learning and Adaptation. 147-156 - Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay:
GEMS: Scene Expansion using Generative Models of Graphs. 157-166 - Mingjie Wang, Hao Cai, Yong Dai, Minglun Gong:
Dynamic Mixture of Counter Network for Location-Agnostic Crowd Counting. 167-177 - Teppei Kurita, Yuhi Kondo, Legong Sun, Yusuke Moriuchi:
Simultaneous Acquisition of High Quality RGB Image and Polarization Information using a Sparse Polarization Sensor. 178-188 - Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi:
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings. 189-197 - Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu:
Lossy Image Compression with Quantized Hierarchical VAEs. 198-207 - Jitesh Jain, Yuqian Zhou, Ning Yu, Humphrey Shi:
Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand. 208-217 - Pedro Figueirêdo, Avinash Paliwal, Nima Khademi Kalantari:
Frame Interpolation for Dynamic Scenes with Implicit Flow Encoding. 218-228 - Jiwan Hur, Jae Young Lee, Jaehyun Choi, Junmo Kim:
I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field Images. 229-238 - B. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra:
Burst Reflection Removal using Reflection Motion Aggregation Cues. 239-248 - Tai-Yin Chiu, Danna Gurari:
Line Search-Based Feature Transformation for Fast, Stable, and Tunable Content-Style Control in Photorealistic Style Transfer. 249-258 - Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura:
Panoptic-aware Image-to-Image Translation. 259-268 - Abhishek Jha, Soroush Seifi, Tinne Tuytelaars:
SimGlim: Simplifying glimpse based active visual reconstruction. 269-278 - Lorenzo Luzi, Carlos Ortiz Marrero, Nile Wynar, Richard G. Baraniuk, Michael J. Henry:
Evaluating generative networks using Gaussian mixtures of image features. 279-288 - Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell:
More Control for Free! Image Synthesis with Semantic Diffusion Guidance. 289-299 - James F. Mullen Jr., Divya Kothandaraman, Aniket Bera, Dinesh Manocha:
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes. 300-310 - Takafumi Iwaguchi, Hiroshi Kawasaki:
Surface normal estimation from optimized and distributed light sources using DNN-based photometric stereo. 311-320 - David Hart, Michael Whitney, Bryan S. Morse:
Interpolated SelectionConv for Spherical Images and Surfaces. 321-330 - Yingnan Ma, Chenqiu Zhao, Anup Basu, Xudong Li:
RAST: Restorable Arbitrary Style Transfer via Multi-restoration. 331-340 - Cameron Gordon, Shin-Fang Chng, Lachlan E. MacDonald, Simon Lucey:
On Quantizing Implicit Neural Representations. 341-350 - Lydia Lindner, Alexander Effland, Filip Ilic, Thomas Pock, Erich Kobler:
Lightweight Video Denoising using Aggregated Shifted Window Attention. 351-360 - Junming Chen, Meirui Jiang, Qi Dou, Qifeng Chen:
Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer. 361-370 - Shahar Mahpod, Noam Gaash, Hay Hoffman, Gil Ben-Artzi:
CTrGAN: Cycle Transformers GAN for Gait Transfer. 371-381 - Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha:
SALAD : Source-free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and Detection. 382-391 - Thomas Westfechtel, Hao-Wei Yeh, Qier Meng, Yusuke Mukuta, Tatsuya Harada:
Backprop Induced Feature Weighting for Adversarial Domain Adaptation with Iterative Label Distribution Alignment. 392-401 - Md Mahmudur Rahman, Rameswar Panda, Mohammad Arif Ul Alam:
Semi-Supervised Domain Adaptation with Auto-Encoder via Simultaneous Learning. 402-411 - Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue:
Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis. 412-422 - Giulio Mattolin, Luca Zanella, Elisa Ricci, Yiming Wang:
ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing. 423-433 - Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou Yang:
Improving Diversity with Adversarially Learned Transformations for Domain Generalization. 434-443 - Donald Shenaj, Eros Fanì, Marco Toldo, Debora Caldarola, Antonio Tavera, Umberto Michieli, Marco Ciccone, Pietro Zanuttigh, Barbara Caputo:
Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated Learning. 444-454 - Matthew R. Keaton, Ram J. Zaveri, Gianfranco Doretto:
CellTranspose: Few-shot Domain Adaptation for Cellular Instance Segmentation. 455-466 - Swati Jindal, Xin Eric Wang:
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection. 467-477 - Vibashan VS, Poojan Oza, Vishal M. Patel:
Towards Online Domain Adaptive Object Detection. 478-488 - Kyusik Cho, Suhyeon Lee, Hongje Seong, Euntai Kim:
Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing. 489-498 - Fabrizio J. Piva, Daan de Geus, Gijs Dubbelman:
Empirical Generalization Study: Unsupervised Domain Adaptation vs. Domain Generalization Methods for Semantic Segmentation in the Wild. 499-508 - Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva:
Intra-Source Style Augmentation for Improved Domain Generalization. 509-519 - Jinyu Yang, Jingjing Liu, Ning Xu, Junzhou Huang:
TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation. 520-530 - Sungsu Hur, Inkyu Shin, Kwanyong Park, Sanghyun Woo, In So Kweon:
Learning Classifiers of Prototypes and Reciprocal Points for Universal Domain Adaptation. 531-540 - Michael Essich, Markus Rehmann, Cristóbal Curio:
Auxiliary Task-Guided CycleGAN for Black-Box Model Domain Adaptation. 541-550 - Weiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi:
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds. 551-560 - Brent Griffin:
Mobile Robot Manipulation using Pure Object Detection. 561-571 - Driton Salihu, Eckehard G. Steinbach:
SGPCR: Spherical Gaussian Point Cloud Representation and its Application to Object Registration and Retrieval. 572-581 - Min Seok Lee, Seok Woo Yang, Sung Won Han:
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation. 582-591 - Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:
PointInverter: Point Cloud Reconstruction and Editing via a Generative Model with Shape Priors. 592-601 - Maximilian Pittner, Alexandru Condurache, Joel Janai:
3D-SpLineNet: 3D Traffic Line Detection using Parametric Spline Representations. 602-611 - Jinlong Li, Runsheng Xu, Jin Ma, Qin Zou, Jiaqi Ma, Hongkai Yu:
Domain Adaptive Object Detection for Autonomous Driving under Foggy Weather. 612-622 - Dusan Malic, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof:
SAILOR: Scaling Anchors via Insights into Latent Object Representation. 623-632 - Zhimin Chen, Longlong Jing, Liang Yang, Yingwei Li, Bing Li:
Class-Level Confidence Based 3D Semi-Supervised Learning. 633-642 - Minghan Zhu, Lingting Ge, Panqu Wang, Huei Peng:
MonoEdge: Monocular 3D Object Detection Using Local Perspectives. 643-652 - Minmin Yang, Jiajing Chen, Senem Velipasalar:
Cross-Modality Feature Fusion Network for Few-Shot 3D Point Cloud Classification. 653-662 - Anas Mahmoud, Jordan S. K. Hu, Steven L. Waslander:
Dense Voxel Fusion for 3D Object Detection. 663-672 - Nagma S. Khan, Kazumine Ogura, Eric Cosatto, Masayuki Ariyoshi:
Real-time Concealed Weapon Detection on 3D Radar Images for Walk-through Screening System. 673-681 - Daeun Lee, Jinkyu Kim:
Resolving Class Imbalance for LiDAR-based Object Detector by Dynamic Weight Average and Contextual Ground Truth Sampling. 682-691 - Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong:
Far3Det: Towards Far-Field 3D Detection. 692-701 - Dmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren:
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation. 702-712 - Simon Niklaus, Ping Hu, Jiawen Chen:
Splatting-based Synthesis for Video Frame Interpolation. 713-723 - Kyungmin Jo, Gyumin Shim, Sanghun Jung, Soyoung Yang, Jaegul Choo:
CG-NeRF: Conditional Generative Neural Radiance Fields for 3D-aware Image Synthesis. 724-733 - Nikola Popovic, Ritika Chakraborty, Danda Pani Paudel, Thomas Probst, Luc Van Gool:
Spatially Multi-conditional Image Generation. 734-743 - Min Woo Kim, Nam Ik Cho:
WHFL: Wavelet-Domain High Frequency Loss for Sketch-to-Image Translation. 744-754 - David Dadon, Ohad Fried, Yacov Hel-Or:
DDNeRF: Depth Distribution Neural Radiance Fields. 755-763 - Hanbit Lee, Youna Kim, Sang-Goo Lee:
Multi-scale Contrastive Learning for Complex Scene Generation. 764-774 - Pol Caselles, Eduard Ramon, Jaime García, Xavier Giró-i-Nieto, Francesc Moreno-Noguer, Gil Triginer:
SIRA: Relightable Avatars from a Single Image. 775-784 - Aditya Chattopadhyay, Xi Zhang, David Paul Wipf, Himanshu Arora, René Vidal:
Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene Generation. 785-794 - Mingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang:
Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields. 795-805 - Kai-En Lin, Yen-Chen Lin, Wei-Sheng Lai, Tsung-Yi Lin, Yi-Chang Shih, Ravi Ramamoorthi:
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image. 806-815 - Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi Di Stefano:
ScanNeRF: a Scalable Benchmark for Neural Radiance Fields. 816-825 - Fariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando De la Torre, Steven Song, Aayush Prakash, Daeil Kim:
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance. 826-836 - Inwoo Hwang, Junho Kim, Young Min Kim:
Ev-NeRF: Event Based Neural Radiance Field. 837-847 - Chaerin Kong, Dong Hyeon Jeon, Ohjoon Kwon, Nojun Kwak:
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation. 848-857 - Samia Shafique, Bailey Kong, Shu Kong, Charless C. Fowlkes:
Creating a Forensic Database of Shoeprints from Online Shoe-Tread Photos. 858-868 - Safa C. Medin, Amir Weiss, Frédo Durand, William T. Freeman, Gregory W. Wornell:
Can Shadows Reveal Biometric Informationƒ. 869-879 - Qiaomu Miao, Minh Hoai, Dimitris Samaras:
Patch-level Gaze Distribution Prediction for Gaze Following. 880-889 - Vikrant Nagpure, Kenji Okuma:
Searching Efficient Neural Architecture with Multi-resolution Fusion Transformer for Appearance-based Gaze Estimation. 890-899 - Siamul Karim Khan, Patrick Tinsley, Adam Czajka:
DeformIrisNet: An Identity-Preserving Model of Iris Texture Deformation. 900-908 - Haidong Zhu, Zhaoheng Zheng, Ram Nevatia:
Gait Recognition Using 3-D Human Body Shape Inference. 909-918 - Wes Robbins, Steven Zhou, Aman Bhatta, Chad Mello, Vítor Albiero, Kevin W. Bowyer, Terrance E. Boult:
CAST: Conditional Attribute Subsampling Toolkit for Fine-grained Evaluation. 919-929 - Ziyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu, C. Karen Liu:
Physically Plausible Animation of Human Upper Body from a Single Image. 930-939 - Manh Huynh, Gita Alaghband:
Online Adaptive Temporal Memory with Certainty Estimation for Human Trajectory Prediction. 940-949 - Igor Vozniak, Philipp Müller, Lorena Hell, Nils Lipp, Ahmed Abouelazm, Christian Müller:
Context-empowered Visual Attention Prediction in Pedestrian Scenarios. 950-960 - Akshay Agarwal, Nalini K. Ratha, Afzel Noore, Richa Singh, Mayank Vatsa:
Misclassifications of Contact Lens Iris PAD Algorithms: Is it Gender Bias or Environmental Conditions? 961-970 - André Brasil Vieira Wyzykowski, Anil K. Jain:
Synthetic Latent Fingerprint Generator. 971-980 - Andreas Specker, Mickael Cormier, Jürgen Beyerer:
UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval. 981-990 - Takahiro Toizumi, Koichi Takahashi, Masato Tsukada:
Segmentation-free Direct Iris Localization Networks. 991-1000 - Ahmed Tawfik Aboukhadra, Jameel Malik, Ahmed Elhayek, Nadia Robertini, Didier Stricker:
THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision. 1001-1010 - Yuxin Tian, Shawn D. Newsam, Kofi Boakye:
Fashion Image Retrieval with Text Feedback by Additive Attention Compositional Learning. 1011-1021 - Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Joemon M. Jose:
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval. 1022-1031 - Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda:
Text-Guided Object Detector for Multi-modal Video Question Answering. 1032-1042 - Srikanth Malla, Chiho Choi, Isht Dwivedi, Joon Hee Choi, Jiachen Li:
DRAMA: Joint Risk Localization and Captioning in Driving. 1043-1052 - Ryugo Morita, Zhiqiang Zhang, Man M. Ho, Jinjia Zhou:
Interactive Image Manipulation with Complex Text Instructions. 1053-1062 - Konstantin Kobs, Michael Steininger, Andreas Hotho:
InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images. 1063-1072 - Tzu-Jui Julius Wang, Jorma Laaksonen, Tomas Langer, Heikki Arponen, Tom E. Bishop:
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision. 1073-1083 - Abhishek Jha, Badri N. Patro, Luc Van Gool, Tinne Tuytelaars:
Barlow constrained optimization for Visual Question Answering. 1084-1093 - Jason Armitage, Leonardo Impett, Rico Sennrich:
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues. 1094-1103 - Chia-Wen Kuo, Chih-Yao Ma, Judy Hoffman, Zsolt Kira:
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation. 1104-1113 - Jihyeon Lee, Woo-Young Kang, Eun-Sol Kim:
Dense but Efficient VideoQA for Intricate Compositional Reasoning. 1114-1123 - Ukyo Honda, Taro Watanabe, Yuji Matsumoto:
Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning. 1124-1134 - Bhavin Jawade, Deen Dayal Mohan, Naji Mohamed Ali, Srirangaraj Setlur, Venu Govindaraju:
NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings. 1135-1144 - Mark Hubenthal, Suren Kumar:
Image-Text Pre-Training for Logo Recognition. 1145-1154 - Sahithya Ravi, Aditya Chinchure, Leonid Sigal, Renjie Liao, Vered Shwartz:
VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge. 1155-1165 - Jonas Theiner, Ralph Ewerth:
TVCalib: Camera Calibration for Sports Field Registration in Soccer. 1166-1175 - Yue Qiu, Shintaro Yamamoto, Ryosuke Yamada, Ryota Suzuki, Hirokatsu Kataoka, Kenji Iwata, Yutaka Satoh:
3D Change Localization and Captioning from Dynamic Scans of Indoor Scenes. 1176-1185 - Donghao Qiao, Farhana H. Zulkernine:
Adaptive Feature Fusion for Cooperative Perception using LiDAR Point Clouds. 1186-1195 - Hanzhe Teng, Dimitrios Chatziparaschis, Xinyue Kan, Amit K. Roy-Chowdhury, Konstantinos Karydis:
Centroid Distance Keypoint Detector for Colored Point Clouds. 1196-1205 - Linh Trinh, Phuong Pham, Hoang Trinh, Nguyen Bach, Dung Nguyen, Giang Nguyen, Huy Nguyen:
PP4AV: A benchmarking Dataset for Privacy-preserving Autonomous Driving. 1206-1215 - Mohamed El Banani, Ignacio Rocco, David Novotný, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham:
Self-supervised Correspondence Estimation via Multiview Registration. 1216-1225 - Jeonghyun Kim, Kaichun Mo, Minhyuk Sung, Woontack Woo:
Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing. 1226-1235 - Chenxi Lola Deng, Enzo Tartaglione:
Compressing Explicit Voxel Grid Representations: fast NeRFs become also small. 1236-1245 - Renrui Zhang, Liuhui Wang, Ziyu Guo, Jianbo Shi:
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis. 1246-1255 - Kazuto Nakashima, Yumi Iwashita, Ryo Kurazume:
Generative Range Imaging for Learning Scene Priors of 3D LiDAR Data. 1256-1266 - Marwane Hariat, Antoine Manzanera, David Filliat:
Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions. 1267-1276 - David Deng, Avideh Zakhor:
RSF: Optimizing Rigid Scene Flow From 3D Point Clouds Without Labels. 1277-1286 - Xingyi Li, Wenxuan Wu, Xiaoli Z. Fern, Fuxin Li:
Improving the Robustness of Point Convolution on k-Nearest Neighbor Neighborhoods with a Viewpoint-Invariant Coordinate Transform. 1287-1297 - Tunhou Zhang, Mingyuan Ma, Feng Yan, Hai Li, Yiran Chen:
: Joint Point Interaction-Dimension Search for 3D Point Cloud. 1298-1307 - Abhishek Aich, Shasha Li, Chengyu Song, M. Salman Asif, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury:
Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial Attacks. 1308-1318 - William Theisen, Daniel Gonzalez Cedre, Zachariah Carmichael, Daniel Moreira, Tim Weninger, Walter J. Scheirer:
Motif Mining: Finding and Summarizing Remixed Image Content. 1319-1328 - Håkon Hukkelås, Frank Lindseth:
DeepPrivacy2: Towards Realistic Full-Body Anonymization. 1329-1338 - Chuqiao Li, Zhiwu Huang, Danda Pani Paudel, Yabin Wang, Mohamad Shahbazi, Xiaopeng Hong, Luc Van Gool:
A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials. 1339-1349 - Radhika Dua, Seongjun Yang, Yixuan Li, Edward Choi:
Task Agnostic and Post-hoc Unseen Distribution Detection. 1350-1359 - Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le, Huy H. Nguyen, Isao Echizen:
Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently. 1360-1368 - Umur A. Ciftci, Gokturk Yuksek, Ilke Demir:
My Face My Choice: Privacy Enhancing Deepfakes for Social Media Anonymization. 1369-1379 - Nathan Drenkow, Max Lennon, I-Jeng Wang, Philippe Burlina:
Do Adaptive Active Attacks Pose Greater Risk Than Static Attacks? 1380-1389 - Jian Jiang, Oya Çeliktutan:
Neural Weight Search for Scalable Task Incremental Learning. 1390-1399 - Thanh Vu, Yanqi Zhou, Chunfeng Wen, Yueqi Li, Jan-Michael Frahm:
Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search. 1400-1410 - Tianhong Li, Lijie Fan, Yuan Yuan, Hao He, Yonglong Tian, Rogério Feris, Piotr Indyk, Dina Katabi:
Addressing Feature Suppression in Unsupervised Visual Representations. 1411-1420 - Hamed Behzadi Khormuji, José Oramas:
A Protocol for Evaluating Model Interpretation Methods from Visual Explanations. 1421-1429 - Håkon Hukkelås, Morten Smebye, Rudolf Mester, Frank Lindseth:
Realistic Full-Body Anonymization with Surface-Guided GANs. 1430-1440 - Tim Lebailly, Tinne Tuytelaars:
Global-Local Self-Distillation for Visual Representation Learning. 1441-1450 - Pavel Suma, Giorgos Tolias:
Large-to-small Image Resolution Asymmetry in Deep Metric Learning. 1451-1460 - Matthew Watson, Bashar Awwad Shiekh Hasan, Noura Al Moubayed:
Learning How to MIMIC: Using Model Explanations to Guide Deep Learning Training. 1461-1470 - Johannes Gilg, Torben Teepe, Fabian Herzog, Gerhard Rigoll:
The Box Size Confidence Bias Harms Your Object Detector. 1471-1480 - Mikolaj Sacha, Dawid Rymarczyk, Lukasz Struski, Jacek Tabor, Bartosz Zielinski:
ProtoSeg: Interpretable Semantic Segmentation with Prototypical Parts. 1481-1492 - Niccolò Cavagnero, Luca Robbiano, Barbara Caputo, Giuseppe Averta:
FreeREA: Training-Free Evolution-based Architecture Search. 1493-1502 - Zhewen Yu, Christos-Savvas Bouganis:
SVD-NAS: Coupling Low-Rank Approximation and Neural Architecture Search. 1503-1512 - Tomoki Uchiyama, Naoya Sogi, Koichiro Niinuma, Kazuhiro Fukui:
Visually explaining 3D-CNN predictions for video classification with an adaptive occlusion sensitivity analysis. 1513-1522 - Jaspreet Singh, Chandan Singh, Ankur Rana:
Orthogonal Transforms For Learning Invariant Representations In Equivariant Neural Networks. 1523-1530 - Shentong Mo, Zhun Sun, Chao Li:
Representation Disentanglement in Generative Models with Contrastive Learning. 1531-1540 - Rishabh Patra, Ramya Hebbalaguppe, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig:
Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning. 1541-1549 - Mitsuhiro Mabuchi, Tetsuya Ishikawa:
Patch-based Privacy Preserving Neural Network for Vision Tasks. 1550-1559 - Brian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik:
PreViTS: Contrastive Pretraining with Video Tracking Supervision. 1560-1570 - Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc Van Gool:
Efficient Visual Tracking with Exemplar Transformers. 1571-1581 - Martin Engilberge, Weizhe Liu, Pascal Fua:
Multi-view Tracking Using Weakly Supervised Human Motion Prediction. 1582-1592 - Jonás Serých, Jirí Matas:
Planar Object Tracking via Weighted Optical Flow. 1593-1602 - Minjung Kim, MyeongAh Cho, Sangyoun Lee:
Feature Disentanglement Learning with Switching and Aggregation for Video-based Person Re-Identification. 1603-1612 - Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi:
Body Part-Based Representation Learning for Occluded Person Re-Identification. 1613-1623 - Djebril Mekhazni, Maximilien Dufau, Christian Desrosiers, Marco Pedersoli, Eric Granger:
Camera Alignment and Weighted Contrastive Learning for Domain Adaptation in Video Person ReID. 1624-1633 - Daniel Davila, Dawei Du, Bryon Lewis, Christopher Funk, Joseph Van Pelt, Roderic Collins, Kellie Corona, Matt S. Brown, Scott McCloskey, Anthony Hoogs, Brian Clipp:
MEVID: Multi-view Extended Videos with Identities for Video Person Re-Identification. 1634-1643 - Thomas Kreutz, Max Mühlhäuser, Alejandro Sánchez Guinea:
Unsupervised 4D LiDAR Moving Object Segmentation in Stationary Settings with Multivariate Occupancy Time Series. 1644-1653 - Keivan Nalaie, Rong Zheng:
AttTrack: Online Deep Attention Transfer for Multi-object Tracking. 1654-1663 - Takanori Asanomi, Kazuya Nishimura, Ryoma Bise:
Multi-Frame Attention with Feature-Level Warping for Drone Crowd Tracking. 1664-1673 - Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan:
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video. 1674-1683 - Lucas Jaffe, Avideh Zakhor:
Gallery Filter Network for Person Search. 1684-1693 - Xiaoyu Xiang, Jon Morton, Fitsum A. Reda, Lucas D. Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan P. Allebach:
HIME: Efficient Headshot Image Super-Resolution with Multiple Exemplars. 1694-1704 - Jeya Maria Jose Valanarasu, Vishal M. Patel:
Fine-Context Shadow Detection using Shadow Removal. 1705-1714 - Haoyu Ren, Yi Fan, Stephen Huang:
Robust Real-world Image Enhancement Based on Multi-Exposure LDR Images. 1715-1723 - Runsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, Jinlong Li, Zibo Meng, Jiaqi Ma, Alan C. Bovik, Hongkai Yu:
Pik-Fix: Restoring and Colorizing Old Photos. 1724-1734 - Dario Fuoli, Martin Danelljan, Radu Timofte, Luc Van Gool:
Fast Online Video Super-Resolution with Deformable Attention Pyramid. 1735-1744 - Jonghwa Yim, Minjae Kim:
Style-Guided Inference of Transformer for High-resolution Image Synthesis. 1745-1755 - Hue Nguyen, Diep Tran, Khoi Nguyen, Rang Nguyen:
PSENet: Progressive Self-Enhancement Network for Unsupervised Extreme-Light Image Enhancement. 1756-1765 - Eugene Lee, Lien-Feng Hsu, Evan Chen, Chen-Yi Lee:
Cross-Resolution Flow Propagation for Foveated Video Super-Resolution. 1766-1775 - Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless C. Fowlkes:
GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding. 1776-1786 - Jooyeol Yun, Sanghyeon Lee, Minho Park, Jaegul Choo:
iColoriT: Towards Propagating Local Hints to the Right Region in Interactive Colorization by Leveraging Vision Transformer. 1787-1796 - Charles Laroche, Andrés Almansa, Matias Tassano:
Deep Model-Based Super-Resolution with Non-uniform Blur. 1797-1808 - Mrinmoy Sen, Sai Pradyumna Chermala, Nazrinbanu Nurmohammad Nagori, Venkat Peddigari, Praful Mathur, B. H. Pawan Prasad, Moon-Hwan Jeong:
SHARDS: Efficient SHAdow Removal using Dual Stage Network for High-Resolution Images. 1809-1817 - Youngin Cho, Junsoo Lee, Soyoung Yang, Juntae Kim, Yeojeong Park, Haneol Lee, Mohammad Azam Khan, Daesik Kim, Jaegul Choo:
Guiding Users to Where to Give Color Hints for Efficient Interactive Sketch Colorization via Unsupervised Region Prioritization. 1818-1827 - Youngrae Kim, Jinsu Lim, Hoonhee Cho, Minji Lee, Dongman Lee, Kuk-Jin Yoon, Ho-Jin Choi:
Efficient Reference-based Video Super-Resolution (ERVSR): Single Reference Image Is All You Need. 1828-1837 - Stavros Tsogkas, Fengjia Zhang, Allan D. Jepson, Alex Levinshtein:
Efficient Flow-Guided Multi-frame De-fencing. 1838-1847 - Marcos V. Conde, Florin-Alexandru Vasluianu, Javier Vazquez-Corral, Radu Timofte:
Perceptual Image Enhancement for Smartphone Real-Time Applications. 1848-1858 - Ting-Wei Wu, Jia-Hong Huang, Joseph Lin, Marcel Worring:
Expert-defined Keywords Improve Interpretability of Retinal Image Captioning. 1859-1868 - Kun Han, Shanlin Sun, Xiangyi Yan, Chenyu You, Hao Tang, Junayed Naushad, Haoyu Ma, Deying Kong, Xiaohui Xie:
Diffeomorphic Image Registration with Neural Velocity Field. 1869-1879 - Ali Mirzazadeh, Florian Dubost, Maxwell Pike, Krish Maniar, Max Zuo, Christopher Lee-Messer, Daniel L. Rubin:
ATCON: Attention Consistency for Vision Models. 1880-1889 - Florian Dubost, Erin Hong, Siyi Tang, Nandita Bhaskhar, Christopher Lee-Messer, Daniel L. Rubin:
Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing. 1890-1899 - Huy H. Nguyen, Trung-Nghia Le, Junichi Yamagishi, Isao Echizen:
Analysis of Master Vein Attacks on Finger Vein Recognition Systems. 1900-1908 - Xiaofei Huang, Michael Wan, Lingfei Luan, Bethany Tunik, Sarah Ostadabbas:
Computer Vision to the Rescue: Infant Postural Symmetry Estimation from Incongruent Annotations. 1909-1917 - Kechun Liu, Beibin Li, Wenjun Wu, Caitlin J. May, Oliver Chang, Stevan Knezevich, Lisa M. Reisch, Joann G. Elmore, Linda G. Shapiro:
VSGD-Net: Virtual Staining Guided Melanocyte Detection on Histopathological Images. 1918-1927 - Bowen Song, Liyue Shen, Lei Xing:
PINER: Prior-informed Implicit Neural Representation Learning for Test-time Adaptation in Sparse-view CT Reconstruction. 1928-1937 - Junmo Cho, Seungjae Han, Eun-Seo Cho, Kijung Shin, Young-Gyu Yoon:
Robust and Efficient Alignment of Calcium Imaging Data through Simultaneous Low Rank and Sparse Decomposition. 1938-1947 - Jiyoon Shin, Jungwoo Lee:
MRI Imputation based on Fused Index- and Intensity-Registration. 1948-1957 - Joshua Peters, Léo Lebrat, Rodrigo Santa Cruz, Aaron Nicolson, Gregg Belous, Salamata Konate, Parnesh Raniga, Vincent Doré, Pierrick Bourgeat, Jurgen Mejan-Fripp, Clinton Fookes, Olivier Salvado:
DBCE : A Saliency Method for Medical Deep Learning Through Anatomically-Consistent Free-Form Deformations. 1958-1968 - Zekai Chen, Devansh Agarwal, Kshitij Aggarwal, Wiem Safta, Mariann Micsinai Balan, Kevin Brown:
Masked Image Modeling Advances 3D Medical Image Analysis. 1969-1979 - Tien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler, Minh-Triet Tran, Ngan Le:
EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification. 1980-1989 - Ella Lan:
Performer: A Novel PPG-to-ECG Reconstruction Transformer for a Digital Biomarker of Cardiovascular Disease Detection. 1990-1998 - Puria Azadi Moghadam, Sanne Van Dalen, Karina C. Martin, Jochen K. Lennerz, Stephen Yip, Hossein Farahani, Ali Bashashati:
A Morphology Focused Diffusion Probabilistic Model for Synthesis of Histopathology Images. 1999-2008 - Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara:
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization. 2009-2018 - Lukas Mehl, Azin Jahedi, Jenny Schmalfuss, Andrés Bruhn:
M-FUSE: Multi-frame Fusion for Scene Flow Estimation. 2019-2028 - Khurram Azeem Hashmi, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal:
BoxMask: Revisiting Bounding Box Supervision for Video Object Detection. 2029-2039 - Jasdeep Singh, Subrahmanyam Murala, G. Sankara Raju Kosuru:
Lightweight Network For Video Motion Magnification. 2040-2049 - Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri, Giovanni Bellitto, Morteza Moradi, Concetto Spampinato, Kevin McGuinness:
TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation. 2050-2059 - Rémi Marsal, Florian Chabot, Angélique Loesch, Hichem Sahbi:
BrightFlow: Brightness-Change-Aware Unsupervised Learning of Optical Flow. 2060-2069 - Tarun Kalluri, Deepak Pathak, Manmohan Chandraker, Du Tran:
FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation. 2070-2081 - Digbalay Bose, Rajat Hebbar, Krishna Somandepalli, Haoyang Zhang, Yin Cui, Kree Cole-McLaughlin, Huisheng Wang, Shrikanth Narayanan:
MovieCLIP: Visual Scene Recognition in Movies. 2082-2091 - Florian Hofherr, Lukas Koestler, Florian Bernard, Daniel Cremers:
Neural Implicit Representations for Physical Parameter Inference from a Single Video. 2092-2102 - Florian Kadner, Tobias Thomas, David Hoppe, Constantin A. Rothkopf:
Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts. 2103-2113 - Boris Chen, Amir Ziai, Rebecca S. Tucker, Yuchen Xie:
Match Cutting: Finding Cuts with Smooth Visual Transitions. 2114-2124 - David Osowiechi, Gustavo Adolfo Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers:
TTTFlow: Unsupervised Test-Time Training with Normalizing Flow. 2125-2126 - Michael Schelling, Pedro Hermosilla, Timo Ropinski:
Weakly-Supervised Optical Flow Estimation for Time-of-Flight. 2134-2143 - Chaerin Min, Tae Hyun Kim, Jongwoo Lim:
Meta-Learning for Adaptation of Deep Optical Flow Networks. 2144-2153 - Zecheng Yu, Yifei Huang, Ryosuke Furuta, Takuma Yagi, Yusuke Goutsu, Yoichi Sato:
Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos. 2154-2162 - Hong Xuan, Xi Stephen Chen:
Dissecting Deep Metric Learning Losses for Image-Text Retrieval. 2163-2172 - Takayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto:
Content-Based Music-Image Retrieval Using Self- and Cross-Modal Feature Embedding Memory. 2173-2183 - Cagri Gungor, Adriana Kovashka:
Complementary Cues from Audio Help Combat Noise in Weakly-Supervised Object Detection. 2184-2193 - Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan:
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution. 2194-2204 - Chunjin Song, Yuchi Zhang, Willis Peng, Parmis Mohaghegh, Bastian Wandt, Helge Rhodin:
AudioViewer: Learning to Visualize Sounds. 2205-2215 - Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale. 2216-2225 - Zudi Lin, Erhan Bas, Kunwar Yashraj Singh, Gurumurthy Swaminathan, Rahul Bhotika:
Relaxing Contrastiveness in Multimodal Representation Learning. 2226-2235 - Arda Senocak, Junsik Kim, Tae-Hyun Oh, Dingzeyu Li, In So Kweon:
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding. 2236-2246 - Youshan Zhang, Jialu Li:
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds. 2247-2256 - Maxime Burchi, Radu Timofte:
Audio-Visual Efficient Conformer for Robust Speech Recognition. 2257-2266 - Prateksha Udhayanan, Suryateja BV, Parth Laturia, Dev Chauhan, Darshan Khandelwal, Stefano Petrangeli, Balaji Vasan Srinivasan:
Recipe2Video: Synthesizing Personalized Videos from Recipe Texts. 2267-2276 - Dennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju:
Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization. 2277-2286 - Arpit Garg, Cuong Nguyen, Rafael Felix, Thanh-Toan Do, Gustavo Carneiro:
Instance-Dependent Noisy Label Learning via Graphical Modelling. 2287-2297 - Menelaos Kanakis, Thomas E. Huang, David Brüggemann, Fisher Yu, Luc Van Gool:
Composite Learning for Robust and Effective Dense Predictions. 2298-2307 - HyunJae Lee, Gihyeon Lee, Junhwan Kim, Sungjun Cho, Dohyun Kim, Donggeun Yoo:
Improving Multi-fidelity Optimization with a Recurring Learning Rate for Hyperparameter Tuning. 2308-2317 - Hongjun Choi, Eun Som Jeon, Ankita Shukla, Pavan K. Turaga:
Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study. 2318-2327 - Ivan Lopes, Tuan-Hung Vu, Raoul de Charette:
Cross-task Attention Mechanism for Dense Multi-task Learning. 2328-2337 - Dupati Srikar Chandra, Sakshi Varshney, P. K. Srijith, Sunil Gupta:
Continual Learning with Dependency Preserving Hypernetworks. 2338-2347 - Souvik Kundu, Sairam Sundaresan, Massoud Pedram, Peter A. Beerel:
FLOAT: Fast Learnable Once-for-All Adversarial Training for Tunable Trade-off between Accuracy and Robustness. 2348-2357 - Geethu Miriam Jacob, Vishal Agarwal, Björn Stenger:
Online Knowledge Distillation for Multi-task Learning. 2358-2367 - Timon Höfer, Benjamin Kiefer, Martin Messmer, Andreas Zell:
HyperPosePDF Hypernetworks Predicting the Probability Distribution on SO(3). 2368-2378 - Ahmed Ben Saad, Kristina Prokopetc, Josselin Kherroubi, Axel Davy, Adrien Courtois, Gabriele Facciolo:
Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth Information. 2379-2388 - Olivier Risser-Maroix, Benjamin Chamand:
What can we Learn by Predicting Accuracy? 2389-2398 - Eva Feillet, Grégoire Petit, Adrian Popescu, Marina Reyboz, Céline Hudelot:
AdvisIL - A Class-Incremental Learning Advisor. 2399-2408 - Daehyun Ahn, Hyungjun Kim, Taesu Kim, Eunhyeok Park, Jae-Joon Kim:
Searching for Robust Binary Neural Networks via Bimodal Parameter Perturbation. 2409-2418 - Ben Usman, Dina Bashkirova, Kate Saenko:
RIFT: Disentangled Unsupervised Image Translation via Restricted Information Flow. 2419-2428 - Gourav Datta, Zeyu Liu, Zihan Yin, Linyu Sun, Akhilesh R. Jaiswal, Peter A. Beerel:
Enabling ISPless Low-Power Computer Vision. 2429-2438 - Benoit Brummer, Christophe De Vleeschouwer:
On the Importance of Denoising when Learning to Compress Images. 2439-2447 - Khanh Quoc Dinh, Kwang Pyo Choi:
End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes. 2448-2457 - Nithin C. Babu, Vignesh Kannan, Rajiv Soundararajan:
No Reference Opinion Unaware Quality Assessment of Authentically Distorted Images. 2458-2467 - Marcin Sendera, Marcin Przewiezlikowski, Konrad Karanowski, Maciej Zieba, Jacek Tabor, Przemyslaw Spurek:
HyperShot: Few-Shot Learning by Kernel HyperNetworks. 2468-2477 - Rakshith Subramanyam, Mark Heimann, T. S. Jayram, Rushil Anirudh, Jayaraman J. Thiagarajan:
Contrastive Knowledge-Augmented Meta-Learning for Few-Shot Classification. 2478-2486 - Hao Ding, Changchang Sun, Hao Tang, Dawen Cai, Yan Yan:
Few-shot Medical Image Segmentation with Cycle-resemblance Attention. 2487-2496 - Nitish Mital, Ezgi Özyilkan, Ali Garjani, Deniz Gündüz:
Neural Distributed Image Compression with Cross-Attention Feature Alignment. 2497-2506 - Huanle Zhang, Hamed Pirsiavash, Xin Liu:
MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification. 2507-2516 - Hao-Wei Chen, Ting-Hsuan Liao, Hsuan-Kung Yang, Chun-Yi Lee:
Pixel-Wise Prediction based Visual Odometry via Uncertainty Estimation. 2517-2527 - Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa:
Universal Deep Image Compression via Content-Adaptive Optimization with Adapters. 2528-2537 - Dahye Kim, Jungin Park, Jiyoung Lee, Seongheon Park, Kwanghoon Sohn:
Language-free Training for Zero-shot Video Grounding. 2538-2547 - Soma Kajiyama, Taihe Piao, Ryo Kawahara, Takahiro Okabe:
Separating Partially-Polarized Diffuse and Specular Reflection Components under Unpolarized Light Sources. 2548-2557 - Alper Kayabasi, Gülin Tüfekci, Ilkay Ulusoy:
Elimination of Non-Novel Segments at Multi-Scale for Few-Shot Segmentation. 2558-2566 - Jihyun Kim, Seong-Hun Jeong, Kyeongbo Kong, Suk-Ju Kang:
An Unified Framework for Language Guided Image Completion. 2567-2577 - Abhishek Aich, Kuan-Chuan Peng, Amit K. Roy-Chowdhury:
Cross-Domain Video Anomaly Detection without Target Domain Adaptation. 2578-2590 - Marco Rudolph, Tom Wehrbein, Bodo Rosenhahn, Bastian Wandt:
Asymmetric Student-Teacher Networks for Industrial Anomaly Detection. 2591-2601 - Julia Hornauer, Vasileios Belagiannis:
Heatmap-based Out-of-Distribution Detection. 2602-2611 - Paul Bergmann, David Sattlegger:
Anomaly Detection in 3D Point Clouds using Deep Geometric Descriptors. 2612-2622 - Wonwoo Cho, Jeonghoon Park, Jaegul Choo:
Training Auxiliary Prototypical Classifiers for Explainable Anomaly Detection in Medical Image Segmentation. 2623-2632 - Hanqiu Deng, Zhaoxiang Zhang, Shihao Zou, Xingyu Li:
Bi-directional Frame Interpolation for Unsupervised Video Anomaly Detection. 2633-2642 - Samuel Wilson, Tobias Fischer, Niko Sünderhauf, Feras Dayoub:
Hyperdimensional Feature Fusion for Out-of-Distribution Detection. 2643-2653 - Keval Doshi, Yasin Yilmaz:
Towards Interpretable Video Anomaly Detection. 2654-2663 - Seongheon Park, Hanjae Kim, Minsu Kim, Dahye Kim, Kwanghoon Sohn:
Normality Guided Multiple Instance Learning for Weakly Supervised Video Anomaly Detection. 2664-2673 - Changhwa Park, Junho Yim, Eunji Jun:
Mutual Learning for Long-Tailed Recognition. 2674-2683 - Xiangyi Yan, Junayed Naushad, Shanlin Sun, Kun Han, Hao Tang, Deying Kong, Haoyu Ma, Chenyu You, Xiaohui Xie:
Representation Recovering for Self-Supervised Pre-training on Medical Images. 2684-2694 - Cheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang, Yu-Chiang Frank Wang:
Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond. 2695-2704 - Julien Denize, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault, Stéphane Canu:
Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning. 2705-2715 - Prakash Chandra Chhipa, Richa Upadhyay, Gustav Grund Pihlgren, Rajkumar Saini, Seiichi Uchida, Marcus Liwicki:
Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological Images. 2716-2726 - Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor:
Motion Aware Self-Supervision for Generic Event Boundary Detection. 2727-2738 - So Hasegawa, Masayuki Hiromoto, Akira Nakagawa, Yuhei Umeda:
Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning. 2739-2748 - Suhong Moon, Domas Buracas, Seunghyun Park, Jinkyu Kim, John F. Canny:
An Embedding-Dynamic Approach to Self-Supervised Learning. 2749-2757 - Samarth Sinha, Peter V. Gehler, Francesco Locatello, Bernt Schiele:
TeST: Test-time Self-Training under Distribution Shift. 2758-2768 - Wei-Chi Chen, Wei-Ta Chu:
SSSD: Self-Supervised Self Distillation. 2769-2776 - Shentong Mo, Zhun Sun, Chao Li:
Multi-level Contrastive Learning for Self-Supervised Vision Transformers. 2777-2786 - Srikrishna Jaganathan, Maximilian Kukla, Jian Wang, Karthik Shetty, Andreas K. Maier:
Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion. 2787-2797 - Salman Mohamadi, Gianfranco Doretto, Donald A. Adjeroh:
FUSSL: Fuzzy Uncertain Self Supervised Learning. 2798-2807 - Atsuyuki Miyai, Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa:
Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation. 2808-2817 - Michael Mu, Sreyasee Das Bhattacharjee, Junsong Yuan:
Self-Supervised Distilled Learning for Multi-modal Misinformation Identification. 2818-2827 - Jiho Jang, Seonhoon Kim, KiYoon Yoo, Chaerin Kong, Jangho Kim, Nojun Kwak:
Self-Distilled Self-supervised Representation Learning. 2828-2838 - Yifan Xu, Pourya Shamsolmoali, Eric Granger, Claire Nicodeme, Laurent Gardes, Jie Yang:
TransVLAD: Multi-Scale Attention-Based Global Descriptors for Visual Geo-Localization. 2839-2848 - Sungho Chun, Sungbum Park, Ju Yong Chang:
Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation. 2849-2858 - Stefan Thalhammer, Timothy Patten, Markus Vincze:
COPE: End-to-end trainable Constant Runtime Object Pose Estimation. 2859-2869 - Christian Grund, Julian Tanke, Juergen Gall:
ElliPose: Stereoscopic 3D Human Pose Estimation by Fitting Ellipsoids. 2870-2880 - Snehal Bhayani, Torsten Sattler, Viktor Larsson, Janne Heikkilä, Zuzana Kukelova:
Partially calibrated semi-generalized pose from hybrid point correspondences. 2881-2890 - Arthur Moreau, Thomas Gilles, Nathan Piasco, Dzmitry Tsishkou, Bogdan Stanciulescu, Arnaud de La Fortelle:
ImPosing: Implicit Pose Encoding for Efficient Visual Localization. 2891-2901 - Moritz Einfalt, Katja Ludwig, Rainer Lienhart:
Uplift and Upsample: Efficient 3D Human Pose Estimation with Uplifting Transformers. 2902-2912 - Xiaohan Zhang, Waqas Sultani, Safwan Wshah:
Cross-View Image Sequence Geo-localization. 2913-2922 - Cheng-Yen Yang, Jiajia Luo, Lu Xia, Yuyin Sun, Nan Qiao, Ke Zhang, Zhongyu Jiang, Jenq-Neng Hwang, Cheng-Hao Kuo:
CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations. 2923-2932 - Seongyeong Lee, Hansoo Park, Dong Uk Kim, Jihyeon Kim, Muhammadjon Boboev, Seungryul Baek:
Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation. 2933-2943 - Lauri Suomela, Jussi Kalliola, Atakan Dag, Harry Edelman, Joni-Kristian Kämäräinen:
Benchmarking Visual Localization for Autonomous Navigation. 2944-2954 - István Sárándi, Alexander Hermans, Bastian Leibe:
Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats. 2955-2965 - Jaehoon Ko, Kyusun Cho, Daewon Choi, Kwangrok Ryoo, Seungryong Kim:
3D GAN Inversion with Pose Optimization. 2966-2975 - Erwin Wu, Hayato Nishioka, Shinichi Furuya, Hideki Koike:
Marker-removal Networks to Collect Precise 3D Hand Data for RGB-based Estimation and its Application in Piano. 2976-2985 - Michaël Soumm, Adrian Popescu, Bertrand Delezoide:
Vis2Rec: A Large-Scale Visual Dataset for Visit Recommendation. 2986-2996 - Amar Ali-bey, Brahim Chaib-draa, Philippe Giguère:
MixVPR: Feature Mixing for Visual Place Recognition. 2997-3006 - Porter Jenkins, Kyle Armstrong, Stephen Nelson, Siddhesh Gotad, J. Stockton Jenkins, Wade Wilkey, Tanner Watts:
CountNet3D: A 3D Computer Vision Approach to Infer Counts of Occluded Objects. 3007-3016 - Zhaoshuo Li, Wei Ye, Dilin Wang, Francis X. Creighton, Russell H. Taylor, Ganesh Venkatesh, Mathias Unberath:
Temporally Consistent Online Depth Estimation in Dynamic Scenes. 3017-3026 - Kensuke Taguchi, Shogo Morita, Yusuke Hayashi, Wataru Imaeda, Hironobu Fujiyoshi:
Uncertainty-Aware Interactive LiDAR Sampling for Deep Depth Completion. 3027-3035 - Kohei Yamashita, Yuto Enyo, Shohei Nobuhara, Ko Nishino:
nLMVS-Net: Deep Non-Lambertian Multi-View Stereo. 3036-3045 - Gustav Bredell, Ertunc Erdil, Bruno Weber, Ender Konukoglu:
Wiener Guided DIP for Unsupervised Blind Image Deconvolution. 3046-3055 - Ching-Ya Chiu, Yu-Ting Wu, I-Chao Shen, Yung-Yu Chuang:
360MVSNet: Deep Multi-view Stereo Network with 360° Images for Indoor Scene Reconstruction. 3056-3065 - Markus Plack, Clara Callenberg, Monika Schneider, Matthias B. Hullin:
Fast Differentiable Transient Rendering for Non-Line-of-Sight Reconstruction. 3066-3075 - Andra Petrovai, Sergiu Nedevschi:
MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmentation. 3076-3085 - Christian Sormann, Emanuele Santellani, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer:
DELS-MVS: Deep Epipolar Line Search for Multi-View Stereo. 3086-3095 - Antoni Rosinol, John J. Leonard, Luca Carlone:
Probabilistic Volumetric Fusion for Dense Monocular SLAM. 3096-3104 - Lu Sang, Bjoern Haefner, Xingxing Zuo, Daniel Cremers:
High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDF. 3105-3114 - Chi-Han Peng, Jiayao Zhang:
High-Resolution Depth Estimation for 360° Panoramas through Perspective and Panoramic Depth Images Registration. 3115-3124 - Berk Kaya, Suryansh Kumar, Carlos Eduardo Porto de Oliveira, Vittorio Ferrari, Luc Van Gool:
Multi-View Photometric Stereo Revisited. 3125-3134 - Hari Santhanam, Nehal Doiphode, Jianbo Shi:
Automated Line Labelling: Dataset for Contour Detection and 3D Reconstruction. 3135-3144 - Mohammad Farazi, Wenhui Zhu, Zhangsihao Yang, Yalin Wang:
Anisotropic Multi-Scale Graph Convolutional Network for Dense Shape Correspondence. 3145-3154 - Stefan Ainetter, Sinisa Stekovic, Friedrich Fraundorfer, Vincent Lepetit:
Automatically Annotating Indoor Images with CAD Models via RGB-D Scans. 3155-3163 - Daan de Geus, Gijs Dubbelman:
Intra-Batch Supervision for Panoptic Segmentation on High-Resolution Images. 3164-3172 - David Brüggemann, Christos Sakaridis, Prune Truong, Luc Van Gool:
Refign: Align and Refine for Adaptation of Semantic Segmentation to Adverse Conditions. 3173-3183 - Kazuya Nishimura, Ryoma Bise:
Weakly Supervised Cell-Instance Segmentation with Two Types of Weak Labels by Single Instance Pasting. 3184-3193 - Dipam Goswami, René Schuster, Joost van de Weijer, Didier Stricker:
Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segmentation. 3194-3203 - Kazuki Endo, Masayuki Tanaka, Masatoshi Okutomi:
Semantic Segmentation of Degraded Images Using Layer-Wise Feature Adjustor. 3204-3212 - Matthias Rottmann, Marco Reese:
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification. 3213-3222 - Loic Themyr, Clément Rambour, Nicolas Thome, Toby Collins, Alexandre Hostettler:
Full Contextual Attention for Multi-resolution Transformers in Semantic Segmentation. 3223-3232 - Subba Reddy Oota, Vijay Rowtula, Shahid Saleem Mohammed, Minghsun Liu, Manish Gupta:
WSNet: Towards An Effective Method for Wound Image Segmentation. 3233-3242 - Bruno Sauvalle, Arnaud de La Fortelle:
Autoencoder-based background reconstruction and foreground segmentation with background noise estimation. 3243-3254 - Fengyi Shen, Zador Pataki, Akhil Gurram, Ziyuan Liu, He Wang, Alois C. Knoll:
LoopDA: Constructing Self-loops to Adapt Nighttime Semantic Segmentation. 3255-3265 - Sandra Kara, Hejer Ammar, Florian Chabot, Quoc Cuong Pham:
Image Segmentation-based Unsupervised Multiple Objects Discovery. 3276-3285 - Shubhankar Borse, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Kumar Yogamani, Fatih Porikli:
X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation. 3286-3296 - Sumin Lee, Sangmin Woo, Yeonju Park, Muhammad Adi Nugroho, Changick Kim:
Modality Mixer for Multi-modal Action Recognition. 3297-3306 - Jeffrey Byrne, Greg Castañón, Zhongheng Li, Gil J. Ettinger:
Fine-grained Activities of People Worldwide. 3307-3318 - Dasom Ahn, Sangwon Kim, Hyunsu Hong, ByoungChul Ko:
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition. 3319-3328 - Gueter Josmy Faure, Min-Hung Chen, Shang-Hong Lai:
Holistic Interaction Transformer Network for Action Detection. 3329-3339 - Yue Qiu, Yoshiki Nagasaki, Kensho Hara, Hirokatsu Kataoka, Ryota Suzuki, Kenji Iwata, Yutaka Satoh:
VirtualHome Action Genome: A Simulated Spatio-Temporal Scene Graph Dataset with Consistent Relationship Labels. 3340-3349 - Jong-Hyeon Seon, Jaedong Hwang, Jonghwan Mun, Bohyung Han:
Stop or Forward: Dynamic Layer Skipping for Efficient Action Recognition. 3350-3359 - Dawei Du, Ameya Shringi, Anthony Hoogs, Christopher Funk:
Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action Recognition. 3360-3369 - Ketul Shah, Anshul Shah, Chun Pong Lau, Celso M. de Melo, Rama Chellappa:
Multi-View Action Recognition using Contrastive Learning. 3370-3380 - Tanay Agrawal, Michal Balazia, Philipp Müller, François Brémond:
Multimodal Vision Transformers with Forced Attention for Behavior Analysis. 3381-3391 - Min-Seok Kang, Dongoh Kang, Hansaem Kim:
Efficient Skeleton-Based Action Recognition via Joint-Mapping strategies. 3392-3401 - Samrudhdhi B. Rangrej, Kevin J. Liang, Tal Hassner, James J. Clark:
GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction. 3402-3412 - Siqi Deng, Yuanjun Xiong, Meng Wang, Wei Xia, Stefano Soatto:
Harnessing Unrecognizable Faces for Improving Face Recognition. 3413-3422 - Byungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong:
IFQA: Interpretable Face Quality Assessment. 3433-3442 - Felix Rosberg, Eren Erdal Aksoy, Fernando Alonso-Fernandez, Cristofer Englund:
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping. 3443-3452 - Sangjin Park, Dae Ha Kim, Byung Cheol Song:
Fine Gaze Redirection Learning with Gaze Hardness-aware Transformation. 3453-3462 - Frank Yu, Sid Fels, Helge Rhodin:
Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching. 3463-3472 - Philipp Terhörst, Malte Ihlefeld, Marco Huber, Naser Damer, Florian Kirchbuchner, Kiran B. Raja, Arjan Kuijper:
QMagFace: Simple and Accurate Quality-Aware Face Recognition. 3473-3483 - Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:
FaceOff: A Video-to-Video Face Swapping System. 3484-3493 - Tingyu Qu, Tinne Tuytelaars, Marie-Francine Moens:
Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive Loss. 3494-3503 - Chirag Raman, Charlie Hewitt, Erroll Wood, Tadas Baltrusaitis:
Mesh-Tension Driven Expression-Based Wrinkles for Synthetic Faces. 3504-3514 - Gwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien P. C. Valentin, Roberto Cipolla, Jingjing Shen:
DigiFace-1M: 1 Million Digital Face Images for Face Recognition. 3515-3524 - Stathis Galanakis, Baris Gecer, Alexandros Lattas, Stefanos Zafeiriou:
3DMM-RF: Convolutional Radiance Fields for 3D Face Modeling. 3525-3536 - Yang Zhang, Simao Herdade, Kapil Thadani, Eric Dodds, Jack Culpepper, Yueh-Ning Ku:
Unifying Margin-Based Softmax Losses in Face Recognition. 3537-3546 - Sahng-Min Yoo, Tae-Min Choi, Jae-Woo Choi, Jong-Hwan Kim:
FastSwap: A Lightweight One-Stage Framework for Real-Time Face Swapping. 3547-3556 - Youssef Dawoud, Arij Bouazizi, Katharina Ernst, Gustavo Carneiro, Vasileios Belagiannis:
Knowing What to Label for Few Shot Microscopy Image Cell Segmentation. 3557-3566 - Zongyi Liu:
A Deep Neural Framework to Detect Individual Advertisement (Ad) from Videos. 3567-3576 - Junfei Xiao, Yutong Bai, Alan L. Yuille, Zongwei Zhou:
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification. 3577-3589 - Rohan Sarkar, Navaneeth Bodla, Mariya I. Vasileva, Yen-Liang Lin, Anurag Beniwal, Alan Lu, Gerard Medioni:
OutfitTransformer: Learning Outfit Representations for Fashion Recommendation. 3590-3598 - Puneet Mathur, Rajiv Jain, Ashutosh Mehra, Jiuxiang Gu, Franck Dernoncourt, Anandhavelu Natarajan, Quan Hung Tran, Verena Kaynig-Fittkau, Ani Nenkova, Dinesh Manocha, Vlad I. Morariu:
LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents. 3599-3609 - Qianru Qiu, Xueting Wang, Mayu Otani, Yuki Iwazaki:
Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation. 3610-3618 - Tom van Sonsbeek, Xiantong Zhen, Dwarikanath Mahapatra, Marcel Worring:
Probabilistic Integration of Object Level Annotations in Chest X-ray Classification. 3619-3629 - Pushpendu Ghosh, Nancy Wang, Promod Yenigalla:
D-Extract: Extracting Dimensional Attributes From Product Images. 3630-3638 - Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi:
Generative Colorization of Structured Mobile Web Pages. 3639-3648 - Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, Dirk Husmeier:
The Fully Convolutional Transformer for Medical Image Segmentation. 3649-3658 - Yuzhi Shi, Mijung Kim, Yeongnam Chae:
Multi-scale Cell-based Layout Representation for Document Understanding. 3659-3668 - Axel De Nardin, Silvia Zottin, Matteo Paier, Gian Luca Foresti, Emanuela Colombi, Claudio Piciarelli:
Efficient few-shot learning for pixel-precise handwritten document layout analysis. 3669-3677 - Juan A. Rodriguez, David Vázquez, Issam H. Laradji, Marco Pedersoli, Pau Rodríguez:
OCR-VQGAN: Taming Text-within-Image Generation. 3678-3687 - Alexander Gillert, Bo Peters, Uwe Freiherr von Lukas, Jürgen Kreyling, Gesche Blume-Werry:
Tracking Growth and Decay of Plant Roots in Minirhizotron Images. 3688-3697 - Scott Workman, Armin Hadzic, Muhammad Usman Rafique:
Handling Image and Label Resolution Mismatch in Remote Sensing. 3698-3707 - Rebbapragada V. C. Sairam, Monish Keswani, Uttaran Sinha, Nishit Shah, Vineeth N. Balasubramanian:
ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection. 3708-3717 - Daniel Steininger, Andreas Trondl, Gerardus Croonen, Julia Simon, Verena Widhalm:
The CropAndWeed Dataset: a Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation. 3718-3727 - Dabing Yu, Qingwu Li, Xiaolin Wang, Zhiliang Zhang, Yixi Qian, Chang Xu:
DSTrans: Dual-Stream Transformer for Hyperspectral Image Restoration. 3728-3738 - Byeolyi Han, Tae-Hyun Oh:
Learning Few-shot Segmentation from Bounding Box Annotations. 3739-3748 - Karim Guirguis, Mohamed Abdelsamad, George Eskandar, Ahmed Hendawy, Matthias Kayser, Bin Yang, Juergen Beyerer:
Towards Discriminative and Transferable One-Stage Few-Shot Object Detectors. 3749-3758 - Prasad P. Iyer, Saaketh Desai, Sadhvikas Addamane, Rémi Dingreville, Igal Brener:
Learning incoherent light emission steering from metasurfaces using generative models. 3759-3766 - Francesco Luzi, Aneesh Gupta, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof:
Transformers For Recognition In Overhead Imagery: A Reality Check. 3767-3776 - Leon Amadeus Varga, Martin Messmer, Nuri Benbarka, Andreas Zell:
Wavelength-aware 2D Convolutions for Hyperspectral Imaging. 3777-3786 - Maofeng Tang, Konstantinos Georgiou, Hairong Qi, Cody Champion, Marc Bosch:
Semantic Segmentation in Aerial Imagery Using Multi-level Contrastive Learning with Local Consistency. 3787-3796 - Antoine Vanderschueren, Christophe De Vleeschouwer:
Are Straight-Through gradients and Soft-Thresholding all you need for Sparse Training? 3797-3806 - Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal:
Learning Attention Propagation for Compositional Zero-Shot Learning. 3817-3826 - Trevor Ortega, Thomas Nelson, Skyler Crane, Josh Myers-Dean, Scott Wehrwein:
Computer Vision for International Border Legibility. 3827-3836 - Lukasz Struski, Tomasz Danel, Marek Smieja, Jacek Tabor, Bartosz Zielinski:
SONGs: Self-Organizing Neural Graphs. 3837-3846 - Guojun Wu, Xin Zhang, Ziming Zhang, Yanhua Li, Xun Zhou, Christopher G. Brinton, Zhenming Liu:
Learning Lightweight Neural Networks via Channel-Split Recurrent Convolution. 3847-3857 - Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly:
SPIQ: Data-Free Per-Channel Static Input Quantization. 3858-3867 - Thomas Verelst, Paul K. Rubenstein, Marcin Eichner, Tinne Tuytelaars, Maxim Berman:
Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations. 3868-3878 - Ju He, Adam Kortylewski, Alan L. Yuille:
CORL: Compositional Representation Learning for Few-Shot Classification. 3879-3888 - Jiayun Wang, Yubei Chen, Stella X. Yu, Brian Cheung, Yann LeCun:
Compact and Optimal Deep Learning with Recurrent Parameter Generators. 3889-3899 - Grégoire Petit, Adrian Popescu, Hugo Schindler, David Picard, Bertrand Delezoide:
FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning. 3900-3909 - Tobias Riedlinger, Matthias Rottmann, Marius Schubert, Hanno Gottschalk:
Gradient-Based Quantification of Epistemic Uncertainty for Deep Object Detectors. 3910-3920 - Deep Patel, P. S. Sastry:
Adaptive Sample Selection for Robust Learning under Label Noise. 3921-3931 - Yilin Ji, Daniel Kästner, Oliver Wirth, Christian Wressnegger:
Randomness is the Root of All Evil: More Reliable Evaluation of Deep Active Learning. 3932-3941 - Yue Liu, Christos Matsoukas, Fredrik Strand, Hossein Azizpour, Kevin Smith:
PatchDropout: Economizing Vision Transformers Using Patch Dropout. 3942-3951 - Bo Sun, Jason Kuen, Zhe Lin, Philippos Mordohai, Simon Chen:
PRN: Panoptic Refinement Network. 3952-3962 - Fang Chen, Gourav Datta, Souvik Kundu, Peter A. Beerel:
Self-Attentive Pooling for Efficient Deep Learning. 3963-3972 - Xiangyu Chen, Qinghao Hu, Kaidong Li, Cuncong Zhong, Guanghui Wang:
Accumulated Trivial Attention Matters in Vision Transformers on Small Datasets. 3973-3981 - Ragav Sachdeva, Andrew Zisserman:
The Change You Want to See. 3982-3991 - Xiwen Dengxiong, Yu Kong:
Ancestor Search: Generalized Open Set Recognition via Hyperbolic Side Information Learning. 3992-4001 - Chang Chen, Jiaming Zhang, Kailun Yang, Kunyu Peng, Rainer Stiefelhagen:
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers. 4002-4011 - Zhanwen Chen, Saed Rezayi, Sheng Li:
More Knowledge, Less Bias: Unbiasing Scene Graph Generation with Explicit Ontological Adjustment. 4012-4021 - Njuod Alsudays, Jing Wu, Yu-Kun Lai, Ze Ji:
AFPSNet: Multi-Class Part Parsing based on Scaled Attention and Feature Fusion. 4022-4031 - Zhiwei Lin, Zengyu Yang, Yongtao Wang:
Foreground Guidance and Multi-Layer Feature Fusion for Unsupervised Object Discovery with Transformers. 4032-4042 - Shin-I Cheng, Yu-Jie Chen, Wei-Chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee:
Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model. 4043-4051 - Phuoc-Hieu Le, Quynh Le, Rang Nguyen, Binh-Son Hua:
Single-Image HDR Reconstruction by Multi-Exposure Generation. 4052-4061 - Michail Christos Doukas, Stylianos Ploumpis, Stefanos Zafeiriou:
Dynamic Neural Portraits. 4062-4072 - Jie An, Tao Li, Hao-Zhi Huang, Jinwen Ma, Jiebo Luo:
Is Bigger Always Better? An Empirical Study on Efficient Architectures for Style Transfer and Beyond. 4073-4083 - Aradhya Neeraj Mathur, Anish Madan, Ojaswa Sharma:
SLI-pSp: Injecting Multi-Scale Spatial Layout in pSp. 4084-4093 - Sameer Malik, Rajiv Soundararajan:
Semi-Supervised Learning for Low-light Image Restoration through Quality Assisted Pseudo-Labeling. 4094-4103 - Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan, Biplab Banerjee:
Contrastive Learning of Semantic Concepts for Open-set Cross-domain Retrieval. 4104-4113 - Sachin Chhabra, Hemanth Venkateswara, Baoxin Li:
Generative Alignment of Posterior Probabilities for Source-free Domain Adaptation. 4114-4123 - Zijian Wang, Yadan Luo, Zi Huang, Mahsa Baktashmotlagh:
FFM: Injecting Out-of-Domain Knowledge via Factorized Frequency Modification. 4124-4133 - Yifan Lu, Gurkirt Singh, Suman Saha, Luc Van Gool:
Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection. 4134-4145 - Tianle Chen, Mahsa Baktashmotlagh, Zijian Wang, Mathieu Salzmann:
Center-aware Adversarial Augmentation for Single Domain Generalization. 4146-4154 - Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano:
Self-Distillation for Unsupervised 3D Domain Adaptation. 4155-4166 - Vikash Kumar, Rohit Lal, Himanshu Patil, Anirban Chakraborty:
CoNMix for Source-free Single and Multi-target Domain Adaptation. 4167-4177 - Jitender Maurya, Keyur Ruganathbhai Ranipa, Osamu Yamaguchi, Tomoyuki Shibata, Daisuke Kobayashi:
Domain Adaptation using Self-Training with Mixup for One-Stage Object Detection. 4178-4187 - Sofia Broomé, Ernest Pokropek, Boyu Li, Hedvig Kjellström:
Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action Recognition. 4188-4198 - Aadarsh Sahoo, Rameswar Panda, Rogério Feris, Kate Saenko, Abir Das:
Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain Adaptation. 4199-4208 - Gaurav Bhatt, Vineeth N. Balasubramanian:
Learning Style Subspaces for Controllable Unpaired Domain Translation. 4209-4218 - Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Tianrui Liu, Shijian Lu, Liang Pan:
TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection. 4219-4228 - Gopi Krishna Erabati, Helder Araújo:
Li3DeTr: A LiDAR based 3D Detection Transformer. 4239-4248 - Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, Xiangyang Xue:
ImpDet: Exploring Implicit Fields for 3D Object Detection. 4249-4259 - Heming Du, Xin Yu, Farookh Khadeer Hussain, Mohammad Ali Armin, Lars Petersson, Weihao Li:
Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. 4260-4269 - Xuepeng Shi, Zhixiang Chen, Tae-Kyun Kim:
Multivariate Probabilistic Monocular 3D Object Detection. 4270-4279 - Ruixin Liu, Zhihao Guan, Zejian Yuan, Ao Liu, Tong Zhou, Tang Kun, Erlong Li, Chao Zheng, Shuqi Mei:
Learning to Detect 3D Lanes by Shape Matching and Embedding. 4280-4288 - Debtanu Gupta, Shubh Maheshwari, Sai Shashank Kalakonda, Manasvi Vaidyula, Ravi Kiran Sarvadevabhatla:
DSAG: A Scalable Deep Framework for Action-Conditioned Multi-Actor Full Body Motion Synthesis. 4289-4297 - Pavel Solovev, Taras Khakhulin, Denis Korzhenkov:
Self-improving Multiplane-to-layer Images for Novel View Synthesis. 4298-4307 - Zirui An, Jingbo Yu, Runtao Liu, Chuang Wang, Qian Yu:
SketchInverter: Multi-Class Sketch-Based Image Generation via GAN Inversion. 4308-4318 - Decai Chen, Peng Zhang, Ingo Feldmann, Oliver Schreer, Peter Eisert:
Recovering Fine Details for Neural Implicit Surface Reconstruction. 4319-4328 - Verica Lazova, Vladimir Guzov, Kyle Olszewski, Sergey Tulyakov, Gerard Pons-Moll:
Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation. 4329-4339 - Weijian Deng, Yumin Suh, Xiang Yu, Masoud Faraki, Liang Zheng, Manmohan Chandraker:
Split to Learn: Gradient Split for Multi-Task Human Image Analysis. 4340-4349 - Devansh Gupta, Aditya Saini, Sarthak Bhagat, Shagun Uppal, Rishi Raj Jain, Drishti Bhasin, Ponnurangam Kumaraguru, Rajiv Ratn Shah:
A Suspect Identification Framework using Contrastive Relevance Feedback. 4350-4358 - Anil Kunchala, Mélanie Bouroche, Bianca Schoen-Phelan:
Towards A Framework for Privacy-Preserving Pedestrian Analysis. 4359-4369 - Thao Minh Le, Vuong Le, Sunil Gupta, Svetha Venkatesh, Truyen Tran:
Guiding Visual Question Answering with Attention Priors. 4370-4379 - Aditay Tripathi, Anand Mishra, Anirban Chakraborty:
Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing. 4380-4389 - Kohei Uehara, Tatsuya Harada:
K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition. 4390-4398 - Zineng Tang, Jaemin Cho, Jie Lei, Mohit Bansal:
PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. 4399-4409 - Zehranaz Canfes, M. Furkan Atasoy, Alara Dirik, Pinar Yanardag:
Text and Image Guided 3D Avatar Generation and Manipulation. 4410-4420 - Yuxiao Chen, Jianbo Yuan, Long Zhao, Tianlang Chen, Rui Luo, Larry Davis, Dimitris N. Metaxas:
More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching. 4421-4429 - Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar:
Watching the News: Towards VideoQA Models that can Read. 4430-4439 - Mingda Zhang, Rebecca Hwa, Adriana Kovashka:
How to Practice VQA on a Resource-limited Target Domain. 4440-4449 - Zhihong Pan, Xin Zhou, Hao Tian:
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation. 4450-4460 - Lokender Tiwari, Brojeshwar Bhowmick:
GarSim: Particle Based Neural Garment Simulator. 4461-4470 - Shubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar:
IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes. 4471-4480 - Yu Feng, Patrick Hansen, Paul N. Whatmough, Guoyu Lu, Yuhao Zhu:
Fast and Accurate: Video Enhancement Using Sparse Depth. 4481-4489 - Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Zhijie Shen, Yao Zhao:
Complementary Bi-directional Feature Compression for Indoor 360° Semantic Segmentation with Self-distillation. 4490-4499 - Guofeng Mei, Fabio Poiesi, Cristiano Saltori, Jian Zhang, Elisa Ricci, Nicu Sebe:
Overlap-guided Gaussian Mixture Models for Point Cloud Registration. 4500-4509 - Petros Tzathas, Petros Maragos, Anastasios Roussos:
3D Neural Sculpting (3DNS): Editing Neural Signed Distance Functions. 4510-4519 - Chengzhi Wu, Xuelei Bi, Julius Pfrommer, Alexander Cebulla, Simon Mangold, Jürgen Beyerer:
Sim2real Transfer Learning for Point Cloud Segmentation: An Industrial Application Case on Autonomous Disassembly. 4520-4529 - Zhihao Zheng, Xiaowen Ying, Zhen Yao, Mooi Choo Chuah:
Robustness of Trajectory Prediction Models Under Map-Based Attacks. 4530-4539 - Saehyung Lee, Hyungyu Lee:
Inducing Data Amplification Using Auxiliary Datasets in Adversarial Training. 4540-4549 - Kazuya Kakizaki, Kazuto Fukuchi, Jun Sakuma:
Certified Defense for Content Based Image Retrieval. 4550-4559 - Avishag Shapira, Alon Zolfi, Luca Demetrio, Battista Biggio, Asaf Shabtai:
Phantom Sponges: Exploiting Non-Maximum Suppression to Attack Deep Object Detectors. 4560-4569 - Hanxiao Tan, Helena Kotthaus:
Explainability-Aware One Point Attack for Point Cloud Neural Networks. 4570-4579 - Xingqian Xu, Shant Navasardyan, Vahram Tadevosyan, Andranik Sargsyan, Yadong Mu, Humphrey Shi:
Image Completion with Heterogeneously Filtered Spectral Hints. 4580-4590 - Yuan Zhao, Bo Liu, Ming Ding, Baoping Liu, Tianqing Zhu, Xin Yu:
Proactive Deepfake Defence via Identity Watermarking. 4591-4600 - Lei Fan, Ying Wu:
Avoiding Lingering in Learning Active Recognition by Adversarial Disturbance. 4601-4610 - Gaurav Kumar Nayak, Ruchit Rawal, Anirban Chakraborty:
DE-CROP: Data-efficient Certified Robustness for Pretrained Classifiers. 4611-4620 - Ke Xu, Yao Xiao, Zhaoheng Zheng, Kaijie Cai, Ram Nevatia:
PatchZero: Defending against Adversarial Patch Attacks by Detecting and Zeroing the Patch. 4621-4630 - Fahim Faisal Niloy, Kishor Kumar Bhaumik, Simon S. Woo:
CFL-Net: Image Forgery Localization Using Contrastive Learning. 4631-4640 - Rui Yang, Duc Minh Vo, Hideki Nakayama:
Indirect Adversarial Losses via an Intermediate Distribution for Training GANs. 4641-4650 - Rahul Venkatesh, Eric Wong, Zico Kolter:
Adversarial robustness in discontinuous spaces via alternating sampling & descent. 4651-4660 - Taras Rumezhak, Francisco Girbal Eiras, Philip H. S. Torr, Adel Bibi:
RANCER: Non-Axis Aligned Anisotropic Certification with Randomized Smoothing. 4661-4669 - Thanh Nguyen-Duc, Trung Le, He Zhao, Jianfei Cai, Dinh Phung:
Adversarial local distribution regularization for knowledge distillation. 4670-4679 - Baoping Liu, Bo Liu, Ming Ding, Tianqing Zhu, Xin Yu:
TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection. 4680-4689 - Likun Zhang, Yahong Chen, Ang Li, Binghui Wang, Yiran Chen, Fenghua Li, Jin Cao, Ben Niu:
Interpreting Disparate Privacy-Utility Tradeoff in Adversarial Learning via Attribute Correlation. 4690-4698 - Shruti Agarwal, Liwen Hu, Evonne Ng, Trevor Darrell, Hao Li, Anna Rohrbach:
Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion. 4699-4708 - Sumedha Singla, Nihal Murali, Forough Arabshahi, Sofia Triantafyllou, Kayhan Batmanghelich:
Augmentation by Counterfactual Explanation -Fixing an Overconfident Classifier. 4709-4719 - Enis Simsar, Umut Kocasari, Ezgi Gülperi Er, Pinar Yanardag:
Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs. 4720-4729 - Hanxiao Tan:
Visualizing Global Explanations of Point Cloud DNNs. 4730-4739 - Taojiannan Yang, Linjie Yang, Xiaojie Jin, Chen Chen:
Revisiting Training-free NAS Metrics: An Efficient Training-based Method. 4740-4749 - Yunhao Ge, Zhi Xu, Yao Xiao, Gan Xin, Yunkui Pang, Laurent Itti:
Encouraging Disentangled and Convex Representation with Controllable Interpolation Regularization. 4750-4758 - Monu Verma, Priyanka Lubal, Santosh Kumar Vipparthi, Mohamed Abdel-Mottaleb:
RNAS-MER: A Refined Neural Architecture Search with Hybrid Spatiotemporal Operations for Micro-Expression Recognition. 4759-4768 - Lena Heidemann, Maureen Monnet, Karsten Roscher:
Concept Correlation and Its Effects on Concept-Based Models. 4769-4777 - Yuqiao Xian, Jinrui Yang, Fufu Yu, Jun Zhang, Xing Sun:
Graph-Based Self-Learning for Robust Person Re-identification. 4778-4787 - Fan Yang, Shigeyuki Odashima, Shoichi Masui, Shan Jiang:
Hard to Track Objects with Irregular Motions and Similar Appearances? Make It Easier by Buffering the Matching Space. 4788-4797 - Wen Guo, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer:
Back to MLP: A Simple Baseline for Human Motion Prediction. 4798-4808 - Mustansar Fiaz, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan:
SAT: Scale-Augmented Transformer for Person Search. 4809-4818 - Gozde Sahin, Laurent Itti:
HOOT: Heavy Occlusions in Object Tracking Benchmark. 4819-4828 - Adhiraj Ghosh, Kuruparan Shanmugalingam, Wen-Yan Lin:
Relation Preserving Triplet Mining for Stabilising the Triplet Loss in Re-identification Systems. 4829-4838 - Jeongseok Hyun, Myunggu Kang, Dongyoon Wee, Dit-Yan Yeung:
Detection Recovery in Online Multi-Object Tracking with Sparse Graph Tracker. 4839-4848 - Xiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang, Peng Chu, Houdong Hu, Jiang Wang, Zicheng Liu:
MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark. 4849-4858 - Peng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu:
TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking. 4859-4869 - Luca Piano, Filippo Gabriele Pratticò, Alessandro Sebastian Russo, Lorenzo Lanari, Lia Morra, Fabrizio Lamberti:
Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification. 4870-4880 - Furkan Kinli, Doga Yilmaz, Baris Özcan, Furkan Kiraç:
Modeling the Lighting in Scenes as Style for Auto White-Balance Correction. 4892-4902 - Mohit Lamba, M. V. A. Suhas Kumar, Kaushik Mitra:
Real-Time Restoration of Dark Stereo Images. 4903-4913 - Mehmet Kerim Yücel, Valia Dimaridou, Bruno Manganelli, Mete Ozay, Anastasios Drosou, Albert Saà-Garriga:
LRA&LDRA: Rethinking Residual Predictions for Efficient Shadow Detection and Removal. 4914-4924 - Quan H. Nguyen, William J. Beksi:
Single Image Super-Resolution via a Dual Interactive Implicit Neural Network. 4925-4934 - Akash Gupta, Sudhir Kumar Singh, Amit K. Roy-Chowdhury:
Joint Video Rolling Shutter Correction and Super-Resolution. 4935-4944 - Jinsu Yoo, Taehoon Kim, Sihaeng Lee, Seung Hwan Kim, Honglak Lee, Tae Hyun Kim:
Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution. 4945-4954 - Bo Zhou, Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Chi Liu, James S. Duncan, Michal Sofka:
DSFormer: A Dual-domain Self-supervised Transformer for Accelerated Multi-contrast MRI Reconstruction. 4955-4964 - Anup Kumar Gupta, Rupesh Kumar, Lokendra Birla, Puneet Gupta:
RADIANT: Better rPPG estimation using signal embeddings and Transformer. 4965-4975 - Ajay Jaiswal, Tianlong Chen, Justin F. Rousseau, Yifan Peng, Ying Ding, Zhangyang Wang:
Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances. 4976-4985 - Georg Wölflein, In Hwa Um, David J. Harrison, Ognjen Arandjelovic:
HoechstGAN: Virtual Lymphocyte Staining Using Generative Adversarial Networks. 4986-4996 - Xin Liu, Brian L. Hill, Ziheng Jiang, Shwetak N. Patel, Daniel McDuff:
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Cardiac Measurement. 4997-5006 - Alexander Hustinx, Fabio Hellmann, Ömer Sümer, Behnam Javanmardi, Elisabeth André, Peter Krawitz, Tzung-Chien Hsieh:
Improving Deep Facial Phenotyping for Ultra-rare Disorder Verification Using Model Ensembles. 5007-5017 - Lokendra Birla, Sneha Shukla, Anup Kumar Gupta, Puneet Gupta:
ALPINE: Improving Remote Heart Rate Estimation using Contrastive Learning. 5018-5027 - Yan Yang, Md. Zakir Hossain, Eric A. Stone, Shafin Rahman:
Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression Prediction. 5028-5037 - Xin Jin, Longhai Wu, Guotao Shen, Youxin Chen, Jie Chen, Jayoon Koo, Cheul-Hee Hahm:
Enhanced Bi-directional Motion Estimation for Video Frame Interpolation. 5038-5046 - Wenjie Yin, Hang Yin, Kim Baraka, Danica Kragic, Mårten Björkman:
Dance Style Transfer with Cross-modal Transformer. 5047-5056 - Yonghu Chen, Dongchen Zhu, Wenjun Shi, Guanghui Zhang, Tianyu Zhang, Xiaolin Zhang, Jiamao Li:
MFCFlow: A Motion Feature Compensated Multi-Frame Recurrent Network for Optical Flow Estimation. 5057-5066 - Jerin Geo James, Devansh Jain, Ajit Rajwade:
GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion Estimates. 5067-5076 - Stefano Savian, Pietro Morerio, Alessio Del Bue, Andrea A. Janes, Tammam Tillo:
Towards Equivariant Optical Flow Estimation with Deep Learning. 5077-5086 - Apoorva Agarwal, Rishabh Dabral, Arjun Jain, Ganesh Ramakrishnan:
Skew-Robust Human-Object Interactions in Videos. 5087-5096 - Valéry Dewil, Adrien Courtois, Mariano Rodríguez, Thibaud Ehret, Nicola Brandonisio, Denis Bujoreanu, Gabriele Facciolo, Pablo Arias:
Video joint denoising and demosaicing with recurrent CNNs. 5097-5108 - Yumeng Wang, Bo Xu, Ziwen Li, Han Huang, Cheng Lu, Yandong Guo:
Video Object Matting via Hierarchical Space-Time Semantic Guidance. 5109-5118 - Shengyu Feng, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar, Subarna Tripathi:
Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs. 5119-5128 - Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee:
Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation. 5129-5138 - Huaizu Jiang, Erik G. Learned-Miller:
DCVNet: Dilated Cost Volume Networks for Fast Optical Flow. 5139-5146 - Tanvir Mahmud, Diana Marculescu:
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization. 5147-5156 - Xinchi Zhou, Dongzhan Zhou, Wanli Ouyang, Hang Zhou, Di Hu:
SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance. 5157-5166 - Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:
Audio-Visual Face Reenactment. 5167-5176 - Jielin Qiu, Franck Dernoncourt, Trung Bui, Zhaowen Wang, Ding Zhao, Hailin Jin:
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos. 5177-5187 - Xinchi Zhou, Dongzhan Zhou, Di Hu, Hang Zhou, Wanli Ouyang:
Exploiting Visual Context Semantics for Sound Source Localization. 5188-5197 - Anchit Gupta, Rudrabha Mukhopadhyay, Sindhu Balachandra, Faizan Farooq Khan, Vinay P. Namboodiri, C. V. Jawahar:
Towards Generating Ultra-High Resolution Talking-Face Videos with Lip synchronization. 5198-5207 - Tianwei Ni, Kiana Ehsani, Luca Weihs, Jordi Salvador:
Towards Disturbance-Free Visual Mobile Manipulation. 5208-5220 - Darshan Singh S, Anchit Gupta, C. V. Jawahar, Makarand Tapaswi:
Unsupervised Audio-Visual Lecture Segmentation. 5221-5230 - Onkar Susladkar, Gayatri Deshmukh, Dhruv Makwana, Sparsh Mittal, R. Sai Chandra Teja, Rekha Singhal:
GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks. 5231-5240 - Shih-Yun Chu, Ming-Sui Lee:
MT-DETR: Robust End-to-end Multimodal Detection with Confidence Fusion. 5241-5250 - Dan Liu, Xi Chen, Chen Ma, Xue Liu:
Hyperspherical Quantization: Toward Smaller and More Accurate Models. 5251-5261 - Gobinda Saha, Kaushik Roy:
Saliency Guided Experience Packing for Replay in Continual Learning. 5262-5272 - Shiv Ram Dubey, Satish Kumar Singh, Bidyut Baran Chaudhuri:
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNs. 5273-5282 - Matthew Dutson, Yin Li, Mohit Gupta:
Spike-Based Anytime Perception. 5283-5293 - Ze Wang, Yue Lu, Qiang Qiu:
Meta-OLE: Meta-learned Orthogonal Low-Rank Embedding. 5294-5303 - Varad Pimpalkhute, Shruti Kunde, Rekha Singhal:
GEMS: Generating Efficient Meta-Subnets. 5304-5312 - Sayan Nag, Mayukh Bhattacharyya, Anuraag Mukherjee, Rohit Kundu:
Serf: Towards better training of deep neural networks using log-Softplus ERror activation Function. 5313-5322 - Shaogang Ren, Hongliang Fei, Dingcheng Li, Ping Li:
Learning Latent Structural Relations with Message Passing Prior. 5323-5332 - Brandon Smart, Gustavo Carneiro:
Bootstrapping the Relationship Between Images and Their Clean and Noisy Labels. 5333-5343 - Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautière, Auke J. Wiggers:
Boosting neural video codecs by exploiting hierarchical redundancy. 5344-5353 - Noor Fathima Ghouse, Jens Petersen, Guillaume Sautière, Auke J. Wiggers, Reza Pourreza:
A neural video codec with spatial rate-distortion control. 5354-5363 - Sizhuo Ma, Paul Mos, Edoardo Charbon, Mohit Gupta:
Burst Vision Using Single-Photon Cameras. 5364-5374 - Xinyu Jiang, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Duoqian Miao:
Few-shot Object Detection via Improved Classification Features. 5375-5384 - Ze Huang, Li Sun, Cheng Zhao, Song Li, Songzhi Su:
EventPoint: Self-Supervised Interest Point Detection and Description for Event-based Camera. 5385-5394 - Qi Rao, Xin Yu, Shant Navasardyan, Humphrey Shi:
Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline. 5395-5404 - Zhihong Pan, Baopu Li, Dongliang He, Wenhao Wu, Errui Ding:
Effective Invertible Arbitrary Image Rescaling. 5405-5414 - Ojas Kishorkumar Shirekar, Anuj Singh, Hadi Jamali Rad:
Self-Attention Message Passing for Contrastive Few-Shot Learning. 5414-5425 - Abhinav Java, Shripad V. Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy:
One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text. 5426-5435 - Fengyuan Yang, Ruiping Wang, Xilin Chen:
Semantic Guided Latent Parts Embedding for Few-Shot Learning. 5436-5446 - Seyed Ehsan Marjani Bajestani, Giovanni Beltrame:
Event-based RGB sensing with structured light. 5447-5456 - Xinyu Li, Yanyi Zhang, Jianbo Yuan, Hanlin Lu, Yibo Zhu:
Discrete Cosin TransFormer: Image Modeling From Frequency Domain. 5457-5467 - Kihyuk Sohn, Jinsung Yoon, Chun-Liang Li, Chen-Yu Lee, Tomas Pfister:
Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types. 5468-5479 - Tomás Vojír, Jirí Matas:
Image-Consistent Detection of Road Anomalies as Unpredictable Patches. 5480-5489 - Aitor Artola, Yannis Kolodziej, Jean-Michel Morel, Thibaud Ehret:
GLAD: A Global-to-Local Anomaly Detector. 5490-5499 - Mohamed Yousef, Marcel Ackermann, Unmesh Kurup, Tom E. Bishop:
No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection. 5500-5509 - Mu Cai, Yixuan Li:
Out-of-distribution Detection via Frequency-regularized Generative Models. 5510-5519 - Jingyang Zhang, Nathan Inkawhich, Randolph Linderman, Yiran Chen, Hai Li:
Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained Environments. 5520-5529 - Kamalakar Vijay Thakare, Yash Raghuwanshi, Debi Prosad Dogra, Heeseung Choi, Ig-Jae Kim:
DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network. 5530-5539 - Genki Osada, Tsubasa Takahashi, Budrul Ahsan, Takashi Nishide:
Out-of-Distribution Detection with Reconstruction Error and Typicality-based Penalty. 5540-5552 - Toshimichi Aota, Lloyd Teh Tzer Tong, Takayuki Okatani:
Zero-shot versus Many-shot: Unsupervised Texture Anomaly Detection. 5553-5561 - Srijan Das, Michael S. Ryoo:
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints. 5562-5572 - Haopeng Li, Qiuhong Ke, Mingming Gong, Tom Drummond:
Progressive Video Summarization via Multimodal Self-supervised Learning. 5573-5582 - Amani Almalki, Longin Jan Latecki:
Self-Supervised Learning with Masked Image Modeling for Teeth Numbering, Detection of Dental Restorations, and Instance Segmentation in Dental Panoramic Radiographs. 5583-5592 - Yangsong Zhang, Subhankar Roy, Hongtao Lu, Elisa Ricci, Stéphane Lathuilière:
Cooperative Self-Training for Multi-Target Adaptive Semantic Segmentation. 5593-5602 - Sutanu Bera, Prabir Kumar Biswas:
Self Supervised Low Dose Computed Tomography Image Denoising Using Invertible Network Exploiting Inter Slice Congruence. 5603-5612 - Ashraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke:
Self-supervised Learning with Local Contrastive Loss for Detection and Semantic Segmentation. 5613-5622 - Leonardo Tadeu Lopes, Daniel Carlos Guimarães Pedronette:
Self-Supervised Clustering based on Manifold Learning and Graph Convolutional Networks. 5623-5632 - Justin Lazarow, Kihyuk Sohn, Chen-Yu Lee, Chun-Liang Li, Zizhao Zhang, Tomas Pfister:
Unifying Distribution Alignment as a Loss for Imbalanced Semi-supervised Learning. 5633-5642 - Mustafa Taha Koçyigit, Timothy M. Hospedales, Hakan Bilen:
Accelerating Self-Supervised Learning via Efficient Training Strategies. 5643-5653 - Hao Zhang, Xin Chen, Heming Jing, Yingbin Zheng, Yuan Wu, Cheng Jin:
ETR: An Efficient Transformer for Re-ranking in Visual Place Recognition. 5654-5663 - Yintong Wang, Lili Chen, Jiamao Li, Xiaolin Zhang:
HandGCNFormer: A Novel Topology-Aware Transformer Network for 3D Hand Pose Estimation. 5664-5673 - Guowei Li, Dongchen Zhu, Guanghui Zhang, Wenjun Shi, Tianyu Zhang, Xiaolin Zhang, Jiamao Li:
SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation. 5674-5683 - Qi Feng, Kun He, He Wen, Cem Keskin, Yuting Ye:
Rethinking the Data Annotation Process for Multiview 3D Pose Estimation with Active Learning and Self-Training. 5684-5693 - Bruce R. Muller, William A. P. Smith:
Self-supervised Relative Pose with Homography Model-fitting in the Loop. 5694-5703 - Shih-Po Lee, Niraj Prakash Kini, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang:
HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar. 5704-5713 - Kyung-Min Jin, Byoung-Sung Lim, Gun-Hee Lee, Tae-Kyung Kang, Seong-Whan Lee:
Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos. 5714-5723 - Rong Wang, Wei Mao, Hongdong Li:
Interacting Hand-Object Pose Estimation via Dense Mutual Attention. 5724-5734 - Pedro Castro, Tae-Kyun Kim:
CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers. 5735-5744 - Huan Liu, Zhixiang Chi, Yuanhao Yu, Yang Wang, Jun Chen, Jin Tang:
Meta-Auxiliary Learning for Future Depth Prediction in Videos. 5745-5754 - Haoyi Zhu:
X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360° Insufficient RGB-D Views. 5755-5764 - Xingyu Chen, Ruonan Zhang, Ji Jiang, Yan Wang, Ge Li, Thomas H. Li:
Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem. 5765-5775 - Runkai Zhao, Heng Wang, Chaoyi Zhang, Weidong Cai:
PointNeuron: 3D Neuron Reconstruction via Geometry and Topology Learning of Point Clouds. 5776-5786 - Ukcheol Shin, Kwanyong Park, Byeong-Uk Lee, Kyunghyun Lee, In So Kweon:
Self-supervised Monocular Depth Estimation from Thermal Images via Adversarial Multi-spectral Adaptation. 5787-5796 - Xingyu Chen, Thomas H. Li, Ruonan Zhang, Ge Li:
Frequency-Aware Self-Supervised Monocular Depth Estimation. 5797-5806 - Avinash Nittur Ramesh, Fabio Giovanneschi, María A. González-Huici:
SIUNet: Sparsity Invariant U-Net for Edge-Aware Depth Completion. 5807-5816 - Nitin Bansal, Pan Ji, Junsong Yuan, Yi Xu:
Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth. 5817-5828 - Andrea Pilzer, Yuxin Hou, Niki A. Loppi, Arno Solin, Juho Kannala:
Expansion of Visual Hints for Improved Generalization in Stereo Matching. 5829-5838 - Jamie Watson, Sara Vicente, Oisin Mac Aodha, Clément Godard, Gabriel J. Brostow, Michael Firman:
Heightfields for Efficient Scene Reconstruction for AR. 5839-5849 - Ashutosh Agarwal, Chetan Arora:
Attention Attention Everywhere: Monocular Depth Prediction with Skip Attention. 5850-5859 - Andrea Conti, Matteo Poggi, Stefano Mattoccia:
Sparsity Agnostic Depth Completion. 5860-5869 - Nan Qiao, Yuyin Sun, Chong Liu, Lu Xia, Jiajia Luo, Ke Zhang, Cheng-Hao Kuo:
Human-in-the-Loop Video Semantic Segmentation Auto-Annotation. 5870-5880 - Georgy Ponimatkin, Nermin Samet, Yang Xiao, Yuming Du, Renaud Marlet, Vincent Lepetit:
A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation. 5881-5892 - Sharat Agarwal, Saket Anand, Chetan Arora:
Reducing Annotation Effort by Identifying and Labeling Contextually Diverse Classes for Semantic Segmentation Under Domain Shift. 5893-5902 - Heejo Kong, Gun-Hee Lee, Suneung Kim, Seong-Whan Lee:
Pruning-Guided Curriculum Learning for Semi-Supervised Semantic Segmentation. 5903-5912 - Minhyeok Lee, Suhwan Cho, Seunghoon Lee, Chaewon Park, Sangyoun Lee:
Unsupervised Video Object Segmentation via Prototype Memory Network. 5913-5923 - Lang Peng, Zhirong Chen, Zhangjie Fu, Pengpeng Liang, Erkang Cheng:
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs. 5924-5932 - Yating Zhou, Wenjing Li, Ge Yang:
SCTS: Instance Segmentation of Single Cells Using a Transformer-Based Semantic-Aware Model and Space-Filling Augmentation. 5933-5942 - Peri Akiva, Kristin J. Dana:
Single Stage Weakly Supervised Semantic Segmentation of Complex Scenes. 5943-5954 - Aneesh Rangnekar, Christopher Kanan, Matthew J. Hoffman:
Semantic Segmentation with Active Semi-Supervised Learning. 5955-5966 - Anurag Das, Yongqin Xian, Yang He, Zeynep Akata, Bernt Schiele:
Urban Scene Semantic Segmentation with Low-Cost Coarse Annotation. 5967-5976 - Chihiro Noguchi, Toshihiro Tanizawa:
Ego-Vehicle Action Recognition based on Semi-Supervised Contrastive Learning. 5977-5987 - Lin Sui, Chen-Lin Zhang, Lixin Gu, Feng Han:
A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector. 5988-5997 - Gurkirt Singh, Vasileios Choutas, Suman Saha, Fisher Yu, Luc Van Gool:
Spatio-Temporal Action Detection Under Large Motion. 5998-6007 - Jing Yang, Jie Shen, Yiming Lin, Yordan Hristov, Maja Pantic:
FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection. 6008-6016 - Jianxiong Zhou, Ying Wu:
Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization. 6017-6026 - Anqi Zhu, Qiuhong Ke, Mingming Gong, James Bailey:
Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition. 6027-6036 - Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee:
Intention-Conditioned Long-Term Human Egocentric Action Anticipation. 6037-6046 - Tae-Kyung Kang, Gun-Hee Lee, Kyung-Min Jin, Seong-Whan Lee:
Action-aware Masking Network with Group-based Attention for Temporal Action Localization. 6047-6056 - Zeyun Zhong, David Schneider, Michael Voit, Rainer Stiefelhagen, Jürgen Beyerer:
Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation. 6057-6066 - Shruti S. Phutke, Subrahmanyam Murala:
Nested Deformable Multi-head Attention for Facial Image Inpainting. 6067-6076 - Nhat Le, Khanh Nguyen, Quang D. Tran, Erman Tjiputra, Bac Le, Anh Nguyen:
Uncertainty-aware Label Distribution Learning for Facial Expression Recognition. 6077-6086 - Chen-Hao Liao, Wen-Cheng Chen, Hsuan-Tung Liu, Yi-Ren Yeh, Min-Chun Hu, Chu-Song Chen:
Domain Invariant Vision Transformer Learning for Face Anti-spoofing. 6087-6096 - Aidan Boyd, Patrick Tinsley, Kevin W. Bowyer, Adam Czajka:
CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning-Based Synthetic Face Detection. 6097-6106 - Yasheng Sun, Jiangke Lin, Hang Zhou, Zhiliang Xu, Dongliang He, Hideki Koike:
ReEnFP: Detail-Preserving Face Reconstruction by Encoding Facial Priors. 6107-6117 - Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Zafari, Moktari Mostofa, Nasser M. Nasrabadi:
A Quality Aware Sample-to-Sample Comparison for Face Recognition. 6118-6127 - Chao-Han Huck Yang, I-Te Danny Hung, Yi-Chieh Liu, Pin-Yu Chen:
Treatment Learning Causal Transformer for Noisy Image Classification. 6128-6139 - Xiangcheng Du, Zhao Zhou, Yingbin Zheng, Tianlong Ma, Xingjiao Wu, Cheng Jin:
Modeling Stroke Mask for End-to-End Text Erasing. 6140-6148 - Boon Peng Yap, Beng Koon Ng:
Cut-Paste Consistency Learning for Semi-Supervised Lesion Segmentation. 6149-6158 - Thomas Stegmüller, Behzad Bozorgtabar, Antoine Spahr, Jean-Philippe Thiran:
ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification. 6159-6168 - Gaurav Patel, Jan P. Allebach, Qiang Qiu:
Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition. 6169-6179 - Britty Baby, Daksh Thapar, Mustafa Chasmai, Tamajit Banerjee, Kunal Dargan, Ashish Suri, Subhashis Banerjee, Chetan Arora:
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments. 6180-6190 - Moein Heidari, Amirhossein Kazerouni, Milad Soltany Kadarvish, Reza Azad, Ehsan Khodapanah Aghdam, Julien Cohen-Adad, Dorit Merhof:
HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation. 6191-6201 - Shivasankaran V. P, Muhammad Yusuf Hassan, Mayank Singh:
LineEX: Data Extraction from Scientific Line Charts. 6202-6210 - Md Mostafijur Rahman, Radu Marculescu:
Medical Image Segmentation via Cascaded Attention Decoding. 6211-6220 - Dehua Zheng, Xiaochen Zheng, Laurence T. Yang, Yuan Gao, Chenlu Zhu, Yiheng Ruan:
MFFN: Multi-view Feature Fusion Network for Camouflaged Object Detection. 6221-6231 - Sayak Nag, Orpaz Goldstein, Amit K. Roy-Chowdhury:
Semantics Guided Contrastive Learning of Transformers for Zero-shot Temporal Activity Detection. 6232-6242 - Junshi Xia, Naoto Yokoya, Bruno Adriano, Clifford Broni-Bediako:
OpenEarthMap: A Benchmark Dataset for Global High-Resolution Land Cover Mapping. 6243-6253 - Yong Wu, Shekhor Chanda, Mehrdad Hosseinzadeh, Zhi Liu, Yang Wang:
Few-Shot Learning of Compact Models via Task-Specific Meta Distillation. 6254-6263 - Zicheng Pan, Xiaohan Yu, Miaohua Zhang, Yongsheng Gao:
SSFE-Net: Self-Supervised Feature Enhancement for Ultra-Fine-Grained Few-Shot Class Incremental Learning. 6264-6273 - Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva:
One-Shot Synthesis of Images and Segmentation Masks. 6274-6283 - Debabrata Pal, Shirsha Bose, Biplab Banerjee, Yogananda V. Jeppu:
MORGAN: Meta-Learning-based Few-Shot Open-Set Recognition via Generative Adversarial Network. 6284-6293 - Ashutosh Kulkarni, Subrahmanyam Murala:
Aerial Image Dehazing with Attentive Deformable Transformers. 6294-6303 - Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le:
Few-shot Object Counting with Similarity-Aware Feature Enhancement. 6304-6313 - He-Yen Hsieh, Ding-Jie Chen, Cheng-Wei Chang, Tyng-Luh Liu:
Aggregating Bilateral Attention for Few-Shot Instance Localization. 6314-6323 - Nathan Elias:
Deep Learning Methodology for Early Detection and Outbreak Prediction of Invasive Species Growth. 6324-6332 - Alvaro Gómez, Gregory Randall, Gabriele Facciolo, Rafael Grompone von Gioi:
Improving the Pair Selection and the Model Fusion Steps of Satellite Multi-View Stereo Pipelines. 6333-6342 - Ankit Jha, Shirsha Bose, Biplab Banerjee:
GAF-Net: Improving the Performance of Remote Sensing Image Fusion using Novel Global Self and Cross Attention Learning. 6343-6352 - Marcelo Gennari Do Nascimento, Victor Adrian Prisacariu, Roger Fawcett, Martin Langhammer:
HyperBlock Floating Point: Generalised Quantization Scheme for Gradient and Inference Computation. 6353-6362 - Minseok Seo, Hakjin Lee, Yongjin Jeon, Junghoon Seo:
Self-Pair: Synthesizing Changes from Single Source for Object Change Detection in Remote Sensing Imagery. 6363-6372 - Ke Li, Dengxin Dai, Luc Van Gool:
Jointly Learning Band Selection and Filter Array Design for Hyperspectral Imaging. 6373-6383 - Evangelos Moschos, Alisa Kugusheva, Paul Coste, Alexandre Stegner:
Computer Vision for Ocean Eddy Detection in Infrared Imagery. 6384-6393 - JunKyu Lee, Blesson Varghese, Hans Vandierendonck:
ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy. 6394-6403 - Swati Bhugra, Vinay Kaushik, Amit Gupta, Brejesh Lall, Santanu Chaudhury:
AnoLeaf: Unsupervised Leaf Disease Segmentation via Structurally Robust Generative Inpainting. 6404-6413 - Sieger Falkena, Hadi Jamali Rad, Jan van Gemert:
LAB: Learnable Activation Binarizer for Binary Neural Networks. 6414-6423 - Cuong Pham, Tuan Hoang, Thanh-Toan Do:
Collaborative Multi-Teacher Knowledge Distillation for Learning Low Bit-width Deep Neural Networks. 6424-6432 - Saptarshi Sinha, Hiroki Ohashi:
Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition. 6433-6442 - Shwai He, Chenbo Jiang, Daize Dong, Liang Ding:
SD-Conv: Towards the Parameter-Efficiency of Dynamic Convolution. 6443-6452 - Hanyu Peng, Weiguo Pian, Mingming Sun, Ping Li:
Dynamic Re-weighting for Long-tailed Semi-supervised Learning. 6453-6463 - Hai Lan, Xihao Wang, Hao Shen, Peidong Liang, Xian Wei:
Couplformer: Rethinking Vision Transformer with Coupling Attention. 6464-6473 - Kuan-Ying Lee, Yuanyi Zhong, Yu-Xiong Wang:
Do Pre-trained Models Benefit Equally in Continual Learning? 6474-6482 - Amélie Gruel, Jean Martinet, Bernabé Linares-Barranco, Teresa Serrano-Gotarredona:
Performance comparison of DVS data spatial downscaling methods using Spiking Neural Networks. 6483-6491 - Vinay Kumar Verma, Nikhil Mehta, Shijing Si, Ricardo Henao, Lawrence Carin:
Pushing the Efficiency Limit Using Structured Sparse Convolutions. 6492-6502 - Bo Zhao, Hakan Bilen:
Dataset Condensation with Distribution Matching. 6503-6512 - Molly O'Brien, Brett Wolfinger, Julia V. Bukowski, Mathias Unberath, Aria Pezeshk, Greg Hager:
Mapping DNN Embedding Manifolds for Network Generalization Prediction. 6513-6522 - Shreyansh Jain, Koteswar Rao Jerripothula:
Federated Learning for Commercial Image Sources. 6523-6532
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.