Submissions from 2022
Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub
End-to-End Myocardial Infarction Classification from Echocardiographic Scans, Mohamed Saeed and Mohammad Yaqub
Fusion and Orthogonal Projection for Improved Face-Voice Association, Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, and Alessio Del Bue
An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, and Mohammad Yaqub
Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?, Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, and Mohammad Yaqub
TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction, Numan Saeed, Ikboljon Sobirov, Roba Al Majzoub, and Mohammad Yaqub
Transformers in Medical Imaging: A Survey, Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu
TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting, Muhammad Hamza Sharif, Dmitry Demidov, Asif Hanif, Mohammad Yaqub, and Min Xu
Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, and Mohammad Yaqub
Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis, Ikboljon Sobirov, Numan Saeed, and Mohammad Yaqub
Moving objects segmentation using generative adversarial modeling, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung
Unsupervised moving object segmentation using background subtraction and optimal adversarial noise sample search, Maryam Sultana, Arif Mahmood, and Soon Ki Jung
Self-Distilled Vision Transformer for Domain Generalization, Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, and Fahad Shahbaz Khan
Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, and Fahad Shahbaz Khan
Deep learning in multimedia healthcare applications: a review, Diana P. Tobón V, M. Shamim Hossain, Ghulam Muhammad, Josu Bilbao, and Abdulmotaleb El Saddik
An Investigation into Whitening Loss for Self-supervised Learning, Xi Weng, Lei Huang, Lei Zhao, Rao Muhammad Anwer, Salman Khan, and Fahad Shahbaz Khan
PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds, Aoran Xiao, Jiaxing Huang, Dayan Guan, Kaiwen Cui, Shijian Lu, and Ling Shao
Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection, Jin Xie, Rao Muhammad Anwer, Hisham Cholakkal, Jing Nie, Jiale Cao, Jorma Laaksonen, and Fahad Shahbaz Khan
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision, Yun Xing, Dayan Guan, Jiaxing Huang, and Shijian Lu
Stabilizing Adversarially Learned One-Class Novelty Detection Using Pseudo Anomalies, Muhammad Zaigham Zaheer, Jin Ha Lee, Arif Mahmood, Marcella Astrid, and Seung Ik Lee
Generative Cooperative Learning for Unsupervised Video Anomaly Detection, M. Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segù, Fisher Yu, and Seung-Ik Lee
Learning Enriched Features for Fast Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao
Deep RGB-D Saliency Detection Without Depth, Yuan Fang Zhang, Jiangbin Zheng, Wenjing Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, and Xiangjian He
Submissions from 2021
Rich Semantics Improve Few-Shot Learning, Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, and Fahad Shahbaz Khan
Low light image enhancement via global and local context modeling, Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, and Ming-Hsuan Yang
Handwriting Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Mubarak A. Shah
From handcrafted to deep features for pedestrian detection: A survey, Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, and Ling Shao
Automatic fetal gestational age estimation from first trimester scans, Sevim Cengiz and Mohammad Yaqub
Structured latent embeddings for recognizing unseen classes in unseen domains, Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N. Balasubramanian, Fahad Shahbaz Khan, and Ling Shao
Commands for autonomous vehicles by progressively stacking visual-linguistic representations, Hang Dai, Shujie Luo, Yong Ding, and Ling Shao
Accuracy vs. complexity: A trade-off in visual question answering models, Moshiur Farazi, Nick Barnes, and Salman Khan
Anomaly detection in video via self-supervised and multi-task learning, Mariana Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah
OW-DETR: Open-world detection transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah
Generative multi-label zero-shot learning, Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Joost van de Weijer
Learning to fuse asymmetric feature maps in Siamese trackers, Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, and Jianbing Shen
A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items, Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi
Tensor pooling-driven instance segmentation framework for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi
Unsupervised anomaly instance segmentation for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi
Synthesizing the Unseen for Zero-Shot Object Detection, Nasir Hayat, Munawar Hayat, Shafin Rahman, Salman Khan, Syed Waqas Zamir, and Fahad Shahbaz Khan
Efficient CNN building blocks for encrypted data, Nayna Jain, Karthik Nandakumar, Nalini K. Ratha, Sharath U. Pankanti, and Uttam Kumar
CryptInfer: enabling encrypted inference on skin lesion images for melanoma detection, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar
Deep Gaussian Processes for Few-shot Segmentation, Joakim Johnander, Johan Edstedt, Martin Danelljan, Michael Felsberg, and Fahad Shahbaz Khan
Towards open world object detection, K. J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian
Dynamically Decoding Source Domain Knowledge for Domain Generalization, Cuicui Kang and Karthik Nandakumar
CpT: Convolutional point transformer for 3D point cloud processing, Chaitanya Kaul, Joshua Mitton, Hang Dai, and Roderick Murray-Smith
Focusnet++: Attentive aggregated transformations for efficient and accurate medical image segmentation, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar
Penalizing small errors using an adaptive logarithmic loss, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar
The Ninth Visual Object Tracking VOT2021 Challenge Results, Fahad Shahbaz Khan
Adversarially robust deepfake media detection using fused convolutional neural network predictions, Sohail Ahmed Khan, Alessandro Artusi, and Hang Dai
Video transformer for deepfake detection with incremental learning, Sohail Ahmed Khan and Hang Dai
Incremental Object Detection via Meta-Learning, Joseph Kj, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian
Edge detail analysis of wear particles, Mohammad Shakeel Laghari, Ahmed Hassan, and Mubashir Noman
Understanding more about human and machine attention in deep neural networks, Qiuxia Lai, Salman Khan, Yongwei Nie, Hanqiu Sun, Jianbing Shen, and Ling Shao
Anchor-free 3D single stage detector with mask-guided attention for point cloud, Jiale Li, Hang Dai, Ling Shao, and Yong Ding
From voxel to point: IoU-guided 3D object detection for point cloud with voxel-to-point decoder, Jiale Li, Hang Dai, Ling Shao, and Yong Ding
P2V-RCNN: point to voxel feature learning for 3D object detection from point clouds, Jiale Li, Yu Sun, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao
C4AV: learning cross-modal representations from transformers, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding
M3DSSD: monocular 3D single stage object detector, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding
Multi-modal Transformers Excel at Class-agnostic Object Detection, Muhammad Maaz, Hanoona Bangalath Rasheed, Salman Hameed Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Ming-Hsuan Yang
SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali
Discriminative region-based multi-label zero-shot learning, Sanath Narayan, Akshita Gupta, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Mubarak Shah
On Generating Transferable Targeted Perturbations, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli
Intriguing properties of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang
On improving adversarial transferability of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Fahad Shahbaz Khan, and Fatih Porikli
Cancelable biometrics vault: A secure key-binding biometric cryptosystem based on chaffing and winnowing, Osama Ouda, Karthik Nandakumar, and Arun Ross
Preface, Bartłomiej W. Papież, Mohammad Yaqub, Jianbo Jiao, Ana I.L. Namburete, and J. Alison Noble
Orthogonal Projection Loss, Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman Khan, and Fahad Shahbaz Khan
Self-supervised video transformer, Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, and Michael Ryoo
Self-supervised predictive convolutional attentive block for anomaly detection, Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah
CyTran: Cycle-consistent transformers for non-contrast to contrast CT translation, Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, and Radu Tudor Ionescu
Exploring complementary strengths of invariant and equivariant representations for few-shot learning, Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah
Efficient encrypted inference on ensembles of decision trees, Kanthi Kiran Sarpatwar, Karthik Nandakumar, Nalini K. Ratha, James T. Rayfield, Karthikeyan Shanmugam, Roman Vaculin, and Sharath U. Pankanti
Continual domain incremental learning for chest X-ray classification in low-resource clinical settings, Shikhar Srivastava, Mohammad Yaqub, Karthik Nandakumar, Zongyuan Ge, and Dwarikanath Mahapatra
Background/foreground separation: guided attention based adversarial modeling (GAAM) versus robust subspace learning methods, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung
A human ear reconstruction autoencoder, Hao Sun, Nick Pears, and Hang Dai
Spatio-temporal relation modeling for few-shot action recognition, Anirudh Thatipelli, Sanath Narayan, Salman Hameed Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Bernard Ghanem
MineGAN++: Mining generative models for efficient knowledge transfer to limited data domains, Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, and Joost Van de Weijer
Automatic extraction of hiatal dimensions in 3-D transperineal pelvic ultrasound recordings, Helena Williams, Laura Cattani, Dominique Van Schoubroeck, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan D'Hooge, and Jan Deprest
An Anomaly Detection System via Moving Surveillance Robots with Human Collaboration, Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Marcella Astrid, and Seung-Ik Lee
Restormer: Efficient transformer for High-resolution image restoration, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang
Multi-stage progressive image restoration, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming Hsuan Yang, and Ling Shao
Learning digital camera pipeline for extreme low-light imaging, Syed Waqas Zamir, Aditya Arora, Salman Khan, Fahad Shahbaz Khan, and Ling Shao
Deep learning predicts EBV status in gastric cancer based on spatial patterns of lymphocyte infiltration, Baoyi Zhang, Kevin Yao, Min Xu, Jia Wu, and Chao Cheng
Submissions from 2020
SipMask: spatial information preservation for fast image and video instance segmentation, Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao
Question-agnostic attention for visual question answering, Moshiur Farazi, Salman Khan, and Nick Barnes
Attention guided semantic relationship parsing for visual question answering, Moshiur R. Farazi, Salman Khan, and Nick Barnes
From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts, Moshiur R. Farazi, Salman Khan, and Nick Barnes
Trainable structure tensors for autonomous baggage threat detection under extreme occlusion, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi
3D IoU-Net: IoU Guided 3D object detector for point clouds, Jiale Li, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao
D2-Net: Weakly-supervised action localization via discriminative embeddings and denoised activations, Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan, Minghsuan Yang, and Ling Shao
Preface, Bartłomiej W. Papież, Ana I.L. Namburete, Mohammad Yaqub, and J. Alison Noble
Any-shot Object Detection, Shafin Rahman, Salman Khan, Nick Barnes, and Fahad Shahbaz Khan
Zero-shot object detection: joint recognition and localization of novel concepts, Shafin Rahman, Salman H. Khan, and Fatih Porikli
Conditional generative modeling via learning the latent space, Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, and Stephen Gould
Rethinking conditional GAN training: An approach using geometrically structured latent manifolds, Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, and Stephen Gould
Fixing localization errors to improve image classification, Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, and Luc Van Gool
Automatic C-plane detection in pelvic floor transperineal lolumetric ultrasound, Helena Williams, Laura Cattani, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan Deprest, and Jan D’hooge
Count- and Similarity-Aware R-CNN for pedestrian detection, Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao
Mask-guided attention network and occlusion-sensitive hard example mining for occluded pedestrian detection, Jin Xie, Yanwei Pang, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Ling Shao
Learning Enriched Features for Real Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shabaz Khan, Ming-Hsuan Yang, and Ling Shao