Follow

Submissions from 2022

Link

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning, Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Contrastive Pretraining for Echocardiography Segmentation with Limited Data, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

PDF

Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

Link

End-to-End Myocardial Infarction Classification from Echocardiographic Scans, Mohamed Saeed and Mohammad Yaqub

Link

Fusion and Orthogonal Projection for Improved Face-Voice Association, Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, and Alessio Del Bue

PDF

An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, and Mohammad Yaqub

PDF

Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?, Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, and Mohammad Yaqub

Link

TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction, Numan Saeed, Ikboljon Sobirov, Roba Al Majzoub, and Mohammad Yaqub

PDF

Transformers in Medical Imaging: A Survey, Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu

Classification of Cumin, Fennel and Carom Using Transfer Learning, Abdullah Ajaz Siddiqui, Shahab Saquib Sohail, Qazi Areeb, Wathiq Mansoor, and Md Tabrez Nafis

PDF

Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, and Mohammad Yaqub

PDF

Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis, Ikboljon Sobirov, Numan Saeed, and Mohammad Yaqub

Link

Moving objects segmentation using generative adversarial modeling, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung

Link

Unsupervised moving object segmentation using background subtraction and optimal adversarial noise sample search, Maryam Sultana, Arif Mahmood, and Soon Ki Jung

PDF

Self-Distilled Vision Transformer for Domain Generalization, Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, and Fahad Shahbaz Khan

Link

Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, and Fahad Shahbaz Khan

Link

Deep learning in multimedia healthcare applications: a review, Diana P. Tobón V, M. Shamim Hossain, Ghulam Muhammad, Josu Bilbao, and Abdulmotaleb El Saddik

Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection, Jin Xie, Rao Muhammad Anwer, Hisham Cholakkal, Jing Nie, Jiale Cao, Jorma Laaksonen, and Fahad Shahbaz Khan

Link

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision, Yun Xing, Dayan Guan, Jiaxing Huang, and Shijian Lu

Link

Stabilizing Adversarially Learned One-Class Novelty Detection Using Pseudo Anomalies, Muhammad Zaigham Zaheer, Jin Ha Lee, Arif Mahmood, Marcella Astrid, and Seung Ik Lee

Link

Generative Cooperative Learning for Unsupervised Video Anomaly Detection, M. Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segù, Fisher Yu, and Seung-Ik Lee

PDF

Learning Enriched Features for Fast Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao

Link

Deep RGB-D Saliency Detection Without Depth, Yuan Fang Zhang, Jiangbin Zheng, Wenjing Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, and Xiangjian He

Submissions from 2021

Link

UBnormal: New benchmark for supervised open-set video anomaly detection, Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, and Mubarak Shah

Link

Rich Semantics Improve Few-Shot Learning, Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, and Fahad Shahbaz Khan

PDF

Low light image enhancement via global and local context modeling, Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, and Ming-Hsuan Yang

PDF

Handwriting Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Mubarak A. Shah

Link

From handcrafted to deep features for pedestrian detection: A survey, Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, and Ling Shao

Link

Automatic fetal gestational age estimation from first trimester scans, Sevim Cengiz and Mohammad Yaqub

PDF

Structured latent embeddings for recognizing unseen classes in unseen domains, Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N. Balasubramanian, Fahad Shahbaz Khan, and Ling Shao

Link

Commands for autonomous vehicles by progressively stacking visual-linguistic representations, Hang Dai, Shujie Luo, Yong Ding, and Ling Shao

Link

Accuracy vs. complexity: A trade-off in visual question answering models, Moshiur Farazi, Nick Barnes, and Salman Khan

Link

Anomaly detection in video via self-supervised and multi-task learning, Mariana Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

PDF

OW-DETR: Open-world detection transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Generative multi-label zero-shot learning, Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Joost van de Weijer

PDF

Learning to fuse asymmetric feature maps in Siamese trackers, Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, and Jianbing Shen

Link

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items, Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

PDF

Tensor pooling-driven instance segmentation framework for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

PDF

Unsupervised anomaly instance segmentation for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

Link

Synthesizing the Unseen for Zero-Shot Object Detection, Nasir Hayat, Munawar Hayat, Shafin Rahman, Salman Khan, Syed Waqas Zamir, and Fahad Shahbaz Khan

PDF

Efficient CNN building blocks for encrypted data, Nayna Jain, Karthik Nandakumar, Nalini K. Ratha, Sharath U. Pankanti, and Uttam Kumar

Link

CryptInfer: enabling encrypted inference on skin lesion images for melanoma detection, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Deep Gaussian Processes for Few-shot Segmentation, Joakim Johnander, Johan Edstedt, Martin Danelljan, Michael Felsberg, and Fahad Shahbaz Khan

PDF

Towards open world object detection, K. J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian

Link

Dynamically Decoding Source Domain Knowledge for Domain Generalization, Cuicui Kang and Karthik Nandakumar

Link

CpT: Convolutional point transformer for 3D point cloud processing, Chaitanya Kaul, Joshua Mitton, Hang Dai, and Roderick Murray-Smith

Link

Focusnet++: Attentive aggregated transformations for efficient and accurate medical image segmentation, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar

Link

Penalizing small errors using an adaptive logarithmic loss, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar

Link

The Ninth Visual Object Tracking VOT2021 Challenge Results, Fahad Shahbaz Khan

Link

Adversarially robust deepfake media detection using fused convolutional neural network predictions, Sohail Ahmed Khan, Alessandro Artusi, and Hang Dai

Link

Video transformer for deepfake detection with incremental learning, Sohail Ahmed Khan and Hang Dai

Link

Incremental Object Detection via Meta-Learning, Joseph Kj, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian

PDF

Edge detail analysis of wear particles, Mohammad Shakeel Laghari, Ahmed Hassan, and Mubashir Noman

Link

Understanding more about human and machine attention in deep neural networks, Qiuxia Lai, Salman Khan, Yongwei Nie, Hanqiu Sun, Jianbing Shen, and Ling Shao

Link

Anchor-free 3D single stage detector with mask-guided attention for point cloud, Jiale Li, Hang Dai, Ling Shao, and Yong Ding

Link

From voxel to point: IoU-guided 3D object detection for point cloud with voxel-to-point decoder, Jiale Li, Hang Dai, Ling Shao, and Yong Ding

PDF

P2V-RCNN: point to voxel feature learning for 3D object detection from point clouds, Jiale Li, Yu Sun, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao

Link

C4AV: learning cross-modal representations from transformers, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding

Link

M3DSSD: monocular 3D single stage object detector, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding

PDF

Multi-modal Transformers Excel at Class-agnostic Object Detection, Muhammad Maaz, Hanoona Bangalath Rasheed, Salman Hameed Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Ming-Hsuan Yang

Link

SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali

PDF

Discriminative region-based multi-label zero-shot learning, Sanath Narayan, Akshita Gupta, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Mubarak Shah

PDF

On Generating Transferable Targeted Perturbations, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli

Link

Intriguing properties of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

On improving adversarial transferability of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Fahad Shahbaz Khan, and Fatih Porikli

Link

Cancelable biometrics vault: A secure key-binding biometric cryptosystem based on chaffing and winnowing, Osama Ouda, Karthik Nandakumar, and Arun Ross

Link

Preface, Bartłomiej W. Papież, Mohammad Yaqub, Jianbo Jiao, Ana I.L. Namburete, and J. Alison Noble

PDF

Orthogonal Projection Loss, Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman Khan, and Fahad Shahbaz Khan

Link

Self-supervised video transformer, Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, and Michael Ryoo

Link

Self-supervised predictive convolutional attentive block for anomaly detection, Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah

Link

CyTran: Cycle-consistent transformers for non-contrast to contrast CT translation, Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, and Radu Tudor Ionescu

PDF

Exploring complementary strengths of invariant and equivariant representations for few-shot learning, Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Efficient encrypted inference on ensembles of decision trees, Kanthi Kiran Sarpatwar, Karthik Nandakumar, Nalini K. Ratha, James T. Rayfield, Karthikeyan Shanmugam, Roman Vaculin, and Sharath U. Pankanti

Link

Continual domain incremental learning for chest X-ray classification in low-resource clinical settings, Shikhar Srivastava, Mohammad Yaqub, Karthik Nandakumar, Zongyuan Ge, and Dwarikanath Mahapatra

Link

Background/foreground separation: guided attention based adversarial modeling (GAAM) versus robust subspace learning methods, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung

Link

A human ear reconstruction autoencoder, Hao Sun, Nick Pears, and Hang Dai

PDF

Spatio-temporal relation modeling for few-shot action recognition, Anirudh Thatipelli, Sanath Narayan, Salman Hameed Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Bernard Ghanem

Link

MineGAN++: Mining generative models for efficient knowledge transfer to limited data domains, Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, and Joost Van de Weijer

Link

Automatic extraction of hiatal dimensions in 3-D transperineal pelvic ultrasound recordings, Helena Williams, Laura Cattani, Dominique Van Schoubroeck, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan D'Hooge, and Jan Deprest

Link

An Anomaly Detection System via Moving Surveillance Robots with Human Collaboration, Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Marcella Astrid, and Seung-Ik Lee

PDF

Restormer: Efficient transformer for High-resolution image restoration, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

Multi-stage progressive image restoration, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming Hsuan Yang, and Ling Shao

Link

Learning digital camera pipeline for extreme low-light imaging, Syed Waqas Zamir, Aditya Arora, Salman Khan, Fahad Shahbaz Khan, and Ling Shao

PDF

Deep learning predicts EBV status in gastric cancer based on spatial patterns of lymphocyte infiltration, Baoyi Zhang, Kevin Yao, Min Xu, Jia Wu, and Chao Cheng

Submissions from 2020

Link

SipMask: spatial information preservation for fast image and video instance segmentation, Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao

Link

Question-agnostic attention for visual question answering, Moshiur Farazi, Salman Khan, and Nick Barnes

Link

Attention guided semantic relationship parsing for visual question answering, Moshiur R. Farazi, Salman Khan, and Nick Barnes

Link

From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts, Moshiur R. Farazi, Salman Khan, and Nick Barnes

PDF

Trainable structure tensors for autonomous baggage threat detection under extreme occlusion, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

Link

3D IoU-Net: IoU Guided 3D object detector for point clouds, Jiale Li, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao

Link

D2-Net: Weakly-supervised action localization via discriminative embeddings and denoised activations, Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan, Minghsuan Yang, and Ling Shao

Link

Preface, Bartłomiej W. Papież, Ana I.L. Namburete, Mohammad Yaqub, and J. Alison Noble

PDF

Any-shot Object Detection, Shafin Rahman, Salman Khan, Nick Barnes, and Fahad Shahbaz Khan

Link

Zero-shot object detection: joint recognition and localization of novel concepts, Shafin Rahman, Salman H. Khan, and Fatih Porikli

Link

Conditional generative modeling via learning the latent space, Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, and Stephen Gould

Link

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds, Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, and Stephen Gould

Link

Fixing localization errors to improve image classification, Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, and Luc Van Gool

Link

Automatic C-plane detection in pelvic floor transperineal lolumetric ultrasound, Helena Williams, Laura Cattani, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan Deprest, and Jan D’hooge

Link

Count- and Similarity-Aware R-CNN for pedestrian detection, Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao

Link

Mask-guided attention network and occlusion-sensitive hard example mining for occluded pedestrian detection, Jin Xie, Yanwei Pang, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Ling Shao