Follow

Submissions from 2022

PDF

Deep Learning Techniques for Diabetic Retinopathy Classification: A Survey, Mohammad Z. Atwany, Abdulwahab H. Sahyoun, and Mohammad Yaqub

PDF

Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification, Hussam Azzuni, Muhammad Ridzuan, Min Xu, and Mohammad Yaqub

Link

SipMaskv2: Enhanced Fast Image and Video Instance Segmentation, Jiale Cao, Yanwei Pang, Rao Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, and Ling Shao

Link

PSTR: End-to-End One-Step Person Search With Transformers, Jiale Cao, Pang Yanwei, Rao Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, and Fahad Shahbaz Khan

PDF

Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating Scans, Sevim Cengiz and Mohammad Yaqub

Link

Automatic schelling points detection from meshes, Geng Chen, Hang Dai, Tao Zhou, Jianbing Shen, and Ling Shao

PDF

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving, Yi-Nan Chen, Hang Dai, and Yong Ding

PDF

Deep-precognitive diagnosis: preventing future pandemics by novel disease detection With biologically-inspired conv-fuzzy network, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Chao Cheng, Jing Zhang, Tianyang Wang, and Min Xu

Link

Towards partial supervision for generic object counting in natural scenes, Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Luc Van Gool

PDF

Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution, Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae Verga, Nicolae-Cătălin Ristea, and Fahad Shabaz Khan

Link

On Demographic Bias in Fingerprint Recognition, Akash Godbole, Steven A. Grosz, Karthik Nandakumar, and Anil K. Jain

PDF

Hyperparameter Optimization for COVID-19 Chest X-Ray Classification, Ibraheem Hamdi, Muhammad Ridzuan, and Mohammad Yaqub

PDF

SubOmiEmbed: Self-supervised representation learning of multi-omics data for cancer type classification, Sayed Hashim, Muhammad Ali, Karthik Nandakumar, and Mohammad Yaqub

PDF

Visual Attention Methods in Deep Learning: An In-Depth Survey, Mohammed Hassanin, Anwar Saeed, Ibrahim Radwan, Fahad Shahbaz Khan, and Ajmal Mian

Link

Scribble-based Boundary-aware Network for Weakly Supervised Salient Object Detection in Remote Sensing Images, Zhou Huang, Xiang Tianzhu, Huaixin Chen, and Hang Dai

Link

High-resolution Iterative Feedback Network for Camouflaged Object Detection, Xiaobin Hu, Deng-Ping Fan, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, and Ling Shao

Link

Optimizing Homomorphic Encryption based Secure Image Analytics, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

Link

PPDL - privacy preserving deep learning using homomorphic encryption, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Energy-based Latent Aligner for Incremental Learning, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Vineeth N. Balasubramanian

PDF

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results, Fahad Shahbaz Khan and Salman Khan

PDF

Transformers in Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, and Mubarak Shah

Link

MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages, Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, and Karthik Nandakumar

PDF

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, and Fahad Shahbaz Khan

PDF

COCOA: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan

Link

HMFS: Hybrid Masking for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia

Link

Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification, Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, and Ling Shao

Link

Highly Accurate Dichotomous Image Segmentation, Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Luc Van Gool, and Ling Shao

Link

Polarity Loss: Improving Visual-Semantic Alignment for Zero-Shot Detection, Shafin Rahman, Salman Khan, and Nick Barnes

PDF

Challenges in COVID-19 Chest X-Ray Classification: Problematic Data or Ineffective Approaches?, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

SepTr: Separable Transformer for Audio Spectrogram Processing, Nicolae-Cătălin Ristea, Radu Tudor Ionescu, and Fahad Shahbaz Khan

PDF

Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

PDF

An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, and Mohammad Yaqub

PDF

Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?, Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, and Mohammad Yaqub

PDF

Transformers in Medical Imaging: A Survey, Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu

PDF

Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, and Mohammad Yaqub

PDF

Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis, Ikboljon Sobirov, Numan Saeed, and Mohammad Yaqub

PDF

Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, and Fahad Shahbaz Khan

Link

Deep learning in multimedia healthcare applications: a review, Diana P. Tobón V, M. Shamim Hossain, Ghulam Muhammad, Josu Bilbao, and Abdulmotaleb El Saddik

PDF

Generative Cooperative Learning for Unsupervised Video Anomaly Detection, M. Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segù, Fisher Yu, and Seung-Ik Lee

Submissions from 2021

Link

UBnormal: New benchmark for supervised open-set video anomaly detection, Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, and Mubarak Shah

Link

Rich Semantics Improve Few-Shot Learning, Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, and Fahad Shahbaz Khan

PDF

Low light image enhancement via global and local context modeling, Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, and Ming-Hsuan Yang

PDF

DoodleFormer: Creative Sketch Drawing with Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, and Michael Felsberg

PDF

Handwriting Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Mubarak A. Shah

Link

From handcrafted to deep features for pedestrian detection: A survey, Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, and Ling Shao

Link

Automatic fetal gestational age estimation from first trimester scans, Sevim Cengiz and Mohammad Yaqub

PDF

Structured latent embeddings for recognizing unseen classes in unseen domains, Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N. Balasubramanian, Fahad Shahbaz Khan, and Ling Shao

Link

Commands for autonomous vehicles by progressively stacking visual-linguistic representations, Hang Dai, Shujie Luo, Yong Ding, and Ling Shao

PDF

Burst image restoration and enhancement, Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

Accuracy vs. complexity: A trade-off in visual question answering models, Moshiur Farazi, Nick Barnes, and Salman Khan

Link

Anomaly detection in video via self-supervised and multi-task learning, Mariana Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

PDF

OW-DETR: Open-world detection transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Generative multi-label zero-shot learning, Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Joost van de Weijer

PDF

Learning to fuse asymmetric feature maps in Siamese trackers, Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, and Jianbing Shen

Link

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items, Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

PDF

Tensor pooling-driven instance segmentation framework for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

PDF

Unsupervised anomaly instance segmentation for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

Link

Synthesizing the Unseen for Zero-Shot Object Detection, Nasir Hayat, Munawar Hayat, Shafin Rahman, Salman Khan, Syed Waqas Zamir, and Fahad Shahbaz Khan

PDF

Efficient CNN building blocks for encrypted data, Nayna Jain, Karthik Nandakumar, Nalini K. Ratha, Sharath U. Pankanti, and Uttam Kumar

Link

CryptInfer: enabling encrypted inference on skin lesion images for melanoma detection, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey And Outlook, Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, and Jiri Matas

PDF

Deep Gaussian Processes for Few-shot Segmentation, Joakim Johnander, Johan Edstedt, Martin Danelljan, Michael Felsberg, and Fahad Shahbaz Khan

PDF

Dense Gaussian processes for few-shot segmentation, Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, and Martin Danelljan

PDF

Towards open world object detection, K. J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian

Link

Dynamically Decoding Source Domain Knowledge for Domain Generalization, Cuicui Kang and Karthik Nandakumar

Link

CpT: Convolutional point transformer for 3D point cloud processing, Chaitanya Kaul, Joshua Mitton, Hang Dai, and Roderick Murray-Smith

Link

Focusnet++: Attentive aggregated transformations for efficient and accurate medical image segmentation, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar

Link

Penalizing small errors using an adaptive logarithmic loss, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar

Link

The Ninth Visual Object Tracking VOT2021 Challenge Results, Fahad Shahbaz Khan

Link

Adversarially robust deepfake media detection using fused convolutional neural network predictions, Sohail Ahmed Khan, Alessandro Artusi, and Hang Dai

Link

Video transformer for deepfake detection with incremental learning, Sohail Ahmed Khan and Hang Dai

Link

Incremental Object Detection via Meta-Learning, Joseph Kj, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian

Link

Understanding more about human and machine attention in deep neural networks, Qiuxia Lai, Salman Khan, Yongwei Nie, Hanqiu Sun, Jianbing Shen, and Ling Shao

Link

Anchor-free 3D single stage detector with mask-guided attention for point cloud, Jiale Li, Hang Dai, Ling Shao, and Yong Ding

Link

From voxel to point: IoU-guided 3D object detection for point cloud with voxel-to-point decoder, Jiale Li, Hang Dai, Ling Shao, and Yong Ding

PDF

P2V-RCNN: point to voxel feature learning for 3D object detection from point clouds, Jiale Li, Yu Sun, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao

Link

C4AV: learning cross-modal representations from transformers, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding

Link

M3DSSD: monocular 3D single stage object detector, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding

PDF

Multi-modal Transformers Excel at Class-agnostic Object Detection, Muhammad Maaz, Hanoona Bangalath Rasheed, Salman Hameed Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Ming-Hsuan Yang

PDF

Context-conditional adaptation for recognizing unseen classes in unseen domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan

Link

SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali

PDF

Discriminative region-based multi-label zero-shot learning, Sanath Narayan, Akshita Gupta, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Mubarak Shah

PDF

On Generating Transferable Targeted Perturbations, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli

Link

Intriguing properties of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

On improving adversarial transferability of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Fahad Shahbaz Khan, and Fatih Porikli

Link

Cancelable biometrics vault: A secure key-binding biometric cryptosystem based on chaffing and winnowing, Osama Ouda, Karthik Nandakumar, and Arun Ross

Link

Preface, Bartłomiej W. Papież, Mohammad Yaqub, Jianbo Jiao, Ana I.L. Namburete, and J. Alison Noble

PDF

Orthogonal Projection Loss, Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman Khan, and Fahad Shahbaz Khan

Link

Self-supervised video transformer, Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, and Michael Ryoo

Link

Self-supervised predictive convolutional attentive block for anomaly detection, Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah

Link

CyTran: Cycle-consistent transformers for non-contrast to contrast CT translation, Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, and Radu Tudor Ionescu

PDF

Exploring complementary strengths of invariant and equivariant representations for few-shot learning, Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Efficient encrypted inference on ensembles of decision trees, Kanthi Kiran Sarpatwar, Karthik Nandakumar, Nalini K. Ratha, James T. Rayfield, Karthikeyan Shanmugam, Roman Vaculin, and Sharath U. Pankanti

Link

Continual domain incremental learning for chest X-ray classification in low-resource clinical settings, Shikhar Srivastava, Mohammad Yaqub, Karthik Nandakumar, Zongyuan Ge, and Dwarikanath Mahapatra

Link

Background/foreground separation: guided attention based adversarial modeling (GAAM) versus robust subspace learning methods, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung

Link

A human ear reconstruction autoencoder, Hao Sun, Nick Pears, and Hang Dai

PDF

Spatio-temporal relation modeling for few-shot action recognition, Anirudh Thatipelli, Sanath Narayan, Salman Hameed Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Bernard Ghanem

Link

MineGAN++: Mining generative models for efficient knowledge transfer to limited data domains, Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, and Joost Van de Weijer

Link

Automatic extraction of hiatal dimensions in 3-D transperineal pelvic ultrasound recordings, Helena Williams, Laura Cattani, Dominique Van Schoubroeck, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan D'Hooge, and Jan Deprest