Follow

Submissions from 2022

PDF

On the Robustness of 3D Object Detectors, Fatima Albreiki, Sultan Abu Ghazal, Jean Lahoud, Rao Anwer, Hisham Cholakkal, and Fahad Shahbaz Khan

PDF

Transformers in Remote Sensing: A Survey, Abdulaziz Amer Aleissaee, Amandeep Kumar, Rao Anwer, Salman Khan, Hisham Cholakkal, Gui-Song Xia, and Fahad Shahbaz Khan

Link

Suppressing Poisoning Attacks on Federated Learning for Medical Imaging, Naif Alkhunaizi, Dmitry Kamzolov, Martin Takac, and Karthik Nandakumar

Link

GARDNet: Robust Multi-view Network for Glaucoma Classification in Color Fundus Images, Ahmed Al-Mahrooqi, Dmitrii Medvedev, Rand Muhtaseb, and Mohammad Yaqub

Link

Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification, Faris Almalik, Mohammad Yaqub, and Karthik Nandakumar

Link

DRGen: Domain Generalization in Diabetic Retinopathy Classification, Mohammad Zeyad Atwany and Mohammad Yaqub

PDF

Deep Learning Techniques for Diabetic Retinopathy Classification: A Survey, Mohammad Z. Atwany, Abdulwahab H. Sahyoun, and Mohammad Yaqub

PDF

Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification, Hussam Azzuni, Muhammad Ridzuan, Min Xu, and Mohammad Yaqub

Link

Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks, Yunqing Bao, Hang Dai, and Abdulmotaleb Elsaddik

Link

SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection, Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah

Link

SipMaskv2: Enhanced Fast Image and Video Instance Segmentation, Jiale Cao, Yanwei Pang, Rao Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, and Ling Shao

Link

PSTR: End-to-End One-Step Person Search With Transformers, Jiale Cao, Pang Yanwei, Rao Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, and Fahad Shahbaz Khan

Link

Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images, Sevim Cengiz, Ibrahim Hamdi, and Mohammad Yaqub

PDF

Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating Scans, Sevim Cengiz and Mohammad Yaqub

Link

Automatic schelling points detection from meshes, Geng Chen, Hang Dai, Tao Zhou, Jianbing Shen, and Ling Shao

PDF

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving, Yi-Nan Chen, Hang Dai, and Yong Ding

PDF

Deep-precognitive diagnosis: preventing future pandemics by novel disease detection With biologically-inspired conv-fuzzy network, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Chao Cheng, Jing Zhang, Tianyang Wang, and Min Xu

Link

Towards partial supervision for generic object counting in natural scenes, Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Luc Van Gool

Link

A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video, Mariana Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

PDF

Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution, Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae Verga, Nicolae-Cătălin Ristea, and Fahad Shabaz Khan

Link

On Demographic Bias in Fingerprint Recognition, Akash Godbole, Steven A. Grosz, Karthik Nandakumar, and Anil K. Jain

PDF

Hyperparameter Optimization for COVID-19 Chest X-Ray Classification, Ibraheem Hamdi, Muhammad Ridzuan, and Mohammad Yaqub

PDF

SubOmiEmbed: Self-supervised representation learning of multi-omics data for cancer type classification, Sayed Hashim, Muhammad Ali, Karthik Nandakumar, and Mohammad Yaqub

PDF

Visual Attention Methods in Deep Learning: An In-Depth Survey, Mohammed Hassanin, Anwar Saeed, Ibrahim Radwan, Fahad Shahbaz Khan, and Ajmal Mian

Link

Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images, Zhou Huang, Tian-Zhu Xiang, Huai-Xin Chen, and Hang Dai

Link

High-resolution Iterative Feedback Network for Camouflaged Object Detection, Xiaobin Hu, Deng-Ping Fan, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, and Ling Shao

Link

Optimizing Homomorphic Encryption based Secure Image Analytics, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

Link

PPDL - privacy preserving deep learning using homomorphic encryption, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Energy-based Latent Aligner for Incremental Learning, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Vineeth N. Balasubramanian

Link

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results, Fahad Shahbaz Khan and Salman Khan

PDF

Transformers in Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, and Mubarak Shah

Link

MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages, Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, and Karthik Nandakumar

PDF

3D Vision with Transformers: A Survey, Jean Lahoud, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Anwer, Salman Khan, and Ming-Hsuan Yang

PDF

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, and Fahad Shahbaz Khan

PDF

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, and Fahad Shahbaz Khan

PDF

COCOA: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan

Link

D3Former: Debiased Dual Distilled Transformer for Incremental Learning, Abdelrahman Mohamed, Rushali Grandhe, K.J. Joseph, Salman Khan, and Fahad Shahbaz Khan

Link

HMFS: Hybrid Masking for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia

Link

EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography, Rand Muhtaseb and Mohammad Yaqub

Link

Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification, Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, and Ling Shao

Link

Guidance Through Surrogate: Toward a Generic Diagnostic Attack, Muzammal Naseer, Salman Khan, Fatih Porikli, and Fahad Shahbaz Khan

PDF

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility, Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Hisham Cholakkal, Salman Khan, Luc Van Gool, and Fahad Shahbaz Khan

Link

Highly Accurate Dichotomous Image Segmentation, Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Luc Van Gool, and Ling Shao

Link

Polarity Loss: Improving Visual-Semantic Alignment for Zero-Shot Detection, Shafin Rahman, Salman Khan, and Nick Barnes

PDF

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection, Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, and Fahad Shahbaz Khan

PDF

Challenges in COVID-19 Chest X-Ray Classification: Problematic Data or Ineffective Approaches?, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

Self-supervision and Multi-task Learning: Challenges in Fine-Grained COVID-19 Multi-class Classification from Chest X-rays, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

SepTr: Separable Transformer for Audio Spectrogram Processing, Nicolae-Cătălin Ristea, Radu Tudor Ionescu, and Fahad Shahbaz Khan

PDF

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning, Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Contrastive Pretraining for Echocardiography Segmentation with Limited Data, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

PDF

Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

Link

End-to-End Myocardial Infarction Classification from Echocardiographic Scans, Mohamed Saeed and Mohammad Yaqub

Link

Fusion and Orthogonal Projection for Improved Face-Voice Association, Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, and Alessio Del Bue

PDF

An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, and Mohammad Yaqub

PDF

Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?, Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, and Mohammad Yaqub

Link

TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction, Numan Saeed, Ikboljon Sobirov, Roba Al Majzoub, and Mohammad Yaqub

PDF

Transformers in Medical Imaging: A Survey, Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu

PDF

Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, and Mohammad Yaqub

PDF

Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis, Ikboljon Sobirov, Numan Saeed, and Mohammad Yaqub

Link

Moving objects segmentation using generative adversarial modeling, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung

PDF

Self-Distilled Vision Transformer for Domain Generalization, Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, and Fahad Shahbaz Khan

PDF

Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, and Fahad Shahbaz Khan

Link

Deep learning in multimedia healthcare applications: a review, Diana P. Tobón V, M. Shamim Hossain, Ghulam Muhammad, Josu Bilbao, and Abdulmotaleb El Saddik

PDF

Generative Cooperative Learning for Unsupervised Video Anomaly Detection, M. Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segù, Fisher Yu, and Seung-Ik Lee

PDF

Learning Enriched Features for Fast Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao

Submissions from 2021

Link

UBnormal: New benchmark for supervised open-set video anomaly detection, Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, and Mubarak Shah

Link

Rich Semantics Improve Few-Shot Learning, Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, and Fahad Shahbaz Khan

PDF

Low light image enhancement via global and local context modeling, Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, and Ming-Hsuan Yang

PDF

DoodleFormer: Creative Sketch Drawing with Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, and Michael Felsberg

PDF

Handwriting Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Mubarak A. Shah

Link

From handcrafted to deep features for pedestrian detection: A survey, Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, and Ling Shao

Link

Automatic fetal gestational age estimation from first trimester scans, Sevim Cengiz and Mohammad Yaqub

PDF

Structured latent embeddings for recognizing unseen classes in unseen domains, Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N. Balasubramanian, Fahad Shahbaz Khan, and Ling Shao

Link

Commands for autonomous vehicles by progressively stacking visual-linguistic representations, Hang Dai, Shujie Luo, Yong Ding, and Ling Shao

PDF

Burst image restoration and enhancement, Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

Accuracy vs. complexity: A trade-off in visual question answering models, Moshiur Farazi, Nick Barnes, and Salman Khan

Link

Anomaly detection in video via self-supervised and multi-task learning, Mariana Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

PDF

OW-DETR: Open-world detection transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Generative multi-label zero-shot learning, Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Joost van de Weijer

PDF

Learning to fuse asymmetric feature maps in Siamese trackers, Wencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, and Jianbing Shen

Link

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items, Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

PDF

Tensor pooling-driven instance segmentation framework for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

PDF

Unsupervised anomaly instance segmentation for baggage threat recognition, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

Link

Synthesizing the Unseen for Zero-Shot Object Detection, Nasir Hayat, Munawar Hayat, Shafin Rahman, Salman Khan, Syed Waqas Zamir, and Fahad Shahbaz Khan

PDF

Efficient CNN building blocks for encrypted data, Nayna Jain, Karthik Nandakumar, Nalini K. Ratha, Sharath U. Pankanti, and Uttam Kumar

Link

CryptInfer: enabling encrypted inference on skin lesion images for melanoma detection, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey And Outlook, Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, and Jiri Matas

PDF

Deep Gaussian Processes for Few-shot Segmentation, Joakim Johnander, Johan Edstedt, Martin Danelljan, Michael Felsberg, and Fahad Shahbaz Khan

PDF

Dense Gaussian processes for few-shot segmentation, Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, and Martin Danelljan

PDF

Towards open world object detection, K. J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian

Link

Dynamically Decoding Source Domain Knowledge for Domain Generalization, Cuicui Kang and Karthik Nandakumar

Link

CpT: Convolutional point transformer for 3D point cloud processing, Chaitanya Kaul, Joshua Mitton, Hang Dai, and Roderick Murray-Smith

Link

Focusnet++: Attentive aggregated transformations for efficient and accurate medical image segmentation, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar

Link

Penalizing small errors using an adaptive logarithmic loss, Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, and Suresh Manandhar

Link

The Ninth Visual Object Tracking VOT2021 Challenge Results, Fahad Shahbaz Khan

Link

Adversarially robust deepfake media detection using fused convolutional neural network predictions, Sohail Ahmed Khan, Alessandro Artusi, and Hang Dai

Link

Video transformer for deepfake detection with incremental learning, Sohail Ahmed Khan and Hang Dai

Link

Incremental Object Detection via Meta-Learning, Joseph Kj, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, and Vineeth N. Balasubramanian

Link

Understanding more about human and machine attention in deep neural networks, Qiuxia Lai, Salman Khan, Yongwei Nie, Hanqiu Sun, Jianbing Shen, and Ling Shao