Follow

Submissions from 2022

Link

Underactuated digital twin's robotic hands with tactile sensing capabilities for well-being, Mohd Faisal, Roberto Alejandro Martinez Velazquez, Fedwa Laamarti, and Abdulmotaleb El Saddik

Link

Artificial intelligence models in digital twins for health and well-being, Rahatara Ferdousi, Fedwa Laamarti, and Abdulmotaleb El Saddik

Link

RailTwin: A Digital Twin Framework For Railway, Rahatara Ferdousi, Fedwa Laamarti, Chunsheng Yang, and Abdulmotaleb Elsaddik

Link

Non-invasive Anemia Detection from Conjunctival Images, Rahatara Ferdousi, Nabila Mabruba, Fedwa Laamarti, Abdulmotaleb El Saddik, and Chunsheng Yang

PDF

PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search, Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Anwer, and Fahad Shahbaz Khan

Link

Video Object Segmentation Based on Guided Feature Transfer Learning, Mustansar Fiaz, Arif Mahmood, Sehar Shahzad Farooq, Kamran Ali, Muhammad Shaheryar, and Soon Ki Jung

PDF

CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection, Dhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Anwer, and Hisham Cholakkal

PDF

How to Train Vision Transformer on Small-scale Datasets?, Hanan Gani, Muzammal Naseer, and Mohammad Yaqub

Link

A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video, Mariana Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

PDF

Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution, Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae Verga, Nicolae-Cătălin Ristea, and Fahad Shabaz Khan

Link

On Demographic Bias in Fingerprint Recognition, Akash Godbole, Steven A. Grosz, Karthik Nandakumar, and Anil K. Jain

PDF

Hyperparameter Optimization for COVID-19 Chest X-Ray Classification, Ibraheem Hamdi, Muhammad Ridzuan, and Mohammad Yaqub

PDF

SubOmiEmbed: Self-supervised representation learning of multi-omics data for cancer type classification, Sayed Hashim, Muhammad Ali, Karthik Nandakumar, and Mohammad Yaqub

PDF

Visual Attention Methods in Deep Learning: An In-Depth Survey, Mohammed Hassanin, Anwar Saeed, Ibrahim Radwan, Fahad Shahbaz Khan, and Ajmal Mian

Link

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection, Yu Hong, Hang Dai, and Yong Ding

Link

Scribble-Supervised Video Object Segmentation, Peiliang Huang, Junwei Han, Nian Liu, Jun Ren, and Dingwen Zhang

Link

Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images, Zhou Huang, Tian-Zhu Xiang, Huai-Xin Chen, and Hang Dai

PDF

An Attention-Based ResNet Architecture for Acute Hemorrhage Detection and Classification: Toward a Health 4.0 Digital Twin Study, Aftab Hussain, Muhammad Usman Yaseen, Muhammad Imran, Muhammad Waqar, Adnan Akhunzada, Mohammad Al-Ja'afreh, and Abdulmotaleb El Saddik

Link

UAV-based Multi-scale Features Fusion Attention for Fire Detection in Smart City Ecosystems, Tanveer Hussain, Hang Dai, Wail Gueaieb, Marco Sicklinger, and Giulia De Masi

Link

High-resolution Iterative Feedback Network for Camouflaged Object Detection, Xiaobin Hu, Deng-Ping Fan, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, and Ling Shao

PDF

Face Pyramid Vision Transformer, Khawar Islam, Muhammad Zaigham Zaheer, and Arif Mahmood

Link

Optimizing Homomorphic Encryption based Secure Image Analytics, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

Link

PPDL - privacy preserving deep learning using homomorphic encryption, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey And Outlook, Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, and Jiri Matas

Link

Dense Gaussian processes for few-shot segmentation, Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, and Martin Danelljan

PDF

Energy-based Latent Aligner for Incremental Learning, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Vineeth N. Balasubramanian

PDF

Eureka: EUphemism Recognition Enhanced Through KNN-based Methods and Augmentation, Sedrick Scott Keh, Rohit Bharadwaj, Emmy Liu, Simone Tedeschi, Varun Gangal, and Roberto Navigli

Object Detection in Aerial Images: A Case Study on Performance Improvement, Adnan Khan, Muhammad Uzair Khattak, and Khaled Dawoud

Link

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results, Fahad Shahbaz Khan and Salman Khan

PDF

Transformers in Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, and Mubarak Shah

PDF

MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages, Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, and Karthik Nandakumar

PDF

Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features, Gokul Karthik Kumar and Karthik Nandakumar

PDF

3D Vision with Transformers: A Survey, Jean Lahoud, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Anwer, Salman Khan, and Ming-Hsuan Yang

Link

Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving, Jiale Li, Hang Dai, and Yong Ding

Link

Disentangled Capsule Routing for Fast Part-Object Relational Saliency, Yi Liu, Dingwen Zhang, Nian Liu, Shoukun Xu, and Jungong Han

PDF

Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation, Yuanwei Liu, Nian Liu, Xiwen Yao, and Junwei Han

Link

Multi-View Brain Network Analysis with Cross-View Missing Network Generation, Gongxu Luo, Chenyang Li, Hejie Cui, Lichao Sun, Lifang He, and Carl Yang

Link

Class-Agnostic Object Detection with Multi-modal Transformer, Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Anwer, and Ming Hsuan Yang

PDF

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, and Fahad Shahbaz Khan

PDF

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, and Fahad Shahbaz Khan

PDF

COCOA: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan

Link

COVIDMe: a digital twin for COVID-19 self-assessment and detection, Roberto Martinez-Velazquez, Fernando Ceballos, Alejandro Sanchez, Abdulmotaleb El Saddik, and Emil Petriu

Link

Adaptive Feature Consolidation Network for Burst Super-Resolution, Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan, and Fahad Shahbaz Khan

Link

D3Former: Debiased Dual Distilled Transformer for Incremental Learning, Abdelrahman Mohamed, Rushali Grandhe, K.J. Joseph, Salman Khan, and Fahad Shahbaz Khan

Link

HM: Hybrid Masking for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia

Link

EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography, Rand Muhtaseb and Mohammad Yaqub

PDF

Towards Improving Calibration in Object Detection Under Domain Shift, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali

Link

Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification, Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, and Ling Shao

Link

Stylized Adversarial Defense, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli

Link

Guidance Through Surrogate: Toward a Generic Diagnostic Attack, Muzammal Naseer, Salman Khan, Fatih Porikli, and Fahad Shahbaz Khan

PDF

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility, Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Hisham Cholakkal, Salman Khan, Luc Van Gool, and Fahad Shahbaz Khan

PDF

Camelira: An Arabic Multi-Dialect Morphological Disambiguator, Ossama Obeid, Go Inoue, and Nizar Habash

Link

Multi‐frame based adversarial learning approach for video surveillance, Prashant W. Patil, Akshay Dudhane, Sachin Chaudhary, and Subrahmanyam Murala

Link

Untrained Neural Network Priors for Inverse Imaging Problems: A Survey, Adnan Qayyum, Inaam Ilahi, Fahad Shamshad, Farid Boussaid, Mohammed Bennamoun, and Junaid Qadir

Link

Highly Accurate Dichotomous Image Segmentation, Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Luc Van Gool, and Ling Shao

Link

Polarity Loss: Improving Visual-Semantic Alignment for Zero-Shot Detection, Shafin Rahman, Salman Khan, and Nick Barnes

PDF

A Robust Normalizing Flow using Bernstein-type Polynomials, Sameera Ramasinghe, Kasun Fernando, Salman Khan, and Nick Barnes

PDF

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection, Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, and Fahad Shahbaz Khan

PDF

Challenges in COVID-19 Chest X-Ray Classification: Problematic Data or Ineffective Approaches?, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

Self-supervision and Multi-task Learning: Challenges in Fine-Grained COVID-19 Multi-class Classification from Chest X-rays, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

SepTr: Separable Transformer for Audio Spectrogram Processing, Nicolae-Cătălin Ristea, Radu Tudor Ionescu, and Fahad Shahbaz Khan

Link

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning, Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Contrastive Pretraining for Echocardiography Segmentation with Limited Data, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

PDF

Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

Link

End-to-End Myocardial Infarction Classification from Echocardiographic Scans, Mohamed Saeed and Mohammad Yaqub

Link

Fusion and Orthogonal Projection for Improved Face-Voice Association, Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, and Alessio Del Bue

PDF

An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, and Mohammad Yaqub

PDF

Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?, Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, and Mohammad Yaqub

Link

TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction, Numan Saeed, Ikboljon Sobirov, Roba Al Majzoub, and Mohammad Yaqub

PDF

Transformers in Medical Imaging: A Survey, Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu

PDF

TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting, Muhammad Hamza Sharif, Dmitry Demidov, Asif Hanif, Mohammad Yaqub, and Min Xu

PDF

Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, and Mohammad Yaqub

PDF

Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis, Ikboljon Sobirov, Numan Saeed, and Mohammad Yaqub

Link

Moving objects segmentation using generative adversarial modeling, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung

Link

Unsupervised moving object segmentation using background subtraction and optimal adversarial noise sample search, Maryam Sultana, Arif Mahmood, and Soon Ki Jung

PDF

Self-Distilled Vision Transformer for Domain Generalization, Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, and Fahad Shahbaz Khan

Link

Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, and Fahad Shahbaz Khan

Link

Deep learning in multimedia healthcare applications: a review, Diana P. Tobón V, M. Shamim Hossain, Ghulam Muhammad, Josu Bilbao, and Abdulmotaleb El Saddik

PDF

An Investigation into Whitening Loss for Self-supervised Learning, Xi Weng, Lei Huang, Lei Zhao, Rao Muhammad Anwer, Salman Khan, and Fahad Shahbaz Khan

PDF

PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds, Aoran Xiao, Jiaxing Huang, Dayan Guan, Kaiwen Cui, Shijian Lu, and Ling Shao

Link

Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection, Jin Xie, Rao Muhammad Anwer, Hisham Cholakkal, Jing Nie, Jiale Cao, Jorma Laaksonen, and Fahad Shahbaz Khan

Link

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision, Yun Xing, Dayan Guan, Jiaxing Huang, and Shijian Lu

Link

Stabilizing Adversarially Learned One-Class Novelty Detection Using Pseudo Anomalies, Muhammad Zaigham Zaheer, Jin Ha Lee, Arif Mahmood, Marcella Astrid, and Seung Ik Lee

Link

Generative Cooperative Learning for Unsupervised Video Anomaly Detection, M. Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segù, Fisher Yu, and Seung-Ik Lee

PDF

Learning Enriched Features for Fast Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao

Link

Deep RGB-D Saliency Detection Without Depth, Yuan Fang Zhang, Jiangbin Zheng, Wenjing Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, and Xiangjian He

Submissions from 2021

Link

Rich Semantics Improve Few-Shot Learning, Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, and Fahad Shahbaz Khan

PDF

Low light image enhancement via global and local context modeling, Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, and Ming-Hsuan Yang

PDF

Handwriting Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Mubarak A. Shah

Link

From handcrafted to deep features for pedestrian detection: A survey, Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, and Ling Shao

Link

Automatic fetal gestational age estimation from first trimester scans, Sevim Cengiz and Mohammad Yaqub

PDF

Structured latent embeddings for recognizing unseen classes in unseen domains, Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N. Balasubramanian, Fahad Shahbaz Khan, and Ling Shao

Link

Commands for autonomous vehicles by progressively stacking visual-linguistic representations, Hang Dai, Shujie Luo, Yong Ding, and Ling Shao

Link

Accuracy vs. complexity: A trade-off in visual question answering models, Moshiur Farazi, Nick Barnes, and Salman Khan

4G-VOS: Video Object Segmentation using guided context embedding, Mustansar Fiaz, Muhammad Zaigham Zaheer, Arif Mahmood, Seung Ik Lee, and Soon Ki Jung

Link

Anomaly detection in video via self-supervised and multi-task learning, Mariana Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

Renewable energy management system for Saudi Arabia: Methodology and preliminary results, Imen Gherboudj, Mohamed Zorgati, Phani Kumar Chamarthi, Arttu Tuomiranta, Baraa Mohandes, Naseema S. Beegum, Jood Al-Sudairi, and Omar Al-Owain

PDF

OW-DETR: Open-world detection transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Generative multi-label zero-shot learning, Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Joost van de Weijer