Submissions from 2022
Underactuated digital twin's robotic hands with tactile sensing capabilities for well-being, Mohd Faisal, Roberto Alejandro Martinez Velazquez, Fedwa Laamarti, and Abdulmotaleb El Saddik
Artificial intelligence models in digital twins for health and well-being, Rahatara Ferdousi, Fedwa Laamarti, and Abdulmotaleb El Saddik
RailTwin: A Digital Twin Framework For Railway, Rahatara Ferdousi, Fedwa Laamarti, Chunsheng Yang, and Abdulmotaleb Elsaddik
Non-invasive Anemia Detection from Conjunctival Images, Rahatara Ferdousi, Nabila Mabruba, Fedwa Laamarti, Abdulmotaleb El Saddik, and Chunsheng Yang
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search, Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Anwer, and Fahad Shahbaz Khan
Video Object Segmentation Based on Guided Feature Transfer Learning, Mustansar Fiaz, Arif Mahmood, Sehar Shahzad Farooq, Kamran Ali, Muhammad Shaheryar, and Soon Ki Jung
CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection, Dhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Anwer, and Hisham Cholakkal
How to Train Vision Transformer on Small-scale Datasets?, Hanan Gani, Muzammal Naseer, and Mohammad Yaqub
A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video, Mariana Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution, Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae Verga, Nicolae-Cătălin Ristea, and Fahad Shabaz Khan
On Demographic Bias in Fingerprint Recognition, Akash Godbole, Steven A. Grosz, Karthik Nandakumar, and Anil K. Jain
Hyperparameter Optimization for COVID-19 Chest X-Ray Classification, Ibraheem Hamdi, Muhammad Ridzuan, and Mohammad Yaqub
SubOmiEmbed: Self-supervised representation learning of multi-omics data for cancer type classification, Sayed Hashim, Muhammad Ali, Karthik Nandakumar, and Mohammad Yaqub
Visual Attention Methods in Deep Learning: An In-Depth Survey, Mohammed Hassanin, Anwar Saeed, Ibrahim Radwan, Fahad Shahbaz Khan, and Ajmal Mian
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection, Yu Hong, Hang Dai, and Yong Ding
Scribble-Supervised Video Object Segmentation, Peiliang Huang, Junwei Han, Nian Liu, Jun Ren, and Dingwen Zhang
Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images, Zhou Huang, Tian-Zhu Xiang, Huai-Xin Chen, and Hang Dai
An Attention-Based ResNet Architecture for Acute Hemorrhage Detection and Classification: Toward a Health 4.0 Digital Twin Study, Aftab Hussain, Muhammad Usman Yaseen, Muhammad Imran, Muhammad Waqar, Adnan Akhunzada, Mohammad Al-Ja'afreh, and Abdulmotaleb El Saddik
UAV-based Multi-scale Features Fusion Attention for Fire Detection in Smart City Ecosystems, Tanveer Hussain, Hang Dai, Wail Gueaieb, Marco Sicklinger, and Giulia De Masi
High-resolution Iterative Feedback Network for Camouflaged Object Detection, Xiaobin Hu, Deng-Ping Fan, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, and Ling Shao
Face Pyramid Vision Transformer, Khawar Islam, Muhammad Zaigham Zaheer, and Arif Mahmood
Optimizing Homomorphic Encryption based Secure Image Analytics, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar
PPDL - privacy preserving deep learning using homomorphic encryption, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar
Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey And Outlook, Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, and Jiri Matas
Dense Gaussian processes for few-shot segmentation, Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, and Martin Danelljan
Energy-based Latent Aligner for Incremental Learning, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Vineeth N. Balasubramanian
Eureka: EUphemism Recognition Enhanced Through KNN-based Methods and Augmentation, Sedrick Scott Keh, Rohit Bharadwaj, Emmy Liu, Simone Tedeschi, Varun Gangal, and Roberto Navigli
Object Detection in Aerial Images: A Case Study on Performance Improvement, Adnan Khan, Muhammad Uzair Khattak, and Khaled Dawoud
NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results, Fahad Shahbaz Khan and Salman Khan
Transformers in Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah
Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, and Mubarak Shah
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages, Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, and Karthik Nandakumar
Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features, Gokul Karthik Kumar and Karthik Nandakumar
3D Vision with Transformers: A Survey, Jean Lahoud, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Anwer, Salman Khan, and Ming-Hsuan Yang
Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving, Jiale Li, Hang Dai, and Yong Ding
Disentangled Capsule Routing for Fast Part-Object Relational Saliency, Yi Liu, Dingwen Zhang, Nian Liu, Shoukun Xu, and Jungong Han
Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation, Yuanwei Liu, Nian Liu, Xiwen Yao, and Junwei Han
Multi-View Brain Network Analysis with Cross-View Missing Network Generation, Gongxu Luo, Chenyang Li, Hejie Cui, Lichao Sun, Lifang He, and Carl Yang
Class-Agnostic Object Detection with Multi-modal Transformer, Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Anwer, and Ming Hsuan Yang
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, and Fahad Shahbaz Khan
Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, and Fahad Shahbaz Khan
COCOA: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan
COVIDMe: a digital twin for COVID-19 self-assessment and detection, Roberto Martinez-Velazquez, Fernando Ceballos, Alejandro Sanchez, Abdulmotaleb El Saddik, and Emil Petriu
Adaptive Feature Consolidation Network for Burst Super-Resolution, Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan, and Fahad Shahbaz Khan
D3Former: Debiased Dual Distilled Transformer for Incremental Learning, Abdelrahman Mohamed, Rushali Grandhe, K.J. Joseph, Salman Khan, and Fahad Shahbaz Khan
HM: Hybrid Masking for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia
EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography, Rand Muhtaseb and Mohammad Yaqub
Towards Improving Calibration in Object Detection Under Domain Shift, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali
Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification, Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, and Ling Shao
Stylized Adversarial Defense, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli
Guidance Through Surrogate: Toward a Generic Diagnostic Attack, Muzammal Naseer, Salman Khan, Fatih Porikli, and Fahad Shahbaz Khan
AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility, Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Hisham Cholakkal, Salman Khan, Luc Van Gool, and Fahad Shahbaz Khan
Camelira: An Arabic Multi-Dialect Morphological Disambiguator, Ossama Obeid, Go Inoue, and Nizar Habash
Multi‐frame based adversarial learning approach for video surveillance, Prashant W. Patil, Akshay Dudhane, Sachin Chaudhary, and Subrahmanyam Murala
Untrained Neural Network Priors for Inverse Imaging Problems: A Survey, Adnan Qayyum, Inaam Ilahi, Fahad Shamshad, Farid Boussaid, Mohammed Bennamoun, and Junaid Qadir
Highly Accurate Dichotomous Image Segmentation, Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Luc Van Gool, and Ling Shao
Polarity Loss: Improving Visual-Semantic Alignment for Zero-Shot Detection, Shafin Rahman, Salman Khan, and Nick Barnes
A Robust Normalizing Flow using Bernstein-type Polynomials, Sameera Ramasinghe, Kasun Fernando, Salman Khan, and Nick Barnes
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection, Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, and Fahad Shahbaz Khan
Challenges in COVID-19 Chest X-Ray Classification: Problematic Data or Ineffective Approaches?, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub
Self-supervision and Multi-task Learning: Challenges in Fine-Grained COVID-19 Multi-class Classification from Chest X-rays, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub
SepTr: Separable Transformer for Audio Spectrogram Processing, Nicolae-Cătălin Ristea, Radu Tudor Ionescu, and Fahad Shahbaz Khan
OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning, Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah
Contrastive Pretraining for Echocardiography Segmentation with Limited Data, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub
Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub
End-to-End Myocardial Infarction Classification from Echocardiographic Scans, Mohamed Saeed and Mohammad Yaqub
Fusion and Orthogonal Projection for Improved Face-Voice Association, Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, and Alessio Del Bue
An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, and Mohammad Yaqub
Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models?, Numan Saeed, Shahad Hardan, Kudaibergen Abutalip, and Mohammad Yaqub
TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction, Numan Saeed, Ikboljon Sobirov, Roba Al Majzoub, and Mohammad Yaqub
Transformers in Medical Imaging: A Survey, Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, and Huazhu Fu
TransResNet: Integrating the Strengths of ViTs and CNNs for High Resolution Medical Image Segmentation via Feature Grafting, Muhammad Hamza Sharif, Dmitry Demidov, Asif Hanif, Mohammad Yaqub, and Min Xu
Automatic Segmentation of Head and Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, and Mohammad Yaqub
Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis, Ikboljon Sobirov, Numan Saeed, and Mohammad Yaqub
Moving objects segmentation using generative adversarial modeling, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung
Unsupervised moving object segmentation using background subtraction and optimal adversarial noise sample search, Maryam Sultana, Arif Mahmood, and Soon Ki Jung
Self-Distilled Vision Transformer for Domain Generalization, Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, and Fahad Shahbaz Khan
Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer, Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, and Fahad Shahbaz Khan
Deep learning in multimedia healthcare applications: a review, Diana P. Tobón V, M. Shamim Hossain, Ghulam Muhammad, Josu Bilbao, and Abdulmotaleb El Saddik
An Investigation into Whitening Loss for Self-supervised Learning, Xi Weng, Lei Huang, Lei Zhao, Rao Muhammad Anwer, Salman Khan, and Fahad Shahbaz Khan
PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds, Aoran Xiao, Jiaxing Huang, Dayan Guan, Kaiwen Cui, Shijian Lu, and Ling Shao
Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection, Jin Xie, Rao Muhammad Anwer, Hisham Cholakkal, Jing Nie, Jiale Cao, Jorma Laaksonen, and Fahad Shahbaz Khan
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision, Yun Xing, Dayan Guan, Jiaxing Huang, and Shijian Lu
Stabilizing Adversarially Learned One-Class Novelty Detection Using Pseudo Anomalies, Muhammad Zaigham Zaheer, Jin Ha Lee, Arif Mahmood, Marcella Astrid, and Seung Ik Lee
Generative Cooperative Learning for Unsupervised Video Anomaly Detection, M. Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segù, Fisher Yu, and Seung-Ik Lee
Learning Enriched Features for Fast Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, and Ling Shao
Deep RGB-D Saliency Detection Without Depth, Yuan Fang Zhang, Jiangbin Zheng, Wenjing Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, and Xiangjian He
Submissions from 2021
Rich Semantics Improve Few-Shot Learning, Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, and Fahad Shahbaz Khan
Low light image enhancement via global and local context modeling, Aditya Arora, Muhammad Haris, Syed Waqas Zamir, Munawar Hayat, Fahad Shahbaz Khan, Ling Shao, and Ming-Hsuan Yang
Handwriting Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Mubarak A. Shah
From handcrafted to deep features for pedestrian detection: A survey, Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, and Ling Shao
Automatic fetal gestational age estimation from first trimester scans, Sevim Cengiz and Mohammad Yaqub
Structured latent embeddings for recognizing unseen classes in unseen domains, Shivam Chandhok, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Vineeth N. Balasubramanian, Fahad Shahbaz Khan, and Ling Shao
Commands for autonomous vehicles by progressively stacking visual-linguistic representations, Hang Dai, Shujie Luo, Yong Ding, and Ling Shao
Accuracy vs. complexity: A trade-off in visual question answering models, Moshiur Farazi, Nick Barnes, and Salman Khan
4G-VOS: Video Object Segmentation using guided context embedding, Mustansar Fiaz, Muhammad Zaigham Zaheer, Arif Mahmood, Seung Ik Lee, and Soon Ki Jung
Anomaly detection in video via self-supervised and multi-task learning, Mariana Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah
Renewable energy management system for Saudi Arabia: Methodology and preliminary results, Imen Gherboudj, Mohamed Zorgati, Phani Kumar Chamarthi, Arttu Tuomiranta, Baraa Mohandes, Naseema S. Beegum, Jood Al-Sudairi, and Omar Al-Owain
OW-DETR: Open-world detection transformer, Akshita Gupta, Sanath Narayan, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah
Generative multi-label zero-shot learning, Akshita Gupta, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Joost van de Weijer