Follow

Submissions from 2023

Link

Context Matters: Distilling Knowledge Graph for Enhanced Object Detection, Aijia Yang, Sihao Lin, Chung Hsing Yeh, Minglei Shu, Yi Yang, and Xiaojun Chang

PDF

Class-Independent Regularization for Learning with Noisy Labels, Rumeng Yi, Dayan Guan, Yaping Huang, and Shijian Lu

Link

Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos, Muhammad Zaigham Zaheer, Arif Mahmood, Marcella Astrid, and Seung Ik Lee

Link

Attribute-Guided Collaborative Learning for Partial Person Re-Identification, Haoyu Zhang, Meng Liu, Yuhong Li, Ming Yan, Zan Gao, Xiaojun Chang, and Liqiang Nie

Link

A Multimodal Coupled Graph Attention Network for Joint Traffic Event Detection and Sentiment Classification, Yazhou Zhang, Prayag Tiwari, Qian Zheng, Abdulmotaleb El Saddik, and M. Shamim Hossain

Link

Harnessing Web3 on Carbon Offset Market for Sustainability: Framework and A Case Study, Chenyu Zhou, Hongzhou Chen, Shiman Wang, Xinyao Sun, Abdulmotaleb El Saddik, and Wei Cai

PDF

Vision Language Navigation with Knowledge-driven Environmental Dreamer, Fengda Zhu, Vincent C.S. Lee, Xiaojun Chang, and Xiaodan Liang

Submissions from 2022

It’s Your Turn, Are You Ready to Get Vaccinated? Towards an Exploration of Vaccine Hesitancy Using Sentiment Analysis of Instagram Posts, Mohammed Talha Alam, Shahab Saquib Sohail, Syed Ubaid, Shakil, Zafar Ali, Mohammad Hijji, Abdul Khader Jilani Saudagar, and Khan Muhammad

Link

Boosting the training of neural networks through hybrid metaheuristics, Mohammed Azmi Al-Betar, Mohammed A. Awadallah, Iyad Abu Doush, Osama Ahmad Alomari, Ammar Kamal Abasi, Sharif Naser Makhadmeh, and Zaid Abdi Alkareem Alyasseri

Link

On the Robustness of 3D Object Detectors, Fatima Albreiki, Sultan Abu Ghazal, Jean Lahoud, Rao Anwer, Hisham Cholakkal, and Fahad Shahbaz Khan

PDF

Transformers in Remote Sensing: A Survey, Abdulaziz Amer Aleissaee, Amandeep Kumar, Rao Anwer, Salman Khan, Hisham Cholakkal, Gui-Song Xia, and Fahad Shahbaz Khan

TransformNet: Self-supervised Representation Learning Through Predicting Geometric Transformations, Muhammad Ali and Sayed Hashim

Link

A Review on Industrial Blockchain, Mohammad Al Jaafreh, Wassim El Ahmar, Mohamad Hoda, and Ali Karime

Link

GARDNet: Robust Multi-view Network for Glaucoma Classification in Color Fundus Images, Ahmed Al-Mahrooqi, Dmitrii Medvedev, Rand Muhtaseb, and Mohammad Yaqub

Link

Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification, Faris Almalik, Mohammad Yaqub, and Karthik Nandakumar

PDF

Towards a Machine Learning-Based Digital Twin for Non-Invasive Human Bio-Signal Fusion, Izaldein Al-Zyoud, Fedwa Laamarti, Xiaocong Ma, Diana Tobón, and Abdulmotaleb Elsaddik

Link

DRGen: Domain Generalization in Diabetic Retinopathy Classification, Mohammad Zeyad Atwany and Mohammad Yaqub

PDF

Deep Learning Techniques for Diabetic Retinopathy Classification: A Survey, Mohammad Z. Atwany, Abdulwahab H. Sahyoun, and Mohammad Yaqub

Link

Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification, Hussam Azzuni, Muhammad Ridzuan, Min Xu, and Mohammad Yaqub

Link

Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks, Yunqing Bao, Hang Dai, and Abdulmotaleb Elsaddik

Link

SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection, Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah

Link

DoodleFormer: Creative Sketch Drawing with Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, and Michael Felsberg

Link

AI-based Blockchain for the Metaverse: Approaches and Challenges, Ouns Bouachir, Moayad Aloqaily, Fakhri Karray, and Abdulmotaleb Elsaddik

Link

SipMaskv2: Enhanced Fast Image and Video Instance Segmentation, Jiale Cao, Yanwei Pang, Rao Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, and Ling Shao

PDF

PSTR: End-to-End One-Step Person Search With Transformers, Jiale Cao, Pang Yanwei, Rao Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, and Fahad Shahbaz Khan

Link

Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images, Sevim Cengiz, Ibrahim Hamdi, and Mohammad Yaqub

PDF

Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating Scans, Sevim Cengiz and Mohammad Yaqub

Link

ORYX-MRSI: A fully-automated open-source software for proton magnetic resonance spectroscopic imaging data analysis, Sevim Cengiz, Muhammed Yildirim, Abdullah Bas, and Esin Ozturk-Isik

Link

Deep Network for Extremely Low-Resolution Human Action Recognition, Sachin Chaudhary, Prashant W. Patil, Akshay Dudhane, and Subrahmanyam Murala

Link

Automatic schelling points detection from meshes, Geng Chen, Hang Dai, Tao Zhou, Jianbing Shen, and Ling Shao

Link

Learning Disentanglement with Decoupled Labels for Vision-Language Navigation, Wenhao Cheng, Xingping Dong, Salman Khan, and Jianbing Shen

PDF

Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving, Yi-Nan Chen, Hang Dai, and Yong Ding

PDF

Deep-precognitive diagnosis: preventing future pandemics by novel disease detection With biologically-inspired conv-fuzzy network, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Chao Cheng, Jing Zhang, Tianyang Wang, and Min Xu

Link

Towards partial supervision for generic object counting in natural scenes, Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Luc Van Gool

Link

Burst image restoration and enhancement, Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

Applications of 3D photography in craniofacial surgery, Christian Duncan, Nick E. Pears, Hang Dai, Will A.P. Smith, and Paul O′higgins

Link

Underactuated digital twin's robotic hands with tactile sensing capabilities for well-being, Mohd Faisal, Roberto Alejandro Martinez Velazquez, Fedwa Laamarti, and Abdulmotaleb El Saddik

Link

Artificial intelligence models in digital twins for health and well-being, Rahatara Ferdousi, Fedwa Laamarti, and Abdulmotaleb El Saddik

Link

RailTwin: A Digital Twin Framework For Railway, Rahatara Ferdousi, Fedwa Laamarti, Chunsheng Yang, and Abdulmotaleb Elsaddik

Link

Non-invasive Anemia Detection from Conjunctival Images, Rahatara Ferdousi, Nabila Mabruba, Fedwa Laamarti, Abdulmotaleb El Saddik, and Chunsheng Yang

PDF

PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search, Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Anwer, and Fahad Shahbaz Khan

Link

Video Object Segmentation Based on Guided Feature Transfer Learning, Mustansar Fiaz, Arif Mahmood, Sehar Shahzad Farooq, Kamran Ali, Muhammad Shaheryar, and Soon Ki Jung

PDF

CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection, Dhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Anwer, and Hisham Cholakkal

PDF

How to Train Vision Transformer on Small-scale Datasets?, Hanan Gani, Muzammal Naseer, and Mohammad Yaqub

Link

A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video, Mariana Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah

PDF

Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution, Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae Verga, Nicolae-Cătălin Ristea, and Fahad Shabaz Khan

Link

On Demographic Bias in Fingerprint Recognition, Akash Godbole, Steven A. Grosz, Karthik Nandakumar, and Anil K. Jain

PDF

Hyperparameter Optimization for COVID-19 Chest X-Ray Classification, Ibraheem Hamdi, Muhammad Ridzuan, and Mohammad Yaqub

PDF

SubOmiEmbed: Self-supervised representation learning of multi-omics data for cancer type classification, Sayed Hashim, Muhammad Ali, Karthik Nandakumar, and Mohammad Yaqub

PDF

Visual Attention Methods in Deep Learning: An In-Depth Survey, Mohammed Hassanin, Anwar Saeed, Ibrahim Radwan, Fahad Shahbaz Khan, and Ajmal Mian

Link

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection, Yu Hong, Hang Dai, and Yong Ding

Link

Scribble-Supervised Video Object Segmentation, Peiliang Huang, Junwei Han, Nian Liu, Jun Ren, and Dingwen Zhang

Link

Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images, Zhou Huang, Tian-Zhu Xiang, Huai-Xin Chen, and Hang Dai

An Attention-Based ResNet Architecture for Acute Hemorrhage Detection and Classification: Toward a Health 4.0 Digital Twin Study, Aftab Hussain, Muhammad Usman Yaseen, Muhammad Imran, Muhammad Waqar, Adnan Akhunzada, Mohammad Al-Ja'afreh, and Abdulmotaleb El Saddik

Link

UAV-based Multi-scale Features Fusion Attention for Fire Detection in Smart City Ecosystems, Tanveer Hussain, Hang Dai, Wail Gueaieb, Marco Sicklinger, and Giulia De Masi

Link

High-resolution Iterative Feedback Network for Camouflaged Object Detection, Xiaobin Hu, Deng-Ping Fan, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, and Ling Shao

PDF

Face Pyramid Vision Transformer, Khawar Islam, Muhammad Zaigham Zaheer, and Arif Mahmood

Link

Optimizing Homomorphic Encryption based Secure Image Analytics, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

Link

PPDL - privacy preserving deep learning using homomorphic encryption, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar

PDF

Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey And Outlook, Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, and Jiri Matas

Link

Dense Gaussian processes for few-shot segmentation, Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, and Martin Danelljan

PDF

Energy-based Latent Aligner for Incremental Learning, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Vineeth N. Balasubramanian

PDF

Eureka: EUphemism Recognition Enhanced Through KNN-based Methods and Augmentation, Sedrick Scott Keh, Rohit Bharadwaj, Emmy Liu, Simone Tedeschi, Varun Gangal, and Roberto Navigli

Link

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results, Fahad Shahbaz Khan and Salman Khan

PDF

Transformers in Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah

PDF

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, and Mubarak Shah

PDF

MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages, Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, and Karthik Nandakumar

PDF

Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features, Gokul Karthik Kumar and Karthik Nandakumar

PDF

3D Vision with Transformers: A Survey, Jean Lahoud, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Anwer, Salman Khan, and Ming-Hsuan Yang

Link

Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving, Jiale Li, Hang Dai, and Yong Ding

Link

Disentangled Capsule Routing for Fast Part-Object Relational Saliency, Yi Liu, Dingwen Zhang, Nian Liu, Shoukun Xu, and Jungong Han

PDF

Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation, Yuanwei Liu, Nian Liu, Xiwen Yao, and Junwei Han

Multi-View Brain Network Analysis with Cross-View Missing Network Generation, Gongxu Luo, Chenyang Li, Hejie Cui, Lichao Sun, Lifang He, and Carl Yang

Link

Class-Agnostic Object Detection with Multi-modal Transformer, Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Anwer, and Ming Hsuan Yang

PDF

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, and Fahad Shahbaz Khan

PDF

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, and Fahad Shahbaz Khan

PDF

COCOA: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan

Link

COVIDMe: a digital twin for COVID-19 self-assessment and detection, Roberto Martinez-Velazquez, Fernando Ceballos, Alejandro Sanchez, Abdulmotaleb El Saddik, and Emil Petriu

Link

Adaptive Feature Consolidation Network for Burst Super-Resolution, Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan, and Fahad Shahbaz Khan

Link

D3Former: Debiased Dual Distilled Transformer for Incremental Learning, Abdelrahman Mohamed, Rushali Grandhe, K.J. Joseph, Salman Khan, and Fahad Shahbaz Khan

Link

HM: Hybrid Masking for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia

Link

EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography, Rand Muhtaseb and Mohammad Yaqub

PDF

Towards Improving Calibration in Object Detection Under Domain Shift, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali

Link

Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification, Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, and Ling Shao

Link

Stylized Adversarial Defense, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli

Link

Guidance Through Surrogate: Toward a Generic Diagnostic Attack, Muzammal Naseer, Salman Khan, Fatih Porikli, and Fahad Shahbaz Khan

PDF

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility, Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Hisham Cholakkal, Salman Khan, Luc Van Gool, and Fahad Shahbaz Khan

Camelira: An Arabic Multi-Dialect Morphological Disambiguator, Ossama Obeid, Go Inoue, and Nizar Habash

Link

Multi‐frame based adversarial learning approach for video surveillance, Prashant W. Patil, Akshay Dudhane, Sachin Chaudhary, and Subrahmanyam Murala

Link

Highly Accurate Dichotomous Image Segmentation, Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Luc Van Gool, and Ling Shao

Link

Polarity Loss: Improving Visual-Semantic Alignment for Zero-Shot Detection, Shafin Rahman, Salman Khan, and Nick Barnes

PDF

A Robust Normalizing Flow using Bernstein-type Polynomials, Sameera Ramasinghe, Kasun Fernando, Salman Khan, and Nick Barnes

PDF

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection, Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, and Fahad Shahbaz Khan

PDF

Challenges in COVID-19 Chest X-Ray Classification: Problematic Data or Ineffective Approaches?, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

Self-supervision and Multi-task Learning: Challenges in Fine-Grained COVID-19 Multi-class Classification from Chest X-rays, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub

Link

SepTr: Separable Transformer for Audio Spectrogram Processing, Nicolae-Cătălin Ristea, Radu Tudor Ionescu, and Fahad Shahbaz Khan

Link

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning, Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Contrastive Pretraining for Echocardiography Segmentation with Limited Data, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

PDF

Is Contrastive Learning Suitable for Left Ventricular Segmentation in Echocardiographic Images?, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub

Link

End-to-End Myocardial Infarction Classification from Echocardiographic Scans, Mohamed Saeed and Mohammad Yaqub