Submissions from 2023
Context Matters: Distilling Knowledge Graph for Enhanced Object Detection, Aijia Yang, Sihao Lin, Chung Hsing Yeh, Minglei Shu, Yi Yang, and Xiaojun Chang
Class-Independent Regularization for Learning with Noisy Labels, Rumeng Yi, Dayan Guan, Yaping Huang, and Shijian Lu
Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos, Muhammad Zaigham Zaheer, Arif Mahmood, Marcella Astrid, and Seung Ik Lee
Attribute-Guided Collaborative Learning for Partial Person Re-Identification, Haoyu Zhang, Meng Liu, Yuhong Li, Ming Yan, Zan Gao, Xiaojun Chang, and Liqiang Nie
A Multimodal Coupled Graph Attention Network for Joint Traffic Event Detection and Sentiment Classification, Yazhou Zhang, Prayag Tiwari, Qian Zheng, Abdulmotaleb El Saddik, and M. Shamim Hossain
Harnessing Web3 on Carbon Offset Market for Sustainability: Framework and A Case Study, Chenyu Zhou, Hongzhou Chen, Shiman Wang, Xinyao Sun, Abdulmotaleb El Saddik, and Wei Cai
Vision Language Navigation with Knowledge-driven Environmental Dreamer, Fengda Zhu, Vincent C.S. Lee, Xiaojun Chang, and Xiaodan Liang
Submissions from 2022
UBnormal: New benchmark for supervised open-set video anomaly detection, Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, and Mubarak Shah
It’s Your Turn, Are You Ready to Get Vaccinated? Towards an Exploration of Vaccine Hesitancy Using Sentiment Analysis of Instagram Posts, Mohammed Talha Alam, Shahab Saquib Sohail, Syed Ubaid, Shakil, Zafar Ali, Mohammad Hijji, Abdul Khader Jilani Saudagar, and Khan Muhammad
Boosting the training of neural networks through hybrid metaheuristics, Mohammed Azmi Al-Betar, Mohammed A. Awadallah, Iyad Abu Doush, Osama Ahmad Alomari, Ammar Kamal Abasi, Sharif Naser Makhadmeh, and Zaid Abdi Alkareem Alyasseri
On the Robustness of 3D Object Detectors, Fatima Albreiki, Sultan Abu Ghazal, Jean Lahoud, Rao Anwer, Hisham Cholakkal, and Fahad Shahbaz Khan
Transformers in Remote Sensing: A Survey, Abdulaziz Amer Aleissaee, Amandeep Kumar, Rao Anwer, Salman Khan, Hisham Cholakkal, Gui-Song Xia, and Fahad Shahbaz Khan
TransformNet: Self-supervised Representation Learning Through Predicting Geometric Transformations, Muhammad Ali and Sayed Hashim
A Review on Industrial Blockchain, Mohammad Al Jaafreh, Wassim El Ahmar, Mohamad Hoda, and Ali Karime
GARDNet: Robust Multi-view Network for Glaucoma Classification in Color Fundus Images, Ahmed Al-Mahrooqi, Dmitrii Medvedev, Rand Muhtaseb, and Mohammad Yaqub
Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification, Faris Almalik, Mohammad Yaqub, and Karthik Nandakumar
Towards a Machine Learning-Based Digital Twin for Non-Invasive Human Bio-Signal Fusion, Izaldein Al-Zyoud, Fedwa Laamarti, Xiaocong Ma, Diana Tobón, and Abdulmotaleb Elsaddik
DRGen: Domain Generalization in Diabetic Retinopathy Classification, Mohammad Zeyad Atwany and Mohammad Yaqub
Deep Learning Techniques for Diabetic Retinopathy Classification: A Survey, Mohammad Z. Atwany, Abdulwahab H. Sahyoun, and Mohammad Yaqub
Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification, Hussam Azzuni, Muhammad Ridzuan, Min Xu, and Mohammad Yaqub
Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks, Yunqing Bao, Hang Dai, and Abdulmotaleb Elsaddik
SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection, Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah
DoodleFormer: Creative Sketch Drawing with Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, and Michael Felsberg
AI-based Blockchain for the Metaverse: Approaches and Challenges, Ouns Bouachir, Moayad Aloqaily, Fakhri Karray, and Abdulmotaleb Elsaddik
SipMaskv2: Enhanced Fast Image and Video Instance Segmentation, Jiale Cao, Yanwei Pang, Rao Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, and Ling Shao
PSTR: End-to-End One-Step Person Search With Transformers, Jiale Cao, Pang Yanwei, Rao Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, and Fahad Shahbaz Khan
Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images, Sevim Cengiz, Ibrahim Hamdi, and Mohammad Yaqub
Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating Scans, Sevim Cengiz and Mohammad Yaqub
ORYX-MRSI: A fully-automated open-source software for proton magnetic resonance spectroscopic imaging data analysis, Sevim Cengiz, Muhammed Yildirim, Abdullah Bas, and Esin Ozturk-Isik
Deep Network for Extremely Low-Resolution Human Action Recognition, Sachin Chaudhary, Prashant W. Patil, Akshay Dudhane, and Subrahmanyam Murala
Automatic schelling points detection from meshes, Geng Chen, Hang Dai, Tao Zhou, Jianbing Shen, and Ling Shao
Learning Disentanglement with Decoupled Labels for Vision-Language Navigation, Wenhao Cheng, Xingping Dong, Salman Khan, and Jianbing Shen
Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving, Yi-Nan Chen, Hang Dai, and Yong Ding
Deep-precognitive diagnosis: preventing future pandemics by novel disease detection With biologically-inspired conv-fuzzy network, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Chao Cheng, Jing Zhang, Tianyang Wang, and Min Xu
Towards partial supervision for generic object counting in natural scenes, Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Luc Van Gool
Burst image restoration and enhancement, Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming-Hsuan Yang
Applications of 3D photography in craniofacial surgery, Christian Duncan, Nick E. Pears, Hang Dai, Will A.P. Smith, and Paul O′higgins
Underactuated digital twin's robotic hands with tactile sensing capabilities for well-being, Mohd Faisal, Roberto Alejandro Martinez Velazquez, Fedwa Laamarti, and Abdulmotaleb El Saddik
Artificial intelligence models in digital twins for health and well-being, Rahatara Ferdousi, Fedwa Laamarti, and Abdulmotaleb El Saddik
RailTwin: A Digital Twin Framework For Railway, Rahatara Ferdousi, Fedwa Laamarti, Chunsheng Yang, and Abdulmotaleb Elsaddik
Non-invasive Anemia Detection from Conjunctival Images, Rahatara Ferdousi, Nabila Mabruba, Fedwa Laamarti, Abdulmotaleb El Saddik, and Chunsheng Yang
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search, Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Anwer, and Fahad Shahbaz Khan
Video Object Segmentation Based on Guided Feature Transfer Learning, Mustansar Fiaz, Arif Mahmood, Sehar Shahzad Farooq, Kamran Ali, Muhammad Shaheryar, and Soon Ki Jung
CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection, Dhanalaxmi Gaddam, Jean Lahoud, Fahad Shahbaz Khan, Rao Anwer, and Hisham Cholakkal
How to Train Vision Transformer on Small-scale Datasets?, Hanan Gani, Muzammal Naseer, and Mohammad Yaqub
A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video, Mariana Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, and Mubarak Shah
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution, Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron, Olivian Savencu, Nicolae Verga, Nicolae-Cătălin Ristea, and Fahad Shabaz Khan
On Demographic Bias in Fingerprint Recognition, Akash Godbole, Steven A. Grosz, Karthik Nandakumar, and Anil K. Jain
Hyperparameter Optimization for COVID-19 Chest X-Ray Classification, Ibraheem Hamdi, Muhammad Ridzuan, and Mohammad Yaqub
SubOmiEmbed: Self-supervised representation learning of multi-omics data for cancer type classification, Sayed Hashim, Muhammad Ali, Karthik Nandakumar, and Mohammad Yaqub
Visual Attention Methods in Deep Learning: An In-Depth Survey, Mohammed Hassanin, Anwar Saeed, Ibrahim Radwan, Fahad Shahbaz Khan, and Ajmal Mian
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection, Yu Hong, Hang Dai, and Yong Ding
Scribble-Supervised Video Object Segmentation, Peiliang Huang, Junwei Han, Nian Liu, Jun Ren, and Dingwen Zhang
Scribble-based boundary-aware network for weakly supervised salient object detection in remote sensing images, Zhou Huang, Tian-Zhu Xiang, Huai-Xin Chen, and Hang Dai
An Attention-Based ResNet Architecture for Acute Hemorrhage Detection and Classification: Toward a Health 4.0 Digital Twin Study, Aftab Hussain, Muhammad Usman Yaseen, Muhammad Imran, Muhammad Waqar, Adnan Akhunzada, Mohammad Al-Ja'afreh, and Abdulmotaleb El Saddik
UAV-based Multi-scale Features Fusion Attention for Fire Detection in Smart City Ecosystems, Tanveer Hussain, Hang Dai, Wail Gueaieb, Marco Sicklinger, and Giulia De Masi
High-resolution Iterative Feedback Network for Camouflaged Object Detection, Xiaobin Hu, Deng-Ping Fan, Xuebin Qin, Hang Dai, Wenqi Ren, Ying Tai, Chengjie Wang, and Ling Shao
Face Pyramid Vision Transformer, Khawar Islam, Muhammad Zaigham Zaheer, and Arif Mahmood
Optimizing Homomorphic Encryption based Secure Image Analytics, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar
PPDL - privacy preserving deep learning using homomorphic encryption, Nayna Jain, Karthik Nandakumar, Nalini Ratha, Sharath Pankanti, and Uttam Kumar
Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey And Outlook, Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, and Jiri Matas
Dense Gaussian processes for few-shot segmentation, Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, and Martin Danelljan
Energy-based Latent Aligner for Incremental Learning, K.J. Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Vineeth N. Balasubramanian
Eureka: EUphemism Recognition Enhanced Through KNN-based Methods and Augmentation, Sedrick Scott Keh, Rohit Bharadwaj, Emmy Liu, Simone Tedeschi, Varun Gangal, and Roberto Navigli
NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results, Fahad Shahbaz Khan and Salman Khan
Transformers in Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, and Mubarak Shah
Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging, Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, and Mubarak Shah
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages, Gokul Karthik Kumar, Abhishek Singh Gehlot, Sahal Shaji Mullappilly, and Karthik Nandakumar
Hate-CLIPper: Multimodal Hateful Meme Classification based on Cross-modal Interaction of CLIP Features, Gokul Karthik Kumar and Karthik Nandakumar
3D Vision with Transformers: A Survey, Jean Lahoud, Jiale Cao, Fahad Shahbaz Khan, Hisham Cholakkal, Rao Anwer, Salman Khan, and Ming-Hsuan Yang
Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving, Jiale Li, Hang Dai, and Yong Ding
Disentangled Capsule Routing for Fast Part-Object Relational Saliency, Yi Liu, Dingwen Zhang, Nian Liu, Shoukun Xu, and Jungong Han
Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation, Yuanwei Liu, Nian Liu, Xiwen Yao, and Junwei Han
Multi-View Brain Network Analysis with Cross-View Missing Network Generation, Gongxu Luo, Chenyang Li, Hejie Cui, Lichao Sun, Lifang He, and Carl Yang
Class-Agnostic Object Detection with Multi-modal Transformer, Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Anwer, and Ming Hsuan Yang
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, and Fahad Shahbaz Khan
Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, and Fahad Shahbaz Khan
COCOA: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains, Puneet Mangla, Shivam Chandhok, Vineeth N. Balasubramanian, and Fahad Shahbaz Khan
COVIDMe: a digital twin for COVID-19 self-assessment and detection, Roberto Martinez-Velazquez, Fernando Ceballos, Alejandro Sanchez, Abdulmotaleb El Saddik, and Emil Petriu
Adaptive Feature Consolidation Network for Burst Super-Resolution, Nancy Mehta, Akshay Dudhane, Subrahmanyam Murala, Syed Waqas Zamir, Salman Khan, and Fahad Shahbaz Khan
D3Former: Debiased Dual Distilled Transformer for Incremental Learning, Abdelrahman Mohamed, Rushali Grandhe, K.J. Joseph, Salman Khan, and Fahad Shahbaz Khan
HM: Hybrid Masking for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia
EchoCoTr: Estimation of the Left Ventricular Ejection Fraction from Spatiotemporal Echocardiography, Rand Muhtaseb and Mohammad Yaqub
Towards Improving Calibration in Object Detection Under Domain Shift, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali
Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification, Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, and Ling Shao
Stylized Adversarial Defense, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli
Guidance Through Surrogate: Toward a Generic Diagnostic Attack, Muzammal Naseer, Salman Khan, Fatih Porikli, and Fahad Shahbaz Khan
AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility, Mubashir Noman, Wafa Al Ghallabi, Daniya Najiha, Christoph Mayer, Hisham Cholakkal, Salman Khan, Luc Van Gool, and Fahad Shahbaz Khan
Camelira: An Arabic Multi-Dialect Morphological Disambiguator, Ossama Obeid, Go Inoue, and Nizar Habash
Multi‐frame based adversarial learning approach for video surveillance, Prashant W. Patil, Akshay Dudhane, Sachin Chaudhary, and Subrahmanyam Murala
Untrained Neural Network Priors for Inverse Imaging Problems: A Survey, Adnan Qayyum, Inaam Ilahi, Fahad Shamshad, Farid Boussaid, Mohammed Bennamoun, and Junaid Qadir
Highly Accurate Dichotomous Image Segmentation, Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Luc Van Gool, and Ling Shao
Polarity Loss: Improving Visual-Semantic Alignment for Zero-Shot Detection, Shafin Rahman, Salman Khan, and Nick Barnes
A Robust Normalizing Flow using Bernstein-type Polynomials, Sameera Ramasinghe, Kasun Fernando, Salman Khan, and Nick Barnes
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection, Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, and Fahad Shahbaz Khan
Challenges in COVID-19 Chest X-Ray Classification: Problematic Data or Ineffective Approaches?, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub
Self-supervision and Multi-task Learning: Challenges in Fine-Grained COVID-19 Multi-class Classification from Chest X-rays, Muhammad Ridzuan, Ameera Ali Bawazir, Ivo Gollini Navarrete, Ibrahim Almakky, and Mohammad Yaqub
SepTr: Separable Transformer for Audio Spectrogram Processing, Nicolae-Cătălin Ristea, Radu Tudor Ionescu, and Fahad Shahbaz Khan
OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning, Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah
Contrastive Pretraining for Echocardiography Segmentation with Limited Data, Mohamed Saeed, Rand Muhtaseb, and Mohammad Yaqub