Submissions from 2023
Collaborative Contrastive Refining for Weakly Supervised Person Search, Chengyou Jia, Minnan Luo, Caixia Yan, Linchao Zhu, Xiaojun Chang, and Qinghua Zheng
Is Synthetic Dataset Reliable for Benchmarking Generalizable Person Re-Identification?, Cuicui Kang
Convolutional Point Transformer, Chaitanya Kaul, Joshua Mitton, Hang Dai, and Roderick Murray-Smith
Why ORB-SLAM is missing commonly occurring loop closures?, Saran Khaliq, Muhammad Latif Anjum, Wajahat Hussain, Muhammad Uzair Khattak, and Momen Rasool
AAD-Net: Advanced end-to-end signal processing system for human emotion detection & recognition using attention-based deep echo state network, Mustaqeem Khan, Abdulmotaleb El Saddik, Fahd Saleh Alotaibi, and Nhat Truong Pham
Metaverse Key Technologies and Blockchains: Impacts & Considerations, Mustaqeem Khan, Abdulmotaleb El Saddik, and Wail Gueaieb
PD-Net: Multi-Stream Hybrid Healthcare System for Parkinson's Disease Detection using Multi Learning Trick Approach, Mustaqeem Khan, Ufaq Khan, and Alice Othmani
ARTriViT: Automatic Face Recognition System Using ViT-Based Siamese Neural Networks with a Triplet Loss, Mustaqeem Khan, Muhammad Saeed, Abdulmotaleb El Saddik, and Wail Gueaieb
Guest Editorial Introduction to the Special Section on Transformer Models in Vision, Salman Khan, Fahad Shahbaz Khan, Ashish Vaswani, Niki Parmar, Ming Hsuan Yang, and Mubarak Shah
DDNet: Diabetic Retinopathy Detection System Using Skip Connection-based Upgraded Feature Block, Ufaq Khan, Mustaqeem Khan, Abdulmotaleb Elsaddik, and Wail Gueaieb
MaPLe: Multi-modal Prompt Learning, Muhammad Uzair Khattak, Hanoona Rasheed, Muhammad Maaz, Salman Khan, and Fahad Shahbaz Khan
Self-regulating Prompts: Foundational Model Adaptation without Forgetting, Muhammad Uzair Khattak, Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming Hsuan Yang, and Fahad Shahbaz Khan
Generative Multiplane Neural Radiance for 3D-Aware Image Generation, Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Ming Hsuan Yang, and Fahad Shahbaz Khan
Cross-Modulated Few-Shot Image Generation for Colorectal Tissue Classification, Amandeep Kumar, Ankan Kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, and Fahad Shahbaz Khan
Person Image Synthesis via Denoising Diffusion Model, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Mubarak Shah, and Fahad Shahbaz Khan
Towards Building Text-to-Speech Systems for the Next Billion Users, Gokul Karthik Kumar, S. V. Praveen, Pratyush Kumar, Mitesh M. Khapra, and Karthik Nandakumar
Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt, Hao Li, Dingwen Zhang, Nian Liu, Lechao Cheng, Yalun Dai, Chao Zhang, Xinggang Wang, and Junwei Han
Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection, Long Li, Junwei Han, Ni Zhang, Nian Liu, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, and Fahad Shahbaz Khan
Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation, Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, and Xiaojun Chang
3D-Aware Multi-Class Image-to-Image Translation with NeRFs, Senmao Li, Joost Van De Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, and Jian Yang
Composable Text Controls in Latent Space with ODEs, Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, and Shuguang Cui
Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation, Nian Liu, Kepan Nan, Wangbo Zhao, Yuanwei Liu, Xiwen Yao, Salman Khan, Hisham Cholakkal, and Rao Muhammad Anwer
Learning Complementary Spatial–Temporal Transformer for Video Salient Object Detection, Nian Liu, Kepan Nan, Wangbo Zhao, Xiwen Yao, and Junwei Han
DGM-DR: Domain Generalization with Mutual Information Regularized Diabetic Retinopathy Classification, Aleksandr Matsun, Dana O. Mohamed, Sharon Chokuwa, Muhammad Ridzuan, and Mohammad Yaqub
MSI: Maximize Support-Set Information for Few-Shot Segmentation, Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, and Mubbasir Kapadia
Adversarial Attacks and Batch Normalization: A Batch Statistics Perspective, Awais Muhammad, Fahad Shamshad, and Sung Ho Bae
Arabic Mini-ClimateGPT: A Climate Change and Sustainability Tailored Arabic LLM, Sahal Shaji Mullappilly, Abdelrahman Shaker, Omkar Thawkar, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, and Fahad Shahbaz Khan
Guest Editorial Digital Twins for Mobile Networks - Part I, Shahid Mumtaz, Soumaya Cherkaoui, Mohsen Guizani, Joel J.P.C. Rodrigues, Abdulmotaleb El Saddik, Sabita Maharjan, Yang Xiao, and Ikram Ashraf
Cal-DETR: Calibrated Detection Transformer, Muhammad Akhtar Munir, Salman Khan, Muhammad Haris Khan, Mohsen Ali, and Fahad Shahbaz Khan
Development of machine learning models for the prediction of binary diffusion coefficients of gases, Ismail Adewale Olumegbon, Ibrahim Olanrewaju Alade, Mojeed Opeyemi Oyedeji, Talal F. Qahtan, and Aliyu Bagudu
Multiclass Confidence and Localization Calibration for Object Detection, Bimsara Pathiraja, Malitha Gunawardhana, and Muhammad Haris Khan
Laplacian ICP for Progressive Registration of 3D Human Head Meshes, Nick Pears, Hang Dai, Will Smith, and Hao Sun
Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition, Sara Pieri, Jose Renato Restom, Samuel Horvath, and Hisham Cholakkal
PromptIR: Prompting for All-in-One Blind Image Restoration, Vaishnav Potlapalli, Syed Waqas Zamir, Salman Khan, and Fahad Shahbaz Khan
A Spatial-Temporal Deformable Attention Based Framework for Breast Lesion Detection in Videos, Chao Qin, Jiale Cao, Huazhu Fu, Rao Muhammad Anwer, and Fahad Shahbaz Khan
How Good is Google Bard’s Visual Understanding? An Empirical Study on Open Challenges, Haotong Qin, Ge Peng Ji, Salman Khan, Deng Ping Fan, Fahad Shahbaz Khan, and Luc Van Gool
Fine-tuned CLIP Models are Efficient Video Learners, Hanoona Rasheed, Muhammad Uzair Khattak, Muhammad Maaz, Salman Khan, and Fahad Shahbaz Khan
Combating Counterfeit Products in Smart Cities with Digital Twin Technology, Muhammad Saad, Mustaqeem Khan, Muhammad Saeed, Abdulmotaleb ElSaddik, and Wail Gueaieb
Gaming-Based Education System for Children on Road Safety in Metaverse Towards Smart Cities, Muhammad Saeed, Abbas Khan, Mustaqeem Khan, Muhammad Saad, Abdulmotaleb El Saddik, and Wail Gueaieb
Single-branch Network for Multimodal Training, Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, and Arif Mahmood
MGMT promoter methylation status prediction using MRI scans? An extensive experimental evaluation of deep learning models, Numan Saeed, Muhammad Ridzuan, Hussain Alasmawi, Ikboljon Sobirov, and Mohammad Yaqub
Prompt-Based Tuning of Transformer Models for Multi-Center Medical Image Segmentation of Head and Neck Cancer, Numan Saeed, Muhammad Ridzuan, Roba Al Majzoub, and Mohammad Yaqub
Hybrid Flexible (HyFlex) learning space design and implementation at graduate level: An iterative process, David Santandreu Calonge, Mark Thompson, Leisa Hassock, and Mohammad Yaqub
CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search, Fahad Shamshad, Muzammal Naseer, and Karthik Nandakumar
PSM-PS: Part-Based Signal Modulation for Person Search, Reem Abdalla Sharif, Mustansar Fiaz, and Rao Anwer
Camera Coach: Activity Recognition and Assessment Using Thermal and RGB Videos, Ahmed Sharshar, Ahmed Hesham Aboeitta, Ahmed Fayez, Mohamed A. Khamis, Ahmed B. Zaky, and Walid Gomaa
Classification of Cumin, Fennel and Carom Using Transfer Learning, Abdullah Ajaz Siddiqui, Shahab Saquib Sohail, Qazi Areeb, Wathiq Mansoor, and Md Tabrez Nafis
Lifelong Learning of Task-Parameter Relationships for Knowledge Transfer, Shikhar Srivastava, Mohammad Yaqub, and Karthik Nandakumar
FLIP: Cross-domain Face Anti-spoofing with Language Guidance, Koushik Srivatsan, Muzammal Naseer, and Karthik Nandakumar
Multilevel Feature Representation for Hybrid Transformers-based Emotion Recognition, Monorama Swain, Bubai Maji, Mustaqeem Khan, Abdulmotaleb El Saddik, and Wail Gueaieb
3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers, Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner, Mubarak Shah, and Fahad Shahbaz Khan
Fast Video Instance Segmentation via Recurrent Encoder-Based Transformers, Omkar Thawakar, Alexandre Rivkind, Ehud Ahissar, and Fahad Shahbaz Khan
Intelligent digital twin reference architecture models for medical and healthcare industry, Zhi Wang and Abdulmotaleb El Saddik
DTITD: An Intelligent Insider Threat Detection Framework Based on Digital Twin and Self-Attention Based Deep Learning Models, Zhi Qiang Wang and Abdulmotaleb El Saddik
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition, Syed Talal Wasim, Muhammad Uzair Khattak, Muzammal Naseer, Salman Khan, Mubarak Shah, and Fahad Shahbaz Khan
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting, Syed Talal Wasim, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah
Hardware Resilience Properties of Text-Guided Image Classifiers, Syed Talal Wasim, Kabila Haile Soboka, Abdulrahman Mahmoud, Salman Khan, David Brooks, and Gu Yeon Wei
Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey, Aoran Xiao, Jiaxing Huang, Dayan Guan, Xiaoqin Zhang, Shijian Lu, and Ling Shao
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds, Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu, and Eric Xing
GraphPrompt: Graph-Based Prompt Templates for Biomedical Synonym Prediction, Hanwen Xu, Jiayou Zhang, Zhirui Wang, Shizhuo Zhang, Megh Bhalerao, Yucong Liu, Dawei Zhu, and Sheng Wang
Context Matters: Distilling Knowledge Graph for Enhanced Object Detection, Aijia Yang, Sihao Lin, Chung Hsing Yeh, Minglei Shu, Yi Yang, and Xiaojun Chang
Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation, Chenxu Yang, Zheng Lin, Lanrui Wang, Chong Tian, Liang Pang, Jiangnan Li, Qirong Ho, and Yanan Cao
Class-Independent Regularization for Learning with Noisy Labels, Rumeng Yi, Dayan Guan, Yaping Huang, and Shijian Lu
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications, Abdelrahman Shaker Youssief, Muhammad Maaz, Hanoona Rasheed, Salman Khan, Ming Hsuan Yang, and Fahad Shahbaz Khan
Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos, Muhammad Zaigham Zaheer, Arif Mahmood, Marcella Astrid, and Seung Ik Lee
Attribute-Guided Collaborative Learning for Partial Person Re-Identification, Haoyu Zhang, Meng Liu, Yuhong Li, Ming Yan, Zan Gao, Xiaojun Chang, and Liqiang Nie
A Multimodal Coupled Graph Attention Network for Joint Traffic Event Detection and Sentiment Classification, Yazhou Zhang, Prayag Tiwari, Qian Zheng, Abdulmotaleb El Saddik, and M. Shamim Hossain
Harnessing Web3 on Carbon Offset Market for Sustainability: Framework and A Case Study, Chenyu Zhou, Hongzhou Chen, Shiman Wang, Xinyao Sun, Abdulmotaleb El Saddik, and Wei Cai
Vision Language Navigation with Knowledge-driven Environmental Dreamer, Fengda Zhu, Vincent C.S. Lee, Xiaojun Chang, and Xiaodan Liang
Submissions from 2022
Gradient Boosting and Linear Regression for Estimating Coastal Bathymetry Based on Sentinel-2 Images, Fahim Abdul Gafoor, Maryam R. Al-Shehhi, Chung Suk Cho, and Hosni Ghedira
UBnormal: New benchmark for supervised open-set video anomaly detection, Andra Acsintoae, Andrei Florescu, Mariana-Iuliana Georgescu, Tudor Mare, Paul Sumedrea, Radu Tudor Ionescu, Fahad Shahbaz Khan, and Mubarak Shah
It’s Your Turn, Are You Ready to Get Vaccinated? Towards an Exploration of Vaccine Hesitancy Using Sentiment Analysis of Instagram Posts, Mohammed Talha Alam, Shahab Saquib Sohail, Syed Ubaid, Shakil, Zafar Ali, Mohammad Hijji, Abdul Khader Jilani Saudagar, and Khan Muhammad
Boosting the training of neural networks through hybrid metaheuristics, Mohammed Azmi Al-Betar, Mohammed A. Awadallah, Iyad Abu Doush, Osama Ahmad Alomari, Ammar Kamal Abasi, Sharif Naser Makhadmeh, and Zaid Abdi Alkareem Alyasseri
On the Robustness of 3D Object Detectors, Fatima Albreiki, Sultan Abu Ghazal, Jean Lahoud, Rao Anwer, Hisham Cholakkal, and Fahad Shahbaz Khan
Transformers in Remote Sensing: A Survey, Abdulaziz Amer Aleissaee, Amandeep Kumar, Rao Anwer, Salman Khan, Hisham Cholakkal, Gui-Song Xia, and Fahad Shahbaz Khan
TransformNet: Self-supervised Representation Learning Through Predicting Geometric Transformations, Muhammad Ali and Sayed Hashim
A Review on Industrial Blockchain, Mohammad Al Jaafreh, Wassim El Ahmar, Mohamad Hoda, and Ali Karime
GARDNet: Robust Multi-view Network for Glaucoma Classification in Color Fundus Images, Ahmed Al-Mahrooqi, Dmitrii Medvedev, Rand Muhtaseb, and Mohammad Yaqub
Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification, Faris Almalik, Mohammad Yaqub, and Karthik Nandakumar
Towards a Machine Learning-Based Digital Twin for Non-Invasive Human Bio-Signal Fusion, Izaldein Al-Zyoud, Fedwa Laamarti, Xiaocong Ma, Diana Tobón, and Abdulmotaleb Elsaddik
DRGen: Domain Generalization in Diabetic Retinopathy Classification, Mohammad Zeyad Atwany and Mohammad Yaqub
Deep Learning Techniques for Diabetic Retinopathy Classification: A Survey, Mohammad Z. Atwany, Abdulwahab H. Sahyoun, and Mohammad Yaqub
Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification, Hussam Azzuni, Muhammad Ridzuan, Min Xu, and Mohammad Yaqub
Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks, Yunqing Bao, Hang Dai, and Abdulmotaleb Elsaddik
SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection, Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah
DoodleFormer: Creative Sketch Drawing with Transformers, Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, and Michael Felsberg
AI-based Blockchain for the Metaverse: Approaches and Challenges, Ouns Bouachir, Moayad Aloqaily, Fakhri Karray, and Abdulmotaleb Elsaddik
SipMaskv2: Enhanced Fast Image and Video Instance Segmentation, Jiale Cao, Yanwei Pang, Rao Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, and Ling Shao
PSTR: End-to-End One-Step Person Search With Transformers, Jiale Cao, Pang Yanwei, Rao Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, and Fahad Shahbaz Khan
Automatic Quality Assessment of First Trimester Crown-Rump-Length Ultrasound Images, Sevim Cengiz, Ibrahim Hamdi, and Mohammad Yaqub
Deep Learning-based Quality Assessment of Clinical Protocol Adherence in Fetal Ultrasound Dating Scans, Sevim Cengiz and Mohammad Yaqub
ORYX-MRSI: A fully-automated open-source software for proton magnetic resonance spectroscopic imaging data analysis, Sevim Cengiz, Muhammed Yildirim, Abdullah Bas, and Esin Ozturk-Isik
Deep Network for Extremely Low-Resolution Human Action Recognition, Sachin Chaudhary, Prashant W. Patil, Akshay Dudhane, and Subrahmanyam Murala
Automatic schelling points detection from meshes, Geng Chen, Hang Dai, Tao Zhou, Jianbing Shen, and Ling Shao
Learning Disentanglement with Decoupled Labels for Vision-Language Navigation, Wenhao Cheng, Xingping Dong, Salman Khan, and Jianbing Shen
Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving, Yi-Nan Chen, Hang Dai, and Yong Ding
Deep-precognitive diagnosis: preventing future pandemics by novel disease detection With biologically-inspired conv-fuzzy network, Aviral Chharia, Rahul Upadhyay, Vinay Kumar, Chao Cheng, Jing Zhang, Tianyang Wang, and Min Xu
Towards partial supervision for generic object counting in natural scenes, Hisham Cholakkal, Guolei Sun, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Luc Van Gool
Burst image restoration and enhancement, Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming-Hsuan Yang
Applications of 3D photography in craniofacial surgery, Christian Duncan, Nick E. Pears, Hang Dai, Will A.P. Smith, and Paul O′higgins