Follow

Submissions from 2024

Enabling Consumer UAVs for Precision Agriculture Applications: A Case Study of Yield Estimation, Jamil Ahmad, Wail Gueaieb, Abdulmotaleb El Saddik, Giulia De Masi, and Fakhri Karray

FUSC: Fetal Ultrasound Semantic Clustering of Second-Trimester Scans Using Deep Self-Supervised Learning, Hussain Alasmawi, Leanne Bricker, and Mohammad Yaqub

Mixat: A Data Set of Bilingual Emirati-English Speech, Maryam Al Ali and Hanan Aldarmaki

A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection, Anas Al-Lahham, Nurbek Tastan, Muhammad Zaigham Zaheer, and Karthik Nandakumar

DDAM-PS: Diligent Domain Adaptive Mixer for Person Search, Mohammed Khaleed Almansoori, Mustansar Fiaz, and Hisham Cholakkal

A Multi-Head Approach with Shuffled Segments for Weakly-Supervised Video Anomaly Detection, Salem Almarri, Muhammad Zaigham Zaheer, and Karthik Nandakumar

UTalk: Bridging the Gap between Humans and AI, Hussam Azzuni, Sharim Jamal, and Abdulmotaleb Elsaddik

Smartphone Video-based Monocular 3D Reconstruction, Hussam Azzuni, Mustaqeem Khan, Abdulmotaleb Elsaddik, and Wail Gueaieb

: Edge-Aware Multimodal Transformer for RGB-D Salient Object Detection, Geng Chen, Qingyue Wang, Bo Dong, Ruitao Ma, Nian Liu, Huazhu Fu, and Yong Xia

TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation, Yahia Dalbah, Jean Lahoud, and Hisham Cholakkal

Exploring User Perceptions of Virtual Reality Scene Design in Metaverse Learning Environments, Rahatara Ferdousi, Mohd Faisal, Fedwa Laamarti, Chunsheng Yang, and Abdulmotaleb El Saddik

Generalizing to Unseen Domains in Diabetic Retinopathy Classification, Chamuditha Jayanga Galappaththige, Gayal Kuruppu, and Muhammad Haris Khan

PDF

CoNIC Challenge: Pushing the frontiers of nuclear detection, segmentation, classification and counting, Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, and Min Xu

Visual Out-of-Distribution Detection in Open-Set Noisy Environments, Rundong He, Zhongyi Han, Xiushan Nie, Yilong Yin, and Xiaojun Chang

Artificial Intelligence-based intrusion detection system for V2V communication in vehicular adhoc networks, Abizar Khalil, Haleem Farman, Moustafa M. Nasralla, Bilal Jan, and Jamil Ahmad

CamoFocus: Enhancing Camouflage Object Detection with Split-Feature Focal Modulation and Context Refinement, Abbas Khan, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El Saddik, Giulia De Masi, and Fakhri Karray

SpotCrack: Leveraging a Lightweight Framework for Crack Segmentation in Infrastructure, Abbas Khan, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El Saddik, Guilia De Masi, and Fakhri Karray

Link

Improving Pseudo-Labelling and Enhancing Robustness for Semi-Supervised Domain Generalization, Adnan Khan, Mai A. Shaaban, and Muhammad Haris Khan

Skin-Former: Mobile-Friendly Transformer for Skin Lesion Diagnosis, Mustaqeem Khan, Jamil Ahmad, Abdulmotaleb El Saddik, and Wail Gueaieb

Link

MSER: Multimodal speech emotion recognition using cross-attention with deep fusion, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El Saddik, and Soonil Kwon

Action Knowledge Graph for Violence Detection Using Audiovisual Features, Mustaqeem Khan, Muhammad Saad, Abbas Khan, Wail Gueaieb, Abdulmotaleb El Saddik, Giulia De Masi, and Fakhri Karray

STT-Net: Simplified Temporal Transformer for Emotion Recognition, Mustaqeem Khan, Abdulmotaleb El Saddik, Mohamed Deriche, and Wail Gueaieb

VD-Net: An Edge Vision-Based Surveillance System for Violence Detection, Mustaqeem Khan, Abdulmotaleb El Saddik, Wail Gueaieb, Giulia De Masi, and Fakhri Karray

Multiclass Alignment of Confidence and Certainty for Network Calibration, Vinith Kugathasan and Muhammad Haris Khan

Link

Robust Perception and Precise Segmentation for Scribble-Supervised RGB-D Saliency Detection, Long Li, Junwei Han, Nian Liu, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, and Fahad Shahbaz Khan

PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping, Luoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, and Xiaochun Cao

VST++: Efficient and Stronger Visual Saliency Transformer, Nian Liu, Ziyang Luo, Ni Zhang, and Junwei Han

Link

Simple Primitives With Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-Shot Learning, Zhe Liu, Yun Li, Lina Yao, Xiaojun Chang, Wei Fang, Xiaojun Wu, and Abdulmotaleb El Saddik

Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation, Yixing Lu, Zhaoxin Fan, and Min Xu

Link

Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection, Neelu Madan, Nicolae Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah

Sensor Allocation and Online-Learning-Based Path Planning for Maritime Situational Awareness Enhancement: A Multi-Agent Approach, Bach Long Nguyen, Anh Dzung Doan, Tat Jun Chin, Christophe Guettier, Surabhi Gupta, Estelle Parra, Ian Reid, and Markus Wagner

ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection, Mubashir Noman, Mustansar Fiaz, Hisham Cholakkal, Salman Khan, and Fahad Shahbaz Khan

Remote Sensing Change Detection With Transformers Trained From Scratch, Mubashir Noman, Mustansar Fiaz, Hisham Cholakkal, Sanath Narayan, Rao Muhammad Anwer, Salman Khan, and Fahad Shahbaz Khan

Dataset for Automatic Region-based Coronary Artery Disease Diagnostics Using X-Ray Angiography Images, Maxim Popov, Akmaral Amanturdieva, Nuren Zhaksylyk, Alsabir Alkanov, Adilbek Saniyazbekov, Temirgali Aimyshev, Eldar Ismailov, and Ablay Bulegenov

Multi-task Learning Approach for Unified Biometric Estimation from Fetal Ultrasound Anomaly Scans, Mohammad Areeb Qazi, Mohammed Talha Alam, Ibrahim Almakky, Werner Gerhard Diehl, Leanne Bricker, and Mohammad Yaqub

Link

PECon: Contrastive Pretraining to Enhance Feature Alignment Between CT and EHR Data for Improved Pulmonary Embolism Diagnosis, Santosh Sanjeev, Salwa K. Al Khatib, Mai A. Shaaban, Ibrahim Almakky, Vijay Ram Papineni, and Mohammad Yaqub

RespiroDynamics: A Multifaceted Dataset for Enhanced Lung Health Assessment Using Deep Learning, Ahmed Sharshar, Muhammad Sharshar, Hosam Elhady, Ahmed Aboeitta, Youssef Nafea, Yasser Ashraf, Mohammad Yaqub, and Mohsen Guizani

Create Your World: Lifelong Text-to-Image Diffusion, Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, and Yang Cong

Link

Smart Contract as a Service: A Paradigm of Reusing Smart Contract in Web3 Ecosystem, Jinghan Sun, Abdulmotaleb El Saddik, and Wei Cai

TransGOP: Transformer-Based Gaze Object Prediction, Binglu Wang, Chenxi Guo, Yang Jin, Haisheng Xia, and Nian Liu

BEVRefiner: Improving 3D Object Detection in Bird’s-Eye-View via Dual Refinement, Binglu Wang, Haowen Zheng, Lei Zhang, Nian Liu, Rao Muhammad Anwer, Hisham Cholakkal, Yongqiang Zhao, and Zhijun Li

MedSegDiff-V2: Diffusion-Based Medical Image Segmentation with Transformer, Junde Wu, Wei Ji, Huazhu Fu, Min Xu, Yueming Jin, and Yanwu Xu

Link

Contextual Dependency Vision Transformer for spectrogram-based multivariate time series analysis, Jieru Yao, Longfei Han, Kaihui Yang, Guangyu Guo, Nian Liu, Xiankai Huang, Zhaohui Zheng, Dingwen Zhang, and Junwei Han

Category-Contextual Relation Encoding Network for Few-Shot Object Detection, Ating Yin, Yaonan Wang, Jianxu Mao, Hui Zhang, Xiuyi Chen, and Jiahua Dong

Link

CADC++: Advanced Consensus-Aware Dynamic Convolution for Co-Salient Object Detection, Ni Zhang, Nian Liu, Fang Nan, and Junwei Han

Submissions from 2023

Link

On the Effects of Filtering Methods on Adversarial Timeseries Data, Mubarak G. Abdu-Aguye and Karthik Nandakumar

Link

Recurrence-based Disentanglement for Detecting Adversarial Attacks on Timeseries Classifiers, Mubarak G. Abdu-Aguye and Karthik Nandakumar

Link

Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing, Daniya Najiha Abdul Kareem, Mustansar Fiaz, Noa Novershtern, Jacob Hanna, and Hisham Cholakkal

PDF

Metaverse Key Requirements and Platforms Survey, Akbobek Abilkaiyrkyzy, Ahmed Elhagry, Fedwa Laamarti, and Abdulmotaleb El Saddik

PDF

Digital Twin of Atmospheric Environment: Sensory Data Fusion for High-Resolution PM2.5 Estimation and Action Policies Recommendation, Kudaibergen Abutalip, Anas Al-lahham, and Abdulmotaleb Elsaddik

Improving Stain Invariance of CNNs for Segmentation by Fusing Channel Attention and Domain-Adversarial Training, Kudaibergen Abutalip, Numan Saeed, Mustaqeem Khan, and Abdulmotaleb El Saddik

PDF

Suitability of SDN and MEC to facilitate digital twin communication over LTE-A, Hikmat Adhami, Mohammad Alja'afreh, Mohamed Hoda, Jiaqi Zhao, Yong Zhou, and Abdulmotaleb Elsaddik

PDF

Smart Street Light Control: A Review on Methods, Innovations, and Extended Applications, Fouad Agramelal, Mohamed Sadik, Youssef Moubarak, and Saad Abouzahir

PDF

ARL-Wavelet-BPF optimization using PSO algorithm for bearing fault diagnosis, Muhammad Ahsan, Dariusz Bismor, and Muhammad Arslan Manzoor

Link

Accelerated MRI Reconstruction via Dynamic Deformable Alignment Based Transformer, Wafa Alghallabi, Akshay Dudhane, Waqas Zamir, Salman Khan, and Fahad Shahbaz Khan

Link

CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representations, Muhammad Ali and Salman Khan

Link

Underwater Object Detection Enhancement via Channel Stabilization, Muhammad Ali and Salman Khan

Link

Cybersecurity in the Metaverse: Challenges and Approaches, Mohammad Alja'Afreh, Ranwa Al Mallah, Ali Karime, and Abdulmotaleb El Saddik

Link

Metaverse through Blockchain and Intelligent Networking: A Comprehensive Survey, Mohammad Alja'Afreh, Sahel Alouneh, Muath Obaidat, Ali Karime, and Abdulmotaleb Elsaddik

Link

A Detailed Analysis of Qualitative and Quantitative Factors in Realization of 6G Communication, Mohammad Alja'afreh, Ali Karime, Sahel Alouneh, and Abdulmotaleb El Saddik

Link

Incorporating MPLS for Better SoC Utilization and Traffic Engineering, Mohammad Alja'Afreh, Muath Obaidat, and Sahel Alouneh

Link

Optimizing System-on-Chip Performance Using AI and SDN: Approaches and Challenges, Mohammad Alja'afreh, Muath Obaidat, Ali Karime, and Sahel Alouneh

3D Instance Segmentation via Enhanced Spatial and Semantic Supervision, Salwa Al Khatib, Mohamed El Amine Boudjoghra, Jean Lahoud, and Fahad Shahbaz Khan

FedSIS: Federated Split Learning with Intermediate Representation Sampling for Privacy-preserving Generalized Face Presentation Attack Detection, Naif Alkhunaizi, Koushik Srivatsan, Faris Almalik, Ibrahim Almakky, and Karthik Nandakumar

Link

FeSViBS: Federated Split Learning of Vision Transformer with Block Sampling, Faris Almalik, Naif Alkhunaizi, Ibrahim Almakky, and Karthik Nandakumar

Link

Artificial Intelligence f or Cybersecurity in IoT-enabled Avionics: Challenges and Solutions, Ranwa Al Mallah, Talal Halabi, Mohammad Alja'afreh, and Ali Karime

Link

Anchor-ReID: A Test Time Adaptation for Person Re-identification, Mohammed Almansoori, Mustansar Fiaz, and Hisham Cholakkal

Link

A Unified Model for Face Matching and Presentation Attack Detection using an Ensemble of Vision Transformer Features, Rouqaiah Al-Refai and Karthik Nandakumar

PDF

Transformer-Based Feature Fusion Approach for Multimodal Visual Sentiment Recognition Using Tweets in the Wild, Fatimah Alzamzami and Abdulmotaleb El Saddik

PDF

Towards Enabling Haptic Communications over 6G: Issues and Challenges, Muhammad Awais, Fasih Ullah Khan, Muhammad Zafar, Muhammad Mudassar, Muhammad Zaigham Zaheer, Khalid Mehmood Cheema, Muhammad Kamran, and Woo Sung Jung

Link

Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathy, Sanoojan Baliah, Fadillah A. Maani, Santosh Sanjeev, and Muhammad Haris Khan

XMem++: Production-level Video Segmentation From Few Annotated Frames, Maksym Bekuzarov, Ariana Bermudez, Joon Young Lee, and Hao Li

Link

Leveraging Self-supervised Learning for Fetal Cardiac Planes Classification Using Ultrasound Scan Videos, Joseph Geo Benjamin, Mothilal Asokan, Amna Alhosani, Hussain Alasmawi, Werner Gerhard Diehl, Leanne Bricker, Karthik Nandakumar, and Mohammad Yaqub

Link

Deteriorated image classification model for malayalam palm leaf manuscripts, B. J. Bipin Nair, N. Shobha Rani, and Mustaqeem Khan

FUSQA: Fetal Ultrasound Segmentation Quality Assessment, Sevim Cengiz, Ibrahim Almakky, and Mohammad Yaqub

Link

Web3 Metaverse: State-of-the-Art and Vision, Hongzhou Chen, Haihan Duan, Maha Abdallah, Yufeng Zhu, Yonggang Wen, Abdulmotaleb El Saddik, and Wei Cai

Link

Generalizing Across Domains in Diabetic Retinopathy via Variational Autoencoders, Sharon Chokuwa and Muhammad H. Khan

Link

RadarFormer: Lightweight and Accurate Real-Time Radar Object Detection Model, Yahia Dalbah, Jean Lahoud, and Hisham Cholakkal

Link

Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes, Dmitry Demidov, Roba Al Majzoub, Amandeep Kumar, and Fahad Khan

Salient Mask-Guided Vision Transformer for Fine-Grained Classification, Dmitry Demidov, Muhammad Hamza Sharif, Aliakbar Abdurahimov, Hisham Cholakkal, and Fahad Shahbaz Khan

Link

Fama–French three versus five, which model is better? A machine learning approach, Boubacar Diallo, Aliyu Bagudu, and Qi Zhang

PDF

Burstormer: Burst Image Restoration and Enhancement Transformer, Akshay Dudhane, Syed Waqas Zamir, Salman Khan, Fahad Shahbaz Khan, and Ming Hsuan Yang

3D Indoor Instance Segmentation in an Open-World, Mohamed El Amine Boudjoghra, Salwa K. Al Khatib, Jean Lahoud, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, and Fahad Shahbaz Khan

Link

Text-to-Metaverse: Towards a Digital Twin-Enabled Multimodal Conditional Generative Metaverse, Ahmed Elhagry

Link

CEAFFOD: Cross-Ensemble Attention-based Feature Fusion Architecture Towards a Robust and Real-time UAV-based Object Detection in Complex Scenarios, Ahmed Elhagry, Hang Dai, Abdulmotaleb El Saddik, Wail Gueaieb, and Giulia De Masi

Link

The Integration of ChatGPT with the Metaverse for Medical Consultations, Abdulmotaleb El Saddik and Sara Ghaboura

PDF

Digital Twin Haptic Robotic Arms: Towards Handshakes in the Metaverse, Mohd Faisal, Fedwa Laamarti, and Abdulmotaleb El Saddik

Towards Instance-adaptive Inference for Federated Learning, Chun Mei Feng, Kai Yu, Nian Liu, Xinxing Xu, Salman Khan, and Wangmeng Zuo

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning, Chun Mei Feng, Kai Yu, Yong Liu, Salman Khan, and Wangmeng Zuo

Link

SAT: Scale-Augmented Transformer for Person Search, Mustansar Fiaz, Hisham Cholakkal, Rao Muhammad Anwer, and Fahad Shahbaz Khan

PDF

Digital Twin for Railway: A Comprehensive Survey, Sara Ghaboura, Rahatara Ferdousi, Fedwa Laamarti, Chunsheng Yang, and Abdulmotaleb El Saddik

Link

Breaking down the Hierarchy: A New Approach to Leukemia Classification, Ibraheem Hamdi, Hosam El-Gendy, Ahmed Sharshar, Mohamed Saeed, Muhammad Ridzuan, Shahrukh K. Hashmi, Naveed Syed, Imran Mirza, Shakir Hussain, Amira Mahmoud Abdalla, and Mohammad Yaqub

Link

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation, Asif Hanif, Muzammal Naseer, Salman Khan, Mubarak Shah, and Fahad Shahbaz Khan

PDF

Self-omics: A Self-supervised Learning Framework for Multi-omics Cancer Data, Sayed Hashim, Karthik Nandakumar, and Mohammad Yaqub

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization, Jameel Hassan, Hanan Gani, Noor Hussein, Muhammad Uzair Khattak, Muzammal Naseer, Fahad Shahbaz Khan, and Salman Khan

Link

Cascaded structure tensor for robust baggage threat detection, Taimur Hassan, Samet Akcay, Bilal Hassan, Mohammed Bennamoun, Salman Khan, Jorge Dias, and Naoufel Werghi

Link

Strong Gravitational Lensing Parameter Estimation with Vision Transformer, Kuan Wei Huang, Geoff Chih Fan Chen, Po Wen Chang, Sheng Chieh Lin, Chia Jung Hsu, Vishal Thengane, and Joshua Yao Yu Lin

Link

SEDA: Self-ensembling ViT with Defensive Distillation and Adversarial Training for Robust Chest X-Rays Classification, Raza Imam, Ibrahim Almakky, Salma Alrashdi, Baketah Alrashdi, and Mohammad Yaqub

Link

StrokeNet: An automated approach for segmentation and rupture risk prediction of intracranial aneurysm, Muhammad Irfan, Khalid Mahmood Malik, Jamil Ahmad, and Ghaus Malik

PDF

TC-Net: A Modest & Lightweight Emotion Recognition System Using Temporal Convolution Network, Muhammad Ishaq, Mustaqeem Khan, and Soonil Kwon