Follow

Submissions from 2021

Link

Understanding more about human and machine attention in deep neural networks, Qiuxia Lai, Salman Khan, Yongwei Nie, Hanqiu Sun, Jianbing Shen, and Ling Shao

Link

Anchor-free 3D single stage detector with mask-guided attention for point cloud, Jiale Li, Hang Dai, Ling Shao, and Yong Ding

Link

From voxel to point: IoU-guided 3D object detection for point cloud with voxel-to-point decoder, Jiale Li, Hang Dai, Ling Shao, and Yong Ding

PDF

P2V-RCNN: point to voxel feature learning for 3D object detection from point clouds, Jiale Li, Yu Sun, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao

Link

C4AV: learning cross-modal representations from transformers, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding

Link

M3DSSD: monocular 3D single stage object detector, Shujie Luo, Hang Dai, Ling Shao, and Yong Ding

PDF

Multi-modal Transformers Excel at Class-agnostic Object Detection, Muhammad Maaz, Hanoona Bangalath Rasheed, Salman Hameed Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, and Ming-Hsuan Yang

Link

SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection, Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, and Mohsen Ali

PDF

Discriminative region-based multi-label zero-shot learning, Sanath Narayan, Akshita Gupta, Salman Khan, Fahad Shahbaz Khan, Ling Shao, and Mubarak Shah

PDF

On Generating Transferable Targeted Perturbations, Muzammal Naseer, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Fatih Porikli

Link

Intriguing properties of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

On improving adversarial transferability of vision transformers, Muzammal Naseer, Kanchana Ranasinghe, Salman Khan, Fahad Shahbaz Khan, and Fatih Porikli

Link

Cancelable biometrics vault: A secure key-binding biometric cryptosystem based on chaffing and winnowing, Osama Ouda, Karthik Nandakumar, and Arun Ross

Link

Preface, Bartłomiej W. Papież, Mohammad Yaqub, Jianbo Jiao, Ana I.L. Namburete, and J. Alison Noble

PDF

Orthogonal Projection Loss, Kanchana Ranasinghe, Muzammal Naseer, Munawar Hayat, Salman Khan, and Fahad Shahbaz Khan

Link

Self-supervised video transformer, Kanchana Ranasinghe, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, and Michael Ryoo

Link

Self-supervised predictive convolutional attentive block for anomaly detection, Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, and Mubarak Shah

Link

CyTran: Cycle-consistent transformers for non-contrast to contrast CT translation, Nicolae-Catalin Ristea, Andreea-Iuliana Miron, Olivian Savencu, Mariana-Iuliana Georgescu, Nicolae Verga, Fahad Shahbaz Khan, and Radu Tudor Ionescu

PDF

Exploring complementary strengths of invariant and equivariant representations for few-shot learning, Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, and Mubarak Shah

Link

Efficient encrypted inference on ensembles of decision trees, Kanthi Kiran Sarpatwar, Karthik Nandakumar, Nalini K. Ratha, James T. Rayfield, Karthikeyan Shanmugam, Roman Vaculin, and Sharath U. Pankanti

Link

Continual domain incremental learning for chest X-ray classification in low-resource clinical settings, Shikhar Srivastava, Mohammad Yaqub, Karthik Nandakumar, Zongyuan Ge, and Dwarikanath Mahapatra

Link

Background/foreground separation: guided attention based adversarial modeling (GAAM) versus robust subspace learning methods, Maryam Sultana, Arif Mahmood, Thierry Bouwmans, Muhammad Haris Khan, and Soon Ki Jung

Link

A human ear reconstruction autoencoder, Hao Sun, Nick Pears, and Hang Dai

PDF

Spatio-temporal relation modeling for few-shot action recognition, Anirudh Thatipelli, Sanath Narayan, Salman Hameed Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Bernard Ghanem

Link

MineGAN++: Mining generative models for efficient knowledge transfer to limited data domains, Yaxing Wang, Abel Gonzalez-Garcia, Chenshen Wu, Luis Herranz, Fahad Shahbaz Khan, Shangling Jui, and Joost Van de Weijer

Link

Automatic extraction of hiatal dimensions in 3-D transperineal pelvic ultrasound recordings, Helena Williams, Laura Cattani, Dominique Van Schoubroeck, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan D'Hooge, and Jan Deprest

Link

An Anomaly Detection System via Moving Surveillance Robots with Human Collaboration, Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Marcella Astrid, and Seung-Ik Lee

PDF

Restormer: Efficient transformer for High-resolution image restoration, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang

Link

Multi-stage progressive image restoration, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming Hsuan Yang, and Ling Shao

Link

Learning digital camera pipeline for extreme low-light imaging, Syed Waqas Zamir, Aditya Arora, Salman Khan, Fahad Shahbaz Khan, and Ling Shao

PDF

Deep learning predicts EBV status in gastric cancer based on spatial patterns of lymphocyte infiltration, Baoyi Zhang, Kevin Yao, Min Xu, Jia Wu, and Chao Cheng

Submissions from 2020

Link

SipMask: spatial information preservation for fast image and video instance segmentation, Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao

Link

Question-agnostic attention for visual question answering, Moshiur Farazi, Salman Khan, and Nick Barnes

Link

Attention guided semantic relationship parsing for visual question answering, Moshiur R. Farazi, Salman Khan, and Nick Barnes

Link

From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts, Moshiur R. Farazi, Salman Khan, and Nick Barnes

PDF

Trainable structure tensors for autonomous baggage threat detection under extreme occlusion, Taimur Hassan, Samet Akçay, Mohammed Bennamoun, Salman Khan, and Naoufel Werghi

Link

3D IoU-Net: IoU Guided 3D object detector for point clouds, Jiale Li, Shujie Luo, Ziqi Zhu, Hang Dai, Andrey S. Krylov, Yong Ding, and Ling Shao

Link

D2-Net: Weakly-supervised action localization via discriminative embeddings and denoised activations, Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan, Minghsuan Yang, and Ling Shao

Link

Preface, Bartłomiej W. Papież, Ana I.L. Namburete, Mohammad Yaqub, and J. Alison Noble

PDF

Any-shot Object Detection, Shafin Rahman, Salman Khan, Nick Barnes, and Fahad Shahbaz Khan

Link

Zero-shot object detection: joint recognition and localization of novel concepts, Shafin Rahman, Salman H. Khan, and Fatih Porikli

Link

Conditional generative modeling via learning the latent space, Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, and Stephen Gould

Link

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds, Sameera Ramasinghe, Moshiur Farazi, Salman Khan, Nick Barnes, and Stephen Gould

Link

Fixing localization errors to improve image classification, Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, and Luc Van Gool

Link

Automatic C-plane detection in pelvic floor transperineal lolumetric ultrasound, Helena Williams, Laura Cattani, Mohammad Yaqub, Carole Sudre, Tom Vercauteren, Jan Deprest, and Jan D’hooge

Link

Count- and Similarity-Aware R-CNN for pedestrian detection, Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, and Ling Shao

Link

Mask-guided attention network and occlusion-sensitive hard example mining for occluded pedestrian detection, Jin Xie, Yanwei Pang, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, and Ling Shao

Link

Learning Enriched Features for Real Image Restoration and Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shabaz Khan, Ming-Hsuan Yang, and Ling Shao