Authors

* External authors

Venue

Date

Share

Open-Set Object Detection By Aligning Known Class Representations

Vishal Chudasama

Naoyuki Onoe*

Pankaj Wasnik

Hiran Sarkar

Vineeth N Balasubramanian

* External authors

WACV-24

2025

Abstract

Open-Set Object Detection (OSOD) has emerged as a contemporary research direction to address the detection of unknown objects. Recently, few works have achieved remarkable performance in the OSOD task by employing contrastive clustering to separate unknown classes. In contrast, we propose a new semantic clustering-based approach to facilitate a meaningful alignment of clusters in semantic space and introduce a class decorrelation module to enhance inter-cluster separation. Our approach further incorporates an object focus module to predict objectness scores, which enhances the detection of unknown objects. Further, we employ i) an evaluation technique that penalizes lowconfidence outputs to mitigate the risk of misclassification of the unknown objects and ii) a new metric called HMP that combines known and unknown precision using harmonic mean. Our extensive experiments demonstrate that the proposed model achieves significant improvement on the MS-COCO & PASCAL VOC dataset for the OSOD task.

Related Publications

Precise Event Spotting in Sports Videos: Solving Long-Range Dependency and Class Imbalance

CVPR, 2025
Sanchayan Santra, Vishal Chudasama, Pankaj Wasnik, Vineeth N Balasubramanian

Precise Event Spotting (PES) aims to identify events and their class from long, untrimmed videos, particularly in sports. The main objective of PES is to detect the event at the exact moment it occurs. Existing methods mainly rely on features from a large pre-trained network…

Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

NAACL, 2025
Kritarth Prasad, Mohammadi Zaki, Pratik Singh, Pankaj Wasnik

Ensembling neural machine translation (NMT) models to produce higher-quality translations than the $L$ individual models has been extensively studied. Recent methods typically employ a candidate selection block (CSB) and an encoder-decoder fusion block (FB), requiring infere…

Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection

CVPRW, 2025
Ayush Ghadiya, Purbayan Kar, Vishal Chudasama, Pankaj Wasnik

Recently, weakly supervised video anomaly detection (WS-VAD) has emerged as a contemporary research direction to identify anomaly events like violence and nudity in videos using only video-level labels. However, this task has substantial challenges, including addressing imba…

  • HOME
  • Publications
  • Open-Set Object Detection By Aligning Known Class Representations

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.