Zhenqi He , Yuanpei Liu , Kai Han
This repository serves as a supplementary resource for our survey paper on Category Discovery (CD) methods. It includes a comprehensive collection of key papers, frameworks, and approaches in the field of CD, summarizing the most recent advancements and techniques. The materials here aim to provide researchers with an accessible overview of current trends and methodologies in CD, along with references and additional insights to support further exploration.
We will continue to maintain and update this repository with new papers and resources as the field evolves. Contributions are welcome, and we encourage pull requests (PRs) to help expand and improve the content for the community.
Category Discovery (CD) addresses the limitations of the closed-world assumption by embracing an open-world setting. As shown in above figure, CD differs from semi-supervised learning and OSR&OOD by clustering unlabelled data that contains unseen categories. It is motivated by that human beings are capable of discovering unknown species by transferring existing knowledge on explored species. CD proves highly applicable across various real-world scenarios. For example, in autonomous driving, vehicles must continuously detect and classify new objects—such as unfamiliar road signs or obstacles—beyond their initial training to ensure safe navigation. In retail, CD can automatically recognize newly introduced products in supermarkets without the need for manual labeling.
In recent years, CD has garnered increasing attention, leading to a proliferation of research exploring various methodologies and settings. It was initially introduced as Novel Category Discovery (NCD) in to cluster unlabelled novel categories by leveraging knowledge from labelled base categories. This concept was later expanded into Generalized Category Discovery (GCD), which relaxed earlier constraints by assuming that the unlabelled data contains both novel and base categories, thereby more closely mirroring real-world scenarios. Further advancing the field, Han *etal.* proposed Semantic Category Discovery (SCD), aiming to assign semantic labels to unlabelled samples from an unconstrained vocabulary space. Additionally, CD has been applied to complex scenarios such as continual learning, where models learn incrementally over time, and federated learning, which focuses on training models across decentralized devices while ensuring data privacy. CD methods have also been explored in challenging settings, including few-shot learning, where limited labelled data is available, and with imbalanced distribution and domain-shifted data, making CD more applicable to real-world problems.The concept of NCD aims to transfer the knowledge learned from base categories to cluster unlabelled unseen categories, motivated by the observation where a child could easily distinguish novel categories (e.g., birds and elephants) after learning to classify base categories (e.g., dogs and cats).
Formally, given a dataset
This setting presumes that the label spaces of the labelled and unlabelled data are disjoint, i.e.,
| Year | Method | Pub. | Backbone | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|
| 2018 | KCL | ICLR | ResNet | Parametric Classifier | Over-estimate | Omniglot, ImageNeg-1K, Office31 |
| 2019 | MCL | ICLR | ResNet, VGG, LeNet | Parametric Classifier | Over-estimate | Omniglot, CIFAR-10&100, ImageNet-1K, MNIST |
| DTC | ICCV | ResNet, VGG | Soft Assignment |
|
Omniglot, CIFAR-10&100, ImageNet-1K, SVHN | |
| 2020 | RS, RS+ | ICLR | ResNet | Parametric Classifier | Known | Omniglot, CIFAR-10&100, ImageNet-1K, SVHN |
| 2021 | Qing etal. | Neural Networks | ResNet | Parametric Classifier | Known | CIFAR-10&100, SVHN |
| OpenMix | CVPR | ResNet, VGG | Parametric Classifier | Known | CIFAR-10&100, ImageNet-1K | |
| NCL | CVPR | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-1K | |
| JOINT | ICCV | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-1K | |
| UNO | ICCV | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-1K | |
| DualRS | ICCV | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100&1K, SSB | |
| 2022 | SMI | ICASSP | VGG-16 |
|
Known | CIFAR-10&100, ImageNet-1K |
| PSSCNNCD | T'CYB | N/A | BKBH |
Progressive label propagation | Coil20, Yeast, MSRA25, PalmData25, Abalone, USPS, Letter, MNIST | |
| Li etal. | NeurIPSW | ResNet |
|
|
CIFAR-100, ImageNet-1K | |
| 2023 | ResTune | T'NNLS | ResNet |
|
Known | CIFAR-10&100, TinyImageNet |
| SK-Hurt | TMLR | ResNet |
|
|
CIFAR-100, ImageNet-1K | |
| IIC | CVPR | ResNet | Parametric Classifier |
|
CIFAR-10&100, ImageNet-1K | |
| NSCL | ICML | ResNet |
|
|
CIFAR-100, ImageNet-1K | |
| CRKD | ICCV | ResNet, ViT | Parametric Classifier | Known | CIFAR-100, SSB | |
| Feng etal. | MICCAI | ResNet | Parametric Classifier | Known | ISIC2019 | |
| 2024 | RAPL | CVPR | ResNet |
|
Known | SoyAgeing |
| SCKD | ECCV | ResNet, ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, SSB | |
| APL | T'PAMI | ResNet | Parametric Classifier | Known | CIFAR-10&100, Omniglot, ImageNet-1K | |
| PrePrint | Hasan etal. | ArXiv | ResNet | Parametric Classifier |
|
CIFAR-10&100 |
Extending the NCD paradigm, Generalized Category Discovery relaxes the disjointness assumption between the base and novel categories, thereby presenting a more challenging and realistic scenario.
In GCD, the labelled and unlabelled datasets may share common categories, i.e.,
Notably, an equivalent formulation has been introduced by Cao etal. under the designation of Open-World Semi-Supervised Learning. In the following context, we refer to both formulations under the umbrella term Generalized Category Discovery.
| Year | Method | Pub. | Backbone | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|
| 2022 | GCD | CVPR | ViT | Semi-$k$-Means |
|
CIFAR-10&100, ImageNet-100, SSB, Herb19 |
| ORCA | CVPR | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, Single-Cell | |
| ComEx | CVPR | ResNet | Parametric Classifier | Known | CIFAR-10&100 | |
| OpenLDN | ECCV | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, TinyImage, Oxford Pets | |
| TRSSL | ECCV | ResNet | Parametric Classifier |
|
CIFAR-10&100, ImageNet-100, TinyImage, Oxford Pets, Scars, Aircrafts | |
| NACH | NeurIPS | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100 | |
| XCon | BMVC | ViT | Semi-$k$-Means |
|
CIFAR-10&100, ImageNet-100, SSB, Oxford Pets | |
| 2023 | OpenCon | TMLR | ResNet | Prototype-based |
|
CIFAR-10&100, ImageNet-100 |
| PromptCAL | CVPR | ViT | Semi-$k$-Means | Known | CIFAR-10&100, ImageNet-100, SSB | |
| DCCL | CVPR | ViT | Infomap | Infomap | CIFAR-10&100, ImageNet-100, CUB, Scars, Oxford Pets | |
| OpenNCD | IJCAI | ResNet | Prototype-based | Prototype Grouping | CIFAR-10&100, ImageNet-100 | |
| SimGCD | ICCV | ViT | Parametric Classifier |
|
CIFAR-10&100, ImageNet-100, SSB, Herb19 | |
| GPC | ICCV | ViT | GMM | GMM | CIFAR-10&100, ImageNet-100, SSB | |
| PIM | ICCV | ViT | Parametric Classifier |
|
CIFAR-10&100, ImageNet-100, CUB, Scars, Herb19 | |
| TIDA | NeurIPS | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, TinyImageNet, Scars, Aircraft | |
| $\mu$GCD | NeurIPS | ResNet, ViT, ViT |
|
Known | Clevr-4 | |
| InfoSieve | NeurIPS | ViT |
|
|
CIFAR-10&100, ImageNet-100, SSB, Oxford Pets, Herb19 | |
| SORL | NeurIPS | ResNet |
|
Known | CIFAR-10&100 | |
| Yang etal. | ICONIP | ViT | Louvain | Louvain | CIFAR-10&100, ImageNet-100, CUB, Scars, Herb19 | |
| 2024 | AMEND | WACV | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, SSB, Herb19 |
| GCA | WACV | ViT | Guided Cluster Aggregation |
|
CIFAR-10&100, ImageNet-100, SSB | |
| SPT-Net | ICLR | ViT, ViT | Parametric Classifier |
|
CIFAR-10&100, ImageNet-100, SSB | |
| LegoGCD | CVPR | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, -1K, SSB, Herb19 | |
| CMS | CVPR | ViT | Agglomerative Clustering | Agglomerative Clustering | CIFAR-100, ImageNet-100, SSB, Herb19 | |
| ActiveGCD | CVPR | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, SSB | |
| TextGCD | ECCV | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, -1K, SSB, Oxford Pets, Flowers102 | |
| LPS | IJCAI | ResNet | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100 | |
| Contextuality-GCD | ICIP | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, -1K, SSB, Herb19 | |
| 2025 | MSGCD | Information Fusion | ViT | Parametric Classifier | Known | CIFAR-100, SSB |
| CPT | IJCV | ViT | Similarity-Based |
|
CIFAR-10&100, ImageNet-100,CUB, Scars, Herb19 | |
| PAL-GCD | AAAI | ViT | Parametric Classifier | DBSCAN | CIFAR-100,ImageNet-100,SSB, Herb19 | |
| DebGCD | ICLR | ViT | Parametric Classifier | DBSCAN | CIFAR-10&100,ImageNet-100&1K,SSB, Herb19,Oxford-Pets | |
| ProtoGCD | T'PAMI | ViT | Parametric Classifier |
|
CIFAR-10&100,ImageNet-100&1K,SSB, Herb19 | |
| MOS | CVPR | ViT | Parametric Classifier | Known | SSB, Oxford-Pets | |
| GET | CVPR | ViT | Parametric Classifier | Known | CIFAR-10&100,ImageNet-100,SSB,Herb19 | |
| AptGCD | CVPR | ViT | Parametric Classifier | Known | CIFAR-10&100,ImageNet-100,SSB,Herb19 | |
| Dai et al | CVPR | ViT | - | Known | SSB, Herb19 | |
| HypCD | CVPR | ViT | - | Known | CIFAR-10&100,ImageNet-100,SSB,Herb19 | |
| PrePrint | CLIP-GCD | ArXiv | ViT | Semi-$k$-Means |
|
CIFAR-10&100, ImageNet-100, -1K, SSB, Flowers102, DomainNet |
| MCDL | ArXiv | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, -1K, CUB, SCars, Herb19 | |
| PNP | ArXiv | ViT | Infomap | Infomap | CIFAR-10&100, ImageNet-100, -1K, SSB, Herb19 | |
| RPIM | ArXiv | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, -1K, CUB, Scars, Herb19 | |
| OpenGCD | ArXiv | ViT | Parametric Classifier |
|
CIFAR-10&100, CUB | |
| ConceptGCD | ArXiv | ViT, ViT | Parametric Classifier |
|
CIFAR-100, ImageNet-100, -1K, SSB, Herb19 | |
| GET | ArXiv | ViT | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100, SSB, Herb19 |
CCD provides a continual setting of category discovery in which new categories are identified sequentially while retaining previously acquired knowledge. CCD presents several distinct scenarios based on the structure of the incoming data.
In the Class Incremental Scenario, the training set
| Year | Method | Pub. | Backbone | Scenario | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|---|
| 2022 | NCDwF | ECCV | ResNet | Class Incremental | Parametric Classifier | Known | CIFAR-10/100, ImageNet-1K |
| FRoST | ECCV | ResNet | Class Incremental | Parametric Classifier | Known | CIFAR-10/100, TinyImageNet | |
| GM | NeurIPS | ResNet | All | Parametric Classifier | Known | CIFAR-100, ImageNet-100, CUB | |
| 2023 | PA-GCD | ICCV | ViT, Resnet | Mixed Incremental | Parametric Classifier | Affinity Propagation | CUB, MIT67, Stanford Dogs, Aircraft |
| MetaGCD | ICCV | ViT | Mixed Incremental |
|
|
CIFAR-10/100, TinyImageNet | |
| iGCD | ICCV | ResNet | Self-Supervised Mixed Incremental | Soft Nearest Neighbor | Density Peaks | CUB, Aircraft, CIFAR-100 | |
| 2024 | Msc-iNCD | ICPR | ViT | Class Incremental | Parametric Classifier | Known | CIFAR-100, ImageNet-100/1K |
| ADM | AAAI | ResNet | Class Incremental | Parametric Classifier | Known | CIFAR-10/100, TinyImageNet | |
| PromptCCD | ECCV | ViT | Mixed Incremental | GMM | GMP | CIFAR-100, ImageNet-100, TinyImageNet | |
| DEAN | ECCV | ViT | Mixed Incremental | Parametric Classifier | Affinity Propagation | CUB, Aircraft, CIFAR-100 | |
| CAMP | ECCV | ViT | Self-Supervised Mixed Incremental | Nearest Centroid Classifier | Known | CUB, Aircraft, SCars, DomainNet, CIFAR-100 | |
| Happy | NeurIPS | ViT | Mixed Incremental | Parametric Classifier | Silhouette Score | CIFAR-100, ImageNet-100, TinyImageNet, CUB | |
| Preprint | FEA | ArXiv | ViT | Class Incremental | Parametric Classifier | Known | CIFAR-10/100, TinyImageNet |
OCD extends conventional category discovery to an inductive learning paradigm with streaming inference. It trains on a labelled support set
| Year | Method | Pub. | Backbone | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|
| 2023 | SMILE | CVPR | ViT | Hash-based | Hash-coding | CIFAR-10&100, ImageNet-100, CUB, Scars, Herb19 |
| 2024 | PHE | NeurIPS | ViT | Hamming Ball-Based | Hamming Ball-Based | CUB, Scars, Oxford Pets, Food-101, iNaturalist |
This setting relaxes the conventional assumption that both labelled and unlabelled data are drawn from the same semantic domain. Formally, let
| Year | Method | Pub. | Backbone | $ \Omega_{\mathcal{U}} $ | Label Assignment | # Unlabelled categories | Dataset | $ \mathcal{Y_L} \cap \mathcal{Y_U} $ |
|---|---|---|---|---|---|---|---|---|
| 2022 | Yu etal. | AAAI | ResNet | Single New Domain | Parametric Classifier |
|
Office, OfficeHome, VisDA | $ \varnothing $ |
| SCDA | ICME | ResNet | Multiple New Domains | Parametric Classifier |
|
Office, OfficeHome, DomainNet | $ \varnothing $ | |
| 2023 | SAN | ICCV | ResNet | Single New Domain | Parametric Classifier | N/A | Office, OfficeHome, VisDA, DomainNet | $ \varnothing $ |
| 2024 | CDAD-Net | CVPRW | ViT | Single New Domain | Semi-$k$-Means | Elbow | OfficeHome, PACS, DomainNet, CIFAR-10&100, ImageNet-100 | $ \neq \varnothing $ |
| 2025 | HiLo | ICLR | ViT | Multiple new Domains | Parametric Classifier |
|
DomainNet, SSB-C | $ \neq \varnothing $ |
| ArXiv | Wang etal. | ArXiv | ViT | Single New Domain | Parametric Classifier | Known | CIFAR-10, OfficeHome, DomainNet | $ \varnothing $ |
DA-CD eliminates the requirement for a balanced distribution imposed on both labelled and unlabelled data in conventional category discovery. Instead, it acknowledges that the data may follow a skewed distribution, such that for certain categories
| Year | Method | Pub. | Backbone | Scenario | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|---|
| 2023 | NCDLR | TMLR | ViT | Long-tailed Distribution for $ \mathcal{Y_L} & \mathcal{Y_U} $ | Parametric Classifier |
|
CIFAR-10, ImageNet-100, Herb19, iNaturalist18 |
| ImbaGCD | CVPRW | Resnet | Imbalanced Distribution for $ \mathcal{Y_U} $ | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100 | |
| GCDLR | ICCVW | Resnet | Imbalanced Distribution for $ \mathcal{Y_U} $ | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100 | |
| BYOP | CVPR | ResNet | Imbalanced Distribution for $ \mathcal{Y_U} $ | Parametric Classifier | Known | CIFAR-10&100, TinyImageNet | |
| BaCon | NeurIPS | ViT | Long-tailed Distribution for $ \mathcal{Y_L} & \mathcal{Y_U} $ |
|
Known | CIFAR-10&100-LT, ImageNet-100-LT, Places-LT | |
| 2024 | Fan etal | CVPR | ViT | Long-tailed Distribution for $ \mathcal{Y_L} & \mathcal{Y_U} $ |
|
Spectral graph | BioMedical Datasets |
In contrast to NCD and GCD, which focus solely on grouping visually similar images without considering their semantic meaning, SCD extends these paradigms by also assigning a semantic label to each unlabelled instance. Specifically, SCD leverages an open vocabulary label space to achieve this goal. In this context, WordNet, comprising approximately 68,000 labels, is employed as a comprehensive and unconstrained vocabulary, facilitating the assignment of meaningful semantic labels.
| Year | Method | Pub. | Backbone | Word Space | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|---|
| 2024 | SCD | CVPRW | ViT | ~Open | KMeans+Top-$k$ Voting | Known | ImageNet-100&1K, SCars, CUB |
| SNCD | AAAI | ResNet | $\mathcal{C}{base} + \mathcal{C}{novel}$ | Parametric Classifier | Known | CIFAR-10&100, ImageNet-100 |
FS-CD addresses the challenge of identifying novel classes when only a very limited amount of labelled data is available. This setting extends traditional category discovery by integrating the principles of few-shot learning. In particular, FSCD adopts an
Chi etal. extend NCD to a few-shot setting by linking it to meta-learning, based on the shared assumption that base and novel categories possess high-level semantic features. By adapting meta-learning techniques such as Model-Agnostic Meta-Learning and Prototypical Networks (ProtoNet), their approach shifts the focus from classification to clustering tasks—a critical adjustment for few-shot category discovery. A key innovation is the introduction of the Clustering-rule-aware Task Sampler, which ensures that training tasks adhere to consistent clustering rules, thereby enabling the model to generalize better to novel categories despite the limited labelled data. However, this method assumes that the number of novel categories is known in advance.
FCD extends Category Discovery in a federated learning setting, facilitating decentralized and collaborative model training among clients while safeguarding data privacy.
| Year | Method | Pub. | Backbone | Label Assignment | # Unlabelled categories | Dataset |
|---|---|---|---|---|---|---|
| 2023 | FedoSSL | ICML | ResNet | Parametric Classifier | Known | CIFAR-10/100, CINIC-10 |
| 2024 | FedGCD | CVPR | ViT | GMM | Semi-FINCH | CIFAR-10/100, ImageNet-100, CUB, SCars, Pets |
| Preprint | GAL | ArXiv | ResNet&34 | Parametric Classifier | Potential Prototype Merge | CIFAR-100, TinyImageNet, ImageNet-100 |


