paint-brush
A Practical Approach to Novel Class Discovery in Tabular Data: Full training procedureby@dataology

A Practical Approach to Novel Class Discovery in Tabular Data: Full training procedure

by Dataology: Study of Data in Computer Science
Dataology: Study of Data in Computer Science HackerNoon profile picture

Dataology: Study of Data in Computer Science

@dataology

Dataology is the study of data. We publish the highest...

May 26th, 2024
Read on Terminal Reader
Read this story in a terminal
Print this story
Read this story w/o Javascript
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

The complete training procedure for Novel Class Discovery (NCD) integrates models, hyperparameter optimization, and novel class estimation through k-fold Cross-Validation. Algorithm 1 provides a simplified overview of this complex process for better understanding.
featured image - A Practical Approach to Novel Class Discovery in Tabular Data: Full training procedure
1x
Read by Dr. One voice-avatar

Listen to this story

Dataology: Study of Data in Computer Science HackerNoon profile picture
Dataology: Study of Data in Computer Science

Dataology: Study of Data in Computer Science

@dataology

Dataology is the study of data. We publish the highest quality university papers & blog posts about the essence of data.

About @dataology
LEARN MORE ABOUT @DATAOLOGY'S
EXPERTISE AND PLACE ON THE INTERNET.
0-item

STORY’S CREDIBILITY

Academic Research Paper

Academic Research Paper

Part of HackerNoon's growing list of open-source research papers, promoting free access to academic material.

Authors:

(1) Troisemaine Colin, Department of Computer Science, IMT Atlantique, Brest, France., and Orange Labs, Lannion, France;

(2) Reiffers-Masson Alexandre, Department of Computer Science, IMT Atlantique, Brest, France.;

(3) Gosselin Stephane, Orange Labs, Lannion, France;

(4) Lemaire Vincent, Orange Labs, Lannion, France;

(5) Vaton Sandrine, Department of Computer Science, IMT Atlantique, Brest, France.

Abstract and Intro

Related work

Approaches

Hyperparameter optimization

Estimating the number of novel classes

Full training procedure

Experiments

Conclusion

Declarations

References

Appendix A: Additional result metrics

Appendix B: Hyperparameters

Appendix C: Cluster Validity Indices numerical results

Appendix D: NCD k-means centroids convergence study

6 Full training procedure

In the previous sections, we presented the models, the hyperparameter optimization and the estimation procedure of the number of novel classes independently. In this section, these components are brought together to form a complete training procedure. To ensure that no prior knowledge about the novel classes is ever used in this process, the number of novel classes is naturally estimated during the k-fold CV introduced in Section 4. As the whole process is quite complex, we try to summarize it in clear terms in this section and in Algorithm 1.


image


image


This paper is available on arxiv under CC 4.0 license.


L O A D I N G
. . . comments & more!

About Author

Dataology: Study of Data in Computer Science HackerNoon profile picture
Dataology: Study of Data in Computer Science@dataology
Dataology is the study of data. We publish the highest quality university papers & blog posts about the essence of data.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Also published here
X REMOVE AD