Abstract Our paper introduces an innovative automated system designed to extract logical rules using the T^ CL logic from various datasets, with a particular emphasis on tabular data. Our starting point is the CN2 algorithm. Typically employed for classification tasks, we have adapted this algorithm to suit our descriptive objectives. We consider well-known datasets (such as Iris and Zoo) to illustrate our approach. Furthermore, we extend this analysis to a complex dataset, notably the GTZAN musical dataset. We have then tested our system by reclassifying the songs available in the GTZAN database with respect to the newly generated musical genres, obtaining encouraging results. This example showcases the algorithm’s efficacy in generating descriptive rules across different data domains. We discuss the adaptability of the proposed approach across various data types, including images, sounds and various heterogeneous structures.
Gliozzi et al. (Fri,) studied this question.