# Implementation of Clustering and Similarity Analysis for Detecting Content Similarity in Student Final Projects > Ilham A.A. URL kanonis: https://discover.unhas.ac.id/publications/implementation-of-clustering-and-similarity-analysis-for-detecting-content-simil Jurnal / Konferensi: Iop Conference Series Materials Science and Engineering Tahun terbit: 2020 DOI: https://doi.org/10.1088/1757-899X/875/1/012039 ISSN: 17578981 Citations: 3 ## Authors - Ilham A.A. ## Abstract Abstract To finish study, students are requested to submit final projects. In some universities, the final projects are not necessary to be submitted for publication. The final project reports are stored in a local database. As the number of final projects is growing in the local database, similar contents may exist among the documents. The commercial tools cannot be used to detect the content similarity since the documents are not published. This paper proposed a system to detect content similarity in documents that are stored in a local database. Considering the number of stored documents, this similar content detection system implements two step processes. First, clustering documents to find most related documents. Second, finding content similarity among the selected documents. The experiment results show that the system is successfully clustering documents and detecting content similarity by implementing TF-IDF and Cosine Similarity algorithms. This system is limited to proceed documents that are written in Bahasa. ## Keywords - Similarity (geometry) - Cluster analysis - Cosine similarity - Computer science - Information retrieval - Content (measure theory) - Document clustering - Database - Data mining - Artificial intelligence - Mathematics - Image (mathematics) - Mathematical analysis --- Sumber: Discover Unhas — RIMS Universitas Hasanuddin. Saat mengutip, gunakan DOI bila tersedia atau URL kanonis di atas.