Performance of a K-Means Algorithm Driven by Careful Seeding

Libero Nigro; Franco Cicirelli

SciTePress - Publication Details

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Performance of a K-Means Algorithm Driven by Careful Seeding

Topics: Data Analytics and Simulation; Domain-Specific Tools; Optimization; Performance Analysis

In Proceedings of the 13th International Conference on Simulation and Modeling Methodologies, Technologies and Applications SIMULTECH - Volume 1, 27-36, 2023 , Rome, Italy

Authors: Libero Nigro ¹ and Franco Cicirelli ²

Affiliations: ¹ Engineering Department of Informatics Modelling Electronics and Systems Science, University of Calabria, Rende, Italy ; ² CNR-National Research Council of Italy–Inst. for High Performance Computing and Networking (ICAR), Rende, Italy

Keyword(s): K-Means Clustering, Seeding Procedure, Greedy K-Means++, Clustering Accuracy Indexes, Java Parallel Streams, Benchmark and Real-World Datasets, Execution Performance.

Abstract: This paper proposes a variation of the K-Means clustering algorithm, named Population-Based K-Means (PB-K-MEANS), which founds its behaviour on careful seeding. The new K-Means algorithm rests on a greedy version of the K-Means++ seeding procedure (g_kmeans++), which proves effective in the search for an accurate clustering solution. PB-K-MEANS first builds a population of candidate solutions by independent runs of K-Means with g_kmeans++. Then the reservoir is used for recombining the stored solutions by Repeated K-Means toward the attainment of a final solution which minimizes the distortion index. PB-K-MEANS is currently implemented in Java through parallel streams and lambda expressions. The paper first recalls basic concepts of clustering and of K-Means together with the role of the seeding procedure, then it goes on by describing basic design and implementation issues of PB-K-MEANS. After that, simulation experiments carried out both on synthetic and real-world datasets are rep orted, confirming good execution performance and careful clustering. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 8.209.245.224

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Nigro, L. and Cicirelli, F. (2023). Performance of a K-Means Algorithm Driven by Careful Seeding. In Proceedings of the 13th International Conference on Simulation and Modeling Methodologies, Technologies and Applications - SIMULTECH; ISBN 978-989-758-668-2; ISSN 2184-2841, SciTePress, pages 27-36. DOI: 10.5220/0012045000003546

@conference{simultech23,
author={Libero Nigro and Franco Cicirelli},
title={Performance of a K-Means Algorithm Driven by Careful Seeding},
booktitle={Proceedings of the 13th International Conference on Simulation and Modeling Methodologies, Technologies and Applications - SIMULTECH},
year={2023},
pages={27-36},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012045000003546},
isbn={978-989-758-668-2},
issn={2184-2841},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Simulation and Modeling Methodologies, Technologies and Applications - SIMULTECH
TI - Performance of a K-Means Algorithm Driven by Careful Seeding
SN - 978-989-758-668-2
IS - 2184-2841
AU - Nigro, L.
AU - Cicirelli, F.
PY - 2023
SP - 27
EP - 36
DO - 10.5220/0012045000003546
PB - SciTePress